Sample records for conclusions cluster analysis

  1. An integrated bioinformatics approach to improve two-color microarray quality-control: impact on biological conclusions.

    PubMed

    van Haaften, Rachel I M; Luceri, Cristina; van Erk, Arie; Evelo, Chris T A

    2009-06-01

    Omics technology used for large-scale measurements of gene expression is rapidly evolving. This work pointed out the need of an extensive bioinformatics analyses for array quality assessment before and after gene expression clustering and pathway analysis. A study focused on the effect of red wine polyphenols on rat colon mucosa was used to test the impact of quality control and normalisation steps on the biological conclusions. The integration of data visualization, pathway analysis and clustering revealed an artifact problem that was solved with an adapted normalisation. We propose a possible point to point standard analysis procedure, based on a combination of clustering and data visualization for the analysis of microarray data.

  2. The detection methods of dynamic objects

    NASA Astrophysics Data System (ADS)

    Knyazev, N. L.; Denisova, L. A.

    2018-01-01

    The article deals with the application of cluster analysis methods for solving the task of aircraft detection on the basis of distribution of navigation parameters selection into groups (clusters). The modified method of cluster analysis for search and detection of objects and then iterative combining in clusters with the subsequent count of their quantity for increase in accuracy of the aircraft detection have been suggested. The course of the method operation and the features of implementation have been considered. In the conclusion the noted efficiency of the offered method for exact cluster analysis for finding targets has been shown.

  3. Challenges in microarray class discovery: a comprehensive examination of normalization, gene selection and clustering

    PubMed Central

    2010-01-01

    Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is preferable, in particular if the gene selection is successful. However, this is an area that needs to be studied further in order to draw any general conclusions. Conclusions The choice of cluster analysis, and in particular gene selection, has a large impact on the ability to cluster individuals correctly based on expression profiles. Normalization has a positive effect, but the relative performance of different normalizations is an area that needs more research. In summary, although clustering, gene selection and normalization are considered standard methods in bioinformatics, our comprehensive analysis shows that selecting the right methods, and the right combinations of methods, is far from trivial and that much is still unexplored in what is considered to be the most basic analysis of genomic data. PMID:20937082

  4. Person mobility in the design and analysis of cluster-randomized cohort prevention trials.

    PubMed

    Vuchinich, Sam; Flay, Brian R; Aber, Lawrence; Bickman, Leonard

    2012-06-01

    Person mobility is an inescapable fact of life for most cluster-randomized (e.g., schools, hospitals, clinic, cities, state) cohort prevention trials. Mobility rates are an important substantive consideration in estimating the effects of an intervention. In cluster-randomized trials, mobility rates are often correlated with ethnicity, poverty and other variables associated with disparity. This raises the possibility that estimated intervention effects may generalize to only the least mobile segments of a population and, thus, create a threat to external validity. Such mobility can also create threats to the internal validity of conclusions from randomized trials. Researchers must decide how to deal with persons who leave study clusters during a trial (dropouts), persons and clusters that do not comply with an assigned intervention, and persons who enter clusters during a trial (late entrants), in addition to the persons who remain for the duration of a trial (stayers). Statistical techniques alone cannot solve the key issues of internal and external validity raised by the phenomenon of person mobility. This commentary presents a systematic, Campbellian-type analysis of person mobility in cluster-randomized cohort prevention trials. It describes four approaches for dealing with dropouts, late entrants and stayers with respect to data collection, analysis and generalizability. The questions at issue are: 1) From whom should data be collected at each wave of data collection? 2) Which cases should be included in the analyses of an intervention effect? and 3) To what populations can trial results be generalized? The conclusions lead to recommendations for the design and analysis of future cluster-randomized cohort prevention trials.

  5. Characterizing Heterogeneity within Head and Neck Lesions Using Cluster Analysis of Multi-Parametric MRI Data

    PubMed Central

    Borri, Marco; Schmidt, Maria A.; Powell, Ceri; Koh, Dow-Mu; Riddell, Angela M.; Partridge, Mike; Bhide, Shreerang A.; Nutting, Christopher M.; Harrington, Kevin J.; Newbold, Katie L.; Leach, Martin O.

    2015-01-01

    Purpose To describe a methodology, based on cluster analysis, to partition multi-parametric functional imaging data into groups (or clusters) of similar functional characteristics, with the aim of characterizing functional heterogeneity within head and neck tumour volumes. To evaluate the performance of the proposed approach on a set of longitudinal MRI data, analysing the evolution of the obtained sub-sets with treatment. Material and Methods The cluster analysis workflow was applied to a combination of dynamic contrast-enhanced and diffusion-weighted imaging MRI data from a cohort of squamous cell carcinoma of the head and neck patients. Cumulative distributions of voxels, containing pre and post-treatment data and including both primary tumours and lymph nodes, were partitioned into k clusters (k = 2, 3 or 4). Principal component analysis and cluster validation were employed to investigate data composition and to independently determine the optimal number of clusters. The evolution of the resulting sub-regions with induction chemotherapy treatment was assessed relative to the number of clusters. Results The clustering algorithm was able to separate clusters which significantly reduced in voxel number following induction chemotherapy from clusters with a non-significant reduction. Partitioning with the optimal number of clusters (k = 4), determined with cluster validation, produced the best separation between reducing and non-reducing clusters. Conclusion The proposed methodology was able to identify tumour sub-regions with distinct functional properties, independently separating clusters which were affected differently by treatment. This work demonstrates that unsupervised cluster analysis, with no prior knowledge of the data, can be employed to provide a multi-parametric characterization of functional heterogeneity within tumour volumes. PMID:26398888

  6. Obstructive Sleep Apnea: A Cluster Analysis at Time of Diagnosis

    PubMed Central

    Grillet, Yves; Richard, Philippe; Stach, Bruno; Vivodtzev, Isabelle; Timsit, Jean-Francois; Lévy, Patrick; Tamisier, Renaud; Pépin, Jean-Louis

    2016-01-01

    Background The classification of obstructive sleep apnea is on the basis of sleep study criteria that may not adequately capture disease heterogeneity. Improved phenotyping may improve prognosis prediction and help select therapeutic strategies. Objectives: This study used cluster analysis to investigate the clinical clusters of obstructive sleep apnea. Methods An ascending hierarchical cluster analysis was performed on baseline symptoms, physical examination, risk factor exposure and co-morbidities from 18,263 participants in the OSFP (French national registry of sleep apnea). The probability for criteria to be associated with a given cluster was assessed using odds ratios, determined by univariate logistic regression. Results: Six clusters were identified, in which patients varied considerably in age, sex, symptoms, obesity, co-morbidities and environmental risk factors. The main significant differences between clusters were minimally symptomatic versus sleepy obstructive sleep apnea patients, lean versus obese, and among obese patients different combinations of co-morbidities and environmental risk factors. Conclusions Our cluster analysis identified six distinct clusters of obstructive sleep apnea. Our findings underscore the high degree of heterogeneity that exists within obstructive sleep apnea patients regarding clinical presentation, risk factors and consequences. This may help in both research and clinical practice for validating new prevention programs, in diagnosis and in decisions regarding therapeutic strategies. PMID:27314230

  7. Cluster analysis of spontaneous preterm birth phenotypes identifies potential associations among preterm birth mechanisms

    PubMed Central

    Esplin, M Sean; Manuck, Tracy A.; Varner, Michael W.; Christensen, Bryce; Biggio, Joseph; Bukowski, Radek; Parry, Samuel; Zhang, Heping; Huang, Hao; Andrews, William; Saade, George; Sadovsky, Yoel; Reddy, Uma M.; Ilekis, John

    2015-01-01

    Objective We sought to employ an innovative tool based on common biological pathways to identify specific phenotypes among women with spontaneous preterm birth (SPTB), in order to enhance investigators' ability to identify to highlight common mechanisms and underlying genetic factors responsible for SPTB. Study Design A secondary analysis of a prospective case-control multicenter study of SPTB. All cases delivered a preterm singleton at SPTB ≤34.0 weeks gestation. Each woman was assessed for the presence of underlying SPTB etiologies. A hierarchical cluster analysis was used to identify groups of women with homogeneous phenotypic profiles. One of the phenotypic clusters was selected for candidate gene association analysis using VEGAS software. Results 1028 women with SPTB were assigned phenotypes. Hierarchical clustering of the phenotypes revealed five major clusters. Cluster 1 (N=445) was characterized by maternal stress, cluster 2 (N=294) by premature membrane rupture, cluster 3 (N=120) by familial factors, and cluster 4 (N=63) by maternal comorbidities. Cluster 5 (N=106) was multifactorial, characterized by infection (INF), decidual hemorrhage (DH) and placental dysfunction (PD). These three phenotypes were highly correlated by Chi-square analysis [PD and DH (p<2.2e-6); PD and INF (p=6.2e-10); INF and DH (p=0.0036)]. Gene-based testing identified the INS (insulin) gene as significantly associated with cluster 3 of SPTB. Conclusion We identified 5 major clusters of SPTB based on a phenotype tool and hierarchal clustering. There was significant correlation between several of the phenotypes. The INS gene was associated with familial factors underlying SPTB. PMID:26070700

  8. Tobacco, Marijuana, and Alcohol Use in University Students: A Cluster Analysis

    PubMed Central

    Primack, Brian A.; Kim, Kevin H.; Shensa, Ariel; Sidani, Jaime E.; Barnett, Tracey E.; Switzer, Galen E.

    2012-01-01

    Objective Segmentation of populations may facilitate development of targeted substance abuse prevention programs. We aimed to partition a national sample of university students according to profiles based on substance use. Participants We used 2008–2009 data from the National College Health Assessment from the American College Health Association. Our sample consisted of 111,245 individuals from 158 institutions. Method We partitioned the sample using cluster analysis according to current substance use behaviors. We examined the association of cluster membership with individual and institutional characteristics. Results Cluster analysis yielded six distinct clusters. Three individual factors—gender, year in school, and fraternity/sorority membership—were the most strongly associated with cluster membership. Conclusions In a large sample of university students, we were able to identify six distinct patterns of substance abuse. It may be valuable to target specific populations of college-aged substance users based on individual factors. However, comprehensive intervention will require a multifaceted approach. PMID:22686360

  9. Visualizing nD Point Clouds as Topological Landscape Profiles to Guide Local Data Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Oesterling, Patrick; Heine, Christian; Weber, Gunther H.

    2012-05-04

    Analyzing high-dimensional point clouds is a classical challenge in visual analytics. Traditional techniques, such as projections or axis-based techniques, suffer from projection artifacts, occlusion, and visual complexity.We propose to split data analysis into two parts to address these shortcomings. First, a structural overview phase abstracts data by its density distribution. This phase performs topological analysis to support accurate and non-overlapping presentation of the high-dimensional cluster structure as a topological landscape profile. Utilizing a landscape metaphor, it presents clusters and their nesting as hills whose height, width, and shape reflect cluster coherence, size, and stability, respectively. A second local analysis phasemore » utilizes this global structural knowledge to select individual clusters or point sets for further, localized data analysis. Focusing on structural entities significantly reduces visual clutter in established geometric visualizations and permits a clearer, more thorough data analysis. In conclusion, this analysis complements the global topological perspective and enables the user to study subspaces or geometric properties, such as shape.« less

  10. A comparison of visual search strategies of elite and non-elite tennis players through cluster analysis.

    PubMed

    Murray, Nicholas P; Hunfalvay, Melissa

    2017-02-01

    Considerable research has documented that successful performance in interceptive tasks (such as return of serve in tennis) is based on the performers' capability to capture appropriate anticipatory information prior to the flight path of the approaching object. Athletes of higher skill tend to fixate on different locations in the playing environment prior to initiation of a skill than their lesser skilled counterparts. The purpose of this study was to examine visual search behaviour strategies of elite (world ranked) tennis players and non-ranked competitive tennis players (n = 43) utilising cluster analysis. The results of hierarchical (Ward's method) and nonhierarchical (k means) cluster analyses revealed three different clusters. The clustering method distinguished visual behaviour of high, middle-and low-ranked players. Specifically, high-ranked players demonstrated longer mean fixation duration and lower variation of visual search than middle-and low-ranked players. In conclusion, the results demonstrated that cluster analysis is a useful tool for detecting and analysing the areas of interest for use in experimental analysis of expertise and to distinguish visual search variables among participants'.

  11. Phylogenetic relationship of Ornithobacterium rhinotracheale strains.

    PubMed

    DE Oca-Jimenez, Roberto Montes; Vega-Sanchez, Vicente; Morales-Erasto, Vladimir; Salgado-Miranda, Celene; Blackall, Patrick J; Soriano-Vargas, Edgardo

    2018-04-10

    The bacterium Ornithobacterium rhinotracheale is associated with respiratory disease in wild birds and poultry. In this study, the phylogenetic analysis of nine reference strains of O. rhinotracheale belonging to serovars A to I, and eight Mexican isolates belonging to serovar A, was performed. The analysis was extended to include available sequences from another 23 strains available in the public domain. The analysis showed that the 40 sequences formed six clusters, I to VI. All eight Mexican field isolates were placed in cluster I. One of the reference strains appears to present genetic diversity not previously recognized and was placed in a new genetic cluster. In conclusion, the phylogenetic analysis of O. rhinotracheale strains, based on the 16S rRNA gene, is a suitable tool for epidemiologic studies.

  12. Clustering of health-related behaviors among early and mid-adolescents in Tuscany: results from a representative cross-sectional study

    PubMed Central

    Lazzeri, Giacomo; Panatto, Donatella; Domnich, Alexander; Arata, Lucia; Pammolli, Andrea; Simi, Rita; Giacchi, Mariano Vincenzo; Amicizia, Daniela; Gasparini, Roberto

    2018-01-01

    Abstract Background A huge amount of literature suggests that adolescents’ health-related behaviors tend to occur in clusters, and the understanding of such behavioral clustering may have direct implications for the effective tailoring of health-promotion interventions. Despite the usefulness of analyzing clustering, Italian data on this topic are scant. This study aimed to evaluate the clustering patterns of health-related behaviors. Methods The present study is based on data from the Health Behaviors in School-aged Children (HBSC) study conducted in Tuscany in 2010, which involved 3291 11-, 13- and 15-year olds. To aggregate students’ data on 22 health-related behaviors, factor analysis and subsequent cluster analysis were performed. Results Factor analysis revealed eight factors, which were dubbed in accordance with their main traits: ‘Alcohol drinking’, ‘Smoking’, ‘Physical activity’, ‘Screen time’, ‘Signs & symptoms’, ‘Healthy eating’, ‘Violence’ and ‘Sweet tooth’. These factors explained 67% of variance and underwent cluster analysis. A six-cluster κ-means solution was established with a 93.8% level of classification validity. The between-cluster differences in both mean age and gender distribution were highly statistically significant. Conclusions Health-compromising behaviors are common among Tuscan teens and occur in distinct clusters. These results may be used by schools, health-promotion authorities and other stakeholders to design and implement tailored preventive interventions in Tuscany. PMID:27908972

  13. Cluster randomised trials in the medical literature: two bibliometric surveys

    PubMed Central

    Bland, J Martin

    2004-01-01

    Background Several reviews of published cluster randomised trials have reported that about half did not take clustering into account in the analysis, which was thus incorrect and potentially misleading. In this paper I ask whether cluster randomised trials are increasing in both number and quality of reporting. Methods Computer search for papers on cluster randomised trials since 1980, hand search of trial reports published in selected volumes of the British Medical Journal over 20 years. Results There has been a large increase in the numbers of methodological papers and of trial reports using the term 'cluster random' in recent years, with about equal numbers of each type of paper. The British Medical Journal contained more such reports than any other journal. In this journal there was a corresponding increase over time in the number of trials where subjects were randomised in clusters. In 2003 all reports showed awareness of the need to allow for clustering in the analysis. In 1993 and before clustering was ignored in most such trials. Conclusion Cluster trials are becoming more frequent and reporting is of higher quality. Perhaps statistician pressure works. PMID:15310402

  14. Functional clustering of time series gene expression data by Granger causality

    PubMed Central

    2012-01-01

    Background A common approach for time series gene expression data analysis includes the clustering of genes with similar expression patterns throughout time. Clustered gene expression profiles point to the joint contribution of groups of genes to a particular cellular process. However, since genes belong to intricate networks, other features, besides comparable expression patterns, should provide additional information for the identification of functionally similar genes. Results In this study we perform gene clustering through the identification of Granger causality between and within sets of time series gene expression data. Granger causality is based on the idea that the cause of an event cannot come after its consequence. Conclusions This kind of analysis can be used as a complementary approach for functional clustering, wherein genes would be clustered not solely based on their expression similarity but on their topological proximity built according to the intensity of Granger causality among them. PMID:23107425

  15. Micro-heterogeneity versus clustering in binary mixtures of ethanol with water or alkanes.

    PubMed

    Požar, Martina; Lovrinčević, Bernarda; Zoranić, Larisa; Primorać, Tomislav; Sokolić, Franjo; Perera, Aurélien

    2016-08-24

    Ethanol is a hydrogen bonding liquid. When mixed in small concentrations with water or alkanes, it forms aggregate structures reminiscent of, respectively, the direct and inverse micellar aggregates found in emulsions, albeit at much smaller sizes. At higher concentrations, micro-heterogeneous mixing with segregated domains is found. We examine how different statistical methods, namely correlation function analysis, structure factor analysis and cluster distribution analysis, can describe efficiently these morphological changes in these mixtures. In particular, we explain how the neat alcohol pre-peak of the structure factor evolves into the domain pre-peak under mixing conditions, and how this evolution differs whether the co-solvent is water or alkane. This study clearly establishes the heuristic superiority of the correlation function/structure factor analysis to study the micro-heterogeneity, since cluster distribution analysis is insensitive to domain segregation. Correlation functions detect the domains, with a clear structure factor pre-peak signature, while the cluster techniques detect the cluster hierarchy within domains. The main conclusion is that, in micro-segregated mixtures, the domain structure is a more fundamental statistical entity than the underlying cluster structures. These findings could help better understand comparatively the radiation scattering experiments, which are sensitive to domains, versus the spectroscopy-NMR experiments, which are sensitive to clusters.

  16. Changing cluster composition in cluster randomised controlled trials: design and analysis considerations

    PubMed Central

    2014-01-01

    Background There are many methodological challenges in the conduct and analysis of cluster randomised controlled trials, but one that has received little attention is that of post-randomisation changes to cluster composition. To illustrate this, we focus on the issue of cluster merging, considering the impact on the design, analysis and interpretation of trial outcomes. Methods We explored the effects of merging clusters on study power using standard methods of power calculation. We assessed the potential impacts on study findings of both homogeneous cluster merges (involving clusters randomised to the same arm of a trial) and heterogeneous merges (involving clusters randomised to different arms of a trial) by simulation. To determine the impact on bias and precision of treatment effect estimates, we applied standard methods of analysis to different populations under analysis. Results Cluster merging produced a systematic reduction in study power. This effect depended on the number of merges and was most pronounced when variability in cluster size was at its greatest. Simulations demonstrate that the impact on analysis was minimal when cluster merges were homogeneous, with impact on study power being balanced by a change in observed intracluster correlation coefficient (ICC). We found a decrease in study power when cluster merges were heterogeneous, and the estimate of treatment effect was attenuated. Conclusions Examples of cluster merges found in previously published reports of cluster randomised trials were typically homogeneous rather than heterogeneous. Simulations demonstrated that trial findings in such cases would be unbiased. However, simulations also showed that any heterogeneous cluster merges would introduce bias that would be hard to quantify, as well as having negative impacts on the precision of estimates obtained. Further methodological development is warranted to better determine how to analyse such trials appropriately. Interim recommendations include avoidance of cluster merges where possible, discontinuation of clusters following heterogeneous merges, allowance for potential loss of clusters and additional variability in cluster size in the original sample size calculation, and use of appropriate ICC estimates that reflect cluster size. PMID:24884591

  17. A generalized analysis of hydrophobic and loop clusters within globular protein sequences

    PubMed Central

    Eudes, Richard; Le Tuan, Khanh; Delettré, Jean; Mornon, Jean-Paul; Callebaut, Isabelle

    2007-01-01

    Background Hydrophobic Cluster Analysis (HCA) is an efficient way to compare highly divergent sequences through the implicit secondary structure information directly derived from hydrophobic clusters. However, its efficiency and application are currently limited by the need of user expertise. In order to help the analysis of HCA plots, we report here the structural preferences of hydrophobic cluster species, which are frequently encountered in globular domains of proteins. These species are characterized only by their hydrophobic/non-hydrophobic dichotomy. This analysis has been extended to loop-forming clusters, using an appropriate loop alphabet. Results The structural behavior of hydrophobic cluster species, which are typical of protein globular domains, was investigated within banks of experimental structures, considered at different levels of sequence redundancy. The 294 more frequent hydrophobic cluster species were analyzed with regard to their association with the different secondary structures (frequencies of association with secondary structures and secondary structure propensities). Hydrophobic cluster species are predominantly associated with regular secondary structures, and a large part (60 %) reveals preferences for α-helices or β-strands. Moreover, the analysis of the hydrophobic cluster amino acid composition generally allows for finer prediction of the regular secondary structure associated with the considered cluster within a cluster species. We also investigated the behavior of loop forming clusters, using a "PGDNS" alphabet. These loop clusters do not overlap with hydrophobic clusters and are highly associated with coils. Finally, the structural information contained in the hydrophobic structural words, as deduced from experimental structures, was compared to the PSI-PRED predictions, revealing that β-strands and especially α-helices are generally over-predicted within the limits of typical β and α hydrophobic clusters. Conclusion The dictionary of hydrophobic clusters described here can help the HCA user to interpret and compare the HCA plots of globular protein sequences, as well as provides an original fundamental insight into the structural bricks of protein folds. Moreover, the novel loop cluster analysis brings additional information for secondary structure prediction on the whole sequence through a generalized cluster analysis (GCA), and not only on regular secondary structures. Such information lays the foundations for developing a new and original tool for secondary structure prediction. PMID:17210072

  18. Astrophysical properties of star clusters in the Magellanic Clouds homogeneously estimated by ASteCA

    NASA Astrophysics Data System (ADS)

    Perren, G. I.; Piatti, A. E.; Vázquez, R. A.

    2017-06-01

    Aims: We seek to produce a homogeneous catalog of astrophysical parameters of 239 resolved star clusters, located in the Small and Large Magellanic Clouds, observed in the Washington photometric system. Methods: The cluster sample was processed with the recently introduced Automated Stellar Cluster Analysis (ASteCA) package, which ensures both an automatized and a fully reproducible treatment, together with a statistically based analysis of their fundamental parameters and associated uncertainties. The fundamental parameters determined for each cluster with this tool, via a color-magnitude diagram (CMD) analysis, are metallicity, age, reddening, distance modulus, and total mass. Results: We generated a homogeneous catalog of structural and fundamental parameters for the studied cluster sample and performed a detailed internal error analysis along with a thorough comparison with values taken from 26 published articles. We studied the distribution of cluster fundamental parameters in both Clouds and obtained their age-metallicity relationships. Conclusions: The ASteCA package can be applied to an unsupervised determination of fundamental cluster parameters, which is a task of increasing relevance as more data becomes available through upcoming surveys. A table with the estimated fundamental parameters for the 239 clusters analyzed is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/602/A89

  19. Development and optimization of SPECT gated blood pool cluster analysis for the prediction of CRT outcome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lalonde, Michel, E-mail: mlalonde15@rogers.com; Wassenaar, Richard; Wells, R. Glenn

    2014-07-15

    Purpose: Phase analysis of single photon emission computed tomography (SPECT) radionuclide angiography (RNA) has been investigated for its potential to predict the outcome of cardiac resynchronization therapy (CRT). However, phase analysis may be limited in its potential at predicting CRT outcome as valuable information may be lost by assuming that time-activity curves (TAC) follow a simple sinusoidal shape. A new method, cluster analysis, is proposed which directly evaluates the TACs and may lead to a better understanding of dyssynchrony patterns and CRT outcome. Cluster analysis algorithms were developed and optimized to maximize their ability to predict CRT response. Methods: Aboutmore » 49 patients (N = 27 ischemic etiology) received a SPECT RNA scan as well as positron emission tomography (PET) perfusion and viability scans prior to undergoing CRT. A semiautomated algorithm sampled the left ventricle wall to produce 568 TACs from SPECT RNA data. The TACs were then subjected to two different cluster analysis techniques, K-means, and normal average, where several input metrics were also varied to determine the optimal settings for the prediction of CRT outcome. Each TAC was assigned to a cluster group based on the comparison criteria and global and segmental cluster size and scores were used as measures of dyssynchrony and used to predict response to CRT. A repeated random twofold cross-validation technique was used to train and validate the cluster algorithm. Receiver operating characteristic (ROC) analysis was used to calculate the area under the curve (AUC) and compare results to those obtained for SPECT RNA phase analysis and PET scar size analysis methods. Results: Using the normal average cluster analysis approach, the septal wall produced statistically significant results for predicting CRT results in the ischemic population (ROC AUC = 0.73;p < 0.05 vs. equal chance ROC AUC = 0.50) with an optimal operating point of 71% sensitivity and 60% specificity. Cluster analysis results were similar to SPECT RNA phase analysis (ROC AUC = 0.78, p = 0.73 vs cluster AUC; sensitivity/specificity = 59%/89%) and PET scar size analysis (ROC AUC = 0.73, p = 1.0 vs cluster AUC; sensitivity/specificity = 76%/67%). Conclusions: A SPECT RNA cluster analysis algorithm was developed for the prediction of CRT outcome. Cluster analysis results produced results equivalent to those obtained from Fourier and scar analysis.« less

  20. Bayesian network meta-analysis for cluster randomized trials with binary outcomes.

    PubMed

    Uhlmann, Lorenz; Jensen, Katrin; Kieser, Meinhard

    2017-06-01

    Network meta-analysis is becoming a common approach to combine direct and indirect comparisons of several treatment arms. In recent research, there have been various developments and extensions of the standard methodology. Simultaneously, cluster randomized trials are experiencing an increased popularity, especially in the field of health services research, where, for example, medical practices are the units of randomization but the outcome is measured at the patient level. Combination of the results of cluster randomized trials is challenging. In this tutorial, we examine and compare different approaches for the incorporation of cluster randomized trials in a (network) meta-analysis. Furthermore, we provide practical insight on the implementation of the models. In simulation studies, it is shown that some of the examined approaches lead to unsatisfying results. However, there are alternatives which are suitable to combine cluster randomized trials in a network meta-analysis as they are unbiased and reach accurate coverage rates. In conclusion, the methodology can be extended in such a way that an adequate inclusion of the results obtained in cluster randomized trials becomes feasible. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  1. Classification of Forefoot Plantar Pressure Distribution in Persons with Diabetes: A Novel Perspective for the Mechanical Management of Diabetic Foot?

    PubMed Central

    Deschamps, Kevin; Matricali, Giovanni Arnoldo; Roosen, Philip; Desloovere, Kaat; Bruyninckx, Herman; Spaepen, Pieter; Nobels, Frank; Tits, Jos; Flour, Mieke; Staes, Filip

    2013-01-01

    Background The aim of this study was to identify groups of subjects with similar patterns of forefoot loading and verify if specific groups of patients with diabetes could be isolated from non-diabetics. Methodology/Principal Findings Ninety-seven patients with diabetes and 33 control participants between 45 and 70 years were prospectively recruited in two Belgian Diabetic Foot Clinics. Barefoot plantar pressure measurements were recorded and subsequently analysed using a semi-automatic total mapping technique. Kmeans cluster analysis was applied on relative regional impulses of six forefoot segments in order to pursue a classification for the control group separately, the diabetic group separately and both groups together. Cluster analysis led to identification of three distinct groups when considering only the control group. For the diabetic group, and the computation considering both groups together, four distinct groups were isolated. Compared to the cluster analysis of the control group an additional forefoot loading pattern was identified. This group comprised diabetic feet only. The relevance of the reported clusters was supported by ANOVA statistics indicating significant differences between different regions of interest and different clusters. Conclusion/s Significance There seems to emerge a new era in diabetic foot medicine which embraces the classification of diabetic patients according to their biomechanical profile. Classification of the plantar pressure distribution has the potential to provide a means to determine mechanical interventions for the prevention and/or treatment of the diabetic foot. PMID:24278219

  2. Sputum neutrophils are associated with more severe asthma phenotypes using cluster analysis

    PubMed Central

    Moore, Wendy C.; Hastie, Annette T.; Li, Xingnan; Li, Huashi; Busse, William W.; Jarjour, Nizar N.; Wenzel, Sally E.; Peters, Stephen P.; Meyers, Deborah A.; Bleecker, Eugene R.

    2013-01-01

    Background Clinical cluster analysis from the Severe Asthma Research Program (SARP) identified five asthma subphenotypes that represent the severity spectrum of early onset allergic asthma, late onset severe asthma and severe asthma with COPD characteristics. Analysis of induced sputum from a subset of SARP subjects showed four sputum inflammatory cellular patterns. Subjects with concurrent increases in eosinophils (≥2%) and neutrophils (≥40%) had characteristics of very severe asthma. Objective To better understand interactions between inflammation and clinical subphenotypes we integrated inflammatory cellular measures and clinical variables in a new cluster analysis. Methods Participants in SARP at three clinical sites who underwent sputum induction were included in this analysis (n=423). Fifteen variables including clinical characteristics and blood and sputum inflammatory cell assessments were selected by factor analysis for unsupervised cluster analysis. Results Four phenotypic clusters were identified. Cluster A (n=132) and B (n=127) subjects had mild-moderate early onset allergic asthma with paucigranulocytic or eosinophilic sputum inflammatory cell patterns. In contrast, these inflammatory patterns were present in only 7% of Cluster C (n=117) and D (n=47) subjects who had moderate-severe asthma with frequent health care utilization despite treatment with high doses of inhaled or oral corticosteroids, and in Cluster D, reduced lung function. The majority these subjects (>83%) had sputum neutrophilia either alone or with concurrent sputum eosinophilia. Baseline lung function and sputum neutrophils were the most important variables determining cluster assignment. Conclusion This multivariate approach identified four asthma subphenotypes representing the severity spectrum from mild-moderate allergic asthma with minimal or eosinophilic predominant sputum inflammation to moderate-severe asthma with neutrophilic predominant or mixed granulocytic inflammation. PMID:24332216

  3. Analysis of candidates for interacting galaxy clusters. I. A1204 and A2029/A2033

    NASA Astrophysics Data System (ADS)

    Gonzalez, Elizabeth Johana; de los Rios, Martín; Oio, Gabriel A.; Lang, Daniel Hernández; Tagliaferro, Tania Aguirre; Domínguez R., Mariano J.; Castellón, José Luis Nilo; Cuevas L., Héctor; Valotto, Carlos A.

    2018-04-01

    Context. Merging galaxy clusters allow for the study of different mass components, dark and baryonic, separately. Also, their occurrence enables to test the ΛCDM scenario, which can be used to put constraints on the self-interacting cross-section of the dark-matter particle. Aim. It is necessary to perform a homogeneous analysis of these systems. Hence, based on a recently presented sample of candidates for interacting galaxy clusters, we present the analysis of two of these cataloged systems. Methods: In this work, the first of a series devoted to characterizing galaxy clusters in merger processes, we perform a weak lensing analysis of clusters A1204 and A2029/A2033 to derive the total masses of each identified interacting structure together with a dynamical study based on a two-body model. We also describe the gas and the mass distributions in the field through a lensing and an X-ray analysis. This is the first of a series of works which will analyze these type of system in order to characterize them. Results: Neither merging cluster candidate shows evidence of having had a recent merger event. Nevertheless, there is dynamical evidence that these systems could be interacting or could interact in the future. Conclusions: It is necessary to include more constraints in order to improve the methodology of classifying merging galaxy clusters. Characterization of these clusters is important in order to properly understand the nature of these systems and their connection with dynamical studies.

  4. MMPI-2: Cluster Analysis of Personality Profiles in Perinatal Depression—Preliminary Evidence

    PubMed Central

    Grillo, Alessandra; Lauriola, Marco; Giacchetti, Nicoletta

    2014-01-01

    Background. To assess personality characteristics of women who develop perinatal depression. Methods. The study started with a screening of a sample of 453 women in their third trimester of pregnancy, to which was administered a survey data form, the Edinburgh Postnatal Depression Scale (EPDS) and the Minnesota Multiphasic Personality Inventory 2 (MMPI-2). A clinical group of subjects with perinatal depression (PND, 55 subjects) was selected; clinical and validity scales of MMPI-2 were used as predictors in hierarchical cluster analysis carried out. Results. The analysis identified three clusters of personality profile: two “clinical” clusters (1 and 3) and an “apparently common” one (cluster 2). The first cluster (39.5%) collects structures of personality with prevalent obsessive or dependent functioning tending to develop a “psychasthenic” depression; the third cluster (13.95%) includes women with prevalent borderline functioning tending to develop “dysphoric” depression; the second cluster (46.5%) shows a normal profile with a “defensive” attitude, probably due to the presence of defense mechanisms or to the fear of stigma. Conclusion. Characteristics of personality have a key role in clinical manifestations of perinatal depression; it is important to detect them to identify mothers at risk and to plan targeted therapeutic interventions. PMID:25574499

  5. Unequal cluster sizes in stepped-wedge cluster randomised trials: a systematic review

    PubMed Central

    Morris, Tom; Gray, Laura

    2017-01-01

    Objectives To investigate the extent to which cluster sizes vary in stepped-wedge cluster randomised trials (SW-CRT) and whether any variability is accounted for during the sample size calculation and analysis of these trials. Setting Any, not limited to healthcare settings. Participants Any taking part in an SW-CRT published up to March 2016. Primary and secondary outcome measures The primary outcome is the variability in cluster sizes, measured by the coefficient of variation (CV) in cluster size. Secondary outcomes include the difference between the cluster sizes assumed during the sample size calculation and those observed during the trial, any reported variability in cluster sizes and whether the methods of sample size calculation and methods of analysis accounted for any variability in cluster sizes. Results Of the 101 included SW-CRTs, 48% mentioned that the included clusters were known to vary in size, yet only 13% of these accounted for this during the calculation of the sample size. However, 69% of the trials did use a method of analysis appropriate for when clusters vary in size. Full trial reports were available for 53 trials. The CV was calculated for 23 of these: the median CV was 0.41 (IQR: 0.22–0.52). Actual cluster sizes could be compared with those assumed during the sample size calculation for 14 (26%) of the trial reports; the cluster sizes were between 29% and 480% of that which had been assumed. Conclusions Cluster sizes often vary in SW-CRTs. Reporting of SW-CRTs also remains suboptimal. The effect of unequal cluster sizes on the statistical power of SW-CRTs needs further exploration and methods appropriate to studies with unequal cluster sizes need to be employed. PMID:29146637

  6. Topic modeling for cluster analysis of large biological and medical datasets

    PubMed Central

    2014-01-01

    Background The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. Results In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Conclusion Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting that topic model-based methods could provide an analytic advancement in the analysis of large biological or medical datasets. PMID:25350106

  7. Sleep, Dietary, and Exercise Behavioral Clusters among Truck Drivers with Obesity: Implications for Interventions

    PubMed Central

    Olson, Ryan; Thompson, Sharon V.; Wipfli, Brad; Hanson, Ginger; Elliot, Diane L.; Anger, W. Kent; Bodner, Todd; Hammer, Leslie B.; Hohn, Elliot; Perrin, Nancy A.

    2015-01-01

    Objective Our objectives were to describe a sample of truck drivers, identify clusters of drivers with similar patterns in behaviors affecting energy balance (sleep, diet, and exercise), and test for cluster differences in health and psychosocial factors. Methods Participants’ (n=452, BMI M=37.2, 86.4% male) self-reported behaviors were dichotomized prior to hierarchical cluster analysis, which identified groups with similar behavior co-variation. Cluster differences were tested with generalized estimating equations. Results Five behavioral clusters were identified that differed significantly in age, smoking status, diabetes prevalence, lost work days, stress, and social support, but not in BMI. Cluster 2, characterized by the best sleep quality, had significantly lower lost workdays and stress than other clusters. Conclusions Weight management interventions for drivers should explicitly address sleep, and may be maximally effective after establishing socially supportive work environments that reduce stress exposures. PMID:26949883

  8. Manipulating measurement scales in medical statistical analysis and data mining: A review of methodologies

    PubMed Central

    Marateb, Hamid Reza; Mansourian, Marjan; Adibi, Peyman; Farina, Dario

    2014-01-01

    Background: selecting the correct statistical test and data mining method depends highly on the measurement scale of data, type of variables, and purpose of the analysis. Different measurement scales are studied in details and statistical comparison, modeling, and data mining methods are studied based upon using several medical examples. We have presented two ordinal–variables clustering examples, as more challenging variable in analysis, using Wisconsin Breast Cancer Data (WBCD). Ordinal-to-Interval scale conversion example: a breast cancer database of nine 10-level ordinal variables for 683 patients was analyzed by two ordinal-scale clustering methods. The performance of the clustering methods was assessed by comparison with the gold standard groups of malignant and benign cases that had been identified by clinical tests. Results: the sensitivity and accuracy of the two clustering methods were 98% and 96%, respectively. Their specificity was comparable. Conclusion: by using appropriate clustering algorithm based on the measurement scale of the variables in the study, high performance is granted. Moreover, descriptive and inferential statistics in addition to modeling approach must be selected based on the scale of the variables. PMID:24672565

  9. Noninvasive Analysis of the Sputum Transcriptome Discriminates Clinical Phenotypes of Asthma

    PubMed Central

    Yan, Xiting; Chu, Jen-Hwa; Gomez, Jose; Koenigs, Maria; Holm, Carole; He, Xiaoxuan; Perez, Mario F.; Zhao, Hongyu; Mane, Shrikant; Martinez, Fernando D.; Ober, Carole; Nicolae, Dan L.; Barnes, Kathleen C.; London, Stephanie J.; Gilliland, Frank; Weiss, Scott T.; Raby, Benjamin A.; Cohn, Lauren

    2015-01-01

    Rationale: The airway transcriptome includes genes that contribute to the pathophysiologic heterogeneity seen in individuals with asthma. Objectives: We analyzed sputum gene expression for transcriptomic endotypes of asthma (TEA), gene signatures that discriminate phenotypes of disease. Methods: Gene expression in the sputum and blood of patients with asthma was measured using Affymetrix microarrays. Unsupervised clustering analysis based on pathways from the Kyoto Encyclopedia of Genes and Genomes was used to identify TEA clusters. Logistic regression analysis of matched blood samples defined an expression profile in the circulation to determine the TEA cluster assignment in a cohort of children with asthma to replicate clinical phenotypes. Measurements and Main Results: Three TEA clusters were identified. TEA cluster 1 had the most subjects with a history of intubation (P = 0.05), a lower prebronchodilator FEV1 (P = 0.006), a higher bronchodilator response (P = 0.03), and higher exhaled nitric oxide levels (P = 0.04) compared with the other TEA clusters. TEA cluster 2, the smallest cluster, had the most subjects that were hospitalized for asthma (P = 0.04). TEA cluster 3, the largest cluster, had normal lung function, low exhaled nitric oxide levels, and lower inhaled steroid requirements. Evaluation of TEA clusters in children confirmed that TEA clusters 1 and 2 are associated with a history of intubation (P = 5.58 × 10−6) and hospitalization (P = 0.01), respectively. Conclusions: There are common patterns of gene expression in the sputum and blood of children and adults that are associated with near-fatal, severe, and milder asthma. PMID:25763605

  10. Clusters of Insomnia Disorder: An Exploratory Cluster Analysis of Objective Sleep Parameters Reveals Differences in Neurocognitive Functioning, Quantitative EEG, and Heart Rate Variability

    PubMed Central

    Miller, Christopher B.; Bartlett, Delwyn J.; Mullins, Anna E.; Dodds, Kirsty L.; Gordon, Christopher J.; Kyle, Simon D.; Kim, Jong Won; D'Rozario, Angela L.; Lee, Rico S.C.; Comas, Maria; Marshall, Nathaniel S.; Yee, Brendon J.; Espie, Colin A.; Grunstein, Ronald R.

    2016-01-01

    Study Objectives: To empirically derive and evaluate potential clusters of Insomnia Disorder through cluster analysis from polysomnography (PSG). We hypothesized that clusters would differ on neurocognitive performance, sleep-onset measures of quantitative (q)-EEG and heart rate variability (HRV). Methods: Research volunteers with Insomnia Disorder (DSM-5) completed a neurocognitive assessment and overnight PSG measures of total sleep time (TST), wake time after sleep onset (WASO), and sleep onset latency (SOL) were used to determine clusters. Results: From 96 volunteers with Insomnia Disorder, cluster analysis derived at least two clusters from objective sleep parameters: Insomnia with normal objective sleep duration (I-NSD: n = 53) and Insomnia with short sleep duration (I-SSD: n = 43). At sleep onset, differences in HRV between I-NSD and I-SSD clusters suggest attenuated parasympathetic activity in I-SSD (P < 0.05). Preliminary work suggested three clusters by retaining the I-NSD and splitting the I-SSD cluster into two: I-SSD A (n = 29): defined by high WASO and I-SSD B (n = 14): a second I-SSD cluster with high SOL and medium WASO. The I-SSD B cluster performed worse than I-SSD A and I-NSD for sustained attention (P ≤ 0.05). In an exploratory analysis, q-EEG revealed reduced spectral power also in I-SSD B before (Delta, Alpha, Beta-1) and after sleep-onset (Beta-2) compared to I-SSD A and I-NSD (P ≤ 0.05). Conclusions: Two insomnia clusters derived from cluster analysis differ in sleep onset HRV. Preliminary data suggest evidence for three clusters in insomnia with differences for sustained attention and sleep-onset q-EEG. Clinical Trial Registration: Insomnia 100 sleep study: Australia New Zealand Clinical Trials Registry (ANZCTR) identification number 12612000049875. URL: https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=347742. Citation: Miller CB, Bartlett DJ, Mullins AE, Dodds KL, Gordon CJ, Kyle SD, Kim JW, D'Rozario AL, Lee RS, Comas M, Marshall NS, Yee BJ, Espie CA, Grunstein RR. Clusters of Insomnia Disorder: an exploratory cluster analysis of objective sleep parameters reveals differences in neurocognitive functioning, quantitative EEG, and heart rate variability. SLEEP 2016;39(11):1993–2004. PMID:27568796

  11. Structural parameters of young star clusters: fractal analysis

    NASA Astrophysics Data System (ADS)

    Hetem, A.

    2017-07-01

    A unified view of star formation in the Universe demand detailed and in-depth studies of young star clusters. This work is related to our previous study of fractal statistics estimated for a sample of young stellar clusters (Gregorio-Hetem et al. 2015, MNRAS 448, 2504). The structural properties can lead to significant conclusions about the early stages of cluster formation: 1) virial conditions can be used to distinguish warm collapsed; 2) bound or unbound behaviour can lead to conclusions about expansion; and 3) fractal statistics are correlated to the dynamical evolution and age. The technique of error bars estimation most used in the literature is to adopt inferential methods (like bootstrap) to estimate deviation and variance, which are valid only for an artificially generated cluster. In this paper, we expanded the number of studied clusters, in order to enhance the investigation of the cluster properties and dynamic evolution. The structural parameters were compared with fractal statistics and reveal that the clusters radial density profile show a tendency of the mean separation of the stars increase with the average surface density. The sample can be divided into two groups showing different dynamic behaviour, but they have the same dynamic evolution, since the entire sample was revealed as being expanding objects, for which the substructures do not seem to have been completely erased. These results are in agreement with the simulations adopting low surface densities and supervirial conditions.

  12. The NGC 7742 star cluster luminosity function: a population analysis revisited

    NASA Astrophysics Data System (ADS)

    de Grijs, Richard; Ma, Chao

    2018-02-01

    We re-examine the properties of the star cluster population in the circumnuclear starburst ring in the face-on spiral galaxy NGC 7742, whose young cluster mass function has been reported to exhibit significant deviations from the canonical power law. We base our reassessment on the clusters’ luminosities (an observational quantity) rather than their masses (a derived quantity), and confirm conclusively that the galaxy’s starburst-ring clusters—and particularly the youngest subsample, {log}(t {{{yr}}}-1)≤ 7.2—show evidence of a turnover in the cluster luminosity function well above the 90% completeness limit adopted to ensure the reliability of our results. This confirmation emphasizes the unique conundrum posed by this unusual cluster population.

  13. Predicting the points of interaction of small molecules in the NF-κB pathway

    PubMed Central

    2011-01-01

    Background The similarity property principle has been used extensively in drug discovery to identify small compounds that interact with specific drug targets. Here we show it can be applied to identify the interactions of small molecules within the NF-κB signalling pathway. Results Clusters that contain compounds with a predominant interaction within the pathway were created, which were then used to predict the interaction of compounds not included in the clustering analysis. Conclusions The technique successfully predicted the points of interactions of compounds that are known to interact with the NF-κB pathway. The method was also shown to be successful when compounds for which the interaction points were unknown were included in the clustering analysis. PMID:21342508

  14. Network module detection: Affinity search technique with the multi-node topological overlap measure

    PubMed Central

    Li, Ai; Horvath, Steve

    2009-01-01

    Background Many clustering procedures only allow the user to input a pairwise dissimilarity or distance measure between objects. We propose a clustering method that can input a multi-point dissimilarity measure d(i1, i2, ..., iP) where the number of points P can be larger than 2. The work is motivated by gene network analysis where clusters correspond to modules of highly interconnected nodes. Here, we define modules as clusters of network nodes with high multi-node topological overlap. The topological overlap measure is a robust measure of interconnectedness which is based on shared network neighbors. In previous work, we have shown that the multi-node topological overlap measure yields biologically meaningful results when used as input of network neighborhood analysis. Findings We adapt network neighborhood analysis for the use of module detection. We propose the Module Affinity Search Technique (MAST), which is a generalized version of the Cluster Affinity Search Technique (CAST). MAST can accommodate a multi-node dissimilarity measure. Clusters grow around user-defined or automatically chosen seeds (e.g. hub nodes). We propose both local and global cluster growth stopping rules. We use several simulations and a gene co-expression network application to argue that the MAST approach leads to biologically meaningful results. We compare MAST with hierarchical clustering and partitioning around medoid clustering. Conclusion Our flexible module detection method is implemented in the MTOM software which can be downloaded from the following webpage: PMID:19619323

  15. A formal concept analysis approach to consensus clustering of multi-experiment expression data

    PubMed Central

    2014-01-01

    Background Presently, with the increasing number and complexity of available gene expression datasets, the combination of data from multiple microarray studies addressing a similar biological question is gaining importance. The analysis and integration of multiple datasets are expected to yield more reliable and robust results since they are based on a larger number of samples and the effects of the individual study-specific biases are diminished. This is supported by recent studies suggesting that important biological signals are often preserved or enhanced by multiple experiments. An approach to combining data from different experiments is the aggregation of their clusterings into a consensus or representative clustering solution which increases the confidence in the common features of all the datasets and reveals the important differences among them. Results We propose a novel generic consensus clustering technique that applies Formal Concept Analysis (FCA) approach for the consolidation and analysis of clustering solutions derived from several microarray datasets. These datasets are initially divided into groups of related experiments with respect to a predefined criterion. Subsequently, a consensus clustering algorithm is applied to each group resulting in a clustering solution per group. These solutions are pooled together and further analysed by employing FCA which allows extracting valuable insights from the data and generating a gene partition over all the experiments. In order to validate the FCA-enhanced approach two consensus clustering algorithms are adapted to incorporate the FCA analysis. Their performance is evaluated on gene expression data from multi-experiment study examining the global cell-cycle control of fission yeast. The FCA results derived from both methods demonstrate that, although both algorithms optimize different clustering characteristics, FCA is able to overcome and diminish these differences and preserve some relevant biological signals. Conclusions The proposed FCA-enhanced consensus clustering technique is a general approach to the combination of clustering algorithms with FCA for deriving clustering solutions from multiple gene expression matrices. The experimental results presented herein demonstrate that it is a robust data integration technique able to produce good quality clustering solution that is representative for the whole set of expression matrices. PMID:24885407

  16. Predicting healthcare outcomes in prematurely born infants using cluster analysis.

    PubMed

    MacBean, Victoria; Lunt, Alan; Drysdale, Simon B; Yarzi, Muska N; Rafferty, Gerrard F; Greenough, Anne

    2018-05-23

    Prematurely born infants are at high risk of respiratory morbidity following neonatal unit discharge, though prediction of outcomes is challenging. We have tested the hypothesis that cluster analysis would identify discrete groups of prematurely born infants with differing respiratory outcomes during infancy. A total of 168 infants (median (IQR) gestational age 33 (31-34) weeks) were recruited in the neonatal period from consecutive births in a tertiary neonatal unit. The baseline characteristics of the infants were used to classify them into hierarchical agglomerative clusters. Rates of viral lower respiratory tract infections (LRTIs) were recorded for 151 infants in the first year after birth. Infants could be classified according to birth weight and duration of neonatal invasive mechanical ventilation (MV) into three clusters. Cluster one (MV ≤5 days) had few LRTIs. Clusters two and three (both MV ≥6 days, but BW ≥or <882 g respectively), had significantly higher LRTI rates. Cluster two had a higher proportion of infants experiencing respiratory syncytial virus LRTIs (P = 0.01) and cluster three a higher proportion of rhinovirus LRTIs (P < 0.001) CONCLUSIONS: Readily available clinical data allowed classification of prematurely born infants into one of three distinct groups with differing subsequent respiratory morbidity in infancy. © 2018 Wiley Periodicals, Inc.

  17. Symptom clusters and quality of life among patients with advanced heart failure

    PubMed Central

    Yu, Doris SF; Chan, Helen YL; Leung, Doris YP; Hui, Elsie; Sit, Janet WH

    2016-01-01

    Objectives To identify symptom clusters among patients with advanced heart failure (HF) and the independent relationships with their quality of life (QoL). Methods This is the secondary data analysis of a cross-sectional study which interviewed 119 patients with advanced HF in the geriatric unit of a regional hospital in Hong Kong. The symptom profile and QoL were assessed by using the Edmonton Symptom Assessment Scale (ESAS) and the McGill QoL Questionnaire. Exploratory factor analysis was used to identify the symptom clusters. Hierarchical regression analysis was used to examine the independent relationships with their QoL, after adjusting the effects of age, gender, and comorbidities. Results The patients were at an advanced age (82.9 ± 6.5 years). Three distinct symptom clusters were identified: they were the distress cluster (including shortness of breath, anxiety, and depression), the decondition cluster (fatigue, drowsiness, nausea, and reduced appetite), and the discomfort cluster (pain, and sense of generalized discomfort). These three symptom clusters accounted for 63.25% of variance of the patients' symptom experience. The small to moderate correlations between these symptom clusters indicated that they were rather independent of one another. After adjusting the age, gender and comorbidities, the distress (β = −0.635, P < 0.001), the decondition (β = −0.148, P = 0.01), and the discomfort (β = −0.258, P < 0.001) symptom clusters independently predicted their QoL. Conclusions This study identified the distinctive symptom clusters among patients with advanced HF. The results shed light on the need to develop palliative care interventions for optimizing the symptom control for this life-limiting disease. PMID:27403150

  18. Cluster Analysis of Clinical Data Identifies Fibromyalgia Subgroups

    PubMed Central

    Docampo, Elisa; Collado, Antonio; Escaramís, Geòrgia; Carbonell, Jordi; Rivera, Javier; Vidal, Javier; Alegre, José

    2013-01-01

    Introduction Fibromyalgia (FM) is mainly characterized by widespread pain and multiple accompanying symptoms, which hinder FM assessment and management. In order to reduce FM heterogeneity we classified clinical data into simplified dimensions that were used to define FM subgroups. Material and Methods 48 variables were evaluated in 1,446 Spanish FM cases fulfilling 1990 ACR FM criteria. A partitioning analysis was performed to find groups of variables similar to each other. Similarities between variables were identified and the variables were grouped into dimensions. This was performed in a subset of 559 patients, and cross-validated in the remaining 887 patients. For each sample and dimension, a composite index was obtained based on the weights of the variables included in the dimension. Finally, a clustering procedure was applied to the indexes, resulting in FM subgroups. Results Variables clustered into three independent dimensions: “symptomatology”, “comorbidities” and “clinical scales”. Only the two first dimensions were considered for the construction of FM subgroups. Resulting scores classified FM samples into three subgroups: low symptomatology and comorbidities (Cluster 1), high symptomatology and comorbidities (Cluster 2), and high symptomatology but low comorbidities (Cluster 3), showing differences in measures of disease severity. Conclusions We have identified three subgroups of FM samples in a large cohort of FM by clustering clinical data. Our analysis stresses the importance of family and personal history of FM comorbidities. Also, the resulting patient clusters could indicate different forms of the disease, relevant to future research, and might have an impact on clinical assessment. PMID:24098674

  19. A Cluster Analysis of Bronchial Asthma Patients with Depressive Symptoms.

    PubMed

    Seino, Yo; Hasegawa, Takashi; Koya, Toshiyuki; Sakagami, Takuro; Mashima, Ichiro; Shimizu, Natsue; Muramatsu, Yoshiyuki; Muramatsu, Kumiko; Suzuki, Eiichi; Kikuchi, Toshiaki

    2018-03-09

    Objective Whether or not depression affects the control or severity of asthma is unclear. We performed a cluster analysis of asthma patients with depressive symptoms to clarify their characteristics. Methods and subjects Multiple medical institutions in Niigata Prefecture, Japan, were surveyed in 2014. We recorded the age, disease duration, body mass index (BMI), medications, and surveyed asthma control status and severity, as well as depressive symptoms and adherence to treatment using questionnaires. A hierarchical cluster analysis was performed on the group of patients assessed as having depression. Results Of 2,273 patients, 128 were assessed as being positive for depressive symptoms (DS[+]). Thirty-three were excluded because of missing data, and the remaining 95 DS[+] patients were classified into 3 clusters (A, B, and C). The patients in cluster A (n=19) were elderly, had severe, poorly controlled asthma, and demonstrated possible adherence barriers; those in cluster B (n=26) were elderly with a low BMI and had no significant adherence barriers but had severe, poorly controlled asthma; and those in cluster C (n=50) were younger, with a high BMI, no significant adherence barriers, well-controlled asthma, and few were severely affected. The scores for depressive symptoms were not significantly different between clusters. Conclusion About half of the patients in the DS[+] group had severe, poorly controlled asthma, and these clusters were able to be distinguished by their ASK-12 score, which reflects adherence barriers. The control status and severity of asthma may also be related to the age, disease duration, and BMI in the DS[+] group.

  20. Integrated Copy Number and Expression Analysis Identifies Profiles of Whole-Arm Chromosomal Alterations and Subgroups with Favorable Outcome in Ovarian Clear Cell Carcinomas

    PubMed Central

    Uehara, Yuriko; Oda, Katsutoshi; Ikeda, Yuji; Koso, Takahiro; Tsuji, Shingo; Yamamoto, Shogo; Asada, Kayo; Sone, Kenbun; Kurikawa, Reiko; Makii, Chinami; Hagiwara, Otoe; Tanikawa, Michihiro; Maeda, Daichi; Hasegawa, Kosei; Nakagawa, Shunsuke; Wada-Hiraike, Osamu; Kawana, Kei; Fukayama, Masashi; Fujiwara, Keiichi; Yano, Tetsu; Osuga, Yutaka; Fujii, Tomoyuki; Aburatani, Hiroyuki

    2015-01-01

    Ovarian clear cell carcinoma (CCC) is generally associated with chemoresistance and poor clinical outcome, even with early diagnosis; whereas high-grade serous carcinomas (SCs) and endometrioid carcinomas (ECs) are commonly chemosensitive at advanced stages. Although an integrated genomic analysis of SC has been performed, conclusive views on copy number and expression profiles for CCC are still limited. In this study, we performed single nucleotide polymorphism analysis with 57 epithelial ovarian cancers (31 CCCs, 14 SCs, and 12 ECs) and microarray expression analysis with 55 cancers (25 CCCs, 16 SCs, and 14 ECs). We then evaluated PIK3CA mutations and ARID1A expression in CCCs. SNP array analysis classified 13% of CCCs into a cluster with high frequency and focal range of copy number alterations (CNAs), significantly lower than for SCs (93%, P < 0.01) and ECs (50%, P = 0.017). The ratio of whole-arm to all CNAs was higher in CCCs (46.9%) than SCs (21.7%; P < 0.0001). SCs with loss of heterozygosity (LOH) of BRCA1 (85%) also had LOH of NF1 and TP53, and LOH of BRCA2 (62%) coexisted with LOH of RB1 and TP53. Microarray analysis classified CCCs into three clusters. One cluster (CCC-2, n = 10) showed more favorable prognosis than the CCC-1 and CCC-3 clusters (P = 0.041). Coexistent alterations of PIK3CA and ARID1A were more common in CCC-1 and CCC-3 (7/11, 64%) than in CCC-2 (0/10, 0%; P < 0.01). Being in cluster CCC-2 was an independent favorable prognostic factor in CCC. In conclusion, CCC was characterized by a high ratio of whole-arm CNAs; whereas CNAs in SC were mainly focal, but preferentially caused LOH of well-known tumor suppressor genes. As such, expression profiles might be useful for sub-classification of CCC, and might provide useful information on prognosis. PMID:26043110

  1. Network visualization of conformational sampling during molecular dynamics simulation.

    PubMed

    Ahlstrom, Logan S; Baker, Joseph Lee; Ehrlich, Kent; Campbell, Zachary T; Patel, Sunita; Vorontsov, Ivan I; Tama, Florence; Miyashita, Osamu

    2013-11-01

    Effective data reduction methods are necessary for uncovering the inherent conformational relationships present in large molecular dynamics (MD) trajectories. Clustering algorithms provide a means to interpret the conformational sampling of molecules during simulation by grouping trajectory snapshots into a few subgroups, or clusters, but the relationships between the individual clusters may not be readily understood. Here we show that network analysis can be used to visualize the dominant conformational states explored during simulation as well as the connectivity between them, providing a more coherent description of conformational space than traditional clustering techniques alone. We compare the results of network visualization against 11 clustering algorithms and principal component conformer plots. Several MD simulations of proteins undergoing different conformational changes demonstrate the effectiveness of networks in reaching functional conclusions. Copyright © 2013 Elsevier Inc. All rights reserved.

  2. Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins

    PubMed Central

    2013-01-01

    Background Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previously we developed an algorithm that clustered mass spectrum peaks that were biologically or technically correlated. Here we demonstrate an algorithm that clusters correlated technical aliases only. Results In this paper, we propose a preprocessing algorithm that can be used for grouping technical aliases in mass spectrometry protein profiling data. The stringency of the variance allowed for clustering is customizable, thereby affecting the number of peaks that are clustered. Subsequent analysis of the clusters, instead of individual peaks, helps reduce difficulties associated with technically-correlated data, and can aid more efficient biomarker identification. Conclusions This software can be used to pre-process and thereby decrease the complexity of protein profiling proteomics data, thus simplifying the subsequent analysis of biomarkers by decreasing the number of tests. The software is also a practical tool for identifying which features to investigate further by purification, identification and confirmation. PMID:24010718

  3. An Empirical Taxonomy of Hospital Governing Board Roles

    PubMed Central

    Lee, Shoou-Yih D; Alexander, Jeffrey A; Wang, Virginia; Margolin, Frances S; Combes, John R

    2008-01-01

    Objective To develop a taxonomy of governing board roles in U.S. hospitals. Data Sources 2005 AHA Hospital Governance Survey, 2004 AHA Annual Survey of Hospitals, and Area Resource File. Study Design A governing board taxonomy was developed using cluster analysis. Results were validated and reviewed by industry experts. Differences in hospital and environmental characteristics across clusters were examined. Data Extraction Methods One-thousand three-hundred thirty-four hospitals with complete information on the study variables were included in the analysis. Principal Findings Five distinct clusters of hospital governing boards were identified. Statistical tests showed that the five clusters had high internal reliability and high internal validity. Statistically significant differences in hospital and environmental conditions were found among clusters. Conclusions The developed taxonomy provides policy makers, health care executives, and researchers a useful way to describe and understand hospital governing board roles. The taxonomy may also facilitate valid and systematic assessment of governance performance. Further, the taxonomy could be used as a framework for governing boards themselves to identify areas for improvement and direction for change. PMID:18355260

  4. Symptom clusters in women with breast cancer: an analysis of data from social media and a research study

    PubMed Central

    Marshall, Sarah A.; Yang, Christopher C.; Ping, Qing; Zhao, Mengnan; Avis, Nancy E.

    2016-01-01

    Purpose User-generated content on social media sites, such as health-related online forums, offers researchers a tantalizing amount of information, but concerns regarding scientific application of such data remain. This paper compares and contrasts symptom cluster patterns derived from messages on a breast cancer forum with those from a symptom checklist completed by breast cancer survivors participating in a research study. Methods Over 50,000 messages generated by 12,991 users of the breast cancer forum on MedHelp.org were transformed into a standard form and examined for the co-occurrence of 25 symptoms. The k-medoid clustering method was used to determine appropriate placement of symptoms within clusters. Findings were compared with a similar analysis of a symptom checklist administered to 653 breast cancer survivors participating in a research study. Results The following clusters were identified using forum data: menopausal/psychological, pain/fatigue, gastrointestinal, and miscellaneous. Study data generated the clusters: menopausal, pain, fatigue/sleep/gastrointestinal, psychological, and increased weight/appetite. Although the clusters are somewhat different, many symptoms that clustered together in the social media analysis remained together in the analysis of the study participants. Density of connections between symptoms, as reflected by rates of co-occurrence and similarity, was higher in the study data. Conclusions The copious amount of data generated by social media outlets can augment findings from traditional data sources. When different sources of information are combined, areas of overlap and discrepancy can be detected, perhaps giving researchers a more accurate picture of reality. However, data derived from social media must be used carefully and with understanding of its limitations. PMID:26476836

  5. The effects of co-morbidity in defining major depression subtypes associated with long-term course and severity.

    PubMed

    Wardenaar, K J; van Loo, H M; Cai, T; Fava, M; Gruber, M J; Li, J; de Jonge, P; Nierenberg, A A; Petukhova, M V; Rose, S; Sampson, N A; Schoevers, R A; Wilcox, M A; Alonso, J; Bromet, E J; Bunting, B; Florescu, S E; Fukao, A; Gureje, O; Hu, C; Huang, Y Q; Karam, A N; Levinson, D; Medina Mora, M E; Posada-Villa, J; Scott, K M; Taib, N I; Viana, M C; Xavier, M; Zarkov, Z; Kessler, R C

    2014-11-01

    Although variation in the long-term course of major depressive disorder (MDD) is not strongly predicted by existing symptom subtype distinctions, recent research suggests that prediction can be improved by using machine learning methods. However, it is not known whether these distinctions can be refined by added information about co-morbid conditions. The current report presents results on this question. Data came from 8261 respondents with lifetime DSM-IV MDD in the World Health Organization (WHO) World Mental Health (WMH) Surveys. Outcomes included four retrospectively reported measures of persistence/severity of course (years in episode; years in chronic episodes; hospitalization for MDD; disability due to MDD). Machine learning methods (regression tree analysis; lasso, ridge and elastic net penalized regression) followed by k-means cluster analysis were used to augment previously detected subtypes with information about prior co-morbidity to predict these outcomes. Predicted values were strongly correlated across outcomes. Cluster analysis of predicted values found three clusters with consistently high, intermediate or low values. The high-risk cluster (32.4% of cases) accounted for 56.6-72.9% of high persistence, high chronicity, hospitalization and disability. This high-risk cluster had both higher sensitivity and likelihood ratio positive (LR+; relative proportions of cases in the high-risk cluster versus other clusters having the adverse outcomes) than in a parallel analysis that excluded measures of co-morbidity as predictors. Although the results using the retrospective data reported here suggest that useful MDD subtyping distinctions can be made with machine learning and clustering across multiple indicators of illness persistence/severity, replication with prospective data is needed to confirm this preliminary conclusion.

  6. Analysis of a hyperdeformed band of 152(66)Dy86 on the basis of a structure with two revolving clusters, each with a previously unrecognized two-tiered structure.

    PubMed

    Pauling, L

    1994-02-01

    Analysis on the basis of the two-revolving-cluster model has been made of a cascade of 11 gamma-rays constituting a hyperdeformed band of 152(66)Dy86 (or possibly 153Dy) reported by Galindo-Uribarri et al. [Galindo-Uribarri, A., et al. (1993) Phys. Rev. Lett. 73, 231-234], leading to the conclusions that the band extends from values K approximately 82-104 for the angular-momentum quantum number, that the moment of inertia is approximately 5650 Da.fm2, that the composition of the central sphere is p40n50 and that of each of the clusters is p13n18, that each of the clusters consists of two tiers of spherons, and that the radii of revolution of the inner and outer tiers have values of about 8.00 and 11.20 fm, respectively.

  7. Analysis of a hyperdeformed band of 152(66)Dy86 on the basis of a structure with two revolving clusters, each with a previously unrecognized two-tiered structure.

    PubMed Central

    Pauling, L

    1994-01-01

    Analysis on the basis of the two-revolving-cluster model has been made of a cascade of 11 gamma-rays constituting a hyperdeformed band of 152(66)Dy86 (or possibly 153Dy) reported by Galindo-Uribarri et al. [Galindo-Uribarri, A., et al. (1993) Phys. Rev. Lett. 73, 231-234], leading to the conclusions that the band extends from values K approximately 82-104 for the angular-momentum quantum number, that the moment of inertia is approximately 5650 Da.fm2, that the composition of the central sphere is p40n50 and that of each of the clusters is p13n18, that each of the clusters consists of two tiers of spherons, and that the radii of revolution of the inner and outer tiers have values of about 8.00 and 11.20 fm, respectively. PMID:11607453

  8. Objective and Perceived Weight: Associations with Risky Adolescent Sexual Behavior

    PubMed Central

    Akers, Aletha Y.; Cohen, Elan D.; Marshal, Michael P.; Roebuck, Geoff; Yu, Lan; Hipwell, Alison E.

    2016-01-01

    CONTEXT Studies have shown that obesity is associated with increased sexual risk-taking, particularly among adolescent females, but the relationships between obesity, perceived weight and sexual risk behaviors are poorly understood. METHODS Integrative data analysis was performed that combined baseline data from the 1994–1995 National Longitudinal Study of Adolescent Health (from 17,606 respondents in grades 7–12) and the 1997 National Longitudinal Survey of Youth (from 7,752 respondents aged 12–16). Using six sexual behaviors measured in both data sets (age at first intercourse, various measures of contraceptive use and number of partners), cluster analysis was conducted that identified five distinct behavior clusters. Multivariate ordinal logistic regression analysis examined associations between adolescents’ weight status (categorized as underweight, normal-weight, overweight or obese) and weight perception and their cluster membership. RESULTS Among males, being underweight, rather than normal-weight, was negatively associated with membership in increasingly risky clusters (odds ratio, 0.5), as was the perception of being overweight, as opposed to about the right weight (0.8). However, being overweight was positively associated with males’ membership in increasingly risky clusters (1.3). Among females, being obese, rather than normal-weight, was negatively correlated with membership in increasingly risky clusters (0.8), while the perception of being overweight was positively correlated with such membership (1.1). CONCLUSIONS Both objective and subjective assessments of weight are associated with the clustering of risky sexual behaviors among adolescents, and these behavioral patterns differ by gender. PMID:27608419

  9. Examination of Previously Published Data to Identify Patterns in the Social Representation of “Loud Music” in Young Adults Across Countries

    PubMed Central

    Manchaiah, Vinaya; Zhao, Fei; Oladeji, Susan; Ratinaud, Pierre

    2018-01-01

    Purpose: The current study was aimed at understanding the patterns in the social representation of loud music reported by young adults in different countries. Materials and Methods: The study included a sample of 534 young adults (18–25 years) from India, Iran, Portugal, United Kingdom, and United States. Participants were recruited using a convince sampling, and data were collected using the free association task. Participants were asked to provide up to five words or phrases that come to mind when thinking about “loud music.” The data were first analyzed using the qualitative content analysis. This was followed by quantitative cluster analysis and chi-square analysis. Results: The content analysis suggested 19 main categories of responses related to loud music. The cluster analysis resulted in for main clusters, namely: (1) emotional oriented perception; (2) problem oriented perception; (3) music and enjoyment oriented perception; and (4) positive emotional and recreation-oriented perception. Country of origin was associated with the likelihood of participants being in each of these clusters. Conclusion: The current study highlights the differences and similarities in young adults’ perception of loud music. These results may have implications to hearing health education to facilitate healthy listening habits. PMID:29457602

  10. Effect of Dust Coagulation Dynamics on the Geometry of Aggregates

    NASA Technical Reports Server (NTRS)

    Nakamura, R.

    1996-01-01

    Master equation gives a more fundamental description of stochastic coagulation processes rather than popular Smoluchowski's equation. In order to examine the effect of the dynamics on the geometry of resulting aggregates, we study Master equation with a rigorous Monte Carlo algorithm. It is found that Cluster-Cluster aggregation model is a good approximation of orderly growth and the aggregates have fluffy structures with a fractal dimension approx. 2. A scaling analysis of Smoluchowski's equation also supports this conclusion.

  11. A New Classification of Diabetic Gait Pattern Based on Cluster Analysis of Biomechanical Data

    PubMed Central

    Sawacha, Zimi; Guarneri, Gabriella; Avogaro, Angelo; Cobelli, Claudio

    2010-01-01

    Background The diabetic foot, one of the most serious complications of diabetes mellitus and a major risk factor for plantar ulceration, is determined mainly by peripheral neuropathy. Neuropathic patients exhibit decreased stability while standing as well as during dynamic conditions. A new methodology for diabetic gait pattern classification based on cluster analysis has been proposed that aims to identify groups of subjects with similar patterns of gait and verify if three-dimensional gait data are able to distinguish diabetic gait patterns from one of the control subjects. Method The gait of 20 nondiabetic individuals and 46 diabetes patients with and without peripheral neuropathy was analyzed [mean age 59.0 (2.9) and 61.1(4.4) years, mean body mass index (BMI) 24.0 (2.8), and 26.3 (2.0)]. K-means cluster analysis was applied to classify the subjects' gait patterns through the analysis of their ground reaction forces, joints and segments (trunk, hip, knee, ankle) angles, and moments. Results Cluster analysis classification led to definition of four well-separated clusters: one aggregating just neuropathic subjects, one aggregating both neuropathics and non-neuropathics, one including only diabetes patients, and one including either controls or diabetic and neuropathic subjects. Conclusions Cluster analysis was useful in grouping subjects with similar gait patterns and provided evidence that there were subgroups that might otherwise not be observed if a group ensemble was presented for any specific variable. In particular, we observed the presence of neuropathic subjects with a gait similar to the controls and diabetes patients with a long disease duration with a gait as altered as the neuropathic one. PMID:20920432

  12. Using Fuzzy Clustering for Real-time Space Flight Safety

    NASA Technical Reports Server (NTRS)

    Lee, Charles; Haskell, Richard E.; Hanna, Darrin; Alena, Richard L.

    2004-01-01

    To ensure space flight safety, it is necessary to monitor myriad sensor readings on the ground and in flight. Since a space shuttle has many sensors, monitoring data and drawing conclusions from information contained within the data in real time is challenging. The nature of the information can be critical to the success of the mission and safety of the crew and therefore, must be processed with minimal data-processing time. Data analysis algorithms could be used to synthesize sensor readings and compare data associated with normal operation with the data obtained that contain fault patterns to draw conclusions. Detecting abnormal operation during early stages in the transition from safe to unsafe operation requires a large amount of historical data that can be categorized into different classes (non-risk, risk). Even though the 40 years of shuttle flight program has accumulated volumes of historical data, these data don t comprehensively represent all possible fault patterns since fault patterns are usually unknown before the fault occurs. This paper presents a method that uses a similarity measure between fuzzy clusters to detect possible faults in real time. A clustering technique based on a fuzzy equivalence relation is used to characterize temporal data. Data collected during an initial time period are separated into clusters. These clusters are characterized by their centroids. Clusters formed during subsequent time periods are either merged with an existing cluster or added to the cluster list. The resulting list of cluster centroids, called a cluster group, characterizes the behavior of a particular set of temporal data. The degree to which new clusters formed in a subsequent time period are similar to the cluster group is characterized by a similarity measure, q. This method is applied to downlink data from Columbia flights. The results show that this technique can detect an unexpected fault that has not been present in the training data set.

  13. Cluster Subcutaneous Allergen Specific Immunotherapy for the Treatment of Allergic Rhinitis: A Systematic Review and Meta-Analysis

    PubMed Central

    Sun, Yueqi; Luo, Xi; Li, Huabin

    2014-01-01

    Background Although allergen specific immunotherapy (SIT) represents the only immune- modifying and curative option available for patients with allergic rhinitis (AR), the optimal schedule for specific subcutaneous immunotherapy (SCIT) is still unknown. The objective of this study is to systematically assess the efficacy and safety of cluster SCIT for patients with AR. Methods By searching PubMed, EMBASE and the Cochrane clinical trials database from 1980 through May 10th, 2013, we collected and analyzed the randomized controlled trials (RCTs) of cluster SCIT to assess its efficacy and safety. Results Eight trials involving 567 participants were included in this systematic review. Our meta-analysis showed that cluster SCIT have similar effect in reduction of both rhinitis symptoms and the requirement for anti-allergic medication compared with conventional SCIT, but when comparing cluster SCIT with placebo, no statistic significance were found in reduction of symptom scores or medication scores. Some caution is required in this interpretation as there was significant heterogeneity between studies. Data relating to Rhinoconjunctivitis Quality of Life Questionnaire (RQLQ) in 3 included studies were analyzed, which consistently point to the efficacy of cluster SCIT in improving quality of life compared to placebo. To assess the safety of cluster SCIT, meta-analysis showed that no differences existed in the incidence of either local adverse reaction or systemic adverse reaction between the cluster group and control group. Conclusion Based on the current limited evidence, we still could not conclude affirmatively that cluster SCIT was a safe and efficacious option for the treatment of AR patients. Further large-scale, well-designed RCTs on this topic are still needed. PMID:24489740

  14. Infrared spectroscopy reveals both qualitative and quantitative differences in equine subchondral bone during maturation

    NASA Astrophysics Data System (ADS)

    Kobrina, Yevgeniya; Isaksson, Hanna; Sinisaari, Miikka; Rieppo, Lassi; Brama, Pieter A.; van Weeren, René; Helminen, Heikki J.; Jurvelin, Jukka S.; Saarakkala, Simo

    2010-11-01

    The collagen phase in bone is known to undergo major changes during growth and maturation. The objective of this study is to clarify whether Fourier transform infrared (FTIR) microspectroscopy, coupled with cluster analysis, can detect quantitative and qualitative changes in the collagen matrix of subchondral bone in horses during maturation and growth. Equine subchondral bone samples (n = 29) from the proximal joint surface of the first phalanx are prepared from two sites subjected to different loading conditions. Three age groups are studied: newborn (0 days old), immature (5 to 11 months old), and adult (6 to 10 years old) horses. Spatial collagen content and collagen cross-link ratio are quantified from the spectra. Additionally, normalized second derivative spectra of samples are clustered using the k-means clustering algorithm. In quantitative analysis, collagen content in the subchondral bone increases rapidly between the newborn and immature horses. The collagen cross-link ratio increases significantly with age. In qualitative analysis, clustering is able to separate newborn and adult samples into two different groups. The immature samples display some nonhomogeneity. In conclusion, this is the first study showing that FTIR spectral imaging combined with clustering techniques can detect quantitative and qualitative changes in the collagen matrix of subchondral bone during growth and maturation.

  15. Supervised group Lasso with applications to microarray data analysis

    PubMed Central

    Ma, Shuangge; Song, Xiao; Huang, Jian

    2007-01-01

    Background A tremendous amount of efforts have been devoted to identifying genes for diagnosis and prognosis of diseases using microarray gene expression data. It has been demonstrated that gene expression data have cluster structure, where the clusters consist of co-regulated genes which tend to have coordinated functions. However, most available statistical methods for gene selection do not take into consideration the cluster structure. Results We propose a supervised group Lasso approach that takes into account the cluster structure in gene expression data for gene selection and predictive model building. For gene expression data without biological cluster information, we first divide genes into clusters using the K-means approach and determine the optimal number of clusters using the Gap method. The supervised group Lasso consists of two steps. In the first step, we identify important genes within each cluster using the Lasso method. In the second step, we select important clusters using the group Lasso. Tuning parameters are determined using V-fold cross validation at both steps to allow for further flexibility. Prediction performance is evaluated using leave-one-out cross validation. We apply the proposed method to disease classification and survival analysis with microarray data. Conclusion We analyze four microarray data sets using the proposed approach: two cancer data sets with binary cancer occurrence as outcomes and two lymphoma data sets with survival outcomes. The results show that the proposed approach is capable of identifying a small number of influential gene clusters and important genes within those clusters, and has better prediction performance than existing methods. PMID:17316436

  16. Dynamics of cD Clusters of Galaxies. 4; Conclusion of a Survey of 25 Abell Clusters

    NASA Technical Reports Server (NTRS)

    Oegerle, William R.; Hill, John M.; Fisher, Richard R. (Technical Monitor)

    2001-01-01

    We present the final results of a spectroscopic study of a sample of cD galaxy clusters. The goal of this program has been to study the dynamics of the clusters, with emphasis on determining the nature and frequency of cD galaxies with peculiar velocities. Redshifts measured with the MX Spectrometer have been combined with those obtained from the literature to obtain typically 50 - 150 observed velocities in each of 25 galaxy clusters containing a central cD galaxy. We present a dynamical analysis of the final 11 clusters to be observed in this sample. All 25 clusters are analyzed in a uniform manner to test for the presence of substructure, and to determine peculiar velocities and their statistical significance for the central cD galaxy. These peculiar velocities were used to determine whether or not the central cD galaxy is at rest in the cluster potential well. We find that 30 - 50% of the clusters in our sample possess significant subclustering (depending on the cluster radius used in the analysis), which is in agreement with other studies of non-cD clusters. Hence, the dynamical state of cD clusters is not different than other present-day clusters. After careful study, four of the clusters appear to have a cD galaxy with a significant peculiar velocity. Dressler-Shectman tests indicate that three of these four clusters have statistically significant substructure within 1.5/h(sub 75) Mpc of the cluster center. The dispersion 75 of the cD peculiar velocities is 164 +41/-34 km/s around the mean cluster velocity. This represents a significant detection of peculiar cD velocities, but at a level which is far below the mean velocity dispersion for this sample of clusters. The picture that emerges is one in which cD galaxies are nearly at rest with respect to the cluster potential well, but have small residual velocities due to subcluster mergers.

  17. A Systems Biology Approach for Identifying Hepatotoxicant Groups Based on Similarity in Mechanisms of Action and Chemical Structure.

    PubMed

    Hebels, Dennie G A J; Rasche, Axel; Herwig, Ralf; van Westen, Gerard J P; Jennen, Danyel G J; Kleinjans, Jos C S

    2016-01-01

    When evaluating compound similarity, addressing multiple sources of information to reach conclusions about common pharmaceutical and/or toxicological mechanisms of action is a crucial strategy. In this chapter, we describe a systems biology approach that incorporates analyses of hepatotoxicant data for 33 compounds from three different sources: a chemical structure similarity analysis based on the 3D Tanimoto coefficient, a chemical structure-based protein target prediction analysis, and a cross-study/cross-platform meta-analysis of in vitro and in vivo human and rat transcriptomics data derived from public resources (i.e., the diXa data warehouse). Hierarchical clustering of the outcome scores of the separate analyses did not result in a satisfactory grouping of compounds considering their known toxic mechanism as described in literature. However, a combined analysis of multiple data types may hypothetically compensate for missing or unreliable information in any of the single data types. We therefore performed an integrated clustering analysis of all three data sets using the R-based tool iClusterPlus. This indeed improved the grouping results. The compound clusters that were formed by means of iClusterPlus represent groups that show similar gene expression while simultaneously integrating a similarity in structure and protein targets, which corresponds much better with the known mechanism of action of these toxicants. Using an integrative systems biology approach may thus overcome the limitations of the separate analyses when grouping liver toxicants sharing a similar mechanism of toxicity.

  18. Recombination-enhanced surface expansion of clusters in intense soft x-ray laser pulses

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rupp, Daniela; Flückiger, Leonie; Adolph, Marcus

    Here, we studied the nanoplasma formation and explosion dynamics of single large xenon clusters in ultrashort, intense x-ray free-electron laser pulses via ion spectroscopy. The simultaneous measurement of single-shot diffraction images enabled a single-cluster analysis that is free from any averaging over the cluster size and laser intensity distributions. The measured charge state-resolved ion energy spectra show narrow distributions with peak positions that scale linearly with final ion charge state. These two distinct signatures are attributed to highly efficient recombination that eventually leads to the dominant formation of neutral atoms in the cluster. The measured mean ion energies exceed themore » value expected without recombination by more than an order of magnitude, indicating that the energy release resulting from electron-ion recombination constitutes a previously unnoticed nanoplasma heating process. This conclusion is supported by results from semiclassical molecular dynamics simulations.« less

  19. Recombination-enhanced surface expansion of clusters in intense soft x-ray laser pulses

    DOE PAGES

    Rupp, Daniela; Flückiger, Leonie; Adolph, Marcus; ...

    2016-10-07

    Here, we studied the nanoplasma formation and explosion dynamics of single large xenon clusters in ultrashort, intense x-ray free-electron laser pulses via ion spectroscopy. The simultaneous measurement of single-shot diffraction images enabled a single-cluster analysis that is free from any averaging over the cluster size and laser intensity distributions. The measured charge state-resolved ion energy spectra show narrow distributions with peak positions that scale linearly with final ion charge state. These two distinct signatures are attributed to highly efficient recombination that eventually leads to the dominant formation of neutral atoms in the cluster. The measured mean ion energies exceed themore » value expected without recombination by more than an order of magnitude, indicating that the energy release resulting from electron-ion recombination constitutes a previously unnoticed nanoplasma heating process. This conclusion is supported by results from semiclassical molecular dynamics simulations.« less

  20. A framework to spatially cluster air pollution monitoring sites in US based on the PM2.5 composition

    PubMed Central

    Austin, Elena; Coull, Brent A.; Zanobetti, Antonella; Koutrakis, Petros

    2013-01-01

    Background Heterogeneity in the response to PM2.5 is hypothesized to be related to differences in particle composition across monitoring sites which reflect differences in source types as well as climatic and topographic conditions impacting different geographic locations. Identifying spatial patterns in particle composition is a multivariate problem that requires novel methodologies. Objectives Use cluster analysis methods to identify spatial patterns in PM2.5 composition. Verify that the resulting clusters are distinct and informative. Methods 109 monitoring sites with 75% reported speciation data during the period 2003–2008 were selected. These sites were categorized based on their average PM2.5 composition over the study period using k-means cluster analysis. The obtained clusters were validated and characterized based on their physico-chemical characteristics, geographic locations, emissions profiles, population density and proximity to major emission sources. Results Overall 31 clusters were identified. These include 21 clusters with 2 or more sites which were further grouped into 4 main types using hierarchical clustering. The resulting groupings are chemically meaningful and represent broad differences in emissions. The remaining clusters, encompassing single sites, were characterized based on their particle composition and geographic location. Conclusions The framework presented here provides a novel tool which can be used to identify and further classify sites based on their PM2.5 composition. The solution presented is fairly robust and yielded groupings that were meaningful in the context of air-pollution research. PMID:23850585

  1. The Technical and Biological Reproducibility of Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry (MALDI-TOF MS) Based Typing: Employment of Bioinformatics in a Multicenter Study

    PubMed Central

    Oberle, Michael; Wohlwend, Nadia; Jonas, Daniel; Maurer, Florian P.; Jost, Geraldine; Tschudin-Sutter, Sarah; Vranckx, Katleen; Egli, Adrian

    2016-01-01

    Background The technical, biological, and inter-center reproducibility of matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI TOF MS) typing data has not yet been explored. The aim of this study is to compare typing data from multiple centers employing bioinformatics using bacterial strains from two past outbreaks and non-related strains. Material/Methods Participants received twelve extended spectrum betalactamase-producing E. coli isolates and followed the same standard operating procedure (SOP) including a full-protein extraction protocol. All laboratories provided visually read spectra via flexAnalysis (Bruker, Germany). Raw data from each laboratory allowed calculating the technical and biological reproducibility between centers using BioNumerics (Applied Maths NV, Belgium). Results Technical and biological reproducibility ranged between 96.8–99.4% and 47.6–94.4%, respectively. The inter-center reproducibility showed a comparable clustering among identical isolates. Principal component analysis indicated a higher tendency to cluster within the same center. Therefore, we used a discriminant analysis, which completely separated the clusters. Next, we defined a reference center and performed a statistical analysis to identify specific peaks to identify the outbreak clusters. Finally, we used a classifier algorithm and a linear support vector machine on the determined peaks as classifier. A validation showed that within the set of the reference center, the identification of the cluster was 100% correct with a large contrast between the score with the correct cluster and the next best scoring cluster. Conclusions Based on the sufficient technical and biological reproducibility of MALDI-TOF MS based spectra, detection of specific clusters is possible from spectra obtained from different centers. However, we believe that a shared SOP and a bioinformatics approach are required to make the analysis robust and reliable. PMID:27798637

  2. NeatMap--non-clustering heat map alternatives in R.

    PubMed

    Rajaram, Satwik; Oono, Yoshi

    2010-01-22

    The clustered heat map is the most popular means of visualizing genomic data. It compactly displays a large amount of data in an intuitive format that facilitates the detection of hidden structures and relations in the data. However, it is hampered by its use of cluster analysis which does not always respect the intrinsic relations in the data, often requiring non-standardized reordering of rows/columns to be performed post-clustering. This sometimes leads to uninformative and/or misleading conclusions. Often it is more informative to use dimension-reduction algorithms (such as Principal Component Analysis and Multi-Dimensional Scaling) which respect the topology inherent in the data. Yet, despite their proven utility in the analysis of biological data, they are not as widely used. This is at least partially due to the lack of user-friendly visualization methods with the visceral impact of the heat map. NeatMap is an R package designed to meet this need. NeatMap offers a variety of novel plots (in 2 and 3 dimensions) to be used in conjunction with these dimension-reduction techniques. Like the heat map, but unlike traditional displays of such results, it allows the entire dataset to be displayed while visualizing relations between elements. It also allows superimposition of cluster analysis results for mutual validation. NeatMap is shown to be more informative than the traditional heat map with the help of two well-known microarray datasets. NeatMap thus preserves many of the strengths of the clustered heat map while addressing some of its deficiencies. It is hoped that NeatMap will spur the adoption of non-clustering dimension-reduction algorithms.

  3. Psychosocial Clusters and their Associations with Well-Being and Health: An Empirical Strategy for Identifying Psychosocial Predictors Most Relevant to Racially/Ethnically Diverse Women’s Health

    PubMed Central

    Jabson, Jennifer M.; Bowen, Deborah; Weinberg, Janice; Kroenke, Candyce; Luo, Juhua; Messina, Catherine; Shumaker, Sally; Tindle, Hilary A.

    2016-01-01

    BACKGROUND Strategies for identifying the most relevant psychosocial predictors in studies of racial/ethnic minority women’s health are limited because they largely exclude cultural influences and they assume that psychosocial predictors are independent. This paper proposes and tests an empirical solution. METHODS Hierarchical cluster analysis, conducted with data from 140,652 Women’s Health Initiative participants, identified clusters among individual psychosocial predictors. Multivariable analyses tested associations between clusters and health outcomes. RESULTS A Social Cluster and a Stress Cluster were identified. The Social Cluster was positively associated with well-being and inversely associated with chronic disease index, and the Stress Cluster was inversely associated with well-being and positively associated with chronic disease index. As hypothesized, the magnitude of association between clusters and outcomes differed by race/ethnicity. CONCLUSIONS By identifying psychosocial clusters and their associations with health, we have taken an important step toward understanding how individual psychosocial predictors interrelate and how empirically formed Stress and Social clusters relate to health outcomes. This study has also demonstrated important insight about differences in associations between these psychosocial clusters and health among racial/ethnic minorities. These differences could signal the best pathways for intervention modification and tailoring. PMID:27279761

  4. Validating clustering of molecular dynamics simulations using polymer models

    PubMed Central

    2011-01-01

    Background Molecular dynamics (MD) simulation is a powerful technique for sampling the meta-stable and transitional conformations of proteins and other biomolecules. Computational data clustering has emerged as a useful, automated technique for extracting conformational states from MD simulation data. Despite extensive application, relatively little work has been done to determine if the clustering algorithms are actually extracting useful information. A primary goal of this paper therefore is to provide such an understanding through a detailed analysis of data clustering applied to a series of increasingly complex biopolymer models. Results We develop a novel series of models using basic polymer theory that have intuitive, clearly-defined dynamics and exhibit the essential properties that we are seeking to identify in MD simulations of real biomolecules. We then apply spectral clustering, an algorithm particularly well-suited for clustering polymer structures, to our models and MD simulations of several intrinsically disordered proteins. Clustering results for the polymer models provide clear evidence that the meta-stable and transitional conformations are detected by the algorithm. The results for the polymer models also help guide the analysis of the disordered protein simulations by comparing and contrasting the statistical properties of the extracted clusters. Conclusions We have developed a framework for validating the performance and utility of clustering algorithms for studying molecular biopolymer simulations that utilizes several analytic and dynamic polymer models which exhibit well-behaved dynamics including: meta-stable states, transition states, helical structures, and stochastic dynamics. We show that spectral clustering is robust to anomalies introduced by structural alignment and that different structural classes of intrinsically disordered proteins can be reliably discriminated from the clustering results. To our knowledge, our framework is the first to utilize model polymers to rigorously test the utility of clustering algorithms for studying biopolymers. PMID:22082218

  5. XCluSim: a visual analytics tool for interactively comparing multiple clustering results of bioinformatics data

    PubMed Central

    2015-01-01

    Background Though cluster analysis has become a routine analytic task for bioinformatics research, it is still arduous for researchers to assess the quality of a clustering result. To select the best clustering method and its parameters for a dataset, researchers have to run multiple clustering algorithms and compare them. However, such a comparison task with multiple clustering results is cognitively demanding and laborious. Results In this paper, we present XCluSim, a visual analytics tool that enables users to interactively compare multiple clustering results based on the Visual Information Seeking Mantra. We build a taxonomy for categorizing existing techniques of clustering results visualization in terms of the Gestalt principles of grouping. Using the taxonomy, we choose the most appropriate interactive visualizations for presenting individual clustering results from different types of clustering algorithms. The efficacy of XCluSim is shown through case studies with a bioinformatician. Conclusions Compared to other relevant tools, XCluSim enables users to compare multiple clustering results in a more scalable manner. Moreover, XCluSim supports diverse clustering algorithms and dedicated visualizations and interactions for different types of clustering results, allowing more effective exploration of details on demand. Through case studies with a bioinformatics researcher, we received positive feedback on the functionalities of XCluSim, including its ability to help identify stably clustered items across multiple clustering results. PMID:26328893

  6. Functional Interference Clusters in Cancer Patients With Bone Metastases: A Secondary Analysis of RTOG 9714

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chow, Edward, E-mail: Edward.Chow@sunnybrook.c; James, Jennifer; Barsevick, Andrea

    Purpose: To explore the relationships (clusters) among the functional interference items in the Brief Pain Inventory (BPI) in patients with bone metastases. Methods: Patients enrolled in the Radiation Therapy Oncology Group (RTOG) 9714 bone metastases study were eligible. Patients were assessed at baseline and 4, 8, and 12 weeks after randomization for the palliative radiotherapy with the BPI, which consists of seven functional items: general activity, mood, walking ability, normal work, relations with others, sleep, and enjoyment of life. Principal component analysis with varimax rotation was used to determine the clusters between the functional items at baseline and the follow-up.more » Cronbach's alpha was used to determine the consistency and reliability of each cluster at baseline and follow-up. Results: There were 448 male and 461 female patients, with a median age of 67 years. There were two functional interference clusters at baseline, which accounted for 71% of the total variance. The first cluster (physical interference) included normal work and walking ability, which accounted for 58% of the total variance. The second cluster (psychosocial interference) included relations with others and sleep, which accounted for 13% of the total variance. The Cronbach's alpha statistics were 0.83 and 0.80, respectively. The functional clusters changed at week 12 in responders but persisted through week 12 in nonresponders. Conclusion: Palliative radiotherapy is effective in reducing bone pain. Functional interference component clusters exist in patients treated for bone metastases. These clusters changed over time in this study, possibly attributable to treatment. Further research is needed to examine these effects.« less

  7. Characterizing Suicide in Toronto: An Observational Study and Cluster Analysis

    PubMed Central

    Sinyor, Mark; Schaffer, Ayal; Streiner, David L

    2014-01-01

    Objective: To determine whether people who have died from suicide in a large epidemiologic sample form clusters based on demographic, clinical, and psychosocial factors. Method: We conducted a coroner’s chart review for 2886 people who died in Toronto, Ontario, from 1998 to 2010, and whose death was ruled as suicide by the Office of the Chief Coroner of Ontario. A cluster analysis using known suicide risk factors was performed to determine whether suicide deaths separate into distinct groups. Clusters were compared according to person- and suicide-specific factors. Results: Five clusters emerged. Cluster 1 had the highest proportion of females and nonviolent methods, and all had depression and a past suicide attempt. Cluster 2 had the highest proportion of people with a recent stressor and violent suicide methods, and all were married. Cluster 3 had mostly males between the ages of 20 and 64, and all had either experienced recent stressors, suffered from mental illness, or had a history of substance abuse. Cluster 4 had the youngest people and the highest proportion of deaths by jumping from height, few were married, and nearly one-half had bipolar disorder or schizophrenia. Cluster 5 had all unmarried people with no prior suicide attempts, and were the least likely to have an identified mental illness and most likely to leave a suicide note. Conclusions: People who die from suicide assort into different patterns of demographic, clinical, and death-specific characteristics. Identifying and studying subgroups of suicides may advance our understanding of the heterogeneous nature of suicide and help to inform development of more targeted suicide prevention strategies. PMID:24444321

  8. Association of Interleukin-1 gene clusters polymorphisms with primary open-angle glaucoma: a meta-analysis.

    PubMed

    Li, Junhua; Feng, Yifan; Sung, Mi Sun; Lee, Tae Hee; Park, Sang Woo

    2017-11-28

    Previous studies have associated the Interleukin-1 (IL-1) gene clusters polymorphisms with the risk of primary open-angle glaucoma (POAG). However, the results were not consistent. Here, we performed a meta-analysis to evaluate the role of IL-1 gene clusters polymorphisms in POAG susceptibility. PubMed, EMBASE and Cochrane Library (up to July 15, 2017) were searched by two independent investigators. All case-control studies investigating the association between single-nucleotide polymorphisms (SNPs) of IL-1 gene clusters and POAG risk were included. Odds ratios (ORs) with 95% confidence intervals (CIs) were calculated for quantifying the strength of association that has been involved in at least two studies. Five studies on IL-1β rs16944 (c. -511C > T) (1053 cases and 986 controls), 4 studies on IL-1α rs1800587 (c. -889C > T) (822 cases and 714 controls), and 4 studies on IL-1β rs1143634 (c. +3953C > T) (798 cases and 730 controls) were included. The results suggest that all three SNPs were not associated with POAG risk. Stratification analyses indicated that the rs1143634 has a suggestive associated with high tension glaucoma (HTG) under dominant (P = 0.03), heterozygote (P = 0.04) and allelic models (P = 0.02), however, the weak association was nullified after Bonferroni adjustments for multiple tests. Based on current meta-analysis, we indicated that there is lack of association between the three SNPs of IL-1 and POAG. However, this conclusion should be interpreted with caution and further well designed studies with large sample-size are required to validate the conclusion as low statistical powers.

  9. Modeling the Movement of Homicide by Type to Inform Public Health Prevention Efforts

    PubMed Central

    Grady, Sue; Pizarro, Jesenia M.; Melde, Chris

    2015-01-01

    Objectives. We modeled the spatiotemporal movement of hotspot clusters of homicide by motive in Newark, New Jersey, to investigate whether different homicide types have different patterns of clustering and movement. Methods. We obtained homicide data from the Newark Police Department Homicide Unit’s investigative files from 1997 through 2007 (n = 560). We geocoded the address at which each homicide victim was found and recorded the date of and the motive for the homicide. We used cluster detection software to model the spatiotemporal movement of statistically significant homicide clusters by motive, using census tract and month of occurrence as the spatial and temporal units of analysis. Results. Gang-motivated homicides showed evidence of clustering and diffusion through Newark. Additionally, gang-motivated homicide clusters overlapped to a degree with revenge and drug-motivated homicide clusters. Escalating dispute and nonintimate familial homicides clustered; however, there was no evidence of diffusion. Intimate partner and robbery homicides did not cluster. Conclusions. By tracking how homicide types diffuse through communities and determining which places have ongoing or emerging homicide problems by type, we can better inform the deployment of prevention and intervention efforts. PMID:26270315

  10. Statistical detection of geographic clusters of resistant Escherichia coli in a regional network with WHONET and SaTScan

    PubMed Central

    Park, Rachel; O'Brien, Thomas F.; Huang, Susan S.; Baker, Meghan A.; Yokoe, Deborah S.; Kulldorff, Martin; Barrett, Craig; Swift, Jamie; Stelling, John

    2016-01-01

    Objectives While antimicrobial resistance threatens the prevention, treatment, and control of infectious diseases, systematic analysis of routine microbiology laboratory test results worldwide can alert new threats and promote timely response. This study explores statistical algorithms for recognizing geographic clustering of multi-resistant microbes within a healthcare network and monitoring the dissemination of new strains over time. Methods Escherichia coli antimicrobial susceptibility data from a three-year period stored in WHONET were analyzed across ten facilities in a healthcare network utilizing SaTScan's spatial multinomial model with two models for defining geographic proximity. We explored geographic clustering of multi-resistance phenotypes within the network and changes in clustering over time. Results Geographic clustering identified from both latitude/longitude and non-parametric facility groupings geographic models were similar, while the latter was offers greater flexibility and generalizability. Iterative application of the clustering algorithms suggested the possible recognition of the initial appearance of invasive E. coli ST131 in the clinical database of a single hospital and subsequent dissemination to others. Conclusion Systematic analysis of routine antimicrobial resistance susceptibility test results supports the recognition of geographic clustering of microbial phenotypic subpopulations with WHONET and SaTScan, and iterative application of these algorithms can detect the initial appearance in and dissemination across a region prompting early investigation, response, and containment measures. PMID:27530311

  11. Identification of Clinical Phenotypes in Idiopathic Interstitial Pneumonia with Pulmonary Emphysema.

    PubMed

    Sato, Suguru; Tanino, Yoshinori; Misa, Kenichi; Fukuhara, Naoko; Nikaido, Takefumi; Uematsu, Manabu; Fukuhara, Atsuro; Wang, Xintao; Ishida, Takashi; Munakata, Mitsuru

    2016-01-01

    Objective Since the term "combined pulmonary fibrosis and emphysema" (CPFE) was first proposed, the co-existence of pulmonary fibrosis and pulmonary emphysema (PE) has drawn considerable attention. However, conflicting results on the clinical characteristics of patients with both pulmonary fibrosis and PE have been published because of the lack of an exact definition of CPFE. The goal of this study was thus to clarify the clinical characteristics and phenotypes of idiopathic interstitial pneumonia (IIP) with PE. Methods We retrospectively analyzed IIP patients who had been admitted to our hospital. Their chest high-resolution computed tomography images were classified into two groups according to the presence of PE. We then performed a cluster analysis to identify the phenotypes of IIP patients with PE. Results Forty-four (53.7%) out of 82 patients had at least mild emphysema in their bilateral lungs. The cluster analysis separated the IIP patients with PE into three clusters. The overall survival rate of one cluster that consisted of mainly idiopathic pulmonary fibrosis (IPF) patients was significantly worse than those of the other clusters. Conclusion Three different phenotypes can be identified in IIP patients with PE, and IPF with PE is a distinct clinical phenotype with a poor prognosis.

  12. Prediction of chemotherapeutic response in bladder cancer using k-means clustering of DCE-MRI pharmacokinetic parameters

    PubMed Central

    Nguyen, Huyen T.; Jia, Guang; Shah, Zarine K.; Pohar, Kamal; Mortazavi, Amir; Zynger, Debra L.; Wei, Lai; Yang, Xiangyu; Clark, Daniel; Knopp, Michael V.

    2015-01-01

    Purpose To apply k-means clustering of two pharmacokinetic parameters derived from 3T DCE-MRI to predict chemotherapeutic response in bladder cancer at the mid-cycle time-point. Materials and Methods With the pre-determined number of 3 clusters, k-means clustering was performed on non-dimensionalized Amp and kep estimates of each bladder tumor. Three cluster volume fractions (VFs) were calculated for each tumor at baseline and mid-cycle. The changes of three cluster VFs from baseline to mid-cycle were correlated with the tumor’s chemotherapeutic response. Receiver-operating-characteristics curve analysis was used to evaluate the performance of each cluster VF change as a biomarker of chemotherapeutic response in bladder cancer. Results k-means clustering partitioned each bladder tumor into cluster 1 (low kep and low Amp), cluster 2 (low kep and high Amp), cluster 3 (high kep and low Amp). The changes of all three cluster VFs were found to be associated with bladder tumor response to chemotherapy. The VF change of cluster 2 presented with the highest area-under-the-curve value (0.96) and the highest sensitivity/specificity/accuracy (96%/100%/97%) with a selected cutoff value. Conclusion k-means clustering of the two DCE-MRI pharmacokinetic parameters can characterize the complex microcirculatory changes within a bladder tumor to enable early prediction of the tumor’s chemotherapeutic response. PMID:24943272

  13. Cluster Analysis of Velocity Field Derived from Dense GNSS Network of Japan

    NASA Astrophysics Data System (ADS)

    Takahashi, A.; Hashimoto, M.

    2015-12-01

    Dense GNSS networks have been widely used to observe crustal deformation. Simpson et al. (2012) and Savage and Simpson (2013) have conducted cluster analyses of GNSS velocity field in the San Francisco Bay Area and Mojave Desert, respectively. They have successfully found velocity discontinuities. They also showed an advantage of cluster analysis for classifying GNSS velocity field. Since in western United States, strike-slip events are dominant, geometry is simple. However, the Japanese Islands are tectonically complicated due to subduction of oceanic plates. There are many types of crustal deformation such as slow slip event and large postseismic deformation. We propose a modified clustering method of GNSS velocity field in Japan to separate time variant and static crustal deformation. Our modification is performing cluster analysis every several months or years, then qualifying cluster member similarity. If a GNSS station moved differently from its neighboring GNSS stations, the station will not belong to in the cluster which includes its surrounding stations. With this method, time variant phenomena were distinguished. We applied our method to GNSS data of Japan from 1996 to 2015. According to the analyses, following conclusions were derived. The first is the clusters boundaries are consistent with known active faults. For examples, the Arima-Takatsuki-Hanaore fault system and the Shimane-Tottori segment proposed by Nishimura (2015) are recognized, though without using prior information. The second is improving detectability of time variable phenomena, such as a slow slip event in northern part of Hokkaido region detected by Ohzono et al. (2015). The last one is the classification of postseismic deformation caused by large earthquakes. The result suggested velocity discontinuities in postseismic deformation of the Tohoku-oki earthquake. This result implies that postseismic deformation is not continuously decaying proportional to distance from its epicenter.

  14. Investigation of defect clusters in ion-irradiated Ni and NiCo using diffuse X-ray scattering and electron microscopy

    DOE PAGES

    Olsen, Raina J.; Jin, Ke; Lu, Chenyang; ...

    2015-11-23

    The nature of defect clusters in Ni and Nimore » $$_{50}$$Co$$_{50}$$ (NiCo) irradiated at room temperature with 2–16 MeV Ni ions is studied using asymptotic diffuse X-ray scattering and transmission electron microscopy (TEM). Analysis of the scattering data provides separate size distributions for vacancy and interstitial type defect clusters, showing that both types of defect clusters have a smaller size and higher density in NiCo than in Ni. Diffuse scattering results show good quantitative agreement with TEM results for cluster sizes greater than 4 nm diameter, but find that the majority of vacancy clusters are under 2 nm in NiCo, which, if not detected, would lead to the conclusion that defect density was actually lower in the alloy. Interstitial dislocation loops and stacking fault tetrahedra are identified by TEM. Lastly comparison of diffuse scattering lineshapes to those calculated for dislocation loops and SFTs indicates that most of the vacancy clusters are SFTs.« less

  15. What is your patient’s cognitive profile? Three distinct subgroups of cognitive function in persons with heart failure

    PubMed Central

    Hawkins, Misty A.W.; Schaefer, Julie T.; Gunstad, John; Dolansky, Mary A.; Redle, Joseph D.; Josephson, Richard; Moore, Shirley M.; Hughes, Joel W.

    2014-01-01

    Purpose To determine whether patients with heart failure (HF) have distinct profiles of cognitive impairment. Background Cognitive impairment is common in HF. Recent work found three cognitive profiles in HF patients— (1) intact, (2) impaired, and (3) memory-impaired. We examined the reproducibility of these profiles and clarified mechanisms. Methods HF patients (68.6±9.7years; N=329) completed neuropsychological testing. Composite scores were created for cognitive domains and used to identify clusters via agglomerative-hierarchical cluster analysis. Results A 3-cluster solution emerged. Cluster 1 (n=109) had intact cognition. Cluster 2 (n=123) was impaired across all domains. Cluster 3 (n=97) had impaired memory only. Clusters differed in age, race, education, SES, IQ, BMI, and diabetes (ps ≤.026) but not in mood, anxiety, cardiovascular, or pulmonary disease (ps≥.118). Conclusions We replicated three distinct patterns of cognitive function in persons with HF. These profiles may help providers offer tailored care to patients with different cognitive and clinical needs. PMID:25510559

  16. Spatio-Temporal Analysis of Smear-Positive Tuberculosis in the Sidama Zone, Southern Ethiopia

    PubMed Central

    Dangisso, Mesay Hailu; Datiko, Daniel Gemechu; Lindtjørn, Bernt

    2015-01-01

    Background Tuberculosis (TB) is a disease of public health concern, with a varying distribution across settings depending on socio-economic status, HIV burden, availability and performance of the health system. Ethiopia is a country with a high burden of TB, with regional variations in TB case notification rates (CNRs). However, TB program reports are often compiled and reported at higher administrative units that do not show the burden at lower units, so there is limited information about the spatial distribution of the disease. We therefore aim to assess the spatial distribution and presence of the spatio-temporal clustering of the disease in different geographic settings over 10 years in the Sidama Zone in southern Ethiopia. Methods A retrospective space–time and spatial analysis were carried out at the kebele level (the lowest administrative unit within a district) to identify spatial and space-time clusters of smear-positive pulmonary TB (PTB). Scan statistics, Global Moran’s I, and Getis and Ordi (Gi*) statistics were all used to help analyze the spatial distribution and clusters of the disease across settings. Results A total of 22,545 smear-positive PTB cases notified over 10 years were used for spatial analysis. In a purely spatial analysis, we identified the most likely cluster of smear-positive PTB in 192 kebeles in eight districts (RR= 2, p<0.001), with 12,155 observed and 8,668 expected cases. The Gi* statistic also identified the clusters in the same areas, and the spatial clusters showed stability in most areas in each year during the study period. The space-time analysis also detected the most likely cluster in 193 kebeles in the same eight districts (RR= 1.92, p<0.001), with 7,584 observed and 4,738 expected cases in 2003-2012. Conclusion The study found variations in CNRs and significant spatio-temporal clusters of smear-positive PTB in the Sidama Zone. The findings can be used to guide TB control programs to devise effective TB control strategies for the geographic areas characterized by the highest CNRs. Further studies are required to understand the factors associated with clustering based on individual level locations and investigation of cases. PMID:26030162

  17. Spatial Hotspot Analysis of Acute Myocardial Infarction Events in an Urban Population: A Correlation Study of Health Problems and Industrial Installation

    PubMed Central

    NAMAYANDE, Motahareh Sadat; NEJADKOORKI, Farhad; NAMAYANDE, Seyedeh Mahdieh; DEHGHAN, Hamidreza

    2016-01-01

    Background: The current study’s objectives were to find any possible spatial patterns and hotspot of cardiovascular events and to perform a correlation study to find any possible relevance between cardiovascular disease (CVE) and location of industrial installation said above. Methods: We used the Acute Myocardial Infarction (AMI) hospital admission record in three main hospitals in Yazd, Yazd Province, Iran during 2013, because of CVDs and searched for possible correlation between industries as point-source pollutants and non-random distribution of AMI events. Results: MI incidence rate in Yazd was obtained 531 per 100,000 person-year among men, 458 per 100,000 person-year among women and 783/100,000 person-yr totally. We applied a GIS Hotspot analysis to determine feasible clusters and two sets of clusters were observed. Mean age of 56 AMI events occurred in the cluster cells was calculated as 62.21±14.75 yr. Age and sex as main confounders of AMI were evaluated in the cluster areas in comparison to other areas. We observed no significant difference regarding sex (59% in cluster cells versus 55% in total for men) and age (62.21±14.7 in cluster cells versus 63.28±13.98 in total for men). Conclusion: We found proximity of AMI events cluster to industries installations, and a steel industry, specifically. There could be an association between road-related pollutants and the observed sets of cluster due to the proximity exist between rather crowded highways nearby the events cluster. PMID:27057527

  18. Fingerprint analysis of Hibiscus mutabilis L. leaves based on ultra performance liquid chromatography with photodiode array detector combined with similarity analysis and hierarchical clustering analysis methods

    PubMed Central

    Liang, Xianrui; Ma, Meiling; Su, Weike

    2013-01-01

    Background: A method for chemical fingerprint analysis of Hibiscus mutabilis L. leaves was developed based on ultra performance liquid chromatography with photodiode array detector (UPLC-PAD) combined with similarity analysis (SA) and hierarchical clustering analysis (HCA). Materials and Methods: 10 batches of Hibiscus mutabilis L. leaves samples were collected from different regions of China. UPLC-PAD was employed to collect chemical fingerprints of Hibiscus mutabilis L. leaves. Results: The relative standard deviations (RSDs) of the relative retention times (RRT) and relative peak areas (RPA) of 10 characteristic peaks (one of them was identified as rutin) in precision, repeatability and stability test were less than 3%, and the method of fingerprint analysis was validated to be suitable for the Hibiscus mutabilis L. leaves. Conclusions: The chromatographic fingerprints showed abundant diversity of chemical constituents qualitatively in the 10 batches of Hibiscus mutabilis L. leaves samples from different locations by similarity analysis on basis of calculating the correlation coefficients between each two fingerprints. Moreover, the HCA method clustered the samples into four classes, and the HCA dendrogram showed the close or distant relations among the 10 samples, which was consistent to the SA result to some extent. PMID:23930008

  19. Clustering of health-related behaviors, health outcomes and demographics in Dutch adolescents: a cross-sectional study

    PubMed Central

    2013-01-01

    Background Recent studies show several health-related behaviors to cluster in adolescents. This has important implications for public health. Interrelated behaviors have been shown to be most effectively targeted by multimodal interventions addressing wider-ranging improvements in lifestyle instead of via separate interventions targeting individual behaviors. However, few previous studies have taken into account a broad, multi-disciplinary range of health-related behaviors and connected these behavioral patterns to health-related outcomes. This paper presents an analysis of the clustering of a broad range of health-related behaviors with relevant demographic factors and several health-related outcomes in adolescents. Methods Self-report questionnaire data were collected from a sample of 2,690 Dutch high school adolescents. Behavioral patterns were deducted via Principal Components Analysis. Subsequently a Two-Step Cluster Analysis was used to identify groups of adolescents with similar behavioral patterns and health-related outcomes. Results Four distinct behavioral patterns describe the analyzed individual behaviors: 1- risk-prone behavior, 2- bully behavior, 3- problematic screen time use, and 4- sedentary behavior. Subsequent cluster analysis identified four clusters of adolescents. Multi-problem behavior was associated with problematic physical and psychosocial health outcomes, as opposed to those exerting relatively few unhealthy behaviors. These associations were relatively independent of demographics such as ethnicity, gender and socio-economic status. Conclusions The results show that health-related behaviors tend to cluster, indicating that specific behavioral patterns underlie individual health behaviors. In addition, specific patterns of health-related behaviors were associated with specific health outcomes and demographic factors. In general, unhealthy behavior on account of multiple health-related behaviors was associated with both poor psychosocial and physical health. These findings have significant meaning for future public health programs, which should be more tailored with use of such knowledge on behavioral clustering via e.g. Transfer Learning. PMID:24305509

  20. Phylodynamic Analysis Reveals CRF01_AE Dissemination between Japan and Neighboring Asian Countries and the Role of Intravenous Drug Use in Transmission

    PubMed Central

    Shiino, Teiichiro; Hattori, Junko; Yokomaku, Yoshiyuki; Iwatani, Yasumasa; Sugiura, Wataru

    2014-01-01

    Background One major circulating HIV-1 subtype in Southeast Asian countries is CRF01_AE, but little is known about its epidemiology in Japan. We conducted a molecular phylodynamic study of patients newly diagnosed with CRF01_AE from 2003 to 2010. Methods Plasma samples from patients registered in Japanese Drug Resistance HIV-1 Surveillance Network were analyzed for protease-reverse transcriptase sequences; all sequences undergo subtyping and phylogenetic analysis using distance-matrix-based, maximum likelihood and Bayesian coalescent Markov Chain Monte Carlo (MCMC) phylogenetic inferences. Transmission clusters were identified using interior branch test and depth-first searches for sub-tree partitions. Times of most recent common ancestor (tMRCAs) of significant clusters were estimated using Bayesian MCMC analysis. Results Among 3618 patient registered in our network, 243 were infected with CRF01_AE. The majority of individuals with CRF01_AE were Japanese, predominantly male, and reported heterosexual contact as their risk factor. We found 5 large clusters with ≥5 members and 25 small clusters consisting of pairs of individuals with highly related CRF01_AE strains. The earliest cluster showed a tMRCA of 1996, and consisted of individuals with their known risk as heterosexual contacts. The other four large clusters showed later tMRCAs between 2000 and 2002 with members including intravenous drug users (IVDU) and non-Japanese, but not men who have sex with men (MSM). In contrast, small clusters included a high frequency of individuals reporting MSM risk factors. Phylogenetic analysis also showed that some individuals infected with HIV strains spread in East and South-eastern Asian countries. Conclusions Introduction of CRF01_AE viruses into Japan is estimated to have occurred in the 1990s. CFR01_AE spread via heterosexual behavior, then among persons connected with non-Japanese, IVDU, and MSM. Phylogenetic analysis demonstrated that some viral variants are largely restricted to Japan, while others have a broad geographic distribution. PMID:25025900

  1. Alternative Sigma Factor Over-Expression Enables Heterologous Expression of a Type II Polyketide Biosynthetic Pathway in Escherichia coli

    PubMed Central

    Stevens, David Cole; Conway, Kyle R.; Pearce, Nelson; Villegas-Peñaranda, Luis Roberto; Garza, Anthony G.; Boddy, Christopher N.

    2013-01-01

    Background Heterologous expression of bacterial biosynthetic gene clusters is currently an indispensable tool for characterizing biosynthetic pathways. Development of an effective, general heterologous expression system that can be applied to bioprospecting from metagenomic DNA will enable the discovery of a wealth of new natural products. Methodology We have developed a new Escherichia coli-based heterologous expression system for polyketide biosynthetic gene clusters. We have demonstrated the over-expression of the alternative sigma factor σ54 directly and positively regulates heterologous expression of the oxytetracycline biosynthetic gene cluster in E. coli. Bioinformatics analysis indicates that σ54 promoters are present in nearly 70% of polyketide and non-ribosomal peptide biosynthetic pathways. Conclusions We have demonstrated a new mechanism for heterologous expression of the oxytetracycline polyketide biosynthetic pathway, where high-level pleiotropic sigma factors from the heterologous host directly and positively regulate transcription of the non-native biosynthetic gene cluster. Our bioinformatics analysis is consistent with the hypothesis that heterologous expression mediated by the alternative sigma factor σ54 may be a viable method for the production of additional polyketide products. PMID:23724102

  2. On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms.

    PubMed

    Chen, Chunlei; He, Li; Zhang, Huixiang; Zheng, Hao; Wang, Lei

    2017-01-01

    Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions.

  3. [Bibliometrics and visualization analysis of land use regression models in ambient air pollution research].

    PubMed

    Zhang, Y J; Zhou, D H; Bai, Z P; Xue, F X

    2018-02-10

    Objective: To quantitatively analyze the current status and development trends regarding the land use regression (LUR) models on ambient air pollution studies. Methods: Relevant literature from the PubMed database before June 30, 2017 was analyzed, using the Bibliographic Items Co-occurrence Matrix Builder (BICOMB 2.0). Keywords co-occurrence networks, cluster mapping and timeline mapping were generated, using the CiteSpace 5.1.R5 software. Relevant literature identified in three Chinese databases was also reviewed. Results: Four hundred sixty four relevant papers were retrieved from the PubMed database. The number of papers published showed an annual increase, in line with the growing trend of the index. Most papers were published in the journal of Environmental Health Perspectives . Results from the Co-word cluster analysis identified five clusters: cluster#0 consisted of birth cohort studies related to the health effects of prenatal exposure to air pollution; cluster#1 referred to land use regression modeling and exposure assessment; cluster#2 was related to the epidemiology on traffic exposure; cluster#3 dealt with the exposure to ultrafine particles and related health effects; cluster#4 described the exposure to black carbon and related health effects. Data from Timeline mapping indicated that cluster#0 and#1 were the main research areas while cluster#3 and#4 were the up-coming hot areas of research. Ninety four relevant papers were retrieved from the Chinese databases with most of them related to studies on modeling. Conclusion: In order to better assess the health-related risks of ambient air pollution, and to best inform preventative public health intervention policies, application of LUR models to environmental epidemiology studies in China should be encouraged.

  4. Radiomics of CT Features May Be Nonreproducible and Redundant: Influence of CT Acquisition Parameters.

    PubMed

    Berenguer, Roberto; Pastor-Juan, María Del Rosario; Canales-Vázquez, Jesús; Castro-García, Miguel; Villas, María Victoria; Legorburo, Francisco Mansilla; Sabater, Sebastià

    2018-04-24

    Purpose To identify the reproducible and nonredundant radiomics features (RFs) for computed tomography (CT). Materials and Methods Two phantoms were used to test RF reproducibility by using test-retest analysis, by changing the CT acquisition parameters (hereafter, intra-CT analysis), and by comparing five different scanners with the same CT parameters (hereafter, inter-CT analysis). Reproducible RFs were selected by using the concordance correlation coefficient (as a measure of the agreement between variables) and the coefficient of variation (defined as the ratio of the standard deviation to the mean). Redundant features were grouped by using hierarchical cluster analysis. Results A total of 177 RFs including intensity, shape, and texture features were evaluated. The test-retest analysis showed that 91% (161 of 177) of the RFs were reproducible according to concordance correlation coefficient. Reproducibility of intra-CT RFs, based on coefficient of variation, ranged from 89.3% (151 of 177) to 43.1% (76 of 177) where the pitch factor and the reconstruction kernel were modified, respectively. Reproducibility of inter-CT RFs, based on coefficient of variation, also showed large material differences, from 85.3% (151 of 177; wood) to only 15.8% (28 of 177; polyurethane). Ten clusters were identified after the hierarchical cluster analysis and one RF per cluster was chosen as representative. Conclusion Many RFs were redundant and nonreproducible. If all the CT parameters are fixed except field of view, tube voltage, and milliamperage, then the information provided by the analyzed RFs can be summarized in only 10 RFs (each representing a cluster) because of redundancy. © RSNA, 2018 Online supplemental material is available for this article.

  5. Mass profile and dynamical status of the z ~ 0.8 galaxy cluster LCDCS 0504

    NASA Astrophysics Data System (ADS)

    Guennou, L.; Biviano, A.; Adami, C.; Limousin, M.; Lima Neto, G. B.; Mamon, G. A.; Ulmer, M. P.; Gavazzi, R.; Cypriano, E. S.; Durret, F.; Clowe, D.; LeBrun, V.; Allam, S.; Basa, S.; Benoist, C.; Cappi, A.; Halliday, C.; Ilbert, O.; Johnston, D.; Jullo, E.; Just, D.; Kubo, J. M.; Márquez, I.; Marshall, P.; Martinet, N.; Maurogordato, S.; Mazure, A.; Murphy, K. J.; Plana, H.; Rostagni, F.; Russeil, D.; Schirmer, M.; Schrabback, T.; Slezak, E.; Tucker, D.; Zaritsky, D.; Ziegler, B.

    2014-06-01

    Context. Constraints on the mass distribution in high-redshift clusters of galaxies are currently not very strong. Aims: We aim to constrain the mass profile, M(r), and dynamical status of the z ~ 0.8 LCDCS 0504 cluster of galaxies that is characterized by prominent giant gravitational arcs near its center. Methods: Our analysis is based on deep X-ray, optical, and infrared imaging as well as optical spectroscopy, collected with various instruments, which we complemented with archival data. We modeled the mass distribution of the cluster with three different mass density profiles, whose parameters were constrained by the strong lensing features of the inner cluster region, by the X-ray emission from the intracluster medium, and by the kinematics of 71 cluster members. Results: We obtain consistent M(r) determinations from three methods based on kinematics (dispersion-kurtosis, caustics, and MAMPOSSt), out to the cluster virial radius, ≃1.3 Mpc and beyond. The mass profile inferred by the strong lensing analysis in the central cluster region is slightly higher than, but still consistent with, the kinematics estimate. On the other hand, the X-ray based M(r) is significantly lower than the kinematics and strong lensing estimates. Theoretical predictions from ΛCDM cosmology for the concentration-mass relation agree with our observational results, when taking into account the uncertainties in the observational and theoretical estimates. There appears to be a central deficit in the intracluster gas mass fraction compared with nearby clusters. Conclusions: Despite the relaxed appearance of this cluster, the determinations of its mass profile by different probes show substantial discrepancies, the origin of which remains to be determined. The extension of a dynamical analysis similar to that of other clusters of the DAFT/FADA survey with multiwavelength data of sufficient quality will allow shedding light on the possible systematics that affect the determination of mass profiles of high-z clusters, which is possibly related to our incomplete understanding of intracluster baryon physics. Table 2 is available in electronic form at http://www.aanda.org

  6. Cluster randomised crossover trials with binary data and unbalanced cluster sizes: application to studies of near-universal interventions in intensive care.

    PubMed

    Forbes, Andrew B; Akram, Muhammad; Pilcher, David; Cooper, Jamie; Bellomo, Rinaldo

    2015-02-01

    Cluster randomised crossover trials have been utilised in recent years in the health and social sciences. Methods for analysis have been proposed; however, for binary outcomes, these have received little assessment of their appropriateness. In addition, methods for determination of sample size are currently limited to balanced cluster sizes both between clusters and between periods within clusters. This article aims to extend this work to unbalanced situations and to evaluate the properties of a variety of methods for analysis of binary data, with a particular focus on the setting of potential trials of near-universal interventions in intensive care to reduce in-hospital mortality. We derive a formula for sample size estimation for unbalanced cluster sizes, and apply it to the intensive care setting to demonstrate the utility of the cluster crossover design. We conduct a numerical simulation of the design in the intensive care setting and for more general configurations, and we assess the performance of three cluster summary estimators and an individual-data estimator based on binomial-identity-link regression. For settings similar to the intensive care scenario involving large cluster sizes and small intra-cluster correlations, the sample size formulae developed and analysis methods investigated are found to be appropriate, with the unweighted cluster summary method performing well relative to the more optimal but more complex inverse-variance weighted method. More generally, we find that the unweighted and cluster-size-weighted summary methods perform well, with the relative efficiency of each largely determined systematically from the study design parameters. Performance of individual-data regression is adequate with small cluster sizes but becomes inefficient for large, unbalanced cluster sizes. When outcome prevalences are 6% or less and the within-cluster-within-period correlation is 0.05 or larger, all methods display sub-nominal confidence interval coverage, with the less prevalent the outcome the worse the coverage. As with all simulation studies, conclusions are limited to the configurations studied. We confined attention to detecting intervention effects on an absolute risk scale using marginal models and did not explore properties of binary random effects models. Cluster crossover designs with binary outcomes can be analysed using simple cluster summary methods, and sample size in unbalanced cluster size settings can be determined using relatively straightforward formulae. However, caution needs to be applied in situations with low prevalence outcomes and moderate to high intra-cluster correlations. © The Author(s) 2014.

  7. Differentiation of Recurrent Glioblastoma from Delayed Radiation Necrosis by Using Voxel-based Multiparametric Analysis of MR Imaging Data.

    PubMed

    Yoon, Ra Gyoung; Kim, Ho Sung; Koh, Myeong Ju; Shim, Woo Hyun; Jung, Seung Chai; Kim, Sang Joon; Kim, Jeong Hoon

    2017-10-01

    Purpose To assess a volume-weighted voxel-based multiparametric (MP) clustering method as an imaging biomarker to differentiate recurrent glioblastoma from delayed radiation necrosis. Materials and Methods The institutional review board approved this retrospective study and waived the informed consent requirement. Seventy-five patients with pathologic analysis-confirmed recurrent glioblastoma (n = 42) or radiation necrosis (n = 33) who presented with enlarged contrast material-enhanced lesions at magnetic resonance (MR) imaging after they completed concurrent chemotherapy and radiation therapy were enrolled. The diagnostic performance of the total MP cluster score was determined by using the area under the receiver operating characteristic curve (AUC) with cross-validation and compared with those of single parameter measurements (10% histogram cutoffs of apparent diffusion coefficient [ADC10] or 90% histogram cutoffs of normalized cerebral blood volume and initial time-signal intensity AUC). Results Receiver operating characteristic curve analysis showed that an AUC for differentiating recurrent glioblastoma from delayed radiation necrosis was highest in the total MP cluster score and lowest for ADC10 for both readers. The total MP cluster score had significantly better diagnostic accuracy than any single parameter (corrected P = .001-.039 for reader 1; corrected P = .005-.041 for reader 2). The total MP cluster score was the best predictor of recurrent glioblastoma (cross-validated AUCs, 0.942-0.946 for both readers), with a sensitivity of 95.2% for reader 1 and 97.6% for reader 2. Conclusion Quantitative analysis with volume-weighted voxel-based MP clustering appears to be superior to the use of single imaging parameters to differentiate recurrent glioblastoma from delayed radiation necrosis. © RSNA, 2017 Online supplemental material is available for this article.

  8. Networking between community health programs: a case study outlining the effectiveness, barriers and enablers

    PubMed Central

    2012-01-01

    Background In India, since the 1990s, there has been a burgeoning of NGOs involved in providing primary health care. This has resulted in a complex NGO-Government interface which is difficult for lone NGOs to navigate. The Uttarakhand Cluster, India, links such small community health programs together to build NGO capacity, increase visibility and better link to the government schemes and the formal healthcare system. This research, undertaken between 1998 and 2011, aims to examine barriers and facilitators to such linking, or clustering, and the effectiveness of this clustering approach. Methods Interviews, indicator surveys and participant observation were used to document the process and explore the enablers, the barriers and the effectiveness of networks improving community health. Results The analysis revealed that when activating, framing, mobilising and synthesizing the Uttarakhand Cluster, key brokers and network players were important in bridging between organisations. The ties (or relationships) that held the cluster together included homophily around common faith, common friendships and geographical location and common mission. Self interest whereby members sought funds, visibility, credibility, increased capacity and access to trainings was also a commonly identified motivating factor for networking. Barriers to network synthesizing included lack of funding, poor communication, limited time and lack of human resources. Risk aversion and mistrust remained significant barriers to overcome for such a network. Conclusions In conclusion, specific enabling factors allowed the clustering approach to be effective at increasing access to resources, creating collaborative opportunities and increasing visibility, credibility and confidence of the cluster members. These findings add to knowledge regarding social network formation and collaboration, and such knowledge will assist in the conceptualisation, formation and success of potential health networks in India and other developing world countries. PMID:22812627

  9. Spatial Clustering of Occupational Injuries in Communities

    PubMed Central

    Friedman, Lee; Chin, Brian; Madigan, Dana

    2015-01-01

    Objectives. Using the social-ecological model, we hypothesized that the home residences of injured workers would be clustered predictably and geographically. Methods. We linked health care and publicly available datasets by home zip code for traumatically injured workers in Illinois from 2000 to 2009. We calculated numbers and rates of injuries, determined the spatial relationships, and developed 3 models. Results. Among the 23 200 occupational injuries, 80% of cases were located in 20% of zip codes and clustered in 10 locations. After component analysis, numbers and clusters of injuries correlated directly with immigrants; injury rates inversely correlated with urban poverty. Conclusions. Traumatic occupational injuries were clustered spatially by home location of the affected workers and in a predictable way. This put an inequitable burden on communities and provided evidence for the possible value of community-based interventions for prevention of occupational injuries. Work should be included in health disparities research. Stakeholders should determine whether and how to intervene at the community level to prevent occupational injuries. PMID:25905838

  10. WordCluster: detecting clusters of DNA words and genomic elements

    PubMed Central

    2011-01-01

    Background Many k-mers (or DNA words) and genomic elements are known to be spatially clustered in the genome. Well established examples are the genes, TFBSs, CpG dinucleotides, microRNA genes and ultra-conserved non-coding regions. Currently, no algorithm exists to find these clusters in a statistically comprehensible way. The detection of clustering often relies on densities and sliding-window approaches or arbitrarily chosen distance thresholds. Results We introduce here an algorithm to detect clusters of DNA words (k-mers), or any other genomic element, based on the distance between consecutive copies and an assigned statistical significance. We implemented the method into a web server connected to a MySQL backend, which also determines the co-localization with gene annotations. We demonstrate the usefulness of this approach by detecting the clusters of CAG/CTG (cytosine contexts that can be methylated in undifferentiated cells), showing that the degree of methylation vary drastically between inside and outside of the clusters. As another example, we used WordCluster to search for statistically significant clusters of olfactory receptor (OR) genes in the human genome. Conclusions WordCluster seems to predict biological meaningful clusters of DNA words (k-mers) and genomic entities. The implementation of the method into a web server is available at http://bioinfo2.ugr.es/wordCluster/wordCluster.php including additional features like the detection of co-localization with gene regions or the annotation enrichment tool for functional analysis of overlapped genes. PMID:21261981

  11. Cluster analysis of bone microarchitecture from high resolution peripheral quantitative computed tomography demonstrates two separate phenotypes associated with high fracture risk in men and women.

    PubMed

    Edwards, M H; Robinson, D E; Ward, K A; Javaid, M K; Walker-Bone, K; Cooper, C; Dennison, E M

    2016-07-01

    Osteoporosis is a major healthcare problem which is conventionally assessed by dual energy X-ray absorptiometry (DXA). New technologies such as high resolution peripheral quantitative computed tomography (HRpQCT) also predict fracture risk. HRpQCT measures a number of bone characteristics that may inform specific patterns of bone deficits. We used cluster analysis to define different bone phenotypes and their relationships to fracture prevalence and areal bone mineral density (BMD). 177 men and 159 women, in whom fracture history was determined by self-report and vertebral fracture assessment, underwent HRpQCT of the distal radius and femoral neck DXA. Five clusters were derived with two clusters associated with elevated fracture risk. "Cluster 1" contained 26 women (50.0% fractured) and 30 men (50.0% fractured) with a lower mean cortical thickness and cortical volumetric BMD, and in men only, a mean total and trabecular area more than the sex-specific cohort mean. "Cluster 2" contained 20 women (50.0% fractured) and 14 men (35.7% fractured) with a lower mean trabecular density and trabecular number than the sex-specific cohort mean. Logistic regression showed fracture rates in these clusters to be significantly higher than the lowest fracture risk cluster [5] (p<0.05). Mean femoral neck areal BMD was significantly lower than cluster 5 in women in cluster 1 and 2 (p<0.001 for both), and in men, in cluster 2 (p<0.001) but not 1 (p=0.220). In conclusion, this study demonstrates two distinct high risk clusters in both men and women which may differ in etiology and response to treatment. As cluster 1 in men does not have low areal BMD, these men may not be identified as high risk by conventional DXA alone. Copyright © 2016. Published by Elsevier Inc.

  12. Prediction models for clustered data: comparison of a random intercept and standard regression model

    PubMed Central

    2013-01-01

    Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436

  13. Modeling Uncertainties in EEG Microstates: Analysis of Real and Imagined Motor Movements Using Probabilistic Clustering-Driven Training of Probabilistic Neural Networks.

    PubMed

    Dinov, Martin; Leech, Robert

    2017-01-01

    Part of the process of EEG microstate estimation involves clustering EEG channel data at the global field power (GFP) maxima, very commonly using a modified K-means approach. Clustering has also been done deterministically, despite there being uncertainties in multiple stages of the microstate analysis, including the GFP peak definition, the clustering itself and in the post-clustering assignment of microstates back onto the EEG timecourse of interest. We perform a fully probabilistic microstate clustering and labeling, to account for these sources of uncertainty using the closest probabilistic analog to KM called Fuzzy C-means (FCM). We train softmax multi-layer perceptrons (MLPs) using the KM and FCM-inferred cluster assignments as target labels, to then allow for probabilistic labeling of the full EEG data instead of the usual correlation-based deterministic microstate label assignment typically used. We assess the merits of the probabilistic analysis vs. the deterministic approaches in EEG data recorded while participants perform real or imagined motor movements from a publicly available data set of 109 subjects. Though FCM group template maps that are almost topographically identical to KM were found, there is considerable uncertainty in the subsequent assignment of microstate labels. In general, imagined motor movements are less predictable on a time point-by-time point basis, possibly reflecting the more exploratory nature of the brain state during imagined, compared to during real motor movements. We find that some relationships may be more evident using FCM than using KM and propose that future microstate analysis should preferably be performed probabilistically rather than deterministically, especially in situations such as with brain computer interfaces, where both training and applying models of microstates need to account for uncertainty. Probabilistic neural network-driven microstate assignment has a number of advantages that we have discussed, which are likely to be further developed and exploited in future studies. In conclusion, probabilistic clustering and a probabilistic neural network-driven approach to microstate analysis is likely to better model and reveal details and the variability hidden in current deterministic and binarized microstate assignment and analyses.

  14. Modeling Uncertainties in EEG Microstates: Analysis of Real and Imagined Motor Movements Using Probabilistic Clustering-Driven Training of Probabilistic Neural Networks

    PubMed Central

    Dinov, Martin; Leech, Robert

    2017-01-01

    Part of the process of EEG microstate estimation involves clustering EEG channel data at the global field power (GFP) maxima, very commonly using a modified K-means approach. Clustering has also been done deterministically, despite there being uncertainties in multiple stages of the microstate analysis, including the GFP peak definition, the clustering itself and in the post-clustering assignment of microstates back onto the EEG timecourse of interest. We perform a fully probabilistic microstate clustering and labeling, to account for these sources of uncertainty using the closest probabilistic analog to KM called Fuzzy C-means (FCM). We train softmax multi-layer perceptrons (MLPs) using the KM and FCM-inferred cluster assignments as target labels, to then allow for probabilistic labeling of the full EEG data instead of the usual correlation-based deterministic microstate label assignment typically used. We assess the merits of the probabilistic analysis vs. the deterministic approaches in EEG data recorded while participants perform real or imagined motor movements from a publicly available data set of 109 subjects. Though FCM group template maps that are almost topographically identical to KM were found, there is considerable uncertainty in the subsequent assignment of microstate labels. In general, imagined motor movements are less predictable on a time point-by-time point basis, possibly reflecting the more exploratory nature of the brain state during imagined, compared to during real motor movements. We find that some relationships may be more evident using FCM than using KM and propose that future microstate analysis should preferably be performed probabilistically rather than deterministically, especially in situations such as with brain computer interfaces, where both training and applying models of microstates need to account for uncertainty. Probabilistic neural network-driven microstate assignment has a number of advantages that we have discussed, which are likely to be further developed and exploited in future studies. In conclusion, probabilistic clustering and a probabilistic neural network-driven approach to microstate analysis is likely to better model and reveal details and the variability hidden in current deterministic and binarized microstate assignment and analyses. PMID:29163110

  15. Exploring the application of latent class cluster analysis for investigating pedestrian crash injury severities in Switzerland.

    PubMed

    Sasidharan, Lekshmi; Wu, Kun-Feng; Menendez, Monica

    2015-12-01

    One of the major challenges in traffic safety analyses is the heterogeneous nature of safety data, due to the sundry factors involved in it. This heterogeneity often leads to difficulties in interpreting results and conclusions due to unrevealed relationships. Understanding the underlying relationship between injury severities and influential factors is critical for the selection of appropriate safety countermeasures. A method commonly employed to address systematic heterogeneity is to focus on any subgroup of data based on the research purpose. However, this need not ensure homogeneity in the data. In this paper, latent class cluster analysis is applied to identify homogenous subgroups for a specific crash type-pedestrian crashes. The manuscript employs data from police reported pedestrian (2009-2012) crashes in Switzerland. The analyses demonstrate that dividing pedestrian severity data into seven clusters helps in reducing the systematic heterogeneity of the data and to understand the hidden relationships between crash severity levels and socio-demographic, environmental, vehicle, temporal, traffic factors, and main reason for the crash. The pedestrian crash injury severity models were developed for the whole data and individual clusters, and were compared using receiver operating characteristics curve, for which results favored clustering. Overall, the study suggests that latent class clustered regression approach is suitable for reducing heterogeneity and revealing important hidden relationships in traffic safety analyses. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Methods for sample size determination in cluster randomized trials

    PubMed Central

    Rutterford, Clare; Copas, Andrew; Eldridge, Sandra

    2015-01-01

    Background: The use of cluster randomized trials (CRTs) is increasing, along with the variety in their design and analysis. The simplest approach for their sample size calculation is to calculate the sample size assuming individual randomization and inflate this by a design effect to account for randomization by cluster. The assumptions of a simple design effect may not always be met; alternative or more complicated approaches are required. Methods: We summarise a wide range of sample size methods available for cluster randomized trials. For those familiar with sample size calculations for individually randomized trials but with less experience in the clustered case, this manuscript provides formulae for a wide range of scenarios with associated explanation and recommendations. For those with more experience, comprehensive summaries are provided that allow quick identification of methods for a given design, outcome and analysis method. Results: We present first those methods applicable to the simplest two-arm, parallel group, completely randomized design followed by methods that incorporate deviations from this design such as: variability in cluster sizes; attrition; non-compliance; or the inclusion of baseline covariates or repeated measures. The paper concludes with methods for alternative designs. Conclusions: There is a large amount of methodology available for sample size calculations in CRTs. This paper gives the most comprehensive description of published methodology for sample size calculation and provides an important resource for those designing these trials. PMID:26174515

  17. The Impact of Clinical, Demographic and Risk Factors on Rates of HIV Transmission: A Population-based Phylogenetic Analysis in British Columbia, Canada

    PubMed Central

    Poon, Art F. Y.; Joy, Jeffrey B.; Woods, Conan K.; Shurgold, Susan; Colley, Guillaume; Brumme, Chanson J.; Hogg, Robert S.; Montaner, Julio S. G.; Harrigan, P. Richard

    2015-01-01

    Background. The diversification of human immunodeficiency virus (HIV) is shaped by its transmission history. We therefore used a population based province wide HIV drug resistance database in British Columbia (BC), Canada, to evaluate the impact of clinical, demographic, and behavioral factors on rates of HIV transmission. Methods. We reconstructed molecular phylogenies from 27 296 anonymized bulk HIV pol sequences representing 7747 individuals in BC—about half the estimated HIV prevalence in BC. Infections were grouped into clusters based on phylogenetic distances, as a proxy for variation in transmission rates. Rates of cluster expansion were reconstructed from estimated dates of HIV seroconversion. Results. Our criteria grouped 4431 individuals into 744 clusters largely separated with respect to risk factors, including large established clusters predominated by injection drug users and more-recently emerging clusters comprising men who have sex with men. The mean log10 viral load of an individual's phylogenetic neighborhood (composed of 5 other individuals with shortest phylogenetic distances) increased their odds of appearing in a cluster by >2-fold per log10 viruses per milliliter. Conclusions. Hotspots of ongoing HIV transmission can be characterized in near real time by the secondary analysis of HIV resistance genotypes, providing an important potential resource for targeting public health initiatives for HIV prevention. PMID:25312037

  18. Racial/Ethnic Differences Moderate Associations of Coping Strategies and Posttraumatic Stress Disorder Symptom Clusters among Women Experiencing Partner Violence: A Multigroup Path Analysis

    PubMed Central

    Weiss, Nicole H.; Johnson, Clinesha D.; Contractor, Ateka; Peasant, Courtney; Swan, Suzanne C.; Sullivan, Tami P.

    2017-01-01

    Background Past research underscores the key role of coping strategies in the development, maintenance, and exacerbation of posttraumatic stress disorder (PTSD) symptoms. The goal of the current study was to extend existing literature by examining whether race/ethnicity moderates the relations among coping strategies (social support, problem-solving, avoidance) and PTSD symptom clusters (intrusion, avoidance, numbing, arousal). Methods Participants were 369 community women (134 African Americans, 131 Latinas, 104 Whites) who reported bidirectional aggression with a current male partner. Multigroup path analysis was utilized to test the moderating role of race/ethnicity in a model linking coping strategies to PTSD symptom clusters. Results The strength and direction of relations among coping strategies and PTSD symptom clusters varied as a function of race/ethnicity. Greater social support coping was related to more arousal symptoms for Latinas and Whites. Greater problem-solving coping was related to fewer arousal symptoms for Latinas. Greater avoidance coping was related to more symptoms across many of the PTSD clusters for African Americans, Latinas, and Whites, however, these relations were strongest for African Americans. Conclusion Results provide support for the moderating role of race/ethnicity in the relations among coping strategies and PTSD symptom clusters, and highlight potential targets for culturally-informed PTSD treatments. PMID:27575609

  19. Conversion events in gene clusters

    PubMed Central

    2011-01-01

    Background Gene clusters containing multiple similar genomic regions in close proximity are of great interest for biomedical studies because of their associations with inherited diseases. However, such regions are difficult to analyze due to their structural complexity and their complicated evolutionary histories, reflecting a variety of large-scale mutational events. In particular, conversion events can mislead inferences about the relationships among these regions, as traced by traditional methods such as construction of phylogenetic trees or multi-species alignments. Results To correct the distorted information generated by such methods, we have developed an automated pipeline called CHAP (Cluster History Analysis Package) for detecting conversion events. We used this pipeline to analyze the conversion events that affected two well-studied gene clusters (α-globin and β-globin) and three gene clusters for which comparative sequence data were generated from seven primate species: CCL (chemokine ligand), IFN (interferon), and CYP2abf (part of cytochrome P450 family 2). CHAP is freely available at http://www.bx.psu.edu/miller_lab. Conclusions These studies reveal the value of characterizing conversion events in the context of studying gene clusters in complex genomes. PMID:21798034

  20. Computational cluster validation for microarray data analysis: experimental assessment of Clest, Consensus Clustering, Figure of Merit, Gap Statistics and Model Explorer.

    PubMed

    Giancarlo, Raffaele; Scaturro, Davide; Utro, Filippo

    2008-10-29

    Inferring cluster structure in microarray datasets is a fundamental task for the so-called -omic sciences. It is also a fundamental question in Statistics, Data Analysis and Classification, in particular with regard to the prediction of the number of clusters in a dataset, usually established via internal validation measures. Despite the wealth of internal measures available in the literature, new ones have been recently proposed, some of them specifically for microarray data. We consider five such measures: Clest, Consensus (Consensus Clustering), FOM (Figure of Merit), Gap (Gap Statistics) and ME (Model Explorer), in addition to the classic WCSS (Within Cluster Sum-of-Squares) and KL (Krzanowski and Lai index). We perform extensive experiments on six benchmark microarray datasets, using both Hierarchical and K-means clustering algorithms, and we provide an analysis assessing both the intrinsic ability of a measure to predict the correct number of clusters in a dataset and its merit relative to the other measures. We pay particular attention both to precision and speed. Moreover, we also provide various fast approximation algorithms for the computation of Gap, FOM and WCSS. The main result is a hierarchy of those measures in terms of precision and speed, highlighting some of their merits and limitations not reported before in the literature. Based on our analysis, we draw several conclusions for the use of those internal measures on microarray data. We report the main ones. Consensus is by far the best performer in terms of predictive power and remarkably algorithm-independent. Unfortunately, on large datasets, it may be of no use because of its non-trivial computer time demand (weeks on a state of the art PC). FOM is the second best performer although, quite surprisingly, it may not be competitive in this scenario: it has essentially the same predictive power of WCSS but it is from 6 to 100 times slower in time, depending on the dataset. The approximation algorithms for the computation of FOM, Gap and WCSS perform very well, i.e., they are faster while still granting a very close approximation of FOM and WCSS. The approximation algorithm for the computation of Gap deserves to be singled-out since it has a predictive power far better than Gap, it is competitive with the other measures, but it is at least two order of magnitude faster in time with respect to Gap. Another important novel conclusion that can be drawn from our analysis is that all the measures we have considered show severe limitations on large datasets, either due to computational demand (Consensus, as already mentioned, Clest and Gap) or to lack of precision (all of the other measures, including their approximations). The software and datasets are available under the GNU GPL on the supplementary material web page.

  1. Representation of Tinnitus in the US Newspaper Media and in Facebook Pages: Cross-Sectional Analysis of Secondary Data

    PubMed Central

    Ratinaud, Pierre; Andersson, Gerhard

    2018-01-01

    Background When people with health conditions begin to manage their health issues, one important issue that emerges is the question as to what exactly do they do with the information that they have obtained through various sources (eg, news media, social media, health professionals, friends, and family). The information they gather helps form their opinions and, to some degree, influences their attitudes toward managing their condition. Objective This study aimed to understand how tinnitus is represented in the US newspaper media and in Facebook pages (ie, social media) using text pattern analysis. Methods This was a cross-sectional study based upon secondary analyses of publicly available data. The 2 datasets (ie, text corpuses) analyzed in this study were generated from US newspaper media during 1980-2017 (downloaded from the database US Major Dailies by ProQuest) and Facebook pages during 2010-2016. The text corpuses were analyzed using the Iramuteq software using cluster analysis and chi-square tests. Results The newspaper dataset had 432 articles. The cluster analysis resulted in 5 clusters, which were named as follows: (1) brain stimulation (26.2%), (2) symptoms (13.5%), (3) coping (19.8%), (4) social support (24.2%), and (5) treatment innovation (16.4%). A time series analysis of clusters indicated a change in the pattern of information presented in newspaper media during 1980-2017 (eg, more emphasis on cluster 5, focusing on treatment inventions). The Facebook dataset had 1569 texts. The cluster analysis resulted in 7 clusters, which were named as: (1) diagnosis (21.9%), (2) cause (4.1%), (3) research and development (13.6%), (4) social support (18.8%), (5) challenges (11.1%), (6) symptoms (21.4%), and (7) coping (9.2%). A time series analysis of clusters indicated no change in information presented in Facebook pages on tinnitus during 2011-2016. Conclusions The study highlights the specific aspects about tinnitus that the US newspaper media and Facebook pages focus on, as well as how these aspects change over time. These findings can help health care providers better understand the presuppositions that tinnitus patients may have. More importantly, the findings can help public health experts and health communication experts in tailoring health information about tinnitus to promote self-management, as well as assisting in appropriate choices of treatment for those living with tinnitus. PMID:29739734

  2. A spatial cluster analysis of tractor overturns in Kentucky from 1960 to 2002

    USGS Publications Warehouse

    Saman, D.M.; Cole, H.P.; Odoi, A.; Myers, M.L.; Carey, D.I.; Westneat, S.C.

    2012-01-01

    Background: Agricultural tractor overturns without rollover protective structures are the leading cause of farm fatalities in the United States. To our knowledge, no studies have incorporated the spatial scan statistic in identifying high-risk areas for tractor overturns. The aim of this study was to determine whether tractor overturns cluster in certain parts of Kentucky and identify factors associated with tractor overturns. Methods: A spatial statistical analysis using Kulldorff's spatial scan statistic was performed to identify county clusters at greatest risk for tractor overturns. A regression analysis was then performed to identify factors associated with tractor overturns. Results: The spatial analysis revealed a cluster of higher than expected tractor overturns in four counties in northern Kentucky (RR = 2.55) and 10 counties in eastern Kentucky (RR = 1.97). Higher rates of tractor overturns were associated with steeper average percent slope of pasture land by county (p = 0.0002) and a greater percent of total tractors with less than 40 horsepower by county (p<0.0001). Conclusions: This study reveals that geographic hotspots of tractor overturns exist in Kentucky and identifies factors associated with overturns. This study provides policymakers a guide to targeted county-level interventions (e.g., roll-over protective structures promotion interventions) with the intention of reducing tractor overturns in the highest risk counties in Kentucky. ?? 2012 Saman et al.

  3. Dietary patterns, insulin sensitivity and inflammation in older adults

    PubMed Central

    Anderson, Amy L.; Harris, Tamara B.; Tylavsky, Frances A.; Perry, Sara E.; Houston, Denise K.; Lee, Jung Sun; Kanaya, Alka M.; Sahyoun, Nadine R.

    2011-01-01

    Background/Objectives Several studies have linked dietary patterns to insulin sensitivity and systemic inflammation, which affect risk of multiple chronic diseases. The purpose of this study was to investigate the dietary patterns of a cohort of older adults, and examine relationships of dietary patterns with markers of insulin sensitivity and systemic inflammation. Subjects/Methods The Health, Aging and Body Composition (Health ABC) Study is a prospective cohort study of 3075 older adults. In Health ABC, multiple indicators of glucose metabolism and systemic inflammation were assessed. Food intake was estimated with a modified Block food frequency questionnaire (FFQ). In this study, dietary patterns of 1751 participants with complete data were derived by cluster analysis. Results Six clusters were identified, including a ‘Healthy foods’ cluster, characterized by higher intake of lowfat dairy products, fruit, whole grains, poultry, fish and vegetables. In the main analysis, the ‘Healthy foods’ cluster had significantly lower fasting insulin and HOMA-IR than the ‘Breakfast cereal’ and ‘High-fat dairy products’ clusters, and lower fasting glucose than the ‘High-fat dairy products’ cluster (P ≤ 0.05). No differences were found in 2-hour glucose. With respect to inflammation, the ‘Healthy foods’ cluster had lower IL-6 than the ‘Sweets and desserts’ and ‘High-fat dairy products’ clusters, and no differences were seen in CRP or TNF-α. Conclusions A dietary pattern high in lowfat dairy products, fruit, whole grains, poultry, fish and vegetables may be associated with greater insulin sensitivity and lower systemic inflammation in older adults. PMID:21915138

  4. Scholarly Research Program in Fuel Analysis and Combustion Research

    DTIC Science & Technology

    1993-02-01

    Public reporting burden for this collection of information is es•tmated to average I hour per response, ilnduding the time fo," reviwing ...Thermal Oxidative Flask Test 45 9. Advanced Fuel System Configuration Descent Condition 57 10. TGPGC for n-Alkane Mixture 63 11. Hierarchical Cluster ...will include all analytical data, data analysis conclusions, recommendations and rationale. 16 a& k : 05 Titl: Development of Test Cell Assemblies for

  5. Worldwide Topology of the Scientific Subject Profile: A Macro Approach in the Country Level

    PubMed Central

    Moya-Anegón, Félix; Herrero-Solana, Víctor

    2013-01-01

    Background Models for the production of knowledge and systems of innovation and science are key elements for characterizing a country in view of its scientific thematic profile. With regard to scientific output and publication in journals of international visibility, the countries of the world may be classified into three main groups according to their thematic bias. Methodology/Principal Findings This paper aims to classify the countries of the world in several broad groups, described in terms of behavioural models that attempt to sum up the characteristics of their systems of knowledge and innovation. We perceive three clusters in our analysis: 1) the biomedical cluster, 2) the basic science & engineering cluster, and 3) the agricultural cluster. The countries are conceptually associated with the clusters via Principal Component Analysis (PCA), and a Multidimensional Scaling (MDS) map with all the countries is presented. Conclusions/Significance As we have seen, insofar as scientific output and publication in journals of international visibility is concerned, the countries of the world may be classified into three main groups according to their thematic profile. These groups can be described in terms of behavioral models that attempt to sum up the characteristics of their systems of knowledge and innovation. PMID:24349467

  6. BiCluE - Exact and heuristic algorithms for weighted bi-cluster editing of biomedical data

    PubMed Central

    2013-01-01

    Background The explosion of biological data has dramatically reformed today's biology research. The biggest challenge to biologists and bioinformaticians is the integration and analysis of large quantity of data to provide meaningful insights. One major problem is the combined analysis of data from different types. Bi-cluster editing, as a special case of clustering, which partitions two different types of data simultaneously, might be used for several biomedical scenarios. However, the underlying algorithmic problem is NP-hard. Results Here we contribute with BiCluE, a software package designed to solve the weighted bi-cluster editing problem. It implements (1) an exact algorithm based on fixed-parameter tractability and (2) a polynomial-time greedy heuristics based on solving the hardest part, edge deletions, first. We evaluated its performance on artificial graphs. Afterwards we exemplarily applied our implementation on real world biomedical data, GWAS data in this case. BiCluE generally works on any kind of data types that can be modeled as (weighted or unweighted) bipartite graphs. Conclusions To our knowledge, this is the first software package solving the weighted bi-cluster editing problem. BiCluE as well as the supplementary results are available online at http://biclue.mpi-inf.mpg.de. PMID:24565035

  7. Identifying clusters of falls-related hospital admissions to inform population targets for prioritising falls prevention programmes

    PubMed Central

    Finch, Caroline F; Stephan, Karen; Shee, Anna Wong; Hill, Keith; Haines, Terry P; Clemson, Lindy; Day, Lesley

    2015-01-01

    Background There has been limited research investigating the relationship between injurious falls and hospital resource use. The aims of this study were to identify clusters of community-dwelling older people in the general population who are at increased risk of being admitted to hospital following a fall and how those clusters differed in their use of hospital resources. Methods Analysis of routinely collected hospital admissions data relating to 45 374 fall-related admissions in Victorian community-dwelling older adults aged ≥65 years that occurred during 2008/2009 to 2010/2011. Fall-related admission episodes were identified based on being admitted from a private residence to hospital with a principal diagnosis of injury (International Classification of Diseases (ICD)-10-AM codes S00 to T75) and having a first external cause of a fall (ICD-10-AM codes W00 to W19). A cluster analysis was performed to identify homogeneous groups using demographic details of patients and information on the presence of comorbidities. Hospital length of stay (LOS) was compared across clusters using competing risks regression. Results Clusters based on area of residence, demographic factors (age, gender, marital status, country of birth) and the presence of comorbidities were identified. Clusters representing hospitalised fallers with comorbidities were associated with longer LOS compared with other cluster groups. Clusters delineated by demographic factors were also associated with increased LOS. Conclusions All patients with comorbidity, and older women without comorbidities, stay in hospital longer following a fall and hence consume a disproportionate share of hospital resources. These findings have important implications for the targeting of falls prevention interventions for community-dwelling older people. PMID:25618735

  8. Exploring syndrome differentiation using non-negative matrix factorization and cluster analysis in patients with atopic dermatitis.

    PubMed

    Yun, Younghee; Jung, Wonmo; Kim, Hyunho; Jang, Bo-Hyoung; Kim, Min-Hee; Noh, Jiseong; Ko, Seong-Gyu; Choi, Inhwa

    2017-08-01

    Syndrome differentiation (SD) results in a diagnostic conclusion based on a cluster of concurrent symptoms and signs, including pulse form and tongue color. In Korea, there is a strong interest in the standardization of Traditional Medicine (TM). In order to standardize TM treatment, standardization of SD should be given priority. The aim of this study was to explore the SD, or symptom clusters, of patients with atopic dermatitis (AD) using non-negative factorization methods and k-means clustering analysis. We screened 80 patients and enrolled 73 eligible patients. One TM dermatologist evaluated the symptoms/signs using an existing clinical dataset from patients with AD. This dataset was designed to collect 15 dermatologic and 18 systemic symptoms/signs associated with AD. Non-negative matrix factorization was used to decompose the original data into a matrix with three features and a weight matrix. The point of intersection of the three coordinates from each patient was placed in three-dimensional space. With five clusters, the silhouette score reached 0.484, and this was the best silhouette score obtained from two to nine clusters. Patients were clustered according to the varying severity of concurrent symptoms/signs. Through the distribution of the null hypothesis generated by 10,000 permutation tests, we found significant cluster-specific symptoms/signs from the confidence intervals in the upper and lower 2.5% of the distribution. Patients in each cluster showed differences in symptoms/signs and severity. In a clinical situation, SD and treatment are based on the practitioners' observations and clinical experience. SD, identified through informatics, can contribute to development of standardized, objective, and consistent SD for each disease. Copyright © 2017. Published by Elsevier Ltd.

  9. A Cluster Analytic Approach to Identifying Predictors and Moderators of Psychosocial Treatment for Bipolar Depression: Results from STEP-BD

    PubMed Central

    Deckersbach, Thilo; Peters, Amy T.; Sylvia, Louisa G.; Gold, Alexandra K.; da Silva Magalhaes, Pedro Vieira; Henry, David B.; Frank, Ellen; Otto, Michael W.; Berk, Michael; Dougherty, Darin D.; Nierenberg, Andrew A.; Miklowitz, David J.

    2016-01-01

    Background We sought to address how predictors and moderators of psychotherapy for bipolar depression – identified individually in prior analyses – can inform the development of a metric for prospectively classifying treatment outcome in intensive psychotherapy (IP) versus collaborative care (CC) adjunctive to pharmacotherapy in the Systematic Treatment Enhancement Program (STEP-BD) study. Methods We conducted post-hoc analyses on 135 STEP-BD participants using cluster analysis to identify subsets of participants with similar clinical profiles and investigated this combined metric as a moderator and predictor of response to IP. We used agglomerative hierarchical cluster analyses and k-means clustering to determine the content of the clinical profiles. Logistic regression and Cox proportional hazard models were used to evaluate whether the resulting clusters predicted or moderated likelihood of recovery or time until recovery. Results The cluster analysis yielded a two-cluster solution: 1) “less-recurrent/severe” and 2) “chronic/recurrent.” Rates of recovery in IP were similar for less-recurrent/severe and chronic/recurrent participants. Less-recurrent/severe patients were more likely than chronic/recurrent patients to achieve recovery in CC (p = .040, OR = 4.56). IP yielded a faster recovery for chronic/recurrent participants, whereas CC led to recovery sooner in the less-recurrent/severe cluster (p = .034, OR = 2.62). Limitations Cluster analyses require list-wise deletion of cases with missing data so we were unable to conduct analyses on all STEP-BD participants. Conclusions A well-powered, parametric approach can distinguish patients based on illness history and provide clinicians with symptom profiles of patients that confer differential prognosis in CC vs. IP. PMID:27289316

  10. Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.

    2004-08-06

    Background The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. Results We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene,more » and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Conclusions Measuring conservation of sequence features closely linked to function - such as binding-site clustering - makes better use of comparative sequence data than commonly used methods that examine only sequence identity.« less

  11. Spatial distribution and cluster analysis of risky sexual behaviours and STDs reported by Chinese adults in Guangzhou, China: a representative population-based study

    PubMed Central

    Chen, Wen; Zhou, Fangjing; Hall, Brian J; Wang, Yu; Latkin, Carl; Ling, Li; Tucker, Joseph D

    2016-01-01

    Objectives To assess associations between residences location, risky sexual behaviours and sexually transmitted diseases (STDs) among adults living in Guangzhou, China. Methods Data were obtained from 751 Chinese adults aged 18–59 years in Guangzhou, China, using stratified random sampling by using spatial epidemiological methods. Face-to-face household interviews were conducted to collect self-report data on risky sexual behaviours and diagnosed STDs. Kulldorff’s spatial scan statistic was implemented to identify and detect spatial distribution and clusters of risky sexual behaviours and STDs. The presence and location of statistically significant clusters were mapped in the study areas using ArcGIS software. Results The prevalence of self-reported risky sexual behaviours was between 5.1% and 50.0%. The self-reported lifetime prevalence of diagnosed STDs was 7.06%. Anal intercourse clustered in an area located along the border within the rural–urban continuum (p=0.001). High rate clusters for alcohol or other drugs using before sex (p=0.008) and migrants who lived in Guangzhou <1 year (p=0.007) overlapped this cluster. Excess cases for unprotected sex (p=0.031) overlapped the cluster for college students (p<0.001). Five of nine (55.6%) students who had sexual experience during the last 12 months located in the cluster of unprotected sex. Conclusions Short-term migrants and college students reported greater risky sexual behaviours. Programmes to increase safer sex within these communities to reduce the risk of STDs are warranted in Guangzhou. Spatial analysis identified geographical clusters of risky sexual behaviours, which is critical for optimising surveillance and targeting control measures for these locations in the future. PMID:26843400

  12. Identification of Urban Leprosy Clusters

    PubMed Central

    Paschoal, José Antonio Armani; Paschoal, Vania Del'Arco; Nardi, Susilene Maria Tonelli; Rosa, Patrícia Sammarco; Ismael, Manuela Gallo y Sanches; Sichieri, Eduvaldo Paulo

    2013-01-01

    Overpopulation of urban areas results from constant migrations that cause disordered urban growth, constituting clusters defined as sets of people or activities concentrated in relatively small physical spaces that often involve precarious conditions. Aim. Using residential grouping, the aim was to identify possible clusters of individuals in São José do Rio Preto, Sao Paulo, Brazil, who have or have had leprosy. Methods. A population-based, descriptive, ecological study using the MapInfo and CrimeStat techniques, geoprocessing, and space-time analysis evaluated the location of 425 people treated for leprosy between 1998 and 2010. Clusters were defined as concentrations of at least 8 people with leprosy; a distance of up to 300 meters between residences was adopted. Additionally, the year of starting treatment and the clinical forms of the disease were analyzed. Results. Ninety-eight (23.1%) of 425 geocoded cases were located within one of ten clusters identified in this study, and 129 cases (30.3%) were in the region of a second-order cluster, an area considered of high risk for the disease. Conclusion. This study identified ten clusters of leprosy cases in the city and identified an area of high risk for the appearance of new cases of the disease. PMID:24288467

  13. Hemodynamic Response to Interictal Epileptiform Discharges Addressed by Personalized EEG-fNIRS Recordings

    PubMed Central

    Pellegrino, Giovanni; Machado, Alexis; von Ellenrieder, Nicolas; Watanabe, Satsuki; Hall, Jeffery A.; Lina, Jean-Marc; Kobayashi, Eliane; Grova, Christophe

    2016-01-01

    Objective: We aimed at studying the hemodynamic response (HR) to Interictal Epileptic Discharges (IEDs) using patient-specific and prolonged simultaneous ElectroEncephaloGraphy (EEG) and functional Near InfraRed Spectroscopy (fNIRS) recordings. Methods: The epileptic generator was localized using Magnetoencephalography source imaging. fNIRS montage was tailored for each patient, using an algorithm to optimize the sensitivity to the epileptic generator. Optodes were glued using collodion to achieve prolonged acquisition with high quality signal. fNIRS data analysis was handled with no a priori constraint on HR time course, averaging fNIRS signals to similar IEDs. Cluster-permutation analysis was performed on 3D reconstructed fNIRS data to identify significant spatio-temporal HR clusters. Standard (GLM with fixed HRF) and cluster-permutation EEG-fMRI analyses were performed for comparison purposes. Results: fNIRS detected HR to IEDs for 8/9 patients. It mainly consisted oxy-hemoglobin increases (seven patients), followed by oxy-hemoglobin decreases (six patients). HR was lateralized in six patients and lasted from 8.5 to 30 s. Standard EEG-fMRI analysis detected an HR in 4/9 patients (4/9 without enough IEDs, 1/9 unreliable result). The cluster-permutation EEG-fMRI analysis restricted to the region investigated by fNIRS showed additional strong and non-canonical BOLD responses starting earlier than the IEDs and lasting up to 30 s. Conclusions: (i) EEG-fNIRS is suitable to detect the HR to IEDs and can outperform EEG-fMRI because of prolonged recordings and greater chance to detect IEDs; (ii) cluster-permutation analysis unveils additional HR features underestimated when imposing a canonical HR function (iii) the HR is often bilateral and lasts up to 30 s. PMID:27047325

  14. Do Sexually Oriented Massage Parlors Cluster in Specific Neighborhoods? A Spatial Analysis of Indoor Sex Work in Los Angeles and Orange Counties, California

    PubMed Central

    Kim, Anna J.; Takahashi, Lois; Wiebe, Douglas J.

    2015-01-01

    Objective Social determinants of health may be substantially affected by spatial factors, which together may explain the persistence of health inequities. Clustering of possible sources of negative health and social outcomes points to a spatial focus for future interventions. We analyzed the spatial clustering of sex work businesses in Southern California to examine where and why they cluster. We explored economic and legal factors as possible explanations of clustering. Methods We manually coded data from a website used by paying members to post reviews of female massage parlor workers. We identified clusters of sexually oriented massage parlor businesses using spatial autocorrelation tests. We conducted spatial regression using census tract data to identify predictors of clustering. Results A total of 889 venues were identified. Clusters of tracts having higher-than-expected numbers of sexually oriented massage parlors (“hot spots”) were located outside downtowns. These hot spots were characterized by a higher proportion of adult males, a higher proportion of households below the federal poverty level, and a smaller average household size. Conclusion Sexually oriented massage parlors in Los Angeles and Orange counties cluster in particular neighborhoods. More research is needed to ascertain the causal factors of such clusters and how interventions can be designed to leverage these spatial factors. PMID:26327731

  15. Application of Factor Analysis on the Financial Ratios of Indian Cement Industry and Validation of the Results by Cluster Analysis

    NASA Astrophysics Data System (ADS)

    De, Anupam; Bandyopadhyay, Gautam; Chakraborty, B. N.

    2010-10-01

    Financial ratio analysis is an important and commonly used tool in analyzing financial health of a firm. Quite a large number of financial ratios, which can be categorized in different groups, are used for this analysis. However, to reduce number of ratios to be used for financial analysis and regrouping them into different groups on basis of empirical evidence, Factor Analysis technique is being used successfully by different researches during the last three decades. In this study Factor Analysis has been applied over audited financial data of Indian cement companies for a period of 10 years. The sample companies are listed on the Stock Exchange India (BSE and NSE). Factor Analysis, conducted over 44 variables (financial ratios) grouped in 7 categories, resulted in 11 underlying categories (factors). Each factor is named in an appropriate manner considering the factor loads and constituent variables (ratios). Representative ratios are identified for each such factor. To validate the results of Factor Analysis and to reach final conclusion regarding the representative ratios, Cluster Analysis had been performed.

  16. Cholera Epidemic in Guinea-Bissau (2008): The Importance of “Place”

    PubMed Central

    Luquero, Francisco J.; Banga, Cunhate Na; Remartínez, Daniel; Palma, Pedro Pablo; Baron, Emanuel; Grais, Rebeca F.

    2011-01-01

    Background As resources are limited when responding to cholera outbreaks, knowledge about where to orient interventions is crucial. We describe the cholera epidemic affecting Guinea-Bissau in 2008 focusing on the geographical spread in order to guide prevention and control activities. Methodology/Principal Findings We conducted two studies: 1) a descriptive analysis of the cholera epidemic in Guinea-Bissau focusing on its geographical spread (country level and within the capital); and 2) a cross-sectional study to measure the prevalence of houses with at least one cholera case in the most affected neighbourhood of the capital (Bairro Bandim) to detect clustering of households with cases (cluster analysis). All cholera cases attending the cholera treatment centres in Guinea-Bissau who fulfilled a modified World Health Organization clinical case definition during the epidemic were included in the descriptive study. For the cluster analysis, a sample of houses was selected from a satellite photo (Google Earth™); 140 houses (and the four closest houses) were assessed from the 2,202 identified structures. We applied K-functions and Kernel smoothing to detect clustering. We confirmed the clustering using Kulldorff's spatial scan statistic. A total of 14,222 cases and 225 deaths were reported in the country (AR = 0.94%, CFR = 1.64%). The more affected regions were Biombo, Bijagos and Bissau (the capital). Bairro Bandim was the most affected neighborhood of the capital (AR = 4.0). We found at least one case in 22.7% of the houses (95%CI: 19.5–26.2) in this neighborhood. The cluster analysis identified two areas within Bairro Bandim at highest risk: a market and an intersection where runoff accumulates waste (p<0.001). Conclusions/Significance Our analysis allowed for the identification of the most affected regions in Guinea-Bissau during the 2008 cholera outbreak, and the most affected areas within the capital. This information was essential for making decisions on where to reinforce treatment and to guide control and prevention activities. PMID:21572530

  17. Regional health care planning: a methodology to cluster facilities using community utilization patterns

    PubMed Central

    2013-01-01

    Background Community-based health care planning and regulation necessitates grouping facilities and areal units into regions of similar health care use. Limited research has explored the methodologies used in creating these regions. We offer a new methodology that clusters facilities based on similarities in patient utilization patterns and geographic location. Our case study focused on Hospital Groups in Michigan, the allocation units used for predicting future inpatient hospital bed demand in the state’s Bed Need Methodology. The scientific, practical, and political concerns that were considered throughout the formulation and development of the methodology are detailed. Methods The clustering methodology employs a 2-step K-means + Ward’s clustering algorithm to group hospitals. The final number of clusters is selected using a heuristic that integrates both a statistical-based measure of cluster fit and characteristics of the resulting Hospital Groups. Results Using recent hospital utilization data, the clustering methodology identified 33 Hospital Groups in Michigan. Conclusions Despite being developed within the politically charged climate of Certificate of Need regulation, we have provided an objective, replicable, and sustainable methodology to create Hospital Groups. Because the methodology is built upon theoretically sound principles of clustering analysis and health care service utilization, it is highly transferable across applications and suitable for grouping facilities or areal units. PMID:23964905

  18. On the Accuracy and Parallelism of GPGPU-Powered Incremental Clustering Algorithms

    PubMed Central

    He, Li; Zheng, Hao; Wang, Lei

    2017-01-01

    Incremental clustering algorithms play a vital role in various applications such as massive data analysis and real-time data processing. Typical application scenarios of incremental clustering raise high demand on computing power of the hardware platform. Parallel computing is a common solution to meet this demand. Moreover, General Purpose Graphic Processing Unit (GPGPU) is a promising parallel computing device. Nevertheless, the incremental clustering algorithm is facing a dilemma between clustering accuracy and parallelism when they are powered by GPGPU. We formally analyzed the cause of this dilemma. First, we formalized concepts relevant to incremental clustering like evolving granularity. Second, we formally proved two theorems. The first theorem proves the relation between clustering accuracy and evolving granularity. Additionally, this theorem analyzes the upper and lower bounds of different-to-same mis-affiliation. Fewer occurrences of such mis-affiliation mean higher accuracy. The second theorem reveals the relation between parallelism and evolving granularity. Smaller work-depth means superior parallelism. Through the proofs, we conclude that accuracy of an incremental clustering algorithm is negatively related to evolving granularity while parallelism is positively related to the granularity. Thus the contradictory relations cause the dilemma. Finally, we validated the relations through a demo algorithm. Experiment results verified theoretical conclusions. PMID:29123546

  19. International linkage of two food-borne hepatitis A clusters through traceback of mussels, the Netherlands, 2012.

    PubMed

    Boxman, Ingeborg L A; Verhoef, Linda; Vennema, Harry; Ngui, Siew-Lin; Friesema, Ingrid H M; Whiteside, Chris; Lees, David; Koopmans, Marion

    2016-01-01

    This report describes an outbreak investigation starting with two closely related suspected food-borne clusters of Dutch hepatitis A cases, nine primary cases in total, with an unknown source in the Netherlands. The hepatitis A virus (HAV) genotype IA sequences of both clusters were highly similar (459/460 nt) and were not reported earlier. Food questionnaires and a case-control study revealed an association with consumption of mussels. Analysis of mussel supply chains identified the most likely production area. International enquiries led to identification of a cluster of patients near this production area with identical HAV sequences with onsets predating the first Dutch cluster of cases. The most likely source for this cluster was a case who returned from an endemic area in Central America, and a subsequent household cluster from which treated domestic sewage was discharged into the suspected mussel production area. Notably, mussels from this area were also consumed by a separate case in the United Kingdom sharing an identical strain with the second Dutch cluster. In conclusion, a small number of patients in a non-endemic area led to geographically dispersed hepatitis A outbreaks with food as vehicle. This link would have gone unnoticed without sequence analyses and international collaboration.

  20. Persistent molecular superfluid response in doped para-hydrogen clusters.

    PubMed

    Raston, P L; Jäger, W; Li, H; Le Roy, R J; Roy, P-N

    2012-06-22

    Direct observation of superfluid response in para-hydrogen (p-H(2)) remains a challenge because of the need for a probe that would not induce localization and a resultant reduction in superfluid fraction. Earlier work [H. Li, R. J. Le Roy, P.-N. Roy, and A. R. W. McKellar, Phys. Rev. Lett. 105, 133401 (2010)] has shown that carbon dioxide can probe the effective inertia of p-H(2) although larger clusters show a lower superfluid response due to localization. It is shown here that the lighter carbon monoxide probe molecule allows one to measure the effective inertia of p-H(2) clusters while maintaining a maximum superfluid response with respect to dopant rotation. Microwave spectroscopy and a theoretical analysis based on Feynman path-integral simulations are used to support this conclusion.

  1. A Population-Based Analysis of Application of WHO Nomenclature in Pathology Reports of Pulmonary Neuroendocrine Tumors.

    PubMed

    Derks, Jules L; van Suylen, Robert Jan; Thunnissen, Erik; den Bakker, Michael A; Smit, Egbert F; Groen, Harry J M; Speel, Ernst J M; Dingemans, Anne-Marie C

    2016-04-01

    Pulmonary neuroendocrine tumors (pNETs) are difficult to classify. We performed a population-based analysis to investigate the application of pNET nomenclature in daily pathology practice. Conclusions from pathology reports (2003-2012) describing carcinoids, (large cell) neuroendocrine carcinomas (NECs), and carcinomas with neuroendocrine features/differentiation were retrieved from the Dutch Pathology Registry by queries on location and diagnosis and screened for terminology. Cases with a nonpulmonary or unknown origin and small cell lung cancer were excluded. Diagnoses were clustered into subgroups and the retrieved terminology was compared with the 2015 World Health Organization (WHO) diagnoses. By means of an online questionnaire, interpretation of the non-WHO nomenclature retrieved from pathology reports was evaluated (by 35 physicians and 19 pathologists). A total of 3216 unique pathology report conclusions with 55 different pNET diagnoses (n = 3052) and 20 uncertain diagnoses (n = 164) were analyzed. Non-WHO nomenclature was used in 15% of diagnoses (n = 488). Diagnoses could be clustered into carcinoids (n = 1086), NEC (n = 1316), carcinomas with neuroendocrine features/differentiation (n = 624), and unspecified pNETs (n = 26). Non-WHO nomenclature within these clusters was found for 7% of carcinoids, 20% of NECs, 13% of carcinomas with neuroendocrine features/differentiation, and 100% of unspecified pNETs and was observed more often in conclusions regarding biopsy or cytological specimens (62% and 12%) compared with resection specimens (26%). Analysis of the questionnaire results revealed that 4 of 19 diagnoses based on non-WHO nomenclature were uniformly interpreted (>50% agreement) by physicians, as were 10 of 19 diagnoses by pathologists. In 15% of pNETs other than small cell lung cancer, a non-WHO nomenclature diagnosis was provided, more frequently on the basis of smaller specimens. The interpretation was different between physicians and pathologists. Application of uniform nomenclature among all clinicians is advocated. Copyright © 2016 International Association for the Study of Lung Cancer. Published by Elsevier Inc. All rights reserved.

  2. Near-infrared spectroscopy of candidate red supergiant stars in clusters

    NASA Astrophysics Data System (ADS)

    Messineo, Maria; Zhu, Qingfeng; Ivanov, Valentin D.; Figer, Donald F.; Davies, Ben; Menten, Karl M.; Kudritzki, Rolf P.; Chen, C.-H. Rosie

    2014-11-01

    Context. Clear identifications of Galactic young stellar clusters farther than a few kpc from the Sun are rare, despite the large number of candidate clusters. Aims: We aim to improve the selection of candidate clusters rich in massive stars with a multiwavelength analysis of photometric Galactic data that range from optical to mid-infrared wavelengths. Methods: We present a photometric and spectroscopic analysis of five candidate stellar clusters, which were selected as overdensities with bright stars (Ks< 7 mag) in GLIMPSE and 2MASS images. Results: A total of 48 infrared spectra were obtained. The combination of photometry and spectroscopy yielded six new red supergiant stars with masses from 10 M⊙ to 15 M⊙. Two red supergiants are located at Galactic coordinates (l,b) = (16.°7, -0.°63) and at a distance of about ~3.9 kpc; four other red supergiants are members of a cluster at Galactic coordinates (l,b) = (49.°3, + 0.°72) and at a distance of ~7.0 kpc. Conclusions: Spectroscopic analysis of the brightest stars of detected overdensities and studies of interstellar extinction along their line of sights are fundamental to distinguish regions of low extinction from actual stellar clusters. The census of young star clusters containing red supergiants is incomplete; in the existing all-sky near-infrared surveys, they can be identified as overdensities of bright stars with infrared color-magnitude diagrams characterized by gaps. Based on observations collected at the European Southern Observatory (ESO Programme 60.A-9700(E), and 089.D-0876), and on observations collected at the UKIRT telescope (programme ID H243NS).MM is currently employed by the MPIfR. Part of this work was performed at RIT (2009), at ESA (2010), and at the MPIfR.Tables 3, 4, and 6 are available in electronic form at http://www.aanda.org

  3. Cluster analysis and quality assessment of logged water at an irrigation project, eastern Saudi Arabia.

    PubMed

    Hussain, Mahbub; Ahmed, Syed Munaf; Abderrahman, Walid

    2008-01-01

    A multivariate statistical technique, cluster analysis, was used to assess the logged surface water quality at an irrigation project at Al-Fadhley, Eastern Province, Saudi Arabia. The principal idea behind using the technique was to utilize all available hydrochemical variables in the quality assessment including trace elements and other ions which are not considered in conventional techniques for water quality assessments like Stiff and Piper diagrams. Furthermore, the area belongs to an irrigation project where water contamination associated with the use of fertilizers, insecticides and pesticides is expected. This quality assessment study was carried out on a total of 34 surface/logged water samples. To gain a greater insight in terms of the seasonal variation of water quality, 17 samples were collected from both summer and winter seasons. The collected samples were analyzed for a total of 23 water quality parameters including pH, TDS, conductivity, alkalinity, sulfate, chloride, bicarbonate, nitrate, phosphate, bromide, fluoride, calcium, magnesium, sodium, potassium, arsenic, boron, copper, cobalt, iron, lithium, manganese, molybdenum, nickel, selenium, mercury and zinc. Cluster analysis in both Q and R modes was used. Q-mode analysis resulted in three distinct water types for both the summer and winter seasons. Q-mode analysis also showed the spatial as well as temporal variation in water quality. R-mode cluster analysis led to the conclusion that there are two major sources of contamination for the surface/shallow groundwater in the area: fertilizers, micronutrients, pesticides, and insecticides used in agricultural activities, and non-point natural sources.

  4. Spatial analysis of malaria in Anhui province, China

    PubMed Central

    Zhang, Wenyi; Wang, Liping; Fang, Liqun; Ma, Jiaqi; Xu, Youfu; Jiang, Jiafu; Hui, Fengming; Wang, Jianjun; Liang, Song; Yang, Hong; Cao, Wuchun

    2008-01-01

    Background Malaria has re-emerged in Anhui Province, China, and this province was the most seriously affected by malaria during 2005–2006. It is necessary to understand the spatial distribution of malaria cases and to identify highly endemic areas for future public health planning and resource allocation in Anhui Province. Methods The annual average incidence at the county level was calculated using malaria cases reported between 2000 and 2006 in Anhui Province. GIS-based spatial analyses were conducted to detect spatial distribution and clustering of malaria incidence at the county level. Results The spatial distribution of malaria cases in Anhui Province from 2000 to 2006 was mapped at the county level to show crude incidence, excess hazard and spatial smoothed incidence. Spatial cluster analysis suggested 10 and 24 counties were at increased risk for malaria (P < 0.001) with the maximum spatial cluster sizes at < 50% and < 25% of the total population, respectively. Conclusion The application of GIS, together with spatial statistical techniques, provide a means to quantify explicit malaria risks and to further identify environmental factors responsible for the re-emerged malaria risks. Future public health planning and resource allocation in Anhui Province should be focused on the maximum spatial cluster region. PMID:18847489

  5. Symptom clusters and treatment time delay in Korean patients with ST-elevation myocardial infarction on admission.

    PubMed

    Kim, Hee-Sook; Eun, Sang Jun; Hwang, Jin Yong; Lee, Kun-Sei; Cho, Sung-Il

    2018-05-01

    Most patients with acute myocardial infarction (AMI) experience more than one symptom at onset. Although symptoms are an important early indicator, patients and physicians may have difficulty interpreting symptoms and detecting AMI at an early stage. This study aimed to identify symptom clusters among Korean patients with ST-elevation myocardial infarction (STEMI), to examine the relationship between symptom clusters and patient-related variables, and to investigate the influence of symptom clusters on treatment time delay (decision time [DT], onset-to-balloon time [OTB]). This was a prospective multicenter study with a descriptive design that used face-to-face interviews. A total of 342 patients with STEMI were included in this study. To identify symptom clusters, two-step cluster analysis was performed using SPSS software. Multinomial logistic regression to explore factors related to each cluster and multiple logistic regression to determine the effect of symptom clusters on treatment time delay were conducted. Three symptom clusters were identified: cluster 1 (classic MI; characterized by chest pain); cluster 2 (stress symptoms; sweating and chest pain); and cluster 3 (multiple symptoms; dizziness, sweating, chest pain, weakness, and dyspnea). Compared with patients in clusters 2 and 3, those in cluster 1 were more likely to have diabetes or prior MI. Patients in clusters 2 and 3, who predominantly showed other symptoms in addition to chest pain, had a significantly shorter DT and OTB than those in cluster 1. In conclusion, to decrease treatment time delay, it seems important that patients and clinicians recognize symptom clusters, rather than relying on chest pain alone. Further research is necessary to translate our findings into clinical practice and to improve patient education and public education campaigns.

  6. Off-road truck-related accidents in U.S. mines

    PubMed Central

    Dindarloo, Saeid R.; Pollard, Jonisha P.; Siami-Irdemoosa, Elnaz

    2016-01-01

    Introduction Off-road trucks are one of the major sources of equipment-related accidents in the U.S. mining industries. A systematic analysis of all off-road truck-related accidents, injuries, and illnesses, which are reported and published by the Mine Safety and Health Administration (MSHA), is expected to provide practical insights for identifying the accident patterns and trends in the available raw database. Therefore, appropriate safety management measures can be administered and implemented based on these accident patterns/trends. Methods A hybrid clustering-classification methodology using K-means clustering and gene expression programming (GEP) is proposed for the analysis of severe and non-severe off-road truck-related injuries at U.S. mines. Using the GEP sub-model, a small subset of the 36 recorded attributes was found to be correlated to the severity level. Results Given the set of specified attributes, the clustering sub-model was able to cluster the accident records into 5 distinct groups. For instance, the first cluster contained accidents related to minerals processing mills and coal preparation plants (91%). More than two-thirds of the victims in this cluster had less than 5 years of job experience. This cluster was associated with the highest percentage of severe injuries (22 severe accidents, 3.4%). Almost 50% of all accidents in this cluster occurred at stone operations. Similarly, the other four clusters were characterized to highlight important patterns that can be used to determine areas of focus for safety initiatives. Conclusions The identified clusters of accidents may play a vital role in the prevention of severe injuries in mining. Further research into the cluster attributes and identified patterns will be necessary to determine how these factors can be mitigated to reduce the risk of severe injuries. Practical application Analyzing injury data using data mining techniques provides some insight into attributes that are associated with high accuracies for predicting injury severity. PMID:27620937

  7. Classification of patients based on their evaluation of hospital outcomes: cluster analysis following a national survey in Norway

    PubMed Central

    2013-01-01

    Background A general trend towards positive patient-reported evaluations of hospitals could be taken as a sign that most patients form a homogeneous, reasonably pleased group, and consequently that there is little need for quality improvement. The objective of this study was to explore this assumption by identifying and statistically validating clusters of patients based on their evaluation of outcomes related to overall satisfaction, malpractice and benefit of treatment. Methods Data were collected using a national patient-experience survey of 61 hospitals in the 4 health regions in Norway during spring 2011. Postal questionnaires were mailed to 23,420 patients after their discharge from hospital. Cluster analysis was performed to identify response clusters of patients, based on their responses to single items about overall patient satisfaction, benefit of treatment and perception of malpractice. Results Cluster analysis identified six response groups, including one cluster with systematically poorer evaluation across outcomes (18.5% of patients) and one small outlier group (5.3%) with very poor scores across all outcomes. One-Way ANOVA with post-hoc tests showed that most differences between the six response groups on the three outcome items were significant. The response groups were significantly associated with nine patient-experience indicators (p < 0.001), and all groups were significantly different from each of the other groups on a majority of the patient-experience indicators. Clusters were significantly associated with age, education, self-perceived health, gender, and the degree to write open comments in the questionnaire. Conclusions The study identified five response clusters with distinct patient-reported outcome scores, in addition to a heterogeneous outlier group with very poor scores across all outcomes. The outlier group and the cluster with systematically poorer evaluation across outcomes comprised almost one-quarter of all patients, clearly demonstrating the need to tailor quality initiatives and improve patient-perceived quality in hospitals. More research on patient clustering in patient evaluation is needed, as well as standardization of methodology to increase comparability across studies. PMID:23433450

  8. Stable clustering and the resolution of dissipationless cosmological N-body simulations

    NASA Astrophysics Data System (ADS)

    Benhaiem, David; Joyce, Michael; Sylos Labini, Francesco

    2017-10-01

    The determination of the resolution of cosmological N-body simulations, I.e. the range of scales in which quantities measured in them represent accurately the continuum limit, is an important open question. We address it here using scale-free models, for which self-similarity provides a powerful tool to control resolution. Such models also provide a robust testing ground for the so-called stable clustering approximation, which gives simple predictions for them. Studying large N-body simulations of such models with different force smoothing, we find that these two issues are in fact very closely related: our conclusion is that the accuracy of two-point statistics in the non-linear regime starts to degrade strongly around the scale at which their behaviour deviates from that predicted by the stable clustering hypothesis. Physically the association of the two scales is in fact simple to understand: stable clustering fails to be a good approximation when there are strong interactions of structures (in particular merging) and it is precisely such non-linear processes which are sensitive to fluctuations at the smaller scales affected by discretization. Resolution may be further degraded if the short distance gravitational smoothing scale is larger than the scale to which stable clustering can propagate. We examine in detail the very different conclusions of studies by Smith et al. and Widrow et al. and find that the strong deviations from stable clustering reported by these works are the results of over-optimistic assumptions about scales resolved accurately by the measured power spectra, and the reliance on Fourier space analysis. We emphasize the much poorer resolution obtained with the power spectrum compared to the two-point correlation function.

  9. Replicating cluster subtypes for the prevention of adolescent smoking and alcohol use

    PubMed Central

    Babbin, Steven F.; Velicer, Wayne F.; Paiva, Andrea L.; Brick, Leslie Ann D.; Redding, Colleen A.

    2015-01-01

    Introduction Substance abuse interventions tailored to the individual level have produced effective outcomes for a wide variety of behaviors. One approach to enhancing tailoring involves using cluster analysis to identify prevention subtypes that represent different attitudes about substance use. This study applied this approach to better understand tailored interventions for smoking and alcohol prevention. Methods Analyses were performed on a sample of sixth graders from 20 New England middle schools involved in a 36-month tailored intervention study. Most adolescents reported being in the Acquisition Precontemplation (aPC) stage at baseline: not smoking or not drinking and not planning to start in the next six months. For smoking (N= 4059) and alcohol (N= 3973), each sample was randomly split into five subsamples. Cluster analysis was performed within each subsample based on three variables: Pros and Cons (from Decisional Balance Scales), and Situational Temptations. Results Across all subsamples for both smoking and alcohol, the following four clusters were identified: (1) Most Protected (MP; low Pros, high Cons, low Temptations); (2) Ambivalent (AM; high Pros, average Cons and Temptations); (3) Risk Denial (RD; average Pros, low Cons, average Temptations); and (4) High Risk (HR; high Pros, low Cons, and very high Temptations). Conclusions Finding the same four clusters within aPC for both smoking and alcohol, replicating the results across the five subsamples, and demonstrating hypothesized relations among the clusters with additional external validity analyses provide strong evidence of the robustness of these results. These clusters demonstrate evidence of validity and can provide a basis for tailoring interventions. PMID:25222849

  10. Characterization of HIV Transmission in South-East Austria

    PubMed Central

    Kessler, Harald H.; Haas, Bernhard; Stelzl, Evelyn; Weninger, Karin; Little, Susan J.; Mehta, Sanjay R.

    2016-01-01

    To gain deeper insight into the epidemiology of HIV-1 transmission in South-East Austria we performed a retrospective analysis of 259 HIV-1 partial pol sequences obtained from unique individuals newly diagnosed with HIV infection in South-East Austria from 2008 through 2014. After quality filtering, putative transmission linkages were inferred when two sequences were ≤1.5% genetically different. Multiple linkages were resolved into putative transmission clusters. Further phylogenetic analyses were performed using BEAST v1.8.1. Finally, we investigated putative links between the 259 sequences from South-East Austria and all publicly available HIV polymerase sequences in the Los Alamos National Laboratory HIV sequence database. We found that 45.6% (118/259) of the sampled sequences were genetically linked with at least one other sequence from South-East Austria forming putative transmission clusters. Clustering individuals were more likely to be men who have sex with men (MSM; p<0.001), infected with subtype B (p<0.001) or subtype F (p = 0.02). Among clustered males who reported only heterosexual (HSX) sex as an HIV risk, 47% clustered closely with MSM (either as pairs or within larger MSM clusters). One hundred and seven of the 259 sequences (41.3%) from South-East Austria had at least one putative inferred linkage with sequences from a total of 69 other countries. In conclusion, analysis of HIV-1 sequences from newly diagnosed individuals residing in South-East Austria revealed a high degree of national and international clustering mainly within MSM. Interestingly, we found that a high number of heterosexual males clustered within MSM networks, suggesting either linkage between risk groups or misrepresentation of sexual risk behaviors by subjects. PMID:26967154

  11. Characterization of HIV Transmission in South-East Austria.

    PubMed

    Hoenigl, Martin; Chaillon, Antoine; Kessler, Harald H; Haas, Bernhard; Stelzl, Evelyn; Weninger, Karin; Little, Susan J; Mehta, Sanjay R

    2016-01-01

    To gain deeper insight into the epidemiology of HIV-1 transmission in South-East Austria we performed a retrospective analysis of 259 HIV-1 partial pol sequences obtained from unique individuals newly diagnosed with HIV infection in South-East Austria from 2008 through 2014. After quality filtering, putative transmission linkages were inferred when two sequences were ≤1.5% genetically different. Multiple linkages were resolved into putative transmission clusters. Further phylogenetic analyses were performed using BEAST v1.8.1. Finally, we investigated putative links between the 259 sequences from South-East Austria and all publicly available HIV polymerase sequences in the Los Alamos National Laboratory HIV sequence database. We found that 45.6% (118/259) of the sampled sequences were genetically linked with at least one other sequence from South-East Austria forming putative transmission clusters. Clustering individuals were more likely to be men who have sex with men (MSM; p<0.001), infected with subtype B (p<0.001) or subtype F (p = 0.02). Among clustered males who reported only heterosexual (HSX) sex as an HIV risk, 47% clustered closely with MSM (either as pairs or within larger MSM clusters). One hundred and seven of the 259 sequences (41.3%) from South-East Austria had at least one putative inferred linkage with sequences from a total of 69 other countries. In conclusion, analysis of HIV-1 sequences from newly diagnosed individuals residing in South-East Austria revealed a high degree of national and international clustering mainly within MSM. Interestingly, we found that a high number of heterosexual males clustered within MSM networks, suggesting either linkage between risk groups or misrepresentation of sexual risk behaviors by subjects.

  12. Degree-based statistic and center persistency for brain connectivity analysis.

    PubMed

    Yoo, Kwangsun; Lee, Peter; Chung, Moo K; Sohn, William S; Chung, Sun Ju; Na, Duk L; Ju, Daheen; Jeong, Yong

    2017-01-01

    Brain connectivity analyses have been widely performed to investigate the organization and functioning of the brain, or to observe changes in neurological or psychiatric conditions. However, connectivity analysis inevitably introduces the problem of mass-univariate hypothesis testing. Although, several cluster-wise correction methods have been suggested to address this problem and shown to provide high sensitivity, these approaches fundamentally have two drawbacks: the lack of spatial specificity (localization power) and the arbitrariness of an initial cluster-forming threshold. In this study, we propose a novel method, degree-based statistic (DBS), performing cluster-wise inference. DBS is designed to overcome the above-mentioned two shortcomings. From a network perspective, a few brain regions are of critical importance and considered to play pivotal roles in network integration. Regarding this notion, DBS defines a cluster as a set of edges of which one ending node is shared. This definition enables the efficient detection of clusters and their center nodes. Furthermore, a new measure of a cluster, center persistency (CP) was introduced. The efficiency of DBS with a known "ground truth" simulation was demonstrated. Then they applied DBS to two experimental datasets and showed that DBS successfully detects the persistent clusters. In conclusion, by adopting a graph theoretical concept of degrees and borrowing the concept of persistence from algebraic topology, DBS could sensitively identify clusters with centric nodes that would play pivotal roles in an effect of interest. DBS is potentially widely applicable to variable cognitive or clinical situations and allows us to obtain statistically reliable and easily interpretable results. Hum Brain Mapp 38:165-181, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  13. Planck 2015 results: XXIV. Cosmology from Sunyaev-Zeldovich cluster counts

    DOE PAGES

    Ade, P. A. R.; Aghanim, N.; Arnaud, M.; ...

    2016-09-20

    In this work, we present cluster counts and corresponding cosmological constraints from the Planck full mission data set. Our catalogue consists of 439 clusters detected via their Sunyaev-Zeldovich (SZ) signal down to a signal-to-noise ratio of 6, and is more than a factor of 2 larger than the 2013 Planck cluster cosmology sample. The counts are consistent with those from 2013 and yield compatible constraints under the same modelling assumptions. Taking advantage of the larger catalogue, we extend our analysis to the two-dimensional distribution in redshift and signal-to-noise. We use mass estimates from two recent studies of gravitational lensing ofmore » background galaxies by Planck clusters to provide priors on the hydrostatic bias parameter, (1-b). In addition, we use lensing of cosmic microwave background (CMB) temperature fluctuations by Planck clusters as an independent constraint on this parameter. These various calibrations imply constraints on the present-day amplitude of matter fluctuations in varying degrees of tension with those from the Planck analysis of primary fluctuations in the CMB; for the lowest estimated values of (1-b) the tension is mild, only a little over one standard deviation, while it remains substantial (3.7σ) for the largest estimated value. We also examine constraints on extensions to the base flat ΛCDM model by combining the cluster and CMB constraints. The combination appears to favour non-minimal neutrino masses, but this possibility does little to relieve the overall tension because it simultaneously lowers the implied value of the Hubble parameter, thereby exacerbating the discrepancy with most current astrophysical estimates. In conclusion, improving the precision of cluster mass calibrations from the current 10%-level to 1% would significantly strengthen these combined analyses and provide a stringent test of the base ΛCDM model.« less

  14. Planck 2015 results: XXIV. Cosmology from Sunyaev-Zeldovich cluster counts

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ade, P. A. R.; Aghanim, N.; Arnaud, M.

    In this work, we present cluster counts and corresponding cosmological constraints from the Planck full mission data set. Our catalogue consists of 439 clusters detected via their Sunyaev-Zeldovich (SZ) signal down to a signal-to-noise ratio of 6, and is more than a factor of 2 larger than the 2013 Planck cluster cosmology sample. The counts are consistent with those from 2013 and yield compatible constraints under the same modelling assumptions. Taking advantage of the larger catalogue, we extend our analysis to the two-dimensional distribution in redshift and signal-to-noise. We use mass estimates from two recent studies of gravitational lensing ofmore » background galaxies by Planck clusters to provide priors on the hydrostatic bias parameter, (1-b). In addition, we use lensing of cosmic microwave background (CMB) temperature fluctuations by Planck clusters as an independent constraint on this parameter. These various calibrations imply constraints on the present-day amplitude of matter fluctuations in varying degrees of tension with those from the Planck analysis of primary fluctuations in the CMB; for the lowest estimated values of (1-b) the tension is mild, only a little over one standard deviation, while it remains substantial (3.7σ) for the largest estimated value. We also examine constraints on extensions to the base flat ΛCDM model by combining the cluster and CMB constraints. The combination appears to favour non-minimal neutrino masses, but this possibility does little to relieve the overall tension because it simultaneously lowers the implied value of the Hubble parameter, thereby exacerbating the discrepancy with most current astrophysical estimates. In conclusion, improving the precision of cluster mass calibrations from the current 10%-level to 1% would significantly strengthen these combined analyses and provide a stringent test of the base ΛCDM model.« less

  15. Integrating participatory community mobilization processes to improve dengue prevention: an eco-bio-social scaling up of local success in Machala, Ecuador

    PubMed Central

    Mitchell-Foster, Kendra; Ayala, Efraín Beltrán; Breilh, Jaime; Spiegel, Jerry; Wilches, Ana Arichabala; Leon, Tania Ordóñez; Delgado, Jefferson Adrian

    2015-01-01

    Background This project investigates the effectiveness and feasibility of scaling-up an eco-bio-social approach for implementing an integrated community-based approach for dengue prevention in comparison with existing insecticide-based and emerging biolarvicide-based programs in an endemic setting in Machala, Ecuador. Methods An integrated intervention strategy (IIS) for dengue prevention (an elementary school-based dengue education program, and clean patio and safe container program) was implemented in 10 intervention clusters from November 2012 to November 2013 using a randomized controlled cluster trial design (20 clusters: 10 intervention, 10 control; 100 households per cluster with 1986 total households). Current existing dengue prevention programs served as the control treatment in comparison clusters. Pupa per person index (PPI) is used as the main outcome measure. Particular attention was paid to social mobilization and empowerment with IIS. Results Overall, IIS was successful in reducing PPI levels in intervention communities versus control clusters, with intervention clusters in the six paired clusters that followed the study design experiencing a greater reduction of PPI compared to controls (2.2 OR, 95% CI: 1.2 to 4.7). Analysis of individual cases demonstrates that consideration for contexualizing programs and strategies to local neighborhoods can be very effective in reducing PPI for dengue transmission risk reduction. Conclusions In the rapidly evolving political climate for dengue control in Ecuador, integration of successful social mobilization and empowerment strategies with existing and emerging biolarvicide-based government dengue prevention and control programs is promising in reducing PPI and dengue transmission risk in southern coastal communities like Machala. However, more profound analysis of social determination of health is called for to assess sustainability prospects. PMID:25604763

  16. Clustering determines the dynamics of complex contagions in multiplex networks

    NASA Astrophysics Data System (ADS)

    Zhuang, Yong; Arenas, Alex; Yaǧan, Osman

    2017-01-01

    We present the mathematical analysis of generalized complex contagions in a class of clustered multiplex networks. The model is intended to understand spread of influence, or any other spreading process implying a threshold dynamics, in setups of interconnected networks with significant clustering. The contagion is assumed to be general enough to account for a content-dependent linear threshold model, where each link type has a different weight (for spreading influence) that may depend on the content (e.g., product, rumor, political view) that is being spread. Using the generating functions formalism, we determine the conditions, probability, and expected size of the emergent global cascades. This analysis provides a generalization of previous approaches and is especially useful in problems related to spreading and percolation. The results present nontrivial dependencies between the clustering coefficient of the networks and its average degree. In particular, several phase transitions are shown to occur depending on these descriptors. Generally speaking, our findings reveal that increasing clustering decreases the probability of having global cascades and their size, however, this tendency changes with the average degree. There exists a certain average degree from which on clustering favors the probability and size of the contagion. By comparing the dynamics of complex contagions over multiplex networks and their monoplex projections, we demonstrate that ignoring link types and aggregating network layers may lead to inaccurate conclusions about contagion dynamics, particularly when the correlation of degrees between layers is high.

  17. A Web-Based Multidrug-Resistant Organisms Surveillance and Outbreak Detection System with Rule-Based Classification and Clustering

    PubMed Central

    Tseng, Yi-Ju; Wu, Jung-Hsuan; Ping, Xiao-Ou; Lin, Hui-Chi; Chen, Ying-Yu; Shang, Rung-Ji; Chen, Ming-Yuan; Lai, Feipei

    2012-01-01

    Background The emergence and spread of multidrug-resistant organisms (MDROs) are causing a global crisis. Combating antimicrobial resistance requires prevention of transmission of resistant organisms and improved use of antimicrobials. Objectives To develop a Web-based information system for automatic integration, analysis, and interpretation of the antimicrobial susceptibility of all clinical isolates that incorporates rule-based classification and cluster analysis of MDROs and implements control chart analysis to facilitate outbreak detection. Methods Electronic microbiological data from a 2200-bed teaching hospital in Taiwan were classified according to predefined criteria of MDROs. The numbers of organisms, patients, and incident patients in each MDRO pattern were presented graphically to describe spatial and time information in a Web-based user interface. Hierarchical clustering with 7 upper control limits (UCL) was used to detect suspicious outbreaks. The system’s performance in outbreak detection was evaluated based on vancomycin-resistant enterococcal outbreaks determined by a hospital-wide prospective active surveillance database compiled by infection control personnel. Results The optimal UCL for MDRO outbreak detection was the upper 90% confidence interval (CI) using germ criterion with clustering (area under ROC curve (AUC) 0.93, 95% CI 0.91 to 0.95), upper 85% CI using patient criterion (AUC 0.87, 95% CI 0.80 to 0.93), and one standard deviation using incident patient criterion (AUC 0.84, 95% CI 0.75 to 0.92). The performance indicators of each UCL were statistically significantly higher with clustering than those without clustering in germ criterion (P < .001), patient criterion (P = .04), and incident patient criterion (P < .001). Conclusion This system automatically identifies MDROs and accurately detects suspicious outbreaks of MDROs based on the antimicrobial susceptibility of all clinical isolates. PMID:23195868

  18. Clusters of Midlife Women by Physical Activity and Their Racial/Ethnic Differences

    PubMed Central

    Im, Eun-Ok; Ko, Young; Chee, Eunice; Chee, Wonshik; Mao, Jun James

    2016-01-01

    Objective The purpose of this study was to identify clusters of midlife women by physical activity and to determine racial/ethnic differences in physical activities in each cluster. Methods This was a secondary analysis of the data from 542 women (157 Non-Hispanic [NH] Whites, 127 Hispanics, 135 NH African Americans, and 123 NH Asian) in a larger Internet study on midlife women’s attitudes toward physical activity. The instruments included the Barriers to Health Activities Scale, the Physical Activity Assessment Inventory, the Questions on Attitudes toward Physical Activity, Subjective Norm, Perceived Behavioral Control, and Behavioral Intention, and the Kaiser Physical Activity Survey. The data were analyzed using hierarchical cluster analyses, ANOVA, and multinominal logistic analyses. Results A three cluster solution was adopted: Cluster 1 (high active living and sports/exercise activity group; 48%), Cluster 2 (high household/caregiving and occupational activity group; 27%), and Cluster 3 (low active living and sports/exercise activity group; 26%). There were significant racial/ethnic differences in occupational activities of Clusters 1 and 3 (all p<.01). Compared with Cluster 1, Cluster 2 tended to have lower family income, less access to health care, higher unemployment, higher perceived barriers scores, and lower social influences scores (all p<.01). Compared with Cluster 1, Cluster 3 tended to have greater obesity, less access to health care, higher perceived barriers scores, more negative attutides toward physical activity, and lower self-efficacy scores (all p<.01). Conclusions Midlife women’s unique patterns of physical activity and their associated factors need to be considered in future intervention development. PMID:27846052

  19. Do recreation motivations and wilderness involvement relate to support for wilderness management? A segmentation analysis

    Treesearch

    Troy E. Hall; Erin Seekamp; David Cole

    2010-01-01

    Surveys show relatively little support for use restrictions to protect wilderness experiences. However, such conclusions based on aggregate data could hide important differences among visitors. Visitors with more wilderness-dependent trip motives were hypothesized to be more supportive of use restrictions. Using survey data from visitors to 13 wildernesses, cluster...

  20. Clustering of diet- and activity-related parenting practices: cross-sectional findings of the INPACT study

    PubMed Central

    2013-01-01

    Background Various diet- and activity-related parenting practices are positive determinants of child dietary and activity behaviour, including home availability, parental modelling and parental policies. There is evidence that parenting practices cluster within the dietary domain and within the activity domain. This study explores whether diet- and activity-related parenting practices cluster across the dietary and activity domain. Also examined is whether the clusters are related to child and parental background characteristics. Finally, to indicate the relevance of the clusters in influencing child dietary and activity behaviour, we examined whether clusters of parenting practices are related to these behaviours. Methods Data were used from 1480 parent–child dyads participating in the Dutch IVO Nutrition and Physical Activity Child cohorT (INPACT). Parents of children aged 8–11 years completed questionnaires at home assessing their diet- and activity-related parenting practices, child and parental background characteristics, and child dietary and activity behaviours. Principal component analysis (PCA) was used to identify clusters of parenting practices. Backward regression analysis was used to examine the relationship between child and parental background characteristics with cluster scores, and partial correlations to examine associations between cluster scores and child dietary and activity behaviours. Results PCA revealed five clusters of parenting practices: 1) high visibility and accessibility of screens and unhealthy food, 2) diet- and activity-related rules, 3) low availability of unhealthy food, 4) diet- and activity-related positive modelling, and 5) positive modelling on sports and fruit. Low parental education was associated with unhealthy cluster 1, while high(er) education was associated with healthy clusters 2, 3 and 5. Separate clusters were related to both child dietary and activity behaviour in the hypothesized directions: healthy clusters were positively related to obesity-reducing behaviours and negatively to obesity-inducing behaviours. Conclusion Parenting practices cluster across the dietary and activity domain. Parental education can be seen as an indicator of a broader parental context in which clusters of parenting practices operate. Separate clusters are related to both child dietary and activity behaviour. Interventions that focus on clusters of parenting practices to assist parents (especially low-educated parents) in changing their child’s dietary and activity behaviour seems justified. PMID:23531232

  1. Identifying Likely Transmission Pathways within a 10-Year Community Outbreak of Tuberculosis by High-Depth Whole Genome Sequencing

    PubMed Central

    Sadsad, Rosemarie; Martinez, Elena; Jelfs, Peter; Hill-Cawthorne, Grant A.; Gilbert, Gwendolyn L.; Marais, Ben J.; Sintchenko, Vitali

    2016-01-01

    Background Improved tuberculosis control and the need to contain the spread of drug-resistant strains provide a strong rationale for exploring tuberculosis transmission dynamics at the population level. Whole-genome sequencing provides optimal strain resolution, facilitating detailed mapping of potential transmission pathways. Methods We sequenced 22 isolates from a Mycobacterium tuberculosis cluster in New South Wales, Australia, identified during routine 24-locus mycobacterial interspersed repetitive unit typing. Following high-depth paired-end sequencing using the Illumina HiSeq 2000 platform, two independent pipelines were employed for analysis, both employing read mapping onto reference genomes as well as de novo assembly, to control biases in variant detection. In addition to single-nucleotide polymorphisms, the analyses also sought to identify insertions, deletions and structural variants. Results Isolates were highly similar, with a distance of 13 variants between the most distant members of the cluster. The most sensitive analysis classified the 22 isolates into 18 groups. Four of the isolates did not appear to share a recent common ancestor with the largest clade; another four isolates had an uncertain ancestral relationship with the largest clade. Conclusion Whole genome sequencing, with analysis of single-nucleotide polymorphisms, insertions, deletions, structural variants and subpopulations, enabled the highest possible level of discrimination between cluster members, clarifying likely transmission pathways and exposing the complexity of strain origin. The analysis provides a basis for targeted public health intervention and enhanced classification of future isolates linked to the cluster. PMID:26938641

  2. Maternal Characteristics and Incidence of Overweight/Obesity in Children: A 13-Year Follow-up Study in an Eastern Mediterranean Population.

    PubMed

    Jalali-Farahani, Sara; Amiri, Parisa; Abbasi, Behnood; Karimi, Mehrdad; Cheraghi, Leila; Daneshpour, Maryam Sadat; Azizi, Fereidoun

    2017-05-01

    Objectives To investigate clustering of parental sociobehavioral factors and their relationship with the incidence of overweight and obesity in Iranian children. Methods Demographics, body weight, and certain medical characteristics of the parents of 2999 children were used to categorize parents by cluster; children's weights were assessed for each cluster. Specifically, survival analysis and Cox regression models were used to test the effect of parental clustering on the incidence of childhood overweight and obesity. Results Maternal metabolic syndrome, education level, age, body weight status, and paternal age had important roles in distinguishing clusters with low, moderate, and high risk. Crude incidence rates (per 10,000 person-years) of overweight and obesity were 416.8 (95% confidence interval (CI) 388.2-447.5) and 114.7 (95% CI 101.2-129.9), respectively. Children of parents with certain constellations of demographic and medical characteristics were 37.0 and 41.0% more likely to become overweight and obese, respectively. Conclusions for Practice The current study demonstrated the vital role of maternal characteristics in distinguishing familial clusters, which could be used to predict the incidence of overweight and obesity in children.

  3. Assessment of hybridization among wild and cultivated Vigna unguiculata subspecies revealed by arbitrarily primed polymerase chain reaction analysis

    PubMed Central

    Vijaykumar, Archana; Saini, Ajay; Jawali, Narendra

    2012-01-01

    Background and aims Intra-species hybridization and incompletely homogenized ribosomal RNA repeat units have earlier been reported in 21 accessions of Vigna unguiculata from six subspecies using internal transcribed spacer (ITS) and 5S intergenic spacer (IGS) analyses. However, the relationships among these accessions were not clear from these analyses. We therefore assessed intra-species hybridization in the same set of accessions. Methodology Arbitrarily primed polymerase chain reaction (AP-PCR) analysis was carried out using 12 primers. The PCR products were resolved on agarose gels and the DNA fragments were scored manually. Genetic relationships were inferred by TREECON software using unweighted paired group method with arithmetic averages (UPGMA) cluster analysis evaluated by bootstrapping and compared with previous analyses based on ITS and 5S IGS. Principal results A total of 202 (86 %) fragments were found to be polymorphic and used for generating a genetic distance matrix. Twenty-one V. unguiculata accessions were grouped into three main clusters. The cultivated subspecies (var. unguiculata) and most of its wild progenitors (var. spontanea) were placed in cluster I along with ssp. pubescens and ssp. stenophylla. Whereas var. spontanea were grouped with ssp. alba and ssp. tenuis accessions in cluster II, ssp. alba and ssp. baoulensis were included in cluster III. Close affinities of ssp. unguiculata, ssp. alba and ssp. tenuis suggested inter-subspecies hybridization. Conclusions Multi-locus AP-PCR analysis reveals that intra-species hybridization is prevalent among V. unguiculata subspecies and suggests that grouping of accessions from two different subspecies is not solely due to the similarity in the ITS and 5S IGS regions but also due to other regions of the genome. PMID:22619698

  4. Classification of Cowpox Viruses into Several Distinct Clades and Identification of a Novel Lineage

    PubMed Central

    Franke, Annika; Pfaff, Florian; Jenckel, Maria; Hoffmann, Bernd; Höper, Dirk; Antwerpen, Markus; Meyer, Hermann; Beer, Martin; Hoffmann, Donata

    2017-01-01

    Cowpox virus (CPXV) was considered as uniform species within the genus Orthopoxvirus (OPV). Previous phylogenetic analysis indicated that CPXV is polyphyletic and isolates may cluster into different clades with two of these clades showing genetic similarities to either variola (VARV) or vaccinia viruses (VACV). Further analyses were initiated to assess both the genetic diversity and the evolutionary background of circulating CPXVs. Here we report the full-length sequences of 20 CPXV strains isolated from different animal species and humans in Germany. A phylogenetic analysis of altogether 83 full-length OPV genomes confirmed the polyphyletic character of the species CPXV and suggested at least four different clades. The German isolates from this study mainly clustered into two CPXV-like clades, and VARV- and VACV-like strains were not observed. A single strain, isolated from a cotton-top tamarin, clustered distantly from all other CPXVs and might represent a novel and unique evolutionary lineage. The classification of CPXV strains into clades roughly followed their geographic origin, with the highest clade diversity so far observed for Germany. Furthermore, we found evidence for recombination between OPV clades without significant disruption of the observed clustering. In conclusion, this analysis markedly expands the number of available CPXV full-length sequences and confirms the co-circulation of several CPXV clades in Germany, and provides the first data about a new evolutionary CPXV lineage. PMID:28604604

  5. Clustering More than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approaches

    PubMed Central

    Boyack, Kevin W.; Newman, David; Duhon, Russell J.; Klavans, Richard; Patek, Michael; Biberstine, Joseph R.; Schijvenaars, Bob; Skupin, André; Ma, Nianli; Börner, Katy

    2011-01-01

    Background We investigate the accuracy of different similarity approaches for clustering over two million biomedical documents. Clustering large sets of text documents is important for a variety of information needs and applications such as collection management and navigation, summary and analysis. The few comparisons of clustering results from different similarity approaches have focused on small literature sets and have given conflicting results. Our study was designed to seek a robust answer to the question of which similarity approach would generate the most coherent clusters of a biomedical literature set of over two million documents. Methodology We used a corpus of 2.15 million recent (2004-2008) records from MEDLINE, and generated nine different document-document similarity matrices from information extracted from their bibliographic records, including titles, abstracts and subject headings. The nine approaches were comprised of five different analytical techniques with two data sources. The five analytical techniques are cosine similarity using term frequency-inverse document frequency vectors (tf-idf cosine), latent semantic analysis (LSA), topic modeling, and two Poisson-based language models – BM25 and PMRA (PubMed Related Articles). The two data sources were a) MeSH subject headings, and b) words from titles and abstracts. Each similarity matrix was filtered to keep the top-n highest similarities per document and then clustered using a combination of graph layout and average-link clustering. Cluster results from the nine similarity approaches were compared using (1) within-cluster textual coherence based on the Jensen-Shannon divergence, and (2) two concentration measures based on grant-to-article linkages indexed in MEDLINE. Conclusions PubMed's own related article approach (PMRA) generated the most coherent and most concentrated cluster solution of the nine text-based similarity approaches tested, followed closely by the BM25 approach using titles and abstracts. Approaches using only MeSH subject headings were not competitive with those based on titles and abstracts. PMID:21437291

  6. Gene expression profiles of breast biopsies from healthy women identify a group with claudin-low features

    PubMed Central

    2011-01-01

    Background Increased understanding of the variability in normal breast biology will enable us to identify mechanisms of breast cancer initiation and the origin of different subtypes, and to better predict breast cancer risk. Methods Gene expression patterns in breast biopsies from 79 healthy women referred to breast diagnostic centers in Norway were explored by unsupervised hierarchical clustering and supervised analyses, such as gene set enrichment analysis and gene ontology analysis and comparison with previously published genelists and independent datasets. Results Unsupervised hierarchical clustering identified two separate clusters of normal breast tissue based on gene-expression profiling, regardless of clustering algorithm and gene filtering used. Comparison of the expression profile of the two clusters with several published gene lists describing breast cells revealed that the samples in cluster 1 share characteristics with stromal cells and stem cells, and to a certain degree with mesenchymal cells and myoepithelial cells. The samples in cluster 1 also share many features with the newly identified claudin-low breast cancer intrinsic subtype, which also shows characteristics of stromal and stem cells. More women belonging to cluster 1 have a family history of breast cancer and there is a slight overrepresentation of nulliparous women in cluster 1. Similar findings were seen in a separate dataset consisting of histologically normal tissue from both breasts harboring breast cancer and from mammoplasty reductions. Conclusion This is the first study to explore the variability of gene expression patterns in whole biopsies from normal breasts and identified distinct subtypes of normal breast tissue. Further studies are needed to determine the specific cell contribution to the variation in the biology of normal breasts, how the clusters identified relate to breast cancer risk and their possible link to the origin of the different molecular subtypes of breast cancer. PMID:22044755

  7. Bacterial community comparisons by taxonomy-supervised analysis independent of sequence alignment and clustering

    PubMed Central

    Sul, Woo Jun; Cole, James R.; Jesus, Ederson da C.; Wang, Qiong; Farris, Ryan J.; Fish, Jordan A.; Tiedje, James M.

    2011-01-01

    High-throughput sequencing of 16S rRNA genes has increased our understanding of microbial community structure, but now even higher-throughput methods to the Illumina scale allow the creation of much larger datasets with more samples and orders-of-magnitude more sequences that swamp current analytic methods. We developed a method capable of handling these larger datasets on the basis of assignment of sequences into an existing taxonomy using a supervised learning approach (taxonomy-supervised analysis). We compared this method with a commonly used clustering approach based on sequence similarity (taxonomy-unsupervised analysis). We sampled 211 different bacterial communities from various habitats and obtained ∼1.3 million 16S rRNA sequences spanning the V4 hypervariable region by pyrosequencing. Both methodologies gave similar ecological conclusions in that β-diversity measures calculated by using these two types of matrices were significantly correlated to each other, as were the ordination configurations and hierarchical clustering dendrograms. In addition, our taxonomy-supervised analyses were also highly correlated with phylogenetic methods, such as UniFrac. The taxonomy-supervised analysis has the advantages that it is not limited by the exhaustive computation required for the alignment and clustering necessary for the taxonomy-unsupervised analysis, is more tolerant of sequencing errors, and allows comparisons when sequences are from different regions of the 16S rRNA gene. With the tremendous expansion in 16S rRNA data acquisition underway, the taxonomy-supervised approach offers the potential to provide more rapid and extensive community comparisons across habitats and samples. PMID:21873204

  8. Obesigenic families: parents’ physical activity and dietary intake patterns predict girls’ risk of overweight

    PubMed Central

    Davison, K Krahnstoever; Birch, L Lipps

    2008-01-01

    OBJECTIVE To determine whether obesigenic families can be identified based on mothers’ and fathers’ dietary and activity patterns. METHODS A total of 197 girls and their parents were assessed when girls were 5 y old; 192 families were reassessed when girls were 7 y old. Measures of parents’ physical activity and dietary intake were obtained and entered into a cluster analysis to assess whether distinct family clusters could be identified. Girls’ skinfold thickness and body mass index (BMI) were also assessed and were used to examine the predictive validity of the clusters. RESULTS Obesigenic and a non-obesigenic family clusters were identified. Mothers and fathers in the obesigenic cluster reported high levels of dietary intake and low levels of physical activity, while mothers and fathers in the non-obesigenic cluster reported low levels of dietary intake and high levels of activity. Girls from families in the obesigenic cluster had significantly higher BMI and skinfold thickness values at age 7 and showed significantly greater increases in BMI and skinfold thickness from ages 5 to 7 y than girls from non-obesigenic families; differences were reduced but not eliminated after controlling for parents’ BMI. CONCLUSIONS Obesigenic families, defined in terms of parents’ activity and dietary patterns, can be used predict children’s risk of obesity. PMID:12187395

  9. Improved Test Planning and Analysis Through the Use of Advanced Statistical Methods

    NASA Technical Reports Server (NTRS)

    Green, Lawrence L.; Maxwell, Katherine A.; Glass, David E.; Vaughn, Wallace L.; Barger, Weston; Cook, Mylan

    2016-01-01

    The goal of this work is, through computational simulations, to provide statistically-based evidence to convince the testing community that a distributed testing approach is superior to a clustered testing approach for most situations. For clustered testing, numerous, repeated test points are acquired at a limited number of test conditions. For distributed testing, only one or a few test points are requested at many different conditions. The statistical techniques of Analysis of Variance (ANOVA), Design of Experiments (DOE) and Response Surface Methods (RSM) are applied to enable distributed test planning, data analysis and test augmentation. The D-Optimal class of DOE is used to plan an optimally efficient single- and multi-factor test. The resulting simulated test data are analyzed via ANOVA and a parametric model is constructed using RSM. Finally, ANOVA can be used to plan a second round of testing to augment the existing data set with new data points. The use of these techniques is demonstrated through several illustrative examples. To date, many thousands of comparisons have been performed and the results strongly support the conclusion that the distributed testing approach outperforms the clustered testing approach.

  10. Investigating the effects of climate variations on bacillary dysentery incidence in northeast China using ridge regression and hierarchical cluster analysis

    PubMed Central

    Huang, Desheng; Guan, Peng; Guo, Junqiao; Wang, Ping; Zhou, Baosen

    2008-01-01

    Background The effects of climate variations on bacillary dysentery incidence have gained more recent concern. However, the multi-collinearity among meteorological factors affects the accuracy of correlation with bacillary dysentery incidence. Methods As a remedy, a modified method to combine ridge regression and hierarchical cluster analysis was proposed for investigating the effects of climate variations on bacillary dysentery incidence in northeast China. Results All weather indicators, temperatures, precipitation, evaporation and relative humidity have shown positive correlation with the monthly incidence of bacillary dysentery, while air pressure had a negative correlation with the incidence. Ridge regression and hierarchical cluster analysis showed that during 1987–1996, relative humidity, temperatures and air pressure affected the transmission of the bacillary dysentery. During this period, all meteorological factors were divided into three categories. Relative humidity and precipitation belonged to one class, temperature indexes and evaporation belonged to another class, and air pressure was the third class. Conclusion Meteorological factors have affected the transmission of bacillary dysentery in northeast China. Bacillary dysentery prevention and control would benefit from by giving more consideration to local climate variations. PMID:18816415

  11. Gene duplications in prokaryotes can be associated with environmental adaptation

    PubMed Central

    2010-01-01

    Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate different categories of functional classification, where paralogs in particular seem to be associated with processes involving interaction with the environment. PMID:20961426

  12. Molecular clustering of patients with diabetes and pulmonary tuberculosis: A systematic review and meta-analysis

    PubMed Central

    Blanco-Guillot, Francles; Delgado-Sánchez, Guadalupe; Mongua-Rodríguez, Norma; Cruz-Hervert, Pablo; Ferreyra-Reyes, Leticia; Ferreira-Guerrero, Elizabeth; Yanes-Lane, Mercedes; Montero-Campos, Rogelio; Bobadilla-del-Valle, Miriam; Torres-González, Pedro; Ponce-de-León, Alfredo; Sifuentes-Osornio, José; Garcia-Garcia, Lourdes

    2017-01-01

    Introduction Many studies have explored the relationship between diabetes mellitus (DM) and tuberculosis (TB) demonstrating increased risk of TB among patients with DM and poor prognosis of patients suffering from the association of DM/TB. Owing to a paucity of studies addressing this question, it remains unclear whether patients with DM and TB are more likely than TB patients without DM to be grouped into molecular clusters defined according to the genotype of the infecting Mycobacterium tuberculosis bacillus. That is, whether there is convincing molecular epidemiological evidence for TB transmission among DM patients. Objective: We performed a systematic review and meta-analysis to quantitatively evaluate the propensity for patients with DM and pulmonary TB (PTB) to cluster according to the genotype of the infecting M. tuberculosis bacillus. Materials and methods We conducted a systematic search in MEDLINE and LILACS from 1990 to June, 2016 with the following combinations of key words “tuberculosis AND transmission” OR “tuberculosis diabetes mellitus” OR “Mycobacterium tuberculosis molecular epidemiology” OR “RFLP-IS6110” OR “Spoligotyping” OR “MIRU-VNTR”. Studies were included if they met the following criteria: (i) studies based on populations from defined geographical areas; (ii) use of genotyping by IS6110- restriction fragment length polymorphism (RFLP) analysis and spoligotyping or mycobacterial interspersed repetitive unit-variable number of tandem repeats (MIRU-VNTR) or other amplification methods to identify molecular clustering; (iii) genotyping and analysis of 50 or more cases of PTB; (iv) study duration of 11 months or more; (v) identification of quantitative risk factors for molecular clustering including DM; (vi) > 60% coverage of the study population; and (vii) patients with PTB confirmed bacteriologically. The exclusion criteria were: (i) Extrapulmonary TB; (ii) TB caused by nontuberculous mycobacteria; (iii) patients with PTB and HIV; (iv) pediatric PTB patients; (v) TB in closed environments (e.g. prisons, elderly homes, etc.); (vi) diabetes insipidus and (vii) outbreak reports. Hartung-Knapp-Sidik-Jonkman method was used to estimate the odds ratio (OR) of the association between DM with molecular clustering of cases with TB. In order to evaluate the degree of heterogeneity a statistical Q test was done. The publication bias was examined with Begg and Egger tests. Review Manager 5.3.5 CMA v.3 and Biostat and Software package R were used. Results Selection criteria were met by six articles which included 4076 patients with PTB of which 13% had DM. Twenty seven percent of the cases were clustered. The majority of cases (48%) were reported in a study in China with 31% clustering. The highest incidence of TB occurred in two studies from China. The global OR for molecular clustering was 0.84 (IC 95% 0.40–1.72). The heterogeneity between studies was moderate (I2 = 55%, p = 0.05), although there was no publication bias (Beggs test p = 0.353 and Eggers p = 0.429). Conclusion There were very few studies meeting our selection criteria. The wide confidence interval indicates that there is not enough evidence to draw conclusions about the association. Clustering of patients with DM in TB transmission chains should be investigated in areas where both diseases are prevalent and focus on specific contexts. PMID:28902922

  13. Analysis of indoor air pollutants checklist using environmetric technique for health risk assessment of sick building complaint in nonindustrial workplace

    PubMed Central

    Syazwan, AI; Rafee, B Mohd; Juahir, Hafizan; Azman, AZF; Nizar, AM; Izwyn, Z; Syahidatussyakirah, K; Muhaimin, AA; Yunos, MA Syafiq; Anita, AR; Hanafiah, J Muhamad; Shaharuddin, MS; Ibthisham, A Mohd; Hasmadi, I Mohd; Azhar, MN Mohamad; Azizan, HS; Zulfadhli, I; Othman, J; Rozalini, M; Kamarul, FT

    2012-01-01

    Purpose To analyze and characterize a multidisciplinary, integrated indoor air quality checklist for evaluating the health risk of building occupants in a nonindustrial workplace setting. Design A cross-sectional study based on a participatory occupational health program conducted by the National Institute of Occupational Safety and Health (Malaysia) and Universiti Putra Malaysia. Method A modified version of the indoor environmental checklist published by the Department of Occupational Health and Safety, based on the literature and discussion with occupational health and safety professionals, was used in the evaluation process. Summated scores were given according to the cluster analysis and principal component analysis in the characterization of risk. Environmetric techniques was used to classify the risk of variables in the checklist. Identification of the possible source of item pollutants was also evaluated from a semiquantitative approach. Result Hierarchical agglomerative cluster analysis resulted in the grouping of factorial components into three clusters (high complaint, moderate-high complaint, moderate complaint), which were further analyzed by discriminant analysis. From this, 15 major variables that influence indoor air quality were determined. Principal component analysis of each cluster revealed that the main factors influencing the high complaint group were fungal-related problems, chemical indoor dispersion, detergent, renovation, thermal comfort, and location of fresh air intake. The moderate-high complaint group showed significant high loading on ventilation, air filters, and smoking-related activities. The moderate complaint group showed high loading on dampness, odor, and thermal comfort. Conclusion This semiquantitative assessment, which graded risk from low to high based on the intensity of the problem, shows promising and reliable results. It should be used as an important tool in the preliminary assessment of indoor air quality and as a categorizing method for further IAQ investigations and complaints procedures. PMID:23055779

  14. Classification and discrimination of pediatric patients undergoing open heart surgery with and without methylprednisolone treatment by cytomics

    NASA Astrophysics Data System (ADS)

    Bocsi, Jozsef; Mittag, Anja; Pierzchalski, Arkadiusz; Osmancik, Pavel; Dähnert, Ingo; Tárnok, Attila

    2011-02-01

    Introduction: Methylprednisolone (MP) is frequently preoperatively administered in children undergoing open heart surgery. The aim of this medication is to inhibit overshooting immune responses. Earlier studies demonstrated cellular and humoral immunological changes in pediatric patients undergoing heart surgeries with and without MP administration. Here in a retrospective study we investigated the modulation of the cellular immune response by MP. The aim was to identify suitable parameters characterizing MP effects by cluster analysis. Methods: Blood samples were analysed from two aged matched groups with surgical correction of septum defects. Group without MP treatment consisted of 10 patients; MP was administered on 21 patients (median dose: 11mg/kg) before cardiopulmonary bypass (CPB). EDTA anticoagulated blood was obtained 24 h preoperatively, after anesthesia, at CPB begin and end (CPB2), 4h, 24h, 48h after surgery, at discharge and at out-patient followup (8.2; 3.3-12.2 month after surgery; median and IQR). Flow cytometry showed the biggest MP relevant changes at CPB2 and 4h postoperatively. They were used for clustering analysis. Classification was made by discriminant analysis and cluster analysis by means of Genes@work software. Results & conclusion: 146 parameters were obtained from analysis. Cross-validation revealed several parameters being able to discriminate between MP groups and to identify immune modulation. MP administration resulted in a delayed activation of monocytes, increased ratio of neutrophils, reduced T-lymphocytes counts. Cluster analysis demonstrated that classification of patients is possible based on the identified cytomics parameters. Further investigation of these parameters might help to understand the MP effects in pediatric open heart surgery.

  15. Cluster analysis of the clinical histories of cattle affected with bovine anaemia associated with Theileria orientalis Ikeda type infection.

    PubMed

    Lawrence, K E; Forsyth, S F; Vaatstra, B L; McFadden, Amj; Pulford, D J; Govindaraju, K; Pomroy, W E

    2017-11-01

    AIM To determine the most commonly used words in the clinical histories of animals naturally infected with Theileria orientalis Ikeda type; whether these words differed between cases categorised by age, farm type or haematocrit (HCT), and if there was any clustering of the common words in relation to these categories. METHODS Clinical histories were transcribed for 605 cases of bovine anaemia associated with T. orientalis (TABA), that were submitted to laboratories with blood samples which tested positive for T. orientalis Ikeda type infection by PCR analysis, between October 2012 and November 2014. χ 2 tests were used to determine whether the proportion of submissions for each word was similar across the categories of HCT (normal, moderate anaemia or severe anaemia), farm type (dairy or beef) and age (young or old). Correspondence analysis (CA) was carried out on a contingency table of the frequency of the 28 most commonly used history words, cross-tabulated by age categories (young, old or unknown). Agglomerative hierarchical clustering, using Ward's method, was then performed on the coordinates from the correspondence analysis. RESULTS The six most commonly used history words were jaundice (204/605), lethargic (162/605), pale mucous membranes (161/605), cow (151/605), anaemia (147/605), and off milk (115/605). The proportion of cases with some history words differed between categories of age, farm type and HCT. The cluster analysis indicated that the recorded history words were grouped in two main clusters. The first included the words weight loss, tachycardia, pale mucous membranes, anaemia, lethargic and thin, and was associated with adult (p<0.001), severe anaemia (p<0.001) and dairy (p<0.001). The second cluster included the words deaths, ill-thrift, calves, calf and diarrhoea, and was associated with young (p<0.001), normal HCT (p<0.001), beef (p<0.001) and moderate anaemia (p<0.001). CONCLUSIONS AND CLINICAL RELEVANCE Cluster analysis of words recorded in clinical histories submitted with blood samples from cases of TABA indicates that two potentially different disease syndromes were associated with T. orientalis Ikeda type infection. One was consistent with the affected cattle suffering from a severe regenerative extravascular haemolytic anaemia, the second displaying as ill thrift and diarrhoea, particularly in young beef cattle.

  16. The X-CLASS-redMaPPer galaxy cluster comparison. I. Identification procedures

    NASA Astrophysics Data System (ADS)

    Sadibekova, T.; Pierre, M.; Clerc, N.; Faccioli, L.; Gastaud, R.; Le Fevre, J.-P.; Rozo, E.; Rykoff, E.

    2014-11-01

    Context. This paper is the first in a series undertaking a comprehensive correlation analysis between optically selected and X-ray-selected cluster catalogues. The rationale of the project is to develop a holistic picture of galaxy clusters utilising optical and X-ray-cluster-selected catalogues with well-understood selection functions. Aims: Unlike most of the X-ray/optical cluster correlations to date, the present paper focuses on the non-matching objects in either waveband. We investigate how the differences observed between the optical and X-ray catalogues may stem from (1) a shortcoming of the detection algorithms; (2) dispersion in the X-ray/optical scaling relations; or (3) substantial intrinsic differences between the cluster populations probed in the X-ray and optical bands. The aim is to inventory and elucidate these effects in order to account for selection biases in the further determination of X-ray/optical cluster scaling relations. Methods: We correlated the X-CLASS serendipitous cluster catalogue extracted from the XMM archive with the redMaPPer optical cluster catalogue derived from the Sloan Digital Sky Survey (DR8). We performed a detailed and, in large part, interactive analysis of the matching output from the correlation. The overlap between the two catalogues has been accurately determined and possible cluster positional errors were manually recovered. The final samples comprise 270 and 355 redMaPPer and X-CLASS clusters, respectively. X-ray cluster matching rates were analysed as a function of optical richness. In the second step, the redMaPPer clusters were correlated with the entire X-ray catalogue, containing point and uncharacterised sources (down to a few 10-15 erg s-1 cm-2 in the [0.5-2] keV band). A stacking analysis was performed for the remaining undetected optical clusters. Results: We find that all rich (λ ≥ 80) clusters are detected in X-rays out to z = 0.6. Below this redshift, the richness threshold for X-ray detection steadily decreases with redshift. Likewise, all X-ray bright clusters are detected by redMaPPer. After correcting for obvious pipeline shortcomings (about 10% of the cases both in optical and X-ray), ~50% of the redMaPPer (down to a richness of 20) are found to coincide with an X-CLASS cluster; when considering X-ray sources of any type, this fraction increases to ~80%; for the remaining objects, the stacking analysis finds a weak signal within 0.5 Mpc around the cluster optical centres. The fraction of clusters totally dominated by AGN-type emission appears to be a few percent. Conversely, ~40% of the X-CLASS clusters are identified with a redMaPPer (down to a richness of 20) - part of the non-matches being due to the X-CLASS sample extending further out than redMaPPer (z< 1.5 vs. z< 0.6), but extending the correlation down to a richness of 5 raises the matching rate to ~65%. Conclusions: This state-of-the-art study involving two well-validated cluster catalogues has shown itself to be complex, and it points to a number of issues inherent to blind cross-matching, owing both to pipeline shortcomings and cluster peculiar properties. These can only been accounted for after a manual check. The combined X-ray and optical scaling relations will be presented in a subsequent article.

  17. A model of the evaporation of binary-fuel clusters of drops

    NASA Technical Reports Server (NTRS)

    Harstad, K.; Bellan, J.

    1991-01-01

    A formulation has been developed to describe the evaporation of dense or dilute clusters of binary-fuel drops. The binary fuel is assumed to be made of a solute and a solvent whose volatility is much lower than that of the solute. Convective flow effects, inducing a circulatory motion inside the drops, are taken into account, as well as turbulence external to the cluster volume. Results obtained with this model show that, similar to the conclusions for single isolated drops, the evaporation of the volatile is controlled by liquid mass diffusion when the cluster is dilute. In contrast, when the cluster is dense, the evaporation of the volatile is controlled by surface layer stripping, that is, by the regression rate of the drop, which is in fact controlled by the evaporation rate of the solvent. These conclusions are in agreement with existing experimental observations. Parametric studies show that these conclusions remain valid with changes in ambient temperature, initial slip velocity between drops and gas, initial drop size, initial cluster size, initial liquid mass fraction of the solute, and various combinations of solvent and solute. The implications of these results for computationally intensive combustor calculations are discussed.

  18. Variable number of tandem repeats and pulsed-field gel electrophoresis cluster analysis of enterohemorrhagic Escherichia coli serovar O157 strains.

    PubMed

    Yokoyama, Eiji; Uchimura, Masako

    2007-11-01

    Ninety-five enterohemorrhagic Escherichia coli serovar O157 strains, including 30 strains isolated from 13 intrafamily outbreaks and 14 strains isolated from 3 mass outbreaks, were studied by pulsed-field gel electrophoresis (PFGE) and variable number of tandem repeats (VNTR) typing, and the resulting data were subjected to cluster analysis. Cluster analysis of the VNTR typing data revealed that 57 (60.0%) of 95 strains, including all epidemiologically linked strains, formed clusters with at least 95% similarity. Cluster analysis of the PFGE patterns revealed that 67 (70.5%) of 95 strains, including all but 1 of the epidemiologically linked strains, formed clusters with 90% similarity. The number of epidemiologically unlinked strains forming clusters was significantly less by VNTR cluster analysis than by PFGE cluster analysis. The congruence value between PFGE and VNTR cluster analysis was low and did not show an obvious correlation. With two-step cluster analysis, the number of clustered epidemiologically unlinked strains by PFGE cluster analysis that were divided by subsequent VNTR cluster analysis was significantly higher than the number by VNTR cluster analysis that were divided by subsequent PFGE cluster analysis. These results indicate that VNTR cluster analysis is more efficient than PFGE cluster analysis as an epidemiological tool to trace the transmission of enterohemorrhagic E. coli O157.

  19. Structural and chemical orders in N i 64.5 Z r 35.5 metallic glass by molecular dynamics simulation

    DOE PAGES

    Tang, L.; Wen, T. Q.; Wang, N.; ...

    2018-03-06

    The atomic structure of Ni 64.5Zr 35.5 metallic glass has been investigated by molecular dynamics (MD) simulations. The calculated structure factors from the MD glassy sample at room temperature agree well with the X-ray diffraction (XRD) and neutron diffraction (ND) experimental data. Using the pairwise cluster alignment and clique analysis methods, we show that there are three types dominant short-range order (SRO) motifs around Ni atoms in the glass sample of Ni 64.5Zr 35.5, i.e., Mixed- Icosahedron(ICO)-Cube, Twined-Cube and icosahedron-like clusters. Furthermore, chemical order and medium-range order (MRO) analysis show that the Mixed-ICOCube and Twined-Cube clusters exhibit the characteristics ofmore » the crystalline B2 phase. In conclusion, our simulation results suggest that the weak glass-forming ability (GFA) of Ni 64.5Zr 35.5 can be attributed to the competition between the glass forming ICO SRO and the crystalline Mixed-ICO-Cube and Twined-Cube motifs.« less

  20. Structural and chemical orders in N i 64.5 Z r 35.5 metallic glass by molecular dynamics simulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tang, L.; Wen, T. Q.; Wang, N.

    The atomic structure of Ni 64.5Zr 35.5 metallic glass has been investigated by molecular dynamics (MD) simulations. The calculated structure factors from the MD glassy sample at room temperature agree well with the X-ray diffraction (XRD) and neutron diffraction (ND) experimental data. Using the pairwise cluster alignment and clique analysis methods, we show that there are three types dominant short-range order (SRO) motifs around Ni atoms in the glass sample of Ni 64.5Zr 35.5, i.e., Mixed- Icosahedron(ICO)-Cube, Twined-Cube and icosahedron-like clusters. Furthermore, chemical order and medium-range order (MRO) analysis show that the Mixed-ICOCube and Twined-Cube clusters exhibit the characteristics ofmore » the crystalline B2 phase. In conclusion, our simulation results suggest that the weak glass-forming ability (GFA) of Ni 64.5Zr 35.5 can be attributed to the competition between the glass forming ICO SRO and the crystalline Mixed-ICO-Cube and Twined-Cube motifs.« less

  1. Helicobacter pylori with the Intact dupA Cluster is more Virulent than the Strains with the Incomplete dupA Cluster.

    PubMed

    Wang, Ming-yi; Shao, Chen; Li, Jie; Yang, Ya-Chao; Wang, Shao-bo; Hao, Jun-ling; Wu, Chun-mei; Gao, Xiao-zhong; Shao, Shi-he

    2015-07-01

    The duodenal ulcer promoting gene (dupA), located in the plasticity region of Helicobacter pylori (H. pylori), is predicted to form a type IV secretory system (T4SS) with vir genes around dupA. In the study, we investigated the association between the dupA cluster status and the virulence of H. pylori in a littoral region of Northeast China. Two hundred and sixty-two H. pylori strains isolated from the chronic gastritis were examined to evaluate the dupA cluster status, cag PAI genes and vacA genotype using PCR and Western blot. Histopathologic evaluations of biopsy specimens were performed to analysis the association between the dupA cluster and the inflammatory response. IL-8 productions in gastric mucosa and from GES-1 cells co-cultured with H. pylori were measured, respectively, to analysis the association between the dupA cluster status and IL-8 production. We found that gastric mucosal inflammatory cell infiltration was significantly higher in patients with dupA-positive H. pylori, including H. pylori with complete dupA cluster (2.71 ± 0.79) and incomplete dupA cluster (2.09 ± 0.61) than in patients with dupA-negative strain (1.73 ± 0.60, p < 0.01), whereas no significant difference in the gastric mucosal atrophy was found according to the status of dupA cluster. Gastric mucosal IL-8 levels were higher in the complete dupA cluster group than in other groups (p < 0.01), and IL-8 production from GES-1 cells was also significantly higher in strains with a complete dupA cluster (1527.9 ± 180.0 pg/ml) than in those with an incomplete dupA cluster (1229.4 ± 75.3 pg/ml, p < 0.01) or those with dupA negative (1201.9 ± 92.3 pg/ml, p < 0.01). In conclusion, the complete dupA cluster in H. pylori is associated with inflammatory cell infiltration and IL-8 secretion, and H. pylori strain with a complete dupA cluster seems to be more virulent than other strains with the incomplete dupA cluster or dupA negative.

  2. Patterns of comorbidity in community-dwelling older people hospitalised for fall-related injury: A cluster analysis

    PubMed Central

    2011-01-01

    Background Community-dwelling older people aged 65+ years sustain falls frequently; these can result in physical injuries necessitating medical attention including emergency department care and hospitalisation. Certain health conditions and impairments have been shown to contribute independently to the risk of falling or experiencing a fall injury, suggesting that individuals with these conditions or impairments should be the focus of falls prevention. Since older people commonly have multiple conditions/impairments, knowledge about which conditions/impairments coexist in at-risk individuals would be valuable in the implementation of a targeted prevention approach. The objective of this study was therefore to examine the prevalence and patterns of comorbidity in this population group. Methods We analysed hospitalisation data from Victoria, Australia's second most populous state, to estimate the prevalence of comorbidity in patients hospitalised at least once between 2005-6 and 2007-8 for treatment of acute fall-related injuries. In patients with two or more comorbid conditions (multicomorbidity) we used an agglomerative hierarchical clustering method to cluster comorbidity variables and identify constellations of conditions. Results More than one in four patients had at least one comorbid condition and among patients with comorbidity one in three had multicomorbidity (range 2-7). The prevalence of comorbidity varied by gender, age group, ethnicity and injury type; it was also associated with a significant increase in the average cumulative length of stay per patient. The cluster analysis identified five distinct, biologically plausible clusters of comorbidity: cardiopulmonary/metabolic, neurological, sensory, stroke and cancer. The cardiopulmonary/metabolic cluster was the largest cluster among the clusters identified. Conclusions The consequences of comorbidity clustering in terms of falls and/or injury outcomes of hospitalised patients should be investigated by future studies. Our findings have particular relevance for falls prevention strategies, clinical practice and planning of follow-up services for these patients. PMID:21851627

  3. Geotemporal Analysis of Neisseria meningitidis Clones in the United States: 2000–2005

    PubMed Central

    Wiringa, Ann E.; Shutt, Kathleen A.; Marsh, Jane W.; Cohn, Amanda C.; Messonnier, Nancy E.; Zansky, Shelley M.; Petit, Susan; Farley, Monica M.; Gershman, Ken; Lynfield, Ruth; Reingold, Arthur; Schaffner, William; Thompson, Jamie; Brown, Shawn T.; Lee, Bruce Y.; Harrison, Lee H.

    2013-01-01

    Background The detection of meningococcal outbreaks relies on serogrouping and epidemiologic definitions. Advances in molecular epidemiology have improved the ability to distinguish unique Neisseria meningitidis strains, enabling the classification of isolates into clones. Around 98% of meningococcal cases in the United States are believed to be sporadic. Methods Meningococcal isolates from 9 Active Bacterial Core surveillance sites throughout the United States from 2000 through 2005 were classified according to serogroup, multilocus sequence typing, and outer membrane protein (porA, porB, and fetA) genotyping. Clones were defined as isolates that were indistinguishable according to this characterization. Case data were aggregated to the census tract level and all non-singleton clones were assessed for non-random spatial and temporal clustering using retrospective space-time analyses with a discrete Poisson probability model. Results Among 1,062 geocoded cases with available isolates, 438 unique clones were identified, 78 of which had ≥2 isolates. 702 cases were attributable to non-singleton clones, accounting for 66.0% of all geocoded cases. 32 statistically significant clusters comprised of 107 cases (10.1% of all geocoded cases) were identified. Clusters had the following attributes: included 2 to 11 cases; 1 day to 33 months duration; radius of 0 to 61.7 km; and attack rate of 0.7 to 57.8 cases per 100,000 population. Serogroups represented among the clusters were: B (n = 12 clusters, 45 cases), C (n = 11 clusters, 27 cases), and Y (n = 9 clusters, 35 cases); 20 clusters (62.5%) were caused by serogroups represented in meningococcal vaccines that are commercially available in the United States. Conclusions Around 10% of meningococcal disease cases in the U.S. could be assigned to a geotemporal cluster. Molecular characterization of isolates, combined with geotemporal analysis, is a useful tool for understanding the spread of virulent meningococcal clones and patterns of transmission in populations. PMID:24349182

  4. A singular value decomposition approach for improved taxonomic classification of biological sequences

    PubMed Central

    2011-01-01

    Background Singular value decomposition (SVD) is a powerful technique for information retrieval; it helps uncover relationships between elements that are not prima facie related. SVD was initially developed to reduce the time needed for information retrieval and analysis of very large data sets in the complex internet environment. Since information retrieval from large-scale genome and proteome data sets has a similar level of complexity, SVD-based methods could also facilitate data analysis in this research area. Results We found that SVD applied to amino acid sequences demonstrates relationships and provides a basis for producing clusters and cladograms, demonstrating evolutionary relatedness of species that correlates well with Linnaean taxonomy. The choice of a reasonable number of singular values is crucial for SVD-based studies. We found that fewer singular values are needed to produce biologically significant clusters when SVD is employed. Subsequently, we developed a method to determine the lowest number of singular values and fewest clusters needed to guarantee biological significance; this system was developed and validated by comparison with Linnaean taxonomic classification. Conclusions By using SVD, we can reduce uncertainty concerning the appropriate rank value necessary to perform accurate information retrieval analyses. In tests, clusters that we developed with SVD perfectly matched what was expected based on Linnaean taxonomy. PMID:22369633

  5. Identification and Characterization of Unique Subgroups of Chronic Pain Individuals with Dispositional Personality Traits.

    PubMed

    Mehta, S; Rice, D; McIntyre, A; Getty, H; Speechley, M; Sequeira, K; Shapiro, A P; Morley-Forster, P; Teasell, R W

    2016-01-01

    Objective. The current study attempted to identify and characterize distinct CP subgroups based on their level of dispositional personality traits. The secondary objective was to compare the difference among the subgroups in mood, coping, and disability. Methods. Individuals with chronic pain were assessed for demographic, psychosocial, and personality measures. A two-step cluster analysis was conducted in order to identify distinct subgroups of patients based on their level of personality traits. Differences in clinical outcomes were compared using the multivariate analysis of variance based on cluster membership. Results. In 229 participants, three clusters were formed. No significant difference was seen among the clusters on patient demographic factors including age, sex, relationship status, duration of pain, and pain intensity. Those with high levels of dispositional personality traits had greater levels of mood impairment compared to the other two groups (p < 0.05). Significant difference in disability was seen between the subgroups. Conclusions. The study identified a high risk group of CP individuals whose level of personality traits significantly correlated with impaired mood and coping. Use of pharmacological treatment alone may not be successful in improving clinical outcomes among these individuals. Instead, a more comprehensive treatment involving psychological treatments may be important in managing the personality traits that interfere with recovery.

  6. Progressive myoclonic epilepsies

    PubMed Central

    Michelucci, Roberto; Canafoglia, Laura; Striano, Pasquale; Gambardella, Antonio; Magaudda, Adriana; Tinuper, Paolo; La Neve, Angela; Ferlazzo, Edoardo; Gobbi, Giuseppe; Giallonardo, Anna Teresa; Capovilla, Giuseppe; Visani, Elisa; Panzica, Ferruccio; Avanzini, Giuliano; Tassinari, Carlo Alberto; Bianchi, Amedeo; Zara, Federico

    2014-01-01

    Objective: To define the clinical spectrum and etiology of progressive myoclonic epilepsies (PMEs) in Italy using a database developed by the Genetics Commission of the Italian League against Epilepsy. Methods: We collected clinical and laboratory data from patients referred to 25 Italian epilepsy centers regardless of whether a positive causative factor was identified. PMEs of undetermined origins were grouped using 2-step cluster analysis. Results: We collected clinical data from 204 patients, including 77 with a diagnosis of Unverricht-Lundborg disease and 37 with a diagnosis of Lafora body disease; 31 patients had PMEs due to rarer genetic causes, mainly neuronal ceroid lipofuscinoses. Two more patients had celiac disease. Despite extensive investigation, we found no definitive etiology for 57 patients. Cluster analysis indicated that these patients could be grouped into 2 clusters defined by age at disease onset, age at myoclonus onset, previous psychomotor delay, seizure characteristics, photosensitivity, associated signs other than those included in the cardinal definition of PME, and pathologic MRI findings. Conclusions: Information concerning the distribution of different genetic causes of PMEs may provide a framework for an updated diagnostic workup. Phenotypes of the patients with PME of undetermined cause varied widely. The presence of separate clusters suggests that novel forms of PME are yet to be clinically and genetically characterized. PMID:24384641

  7. Phylogenetic Evidence for Lateral Gene Transfer in the Intestine of Marine Iguanas

    PubMed Central

    Nelson, David M.; Cann, Isaac K. O.; Altermann, Eric; Mackie, Roderick I.

    2010-01-01

    Background Lateral gene transfer (LGT) appears to promote genotypic and phenotypic variation in microbial communities in a range of environments, including the mammalian intestine. However, the extent and mechanisms of LGT in intestinal microbial communities of non-mammalian hosts remains poorly understood. Methodology/Principal Findings We sequenced two fosmid inserts obtained from a genomic DNA library derived from an agar-degrading enrichment culture of marine iguana fecal material. The inserts harbored 16S rRNA genes that place the organism from which they originated within Clostridium cluster IV, a well documented group that habitats the mammalian intestinal tract. However, sequence analysis indicates that 52% of the protein-coding genes on the fosmids have top BLASTX hits to bacterial species that are not members of Clostridium cluster IV, and phylogenetic analysis suggests that at least 10 of 44 coding genes on the fosmids may have been transferred from Clostridium cluster XIVa to cluster IV. The fosmids encoded four transposase-encoding genes and an integrase-encoding gene, suggesting their involvement in LGT. In addition, several coding genes likely involved in sugar transport were probably acquired through LGT. Conclusion Our phylogenetic evidence suggests that LGT may be common among phylogenetically distinct members of the phylum Firmicutes inhabiting the intestinal tract of marine iguanas. PMID:20520734

  8. Characteristics of HIV-infected U.S. Army soldiers linked in molecular transmission clusters, 2001-2012

    PubMed Central

    Jagodzinski, Linda L.; Liu, Ying; Pham, Peter T.; Kijak, Gustavo H.; Tovanabutra, Sodsai; McCutchan, Francine E.; Scoville, Stephanie L.; Cersovsky, Steven B.; Michael, Nelson L.; Scott, Paul T.; Peel, Sheila A.

    2017-01-01

    Objective Recent surveillance data suggests the United States (U.S.) Army HIV epidemic is concentrated among men who have sex with men. To identify potential targets for HIV prevention strategies, the relationship between demographic and clinical factors and membership within transmission clusters based on baseline pol sequences of HIV-infected Soldiers from 2001 through 2012 were analyzed. Methods We conducted a retrospective analysis of baseline partial pol sequences, demographic and clinical characteristics available for all Soldiers in active service and newly-diagnosed with HIV-1 infection from January 1, 2001 through December 31, 2012. HIV-1 subtype designations and transmission clusters were identified from phylogenetic analysis of sequences. Univariate and multivariate logistic regression models were used to evaluate and adjust for the association between characteristics and cluster membership. Results Among 518 of 995 HIV-infected Soldiers with available partial pol sequences, 29% were members of a transmission cluster. Assignment to a southern U.S. region at diagnosis and year of diagnosis were independently associated with cluster membership after adjustment for other significant characteristics (p<0.10) of age, race, year of diagnosis, region of duty assignment, sexually transmitted infections, last negative HIV test, antiretroviral therapy, and transmitted drug resistance. Subtyping of the pol fragment indicated HIV-1 subtype B infection predominated (94%) among HIV-infected Soldiers. Conclusion These findings identify areas to explore as HIV prevention targets in the U.S. Army. An increased frequency of current force testing may be justified, especially among Soldiers assigned to duty in installations with high local HIV prevalence such as southern U.S. states. PMID:28759645

  9. Universal dynamical properties preclude standard clustering in a large class of biochemical data.

    PubMed

    Gomez, Florian; Stoop, Ralph L; Stoop, Ruedi

    2014-09-01

    Clustering of chemical and biochemical data based on observed features is a central cognitive step in the analysis of chemical substances, in particular in combinatorial chemistry, or of complex biochemical reaction networks. Often, for reasons unknown to the researcher, this step produces disappointing results. Once the sources of the problem are known, improved clustering methods might revitalize the statistical approach of compound and reaction search and analysis. Here, we present a generic mechanism that may be at the origin of many clustering difficulties. The variety of dynamical behaviors that can be exhibited by complex biochemical reactions on variation of the system parameters are fundamental system fingerprints. In parameter space, shrimp-like or swallow-tail structures separate parameter sets that lead to stable periodic dynamical behavior from those leading to irregular behavior. We work out the genericity of this phenomenon and demonstrate novel examples for their occurrence in realistic models of biophysics. Although we elucidate the phenomenon by considering the emergence of periodicity in dependence on system parameters in a low-dimensional parameter space, the conclusions from our simple setting are shown to continue to be valid for features in a higher-dimensional feature space, as long as the feature-generating mechanism is not too extreme and the dimension of this space is not too high compared with the amount of available data. For online versions of super-paramagnetic clustering see http://stoop.ini.uzh.ch/research/clustering. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  10. Pinpointing clusters of apparently sporadic cases of Legionnaires' disease.

    PubMed Central

    Bhopal, R. S.; Diggle, P.; Rowlingson, B.

    1992-01-01

    OBJECTIVES--To test the hypothesis that many non-outbreak cases of legionnaires' disease are not sporadic and to attempt to pinpoint cases clustering in space and time. DESIGN--Descriptive study of a case series, 1978-86. SETTING--15 health boards in Scotland. PATIENTS--203 probable cases of non-outbreak, non-travel, community acquired legionnaires' disease in patients resident in Scotland. MAIN MEASURES--Date of onset of disease and postcode and health board of residence of cases. RESULTS--Space-time clustering was present and numerous groups of cases were identified, all but two being newly recognised. Nine cases occurred during three months within two postcodes in Edinburgh, and an outbreak was probably missed. In several places cases occurred in one area over a prolonged period--for example, nine cases in postcode districts G11.5 and G12.8 in Glasgow during five years (estimated mean annual incidence of community acquired, non-outbreak, non-travel legionnaires' disease of 146 per million residents v 4.8 per million for Scotland). Statistical analysis showed that the space time clustering of cases in the Glasgow and Edinburgh areas was unusual (p = 0.036, p = 0.068 respectively). CONCLUSION--Future surveillance requires greater awareness that clusters can be overlooked; case searching whenever a case is identified; collection of complete information particularly of date of onset of the disease and address or postcode; ongoing analysis for space-time clustering; and an accurate yet workable definition of sporadic cases. Other researchers should re-examine their data on apparently sporadic infection. PMID:1586784

  11. A taxonomy of epithelial human cancer and their metastases

    PubMed Central

    2009-01-01

    Background Microarray technology has allowed to molecularly characterize many different cancer sites. This technology has the potential to individualize therapy and to discover new drug targets. However, due to technological differences and issues in standardized sample collection no study has evaluated the molecular profile of epithelial human cancer in a large number of samples and tissues. Additionally, it has not yet been extensively investigated whether metastases resemble their tissue of origin or tissue of destination. Methods We studied the expression profiles of a series of 1566 primary and 178 metastases by unsupervised hierarchical clustering. The clustering profile was subsequently investigated and correlated with clinico-pathological data. Statistical enrichment of clinico-pathological annotations of groups of samples was investigated using Fisher exact test. Gene set enrichment analysis (GSEA) and DAVID functional enrichment analysis were used to investigate the molecular pathways. Kaplan-Meier survival analysis and log-rank tests were used to investigate prognostic significance of gene signatures. Results Large clusters corresponding to breast, gastrointestinal, ovarian and kidney primary tissues emerged from the data. Chromophobe renal cell carcinoma clustered together with follicular differentiated thyroid carcinoma, which supports recent morphological descriptions of thyroid follicular carcinoma-like tumors in the kidney and suggests that they represent a subtype of chromophobe carcinoma. We also found an expression signature identifying primary tumors of squamous cell histology in multiple tissues. Next, a subset of ovarian tumors enriched with endometrioid histology clustered together with endometrium tumors, confirming that they share their etiopathogenesis, which strongly differs from serous ovarian tumors. In addition, the clustering of colon and breast tumors correlated with clinico-pathological characteristics. Moreover, a signature was developed based on our unsupervised clustering of breast tumors and this was predictive for disease-specific survival in three independent studies. Next, the metastases from ovarian, breast, lung and vulva cluster with their tissue of origin while metastases from colon showed a bimodal distribution. A significant part clusters with tissue of origin while the remaining tumors cluster with the tissue of destination. Conclusion Our molecular taxonomy of epithelial human cancer indicates surprising correlations over tissues. This may have a significant impact on the classification of many cancer sites and may guide pathologists, both in research and daily practice. Moreover, these results based on unsupervised analysis yielded a signature predictive of clinical outcome in breast cancer. Additionally, we hypothesize that metastases from gastrointestinal origin either remember their tissue of origin or adapt to the tissue of destination. More specifically, colon metastases in the liver show strong evidence for such a bimodal tissue specific profile. PMID:20017941

  12. SSR analysis of genetic diversity and structure of the germplasm of faba bean (Vicia faba L.).

    PubMed

    El-Esawi, Mohamed A

    Assessing the diversity and genetic structure of faba bean (Vicia faba L.) germplasm is essential to improve the quality and yield of this economically important crop. In this study, simple sequence repeats (SSRs) were utilized to evaluate the diversity and structure of 35 faba bean genotypes originating from three different geographical regions (Northern Africa, Eastern Africa, and Near East). All 15 SSR loci generated a total of 100 alleles. The allele number per locus varied from 4 to 11, with a mean of 6.67. The expected heterozygosity (H e ) of SSR loci ranged between 0.51 and 0.81, with a mean of 0.63. The PIC value also varied from 0.44 to 0.78, with an average of 0.58. The expected heterozygosity of 22 faba bean genotypes was higher than the observed one. Interestingly, AMOVA analysis showed that much of variability resided within accessions (79.2%). A highly significant difference among regions was also evidenced, and represented 5.3% of the total variation. Moreover, cluster analysis divided the 35 faba bean genotypes into two main clusters. The first main cluster comprised all faba bean genotypes originating from the Near East region, whereas the second main cluster comprised all the genotypes originating from the Northern and Eastern Africa regions, indicating that the Northern and Eastern African faba bean genotypes were more closely related to each other than to the Near East genotypes. Structure analysis also revealed that the 35 faba bean genotypes might be assigned to two populations, in complete accordance with cluster analysis data. In conclusion, this study showed high levels of diversity in the analysed genotypes of faba bean, and could be utilized in future breeding programmes to develop new cultivars of high yield. Copyright © 2017 Académie des sciences. Published by Elsevier Masson SAS. All rights reserved.

  13. Structure and substructure analysis of DAFT/FADA galaxy clusters in the [0.4-0.9] redshift range

    NASA Astrophysics Data System (ADS)

    Guennou, L.; Adami, C.; Durret, F.; Lima Neto, G. B.; Ulmer, M. P.; Clowe, D.; LeBrun, V.; Martinet, N.; Allam, S.; Annis, J.; Basa, S.; Benoist, C.; Biviano, A.; Cappi, A.; Cypriano, E. S.; Gavazzi, R.; Halliday, C.; Ilbert, O.; Jullo, E.; Just, D.; Limousin, M.; Márquez, I.; Mazure, A.; Murphy, K. J.; Plana, H.; Rostagni, F.; Russeil, D.; Schirmer, M.; Slezak, E.; Tucker, D.; Zaritsky, D.; Ziegler, B.

    2014-01-01

    Context. The DAFT/FADA survey is based on the study of ~90 rich (masses found in the literature >2 × 1014 M⊙) and moderately distant clusters (redshifts 0.4 < z < 0.9), all with HST imaging data available. This survey has two main objectives: to constrain dark energy (DE) using weak lensing tomography on galaxy clusters and to build a database (deep multi-band imaging allowing photometric redshift estimates, spectroscopic data, X-ray data) of rich distant clusters to study their properties. Aims: We analyse the structures of all the clusters in the DAFT/FADA survey for which XMM-Newton and/or a sufficient number of galaxy redshifts in the cluster range are available, with the aim of detecting substructures and evidence for merging events. These properties are discussed in the framework of standard cold dark matter (ΛCDM) cosmology. Methods: In X-rays, we analysed the XMM-Newton data available, fit a β-model, and subtracted it to identify residuals. We used Chandra data, when available, to identify point sources. In the optical, we applied a Serna & Gerbal (SG) analysis to clusters with at least 15 spectroscopic galaxy redshifts available in the cluster range. We discuss the substructure detection efficiencies of both methods. Results: XMM-Newton data were available for 32 clusters, for which we derive the X-ray luminosity and a global X-ray temperature for 25 of them. For 23 clusters we were able to fit the X-ray emissivity with a β-model and subtract it to detect substructures in the X-ray gas. A dynamical analysis based on the SG method was applied to the clusters having at least 15 spectroscopic galaxy redshifts in the cluster range: 18 X-ray clusters and 11 clusters with no X-ray data. The choice of a minimum number of 15 redshifts implies that only major substructures will be detected. Ten substructures were detected both in X-rays and by the SG method. Most of the substructures detected both in X-rays and with the SG method are probably at their first cluster pericentre approach and are relatively recent infalls. We also find hints of a decreasing X-ray gas density profile core radius with redshift. Conclusions: The percentage of mass included in substructures was found to be roughly constant with redshift values of 5-15%, in agreement both with the general CDM framework and with the results of numerical simulations. Galaxies in substructures show the same general behaviour as regular cluster galaxies; however, in substructures, there is a deficiency of both late type and old stellar population galaxies. Late type galaxies with recent bursts of star formation seem to be missing in the substructures close to the bottom of the host cluster potential well. However, our sample would need to be increased to allow a more robust analysis. Tables 1, 2, 4 and Appendices A-C are available in electronic form at http://www.aanda.org

  14. Geovisual analytics to enhance spatial scan statistic interpretation: an analysis of U.S. cervical cancer mortality

    PubMed Central

    Chen, Jin; Roth, Robert E; Naito, Adam T; Lengerich, Eugene J; MacEachren, Alan M

    2008-01-01

    Background Kulldorff's spatial scan statistic and its software implementation – SaTScan – are widely used for detecting and evaluating geographic clusters. However, two issues make using the method and interpreting its results non-trivial: (1) the method lacks cartographic support for understanding the clusters in geographic context and (2) results from the method are sensitive to parameter choices related to cluster scaling (abbreviated as scaling parameters), but the system provides no direct support for making these choices. We employ both established and novel geovisual analytics methods to address these issues and to enhance the interpretation of SaTScan results. We demonstrate our geovisual analytics approach in a case study analysis of cervical cancer mortality in the U.S. Results We address the first issue by providing an interactive visual interface to support the interpretation of SaTScan results. Our research to address the second issue prompted a broader discussion about the sensitivity of SaTScan results to parameter choices. Sensitivity has two components: (1) the method can identify clusters that, while being statistically significant, have heterogeneous contents comprised of both high-risk and low-risk locations and (2) the method can identify clusters that are unstable in location and size as the spatial scan scaling parameter is varied. To investigate cluster result stability, we conducted multiple SaTScan runs with systematically selected parameters. The results, when scanning a large spatial dataset (e.g., U.S. data aggregated by county), demonstrate that no single spatial scan scaling value is known to be optimal to identify clusters that exist at different scales; instead, multiple scans that vary the parameters are necessary. We introduce a novel method of measuring and visualizing reliability that facilitates identification of homogeneous clusters that are stable across analysis scales. Finally, we propose a logical approach to proceed through the analysis of SaTScan results. Conclusion The geovisual analytics approach described in this manuscript facilitates the interpretation of spatial cluster detection methods by providing cartographic representation of SaTScan results and by providing visualization methods and tools that support selection of SaTScan parameters. Our methods distinguish between heterogeneous and homogeneous clusters and assess the stability of clusters across analytic scales. Method We analyzed the cervical cancer mortality data for the United States aggregated by county between 2000 and 2004. We ran SaTScan on the dataset fifty times with different parameter choices. Our geovisual analytics approach couples SaTScan with our visual analytic platform, allowing users to interactively explore and compare SaTScan results produced by different parameter choices. The Standardized Mortality Ratio and reliability scores are visualized for all the counties to identify stable, homogeneous clusters. We evaluated our analysis result by comparing it to that produced by other independent techniques including the Empirical Bayes Smoothing and Kafadar spatial smoother methods. The geovisual analytics approach introduced here is developed and implemented in our Java-based Visual Inquiry Toolkit. PMID:18992163

  15. Gastrointestinal Fibroblasts Have Specialized, Diverse Transcriptional Phenotypes: A Comprehensive Gene Expression Analysis of Human Fibroblasts

    PubMed Central

    Ishii, Genichiro; Aoyagi, Kazuhiko; Sasaki, Hiroki; Ochiai, Atsushi

    2015-01-01

    Background Fibroblasts are the principal stromal cells that exist in whole organs and play vital roles in many biological processes. Although the functional diversity of fibroblasts has been estimated, a comprehensive analysis of fibroblasts from the whole body has not been performed and their transcriptional diversity has not been sufficiently explored. The aim of this study was to elucidate the transcriptional diversity of human fibroblasts within the whole body. Methods Global gene expression analysis was performed on 63 human primary fibroblasts from 13 organs. Of these, 32 fibroblasts from gastrointestinal organs (gastrointestinal fibroblasts: GIFs) were obtained from a pair of 2 anatomical sites: the submucosal layer (submucosal fibroblasts: SMFs) and the subperitoneal layer (subperitoneal fibroblasts: SPFs). Using hierarchical clustering analysis, we elucidated identifiable subgroups of fibroblasts and analyzed the transcriptional character of each subgroup. Results In unsupervised clustering, 2 major clusters that separate GIFs and non-GIFs were observed. Organ- and anatomical site-dependent clusters within GIFs were also observed. The signature genes that discriminated GIFs from non-GIFs, SMFs from SPFs, and the fibroblasts of one organ from another organ consisted of genes associated with transcriptional regulation, signaling ligands, and extracellular matrix remodeling. Conclusions GIFs are characteristic fibroblasts with specific gene expressions from transcriptional regulation, signaling ligands, and extracellular matrix remodeling related genes. In addition, the anatomical site- and organ-dependent diversity of GIFs was also discovered. These features of GIFs contribute to their specific physiological function and homeostatic maintenance, and create a functional diversity of the gastrointestinal tract. PMID:26046848

  16. White Matter Tract Integrity in Alzheimer's Disease vs. Late Onset Bipolar Disorder and Its Correlation with Systemic Inflammation and Oxidative Stress Biomarkers.

    PubMed

    Besga, Ariadna; Chyzhyk, Darya; Gonzalez-Ortega, Itxaso; Echeveste, Jon; Graña-Lecuona, Marina; Graña, Manuel; Gonzalez-Pinto, Ana

    2017-01-01

    Background: Late Onset Bipolar Disorder (LOBD) is the development of Bipolar Disorder (BD) at an age above 50 years old. It is often difficult to differentiate from other aging dementias, such as Alzheimer's Disease (AD), because they share cognitive and behavioral impairment symptoms. Objectives: We look for WM tract voxel clusters showing significant differences when comparing of AD vs. LOBD, and its correlations with systemic blood plasma biomarkers (inflammatory, neurotrophic factors, and oxidative stress). Materials: A sample of healthy controls (HC) ( n = 19), AD patients ( n = 35), and LOBD patients ( n = 24) was recruited at the Alava University Hospital. Blood plasma samples were obtained at recruitment time and analyzed to extract the inflammatory, oxidative stress, and neurotrophic factors. Several modalities of MRI were acquired for each subject, Methods: Fractional anisotropy (FA) coefficients are obtained from diffusion weighted imaging (DWI). Tract based spatial statistics (TBSS) finds FA skeleton clusters of WM tract voxels showing significant differences for all possible contrasts between HC, AD, and LOBD. An ANOVA F -test over all contrasts is carried out. Results of F -test are used to mask TBSS detected clusters for the AD > LOBD and LOBD > AD contrast to select the image clusters used for correlation analysis. Finally, Pearson's correlation coefficients between FA values at cluster sites and systemic blood plasma biomarker values are computed. Results: The TBSS contrasts with by ANOVA F -test has identified strongly significant clusters in the forceps minor, inferior longitudinal fasciculus, inferior fronto-occipital fasciculus, and cingulum gyrus. The correlation analysis of these tract clusters found strong negative correlation of AD with the nerve growth factor (NGF) and brain derived neurotrophic factor (BDNF) blood biomarkers. Negative correlation of AD and positive correlation of LOBD with inflammation biomarker IL6 was also found. Conclusion: TBSS voxel clusters tract atlas localizations are consistent with greater behavioral impairment and mood disorders in LOBD than in AD. Correlation analysis confirms that neurotrophic factors (i.e., NGF, BDNF) play a great role in AD while are absent in LOBD pathophysiology. Also, correlation results of IL1 and IL6 suggest stronger inflammatory effects in LOBD than in AD.

  17. Kinematic evidence of satellite galaxy populations in the potential wells of first-ranked cluster galaxies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cowie, L.L.; Hu, E.M.

    1986-06-01

    The velocities of 38 centrally positioned galaxies (r much less than 100 kpc) were measured relative to the velocity of the first-ranked galaxy in 14 rich clusters. Analysis of the velocity distribution function of this sample and of previous data shows that the population cannot be fit by a single Gaussian. An adequate fit is obtained if 60 percent of the objects lie in a Gaussian with sigma = 250 km/s and the remainder in a population with sigma = 1400 km/s. All previous data sets are individually consistent with this conclusion. This suggests that there is a bound populationmore » of galaxies in the potential well of the central galaxy in addition to the normal population of the cluster core. This is taken as supporting evidence for the galactic cannibalism model of cD galaxy formation. 14 references.« less

  18. Kinematic evidence of satellite galaxy populations in the potential wells of first-ranked cluster galaxies

    NASA Technical Reports Server (NTRS)

    Cowie, L. L.; Hu, E. M.

    1986-01-01

    The velocities of 38 centrally positioned galaxies (r much less than 100 kpc) were measured relative to the velocity of the first-ranked galaxy in 14 rich clusters. Analysis of the velocity distribution function of this sample and of previous data shows that the population cannot be fit by a single Gaussian. An adequate fit is obtained if 60 percent of the objects lie in a Gaussian with sigma = 250 km/s and the remainder in a population with sigma = 1400 km/s. All previous data sets are individually consistent with this conclusion. This suggests that there is a bound population of galaxies in the potential well of the central galaxy in addition to the normal population of the cluster core. This is taken as supporting evidence for the galactic cannibalism model of cD galaxy formation.

  19. The Integrated Cluster Finder for the ARCHES project

    NASA Astrophysics Data System (ADS)

    Mints, Alexey; Schwope, Axel; Rosen, Simon; Pineau, François-Xavier; Carrera, Francisco

    2017-01-01

    Context. Clusters of galaxies are important for cosmology and astrophysics. They may be discovered through either the summed optical/IR radiation originating from their member galaxies or via X-ray emission originating from the hot intracluster medium. X-ray samples are not affected by projection effects but a redshift determination typically needs optical and infrared follow-up to then infer X-ray temperatures and luminosities. Aims: We want to confirm serendipitously discovered X-ray emitting cluster candidates and measure their cosmological redshift through the analysis and exploration of multi-wavelength photometric catalogues. Methods: We developed a tool, the Integrated Cluster Finder (ICF), to search for clusters by determining overdensities of potential member galaxies in optical and infrared catalogues. Based on a spectroscopic meta-catalogue we calibrated colour-redshift relations that combine optical (SDSS) and IR data (UKIDSS, WISE). The tool is used to quantify the overdensity of galaxies against the background via a modified redMaPPer technique and to quantify the confidence of a cluster detection. Results: Cluster finding results are compared to reference catalogues found in the literature. The results agree to within 95-98%. The tool is used to confirm 488 out of 830 cluster candidates drawn from 3XMMe in the footprint of the SDSS and CFHT catalogues. Conclusions: The ICF is a flexible and highly efficient tool to search for galaxy clusters in multiple catalogues and is freely available to the community. It may be used to identify the cluster content in future X-ray catalogues from XMM-Newton and eventually from eROSITA.

  20. Geospatial Distribution and Clustering of Chlamydia trachomatis in Communities Undergoing Mass Azithromycin Treatment

    PubMed Central

    Yohannan, Jithin; He, Bing; Wang, Jiangxia; Greene, Gregory; Schein, Yvette; Mkocha, Harran; Munoz, Beatriz; Quinn, Thomas C.; Gaydos, Charlotte; West, Sheila K.

    2014-01-01

    Purpose. We detected spatial clustering of households with Chlamydia trachomatis infection (CI) and active trachoma (AT) in villages undergoing mass treatment with azithromycin (MDA) over time. Methods. We obtained global positioning system (GPS) coordinates for all households in four villages in Kongwa District, Tanzania. Every 6 months for a period of 42 months, our team examined all children under 10 for AT, and tested for CI with ocular swabbing and Amplicor. Villages underwent four rounds of annual MDA. We classified households as having ≥1 child with CI (or AT) or having 0 children with CI (or AT). We calculated the difference in the K function between households with and without CI or AT to detect clustering at each time point. Results. Between 918 and 991 households were included over the 42 months of this analysis. At baseline, 306 households (32.59%) had ≥1 child with CI, which declined to 73 households (7.50%) at 42 months. We observed borderline clustering of households with CI at 12 months after one round of MDA and statistically significant clustering with growing cluster sizes between 18 and 24 months after two rounds of MDA. Clusters diminished in size at 30 months after 3 rounds of MDA. Active trachoma did not cluster at any time point. Conclusions. This study demonstrates that CI clusters after multiple rounds of MDA. Clusters of infection may increase in size if the annual antibiotic pressure is removed. The absence of growth after the three rounds suggests the start of control of transmission. PMID:24906862

  1. Global, local and focused geographic clustering for case-control data with residential histories

    PubMed Central

    Jacquez, Geoffrey M; Kaufmann, Andy; Meliker, Jaymie; Goovaerts, Pierre; AvRuskin, Gillian; Nriagu, Jerome

    2005-01-01

    Background This paper introduces a new approach for evaluating clustering in case-control data that accounts for residential histories. Although many statistics have been proposed for assessing local, focused and global clustering in health outcomes, few, if any, exist for evaluating clusters when individuals are mobile. Methods Local, global and focused tests for residential histories are developed based on sets of matrices of nearest neighbor relationships that reflect the changing topology of cases and controls. Exposure traces are defined that account for the latency between exposure and disease manifestation, and that use exposure windows whose duration may vary. Several of the methods so derived are applied to evaluate clustering of residential histories in a case-control study of bladder cancer in south eastern Michigan. These data are still being collected and the analysis is conducted for demonstration purposes only. Results Statistically significant clustering of residential histories of cases was found but is likely due to delayed reporting of cases by one of the hospitals participating in the study. Conclusion Data with residential histories are preferable when causative exposures and disease latencies occur on a long enough time span that human mobility matters. To analyze such data, methods are needed that take residential histories into account. PMID:15784151

  2. Tardigrade workbench: comparing stress-related proteins, sequence-similar and functional protein clusters as well as RNA elements in tardigrades

    PubMed Central

    2009-01-01

    Background Tardigrades represent an animal phylum with extraordinary resistance to environmental stress. Results To gain insights into their stress-specific adaptation potential, major clusters of related and similar proteins are identified, as well as specific functional clusters delineated comparing all tardigrades and individual species (Milnesium tardigradum, Hypsibius dujardini, Echiniscus testudo, Tulinus stephaniae, Richtersius coronifer) and functional elements in tardigrade mRNAs are analysed. We find that 39.3% of the total sequences clustered in 58 clusters of more than 20 proteins. Among these are ten tardigrade specific as well as a number of stress-specific protein clusters. Tardigrade-specific functional adaptations include strong protein, DNA- and redox protection, maintenance and protein recycling. Specific regulatory elements regulate tardigrade mRNA stability such as lox P DICE elements whereas 14 other RNA elements of higher eukaryotes are not found. Further features of tardigrade specific adaption are rapidly identified by sequence and/or pattern search on the web-tool tardigrade analyzer http://waterbear.bioapps.biozentrum.uni-wuerzburg.de. The work-bench offers nucleotide pattern analysis for promotor and regulatory element detection (tardigrade specific; nrdb) as well as rapid COG search for function assignments including species-specific repositories of all analysed data. Conclusion Different protein clusters and regulatory elements implicated in tardigrade stress adaptations are analysed including unpublished tardigrade sequences. PMID:19821996

  3. Intracluster light at the Frontier - II. The Frontier Fields Clusters

    NASA Astrophysics Data System (ADS)

    Montes, Mireia; Trujillo, Ignacio

    2018-02-01

    Multiwavelength deep observations are a key tool to understand the origin of the diffuse light in clusters of galaxies: the intracluster light (ICL). For this reason, we take advantage of the Hubble Frontier Fields (HFF) survey to investigate the properties of the stellar populations of the ICL of its six massive intermediate redshift (0.3 < z < 0.6) clusters. We carry on this analysis down to a radial distance of ˜120 kpc from the brightest cluster galaxy. We found that the average metallicity of the ICL is [Fe/H]ICL ˜ -0.5, compatible with the value of the outskirts of the Milky Way. The mean stellar ages of the ICL are between 2 and 6 Gyr younger than the most massive galaxies of the clusters. Those results suggest that the ICL of these massive (>1015 M⊙) clusters is formed by the stripping of MW-like objects that have been accreted at z < 1, in agreement with current simulations. We do not find any significant increase in the fraction of light of the ICL with cosmic time, although the redshift range explored is narrow to derive any strong conclusion. When exploring the slope of the stellar mass density profile, we found that the ICL of the HFF clusters follows the shape of their underlying dark matter haloes, in agreement with the idea that the ICL is the result of the stripping of galaxies at recent times.

  4. Spatial Analysis of Hemorrhagic Fever with Renal Syndrome in Zibo City, China, 2009–2012

    PubMed Central

    Wang, Ling; Yang, Shuxia; Zhang, Ling; Cao, Haixia; Zhang, Yan; Hu, Haodong; Zhai, Shenyong

    2013-01-01

    Background Hemorrhagic fever with renal syndrome (HFRS) is highly endemic in mainland China, where human cases account for 90% of the total global cases. Zibo City is one of the most serious affected areas in Shandong Province China with the HFRS incidence increasing sharply from 2009 to 2012. However, the hotspots of HFRS in Zibo remained unclear. Thus, a spatial analysis was conducted with the aim to explore the spatial, spatial-temporal and seasonal patterns of HFRS in Zibo from 2009 to 2012, and to provide guidance for formulating regional prevention and control strategies. Methods The study was based on the reported cases of HFRS from the National Notifiable Disease Surveillance System. Annualized incidence maps and seasonal incidence maps were produced to analyze the spatial and seasonal distribution of HFRS in Zibo City. Then spatial scan statistics and space-time scan statistics were conducted to identify clusters of HFRS. Results There were 200 cases reported in Zibo City during the 4-year study period. One most likely cluster and one secondary cluster for high incidence of HFRS were identified by the space-time analysis. And the most likely cluster was found to exist at Yiyuan County in October to December 2012. The human infections in the fall and winter reflected a seasonal characteristic pattern of Hantaan virus (HTNV) transmission. The secondary cluster was detected at the center of Zibo in May to June 2009, presenting a seasonal characteristic of Seoul virus (SEOV) transmission. Conclusion To control and prevent HFRS in Zibo city, the comprehensive preventive strategy should be implemented in the southern areas of Zibo in autumn and in the northern areas of Zibo in spring. PMID:23840719

  5. Delayed inflammatory mRNA and protein expression after spinal cord injury

    PubMed Central

    2011-01-01

    Background Spinal cord injury (SCI) induces secondary tissue damage that is associated with inflammation. We have previously demonstrated that inflammation-related gene expression after SCI occurs in two waves - an initial cluster that is acutely and transiently up-regulated within 24 hours, and a more delayed cluster that peaks between 72 hours and 7 days. Here we extend the microarray analysis of these gene clusters up to 6 months post-SCI. Methods Adult male rats were subjected to mild, moderate or severe spinal cord contusion injury at T9 using a well-characterized weight-drop model. Tissue from the lesion epicenter was obtained 4 hours, 24 hours, 7 days, 28 days, 3 months or 6 months post-injury and processed for microarray analysis and protein expression. Results Anchor gene analysis using C1qB revealed a cluster of genes that showed elevated expression through 6 months post-injury, including galectin-3, p22PHOX, gp91PHOX, CD53 and progranulin. The expression of these genes occurred primarily in microglia/macrophage cells and was confirmed at the protein level using both immunohistochemistry and western blotting. As p22PHOX and gp91PHOX are components of the NADPH oxidase enzyme, enzymatic activity and its role in SCI were assessed and NADPH oxidase activity was found to be significantly up-regulated through 6 months post-injury. Further, treating rats with the nonspecific, irreversible NADPH oxidase inhibitor diphenylene iodinium (DPI) reduced both lesion volume and expression of chronic gene cluster proteins one month after trauma. Conclusions These data demonstrate that inflammation-related genes are chronically up-regulated after SCI and may contribute to further tissue loss. PMID:21975064

  6. Parity among interpretation methods of MLEE patterns and disparity among clustering methods in epidemiological typing of Candida albicans.

    PubMed

    Boriollo, Marcelo Fabiano Gomes; Rosa, Edvaldo Antonio Ribeiro; Gonçalves, Reginaldo Bruno; Höfling, José Francisco

    2006-03-01

    The typing of C. albicans by MLEE (multilocus enzyme electrophoresis) is dependent on the interpretation of enzyme electrophoretic patterns, and the study of the epidemiological relationships of these yeasts can be conducted by cluster analysis. Therefore, the aims of the present study were to first determine the discriminatory power of genetic interpretation (deduction of the allelic composition of diploid organisms) and numerical interpretation (mere determination of the presence and absence of bands) of MLEE patterns, and then to determine the concordance (Pearson product-moment correlation coefficient) and similarity (Jaccard similarity coefficient) of the groups of strains generated by three cluster analysis models, and the discriminatory power of such models as well [model A: genetic interpretation, genetic distance matrix of Nei (d(ij)) and UPGMA dendrogram; model B: genetic interpretation, Dice similarity matrix (S(D1)) and UPGMA dendrogram; model C: numerical interpretation, Dice similarity matrix (S(D2)) and UPGMA dendrogram]. MLEE was found to be a powerful and reliable tool for the typing of C. albicans due to its high discriminatory power (>0.9). Discriminatory power indicated that numerical interpretation is a method capable of discriminating a greater number of strains (47 versus 43 subtypes), but also pointed to model B as a method capable of providing a greater number of groups, suggesting its use for the typing of C. albicans by MLEE and cluster analysis. Very good agreement was only observed between the elements of the matrices S(D1) and S(D2), but a large majority of the groups generated in the three UPGMA dendrograms showed similarity S(J) between 4.8% and 75%, suggesting disparities in the conclusions obtained by the cluster assays.

  7. Characterization of genome sequences and clinical features of coxsackievirus A6 strains collected in Hyogo, Japan in 1999-2013.

    PubMed

    Ogi, Miki; Yano, Yoshihiko; Chikahira, Masatsugu; Takai, Denshi; Oshibe, Tomohiro; Arashiro, Takeshi; Hanaoka, Nozomu; Fujimoto, Tsuguto; Hayashi, Yoshitake

    2017-08-01

    Coxsackievirus A6 (CV-A6) is an enterovirus, which is known to cause herpangina. However, since 2009 it has frequently been isolated from children with hand, foot, and mouth disease (HFMD). In Japan, CV-A6 has been linked to HFMD outbreaks in 2011 and 2013. In this study, the full-length genome sequencing of CV-A6 strains were analyzed to identify the association with clinical manifestations. Five thousand six hundred and twelve children with suspected enterovirus infection (0-17 years old) between 1999 and 2013 in Hyogo Prefecture, Japan, were enrolled. Enterovirus infection was confirmed with reverse transcriptase-PCR in 753 children (791 samples), 127 of whom (133 samples) were positive for CV-A6 based on the direct sequencing of the VP4 region. The complete genomes of CV-A6 from 22 positive patients with different clinical manifestations were investigated. A phylogenetic analysis divided these 22 strains into two clusters based on the VP1 region; cluster I contained strains collected in 1999-2009 and mostly related to herpangina, and cluster II contained strains collected in 2011-2013 and related to HFMD outbreak. Based on the full-length polyprotein analysis, the amino acid differences between the strains in cluster I and II were 97.7 ± 0.28%. Amino acid differences were detected in 17 positions within the polyprotein. Strains collected in 1999-2009 and those in 2011-2013 were separately clustered by phylogenetic analysis based on 5'UTR and 3Dpol region, as well as VP1 region. In conclusion, HFMD outbreaks by CV-A6 were recently frequent in Japan and the accumulation of genomic change might be associated with the clinical course. © 2017 Wiley Periodicals, Inc.

  8. Bagging Voronoi classifiers for clustering spatial functional data

    NASA Astrophysics Data System (ADS)

    Secchi, Piercesare; Vantini, Simone; Vitelli, Valeria

    2013-06-01

    We propose a bagging strategy based on random Voronoi tessellations for the exploration of geo-referenced functional data, suitable for different purposes (e.g., classification, regression, dimensional reduction, …). Urged by an application to environmental data contained in the Surface Solar Energy database, we focus in particular on the problem of clustering functional data indexed by the sites of a spatial finite lattice. We thus illustrate our strategy by implementing a specific algorithm whose rationale is to (i) replace the original data set with a reduced one, composed by local representatives of neighborhoods covering the entire investigated area; (ii) analyze the local representatives; (iii) repeat the previous analysis many times for different reduced data sets associated to randomly generated different sets of neighborhoods, thus obtaining many different weak formulations of the analysis; (iv) finally, bag together the weak analyses to obtain a conclusive strong analysis. Through an extensive simulation study, we show that this new procedure - which does not require an explicit model for spatial dependence - is statistically and computationally efficient.

  9. Puma (Puma concolor) epididymal sperm morphometry

    PubMed Central

    Cucho, Hernán; Alarcón, Virgilio; Ordóñez, César; Ampuero, Enrique; Meza, Aydee; Soler, Carles

    2016-01-01

    The Andean puma (Puma concolor) has not been widely studied, particularly in reference to its semen characteristics. The aim of the present study was to define the morphometry of puma sperm heads and classify their subpopulations by cluster analysis. Samples were recovered postmortem from two epididymides from one animal and prepared for morphological observation after staining with the Hemacolor kit. Morphometric data were obtained from 581 spermatozoa using a CASA-Morph system, rendering 13 morphometric parameters. The principal component (PC) analysis was performed followed by cluster analysis for the establishment of subpopulations. Two PC components were obtained, the first related to size and the second to shape. Three subpopulations were observed, corresponding to elongated and intermediate-size sperm heads and acrosomes, to large heads with large acrosomes, and to small heads with short acrosomes. In conclusion, puma spermatozoa showed no uniform sperm morphology but three clear subpopulations. These results should be used for future work in the establishment of an adequate germplasm bank of this species. PMID:27678466

  10. Puma (Puma concolor) epididymal sperm morphometry.

    PubMed

    Cucho, Hernán; Alarcón, Virgilio; Ordóñez, César; Ampuero, Enrique; Meza, Aydee; Soler, Carles

    2016-01-01

    The Andean puma (Puma concolor) has not been widely studied, particularly in reference to its semen characteristics. The aim of the present study was to define the morphometry of puma sperm heads and classify their subpopulations by cluster analysis. Samples were recovered postmortem from two epididymides from one animal and prepared for morphological observation after staining with the Hemacolor kit. Morphometric data were obtained from 581 spermatozoa using a CASA-Morph system, rendering 13 morphometric parameters. The principal component (PC) analysis was performed followed by cluster analysis for the establishment of subpopulations. Two PC components were obtained, the first related to size and the second to shape. Three subpopulations were observed, corresponding to elongated and intermediate-size sperm heads and acrosomes, to large heads with large acrosomes, and to small heads with short acrosomes. In conclusion, puma spermatozoa showed no uniform sperm morphology but three clear subpopulations. These results should be used for future work in the establishment of an adequate germplasm bank of this species.

  11. Influence of diet, menstruation and genetic factors on iron status: a cross-sectional study in Spanish women of childbearing age.

    PubMed

    Blanco-Rojo, Ruth; Toxqui, Laura; López-Parra, Ana M; Baeza-Richer, Carlos; Pérez-Granados, Ana M; Arroyo-Pardo, Eduardo; Vaquero, M Pilar

    2014-03-06

    The aim of this study was to investigate the combined influence of diet, menstruation and genetic factors on iron status in Spanish menstruating women (n = 142). Dietary intake was assessed by a 72-h detailed dietary report and menstrual blood loss by a questionnaire, to determine a Menstrual Blood Loss Coefficient (MBLC). Five selected SNPs were genotyped: rs3811647, rs1799852 (Tf gene); rs1375515 (CACNA2D3 gene); and rs1800562 and rs1799945 (HFE gene, mutations C282Y and H63D, respectively). Iron biomarkers were determined and cluster analysis was performed. Differences among clusters in dietary intake, menstrual blood loss parameters and genotype frequencies distribution were studied. A categorical regression was performed to identify factors associated with cluster belonging. Three clusters were identified: women with poor iron status close to developing iron deficiency anemia (Cluster 1, n = 26); women with mild iron deficiency (Cluster 2, n = 59) and women with normal iron status (Cluster 3, n = 57). Three independent factors, red meat consumption, MBLC and mutation C282Y, were included in the model that better explained cluster belonging (R2 = 0.142, p < 0.001). In conclusion, the combination of high red meat consumption, low menstrual blood loss and the HFE C282Y mutation may protect from iron deficiency in women of childbearing age. These findings could be useful to implement adequate strategies to prevent iron deficiency anemia.

  12. Descriptive Epidemiology of Typhoid Fever during an Epidemic in Harare, Zimbabwe, 2012

    PubMed Central

    Polonsky, Jonathan A.; Martínez-Pino, Isabel; Nackers, Fabienne; Chonzi, Prosper; Manangazira, Portia; Van Herp, Michel; Maes, Peter; Porten, Klaudia; Luquero, Francisco J.

    2014-01-01

    Background Typhoid fever remains a significant public health problem in developing countries. In October 2011, a typhoid fever epidemic was declared in Harare, Zimbabwe - the fourth enteric infection epidemic since 2008. To orient control activities, we described the epidemiology and spatiotemporal clustering of the epidemic in Dzivaresekwa and Kuwadzana, the two most affected suburbs of Harare. Methods A typhoid fever case-patient register was analysed to describe the epidemic. To explore clustering, we constructed a dataset comprising GPS coordinates of case-patient residences and randomly sampled residential locations (spatial controls). The scale and significance of clustering was explored with Ripley K functions. Cluster locations were determined by a random labelling technique and confirmed using Kulldorff's spatial scan statistic. Principal Findings We analysed data from 2570 confirmed and suspected case-patients, and found significant spatiotemporal clustering of typhoid fever in two non-overlapping areas, which appeared to be linked to environmental sources. Peak relative risk was more than six times greater than in areas lying outside the cluster ranges. Clusters were identified in similar geographical ranges by both random labelling and Kulldorff's spatial scan statistic. The spatial scale at which typhoid fever clustered was highly localised, with significant clustering at distances up to 4.5 km and peak levels at approximately 3.5 km. The epicentre of infection transmission shifted from one cluster to the other during the course of the epidemic. Conclusions This study demonstrated highly localised clustering of typhoid fever during an epidemic in an urban African setting, and highlights the importance of spatiotemporal analysis for making timely decisions about targetting prevention and control activities and reinforcing treatment during epidemics. This approach should be integrated into existing surveillance systems to facilitate early detection of epidemics and identify their spatial range. PMID:25486292

  13. STAR FORMATION AND SUPERCLUSTER ENVIRONMENT OF 107 NEARBY GALAXY CLUSTERS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cohen, Seth A.; Hickox, Ryan C.; Wegner, Gary A.

    We analyze the relationship between star formation (SF), substructure, and supercluster environment in a sample of 107 nearby galaxy clusters using data from the Sloan Digital Sky Survey. Previous works have investigated the relationships between SF and cluster substructure, and cluster substructure and supercluster environment, but definitive conclusions relating all three of these variables has remained elusive. We find an inverse relationship between cluster SF fraction ( f {sub SF}) and supercluster environment density, calculated using the Galaxy luminosity density field at a smoothing length of 8 h {sup −1} Mpc (D8). The slope of f {sub SF} versus D8more » is −0.008 ± 0.002. The f {sub SF} of clusters located in low-density large-scale environments, 0.244 ± 0.011, is higher than for clusters located in high-density supercluster cores, 0.202 ± 0.014. We also divide superclusters, according to their morphology, into filament- and spider-type systems. The inverse relationship between cluster f {sub SF} and large-scale density is dominated by filament- rather than spider-type superclusters. In high-density cores of superclusters, we find a higher f {sub SF} in spider-type superclusters, 0.229 ± 0.016, than in filament-type superclusters, 0.166 ± 0.019. Using principal component analysis, we confirm these results and the direct correlation between cluster substructure and SF. These results indicate that cluster SF is affected by both the dynamical age of the cluster (younger systems exhibit higher amounts of SF); the large-scale density of the supercluster environment (high-density core regions exhibit lower amounts of SF); and supercluster morphology (spider-type superclusters exhibit higher amounts of SF at high densities).« less

  14. Developing Appropriate Methods for Cost-Effectiveness Analysis of Cluster Randomized Trials

    PubMed Central

    Gomes, Manuel; Ng, Edmond S.-W.; Nixon, Richard; Carpenter, James; Thompson, Simon G.

    2012-01-01

    Aim. Cost-effectiveness analyses (CEAs) may use data from cluster randomized trials (CRTs), where the unit of randomization is the cluster, not the individual. However, most studies use analytical methods that ignore clustering. This article compares alternative statistical methods for accommodating clustering in CEAs of CRTs. Methods. Our simulation study compared the performance of statistical methods for CEAs of CRTs with 2 treatment arms. The study considered a method that ignored clustering—seemingly unrelated regression (SUR) without a robust standard error (SE)—and 4 methods that recognized clustering—SUR and generalized estimating equations (GEEs), both with robust SE, a “2-stage” nonparametric bootstrap (TSB) with shrinkage correction, and a multilevel model (MLM). The base case assumed CRTs with moderate numbers of balanced clusters (20 per arm) and normally distributed costs. Other scenarios included CRTs with few clusters, imbalanced cluster sizes, and skewed costs. Performance was reported as bias, root mean squared error (rMSE), and confidence interval (CI) coverage for estimating incremental net benefits (INBs). We also compared the methods in a case study. Results. Each method reported low levels of bias. Without the robust SE, SUR gave poor CI coverage (base case: 0.89 v. nominal level: 0.95). The MLM and TSB performed well in each scenario (CI coverage, 0.92–0.95). With few clusters, the GEE and SUR (with robust SE) had coverage below 0.90. In the case study, the mean INBs were similar across all methods, but ignoring clustering underestimated statistical uncertainty and the value of further research. Conclusions. MLMs and the TSB are appropriate analytical methods for CEAs of CRTs with the characteristics described. SUR and GEE are not recommended for studies with few clusters. PMID:22016450

  15. Diffuse light and building history of the galaxy cluster Abell 2667

    NASA Astrophysics Data System (ADS)

    Covone, G.; Adami, C.; Durret, F.; Kneib, J.-P.; Lima Neto, G. B.; Slezak, E.

    2006-12-01

    Aims.We searched for diffuse intracluster light in the galaxy cluster Abell 2667 (z=0.233) from HST images in three broad band-filters. Methods: .We applied an iterative multi-scale wavelet analysis and reconstruction technique to these images, which allows to subtract stars and galaxies from the original images. Results: .We detect a zone of diffuse emission southwest of the cluster center (DS1) and a second faint object (ComDif) within DS1. Another diffuse source (DS2) may be detected at lower confidence level northeast of the center. These sources of diffuse light contribute to 10-15% of the total visible light in the cluster. Whether they are independent entities or part of the very elliptical external envelope of the central galaxy remains unclear. Deep VLT VIMOS integral field spectroscopy reveals a faint continuum at the positions of DS1 and ComDif but do not allow a redshift to be computed, so we conclude if these sources are part of the central galaxy or not. A hierarchical substructure detection method reveals the presence of several galaxy pairs and groups defining a similar direction to the one drawn by the DS1 - central galaxy - DS2 axis. The analysis of archive XMM-Newton and Chandra observations shows X-ray emission elongated in the same direction. The X-ray temperature map shows the presence of a cool core, a broad cool zone stretching from north to south, and hotter regions towards the northeast, southwest, and northwest. This might suggest shock fronts along these directions produced by infalling material, even if uncertainties remain quite large on the temperature determination far from the center. Conclusions: .These various data are consistent with a picture in which diffuse sources are concentrations of tidal debris and harassed matter expelled from infalling galaxies by tidal stripping and undergoing an accretion process onto the central cluster galaxy; as such, they are expected to be found along the main infall directions. Note, however, that the limited signal to noise of the various data and the apparent lack of large numbers of well-defined independent tidal tails, besides the one named ComDif, preclude definitive conclusions on this scenario.

  16. A cluster-analytic approach towards multidimensional health-related behaviors in adolescents: the MoMo-Study

    PubMed Central

    2012-01-01

    Background Although knowledge on single health-related behaviors and their association with health parameters is available, research on multiple health-related behaviors is needed to understand the interactions among these behaviors. The aims of the study were (a) to identify typical health-related behavior patterns in German adolescents focusing on physical activity, media use and dietary behavior; (b) to describe the socio-demographic correlates of the identified clusters and (c) to study their association with overweight. Methods Within the framework of the German Health Interview and Examination Survey for Children and Adolescents (KiGGS) and the “Motorik-Modul” (MoMo), 1,643 German adolescents (11–17 years) completed a questionnaire assessing the amount and type of weekly physical activity in sports clubs and during leisure time, weekly use of television, computer and console games and the frequency and amount of food consumption. From this data the three indices ‘physical activity’, ‘media use’ and ‘healthy nutrition’ were derived and included in a cluster analysis conducted with Ward’s Method and K-means analysis. Chi-square tests were performed to identify socio-demographic correlates of the clusters as well as their association with overweight. Results Four stable clusters representing typical health-related behavior patterns were identified: Cluster 1 (16.2%)—high scores in physical activity index and average scores in media use index and healthy nutrition index; cluster 2 (34.6%)—high healthy nutrition score and below average scores in the other two indices; cluster 3 (18.4%)—low physical activity score, low healthy nutrition score and very high media use score; cluster 4 (30.5%)—below average scores on all three indices. Boys were overrepresented in the clusters 1 and 3, and the relative number of adolescents with low socio-economic status as well as overweight was significantly higher than average in cluster 3. Conclusions Meaningful and stable clusters of health-related behavior were identified. These results confirm findings of another youth study hence supporting the assumption that these clusters represent typical behavior patterns of adolescents. These results are particularly relevant for the characterization of target groups for primary prevention of lifestyle diseases. PMID:23273134

  17. VEGF-Induced Expression of miR-17–92 Cluster in Endothelial Cells Is Mediated by ERK/ELK1 Activation and Regulates Angiogenesis

    PubMed Central

    Chamorro-Jorganes, Aránzazu; Lee, Monica Y.; Araldi, Elisa; Landskroner-Eiger, Shira; Fernández-Fuertes, Marta; Sahraei, Mahnaz; Quiles del Rey, Maria; van Solingen, Coen; Yu, Jun; Fernández-Hernando, Carlos; Sessa, William C.

    2016-01-01

    Rationale: Several lines of evidence indicate that the regulation of microRNA (miRNA) levels by different stimuli may contribute to the modulation of stimulus-induced responses. The miR-17–92 cluster has been linked to tumor development and angiogenesis, but its role in vascular endothelial growth factor–induced endothelial cell (EC) functions is unclear and its regulation is unknown. Objective: The purpose of this study was to elucidate the mechanism by which VEGF regulates the expression of miR-17–92 cluster in ECs and determine its contribution to the regulation of endothelial angiogenic functions, both in vitro and in vivo. This was done by analyzing the effect of postnatal inactivation of miR-17–92 cluster in the endothelium (miR-17–92 iEC-KO mice) on developmental retinal angiogenesis, VEGF-induced ear angiogenesis, and tumor angiogenesis. Methods and Results: Here, we show that Erk/Elk1 activation on VEGF stimulation of ECs is responsible for Elk-1-mediated transcription activation (chromatin immunoprecipitation analysis) of the miR-17–92 cluster. Furthermore, we demonstrate that VEGF-mediated upregulation of the miR-17–92 cluster in vitro is necessary for EC proliferation and angiogenic sprouting. Finally, we provide genetic evidence that miR-17–92 iEC-KO mice have blunted physiological retinal angiogenesis during development and diminished VEGF-induced ear angiogenesis and tumor angiogenesis. Computational analysis and rescue experiments show that PTEN (phosphatase and tensin homolog) is a target of the miR-17–92 cluster and is a crucial mediator of miR-17-92–induced EC proliferation. However, the angiogenic transcriptional program is reduced when miR-17–92 is inhibited. Conclusions: Taken together, our results indicate that VEGF-induced miR-17–92 cluster expression contributes to the angiogenic switch of ECs and participates in the regulation of angiogenesis. PMID:26472816

  18. Hypersexuality and high sexual desire: exploring the structure of problematic sexuality.

    PubMed

    Carvalho, Joana; Štulhofer, Aleksandar; Vieira, Armando L; Jurin, Tanja

    2015-06-01

    The concept of hypersexuality has been accompanied by fierce debates and conflicting conclusions about its nature. One of the central questions under the discussion is a potential overlap between hypersexuality and high sexual desire. With the relevant research in its early phase, the structure of hypersexuality remains largely unknown. The aim of the present study was to systematically explore the overlap between problematic sexuality and high sexual desire. A community online survey was carried out in Croatia in 2014. The data were first cluster analyzed (by gender) based on sexual desire, sexual activity, perceived lack of control over one's sexuality, and negative behavioral consequences. Participants in the meaningful clusters were then compared for psychosocial characteristics. To complement cluster analysis (CA), multigroup confirmatory factor analysis (CFA) of the same four constructs was carried out. Indicators representing the proposed structure of hypersexuality were included: sexual desire, frequency of sexual activity, lack of control over one's sexuality, and negative behavioral outcomes. Psychosocial characteristics such as religiosity, attitudes toward pornography, and general psychopathology were also evaluated. CA pointed to the existence of two meaningful clusters, one representing problematic sexuality, that is, lack of control over one's sexuality and negative outcomes (control/consequences cluster), and the other reflecting high sexual desire and frequent sexual activity (desire/activity cluster). Compared with the desire/activity cluster, individuals from the control/consequences cluster reported more psychopathology and were characterized by more traditional attitudes. Complementing the CA findings, CFA pointed to two distinct latent dimensions-problematic sexuality and high sexual desire/activity. Our study supports the distinctiveness of hypersexuality and high sexual desire/activity, suggesting that problematic sexuality might be more associated with the perceived lack of personal control over sexuality and moralistic attitudes than with high levels of sexual desire and activity. © 2015 International Society for Sexual Medicine.

  19. Athletic groin pain (part 2): a prospective cohort study on the biomechanical evaluation of change of direction identifies three clusters of movement patterns

    PubMed Central

    Franklyn-Miller, A; Richter, C; King, E; Gore, S; Moran, K; Strike, S; Falvey, E C

    2017-01-01

    Background Athletic groin pain (AGP) is prevalent in sports involving repeated accelerations, decelerations, kicking and change-of-direction movements. Clinical and radiological examinations lack the ability to assess pathomechanics of AGP, but three-dimensional biomechanical movement analysis may be an important innovation. Aim The primary aim was to describe and analyse movements used by patients with AGP during a maximum effort change-of-direction task. The secondary aim was to determine if specific anatomical diagnoses were related to a distinct movement strategy. Methods 322 athletes with a current symptom of chronic AGP participated. Structured and standardised clinical assessments and radiological examinations were performed on all participants. Additionally, each participant performed multiple repetitions of a planned maximum effort change-of-direction task during which whole body kinematics were recorded. Kinematic and kinetic data were examined using continuous waveform analysis techniques in combination with a subgroup design that used gap statistic and hierarchical clustering. Results Three subgroups (clusters) were identified. Kinematic and kinetic measures of the clusters differed strongly in patterns observed in thorax, pelvis, hip, knee and ankle. Cluster 1 (40%) was characterised by increased ankle eversion, external rotation and knee internal rotation and greater knee work. Cluster 2 (15%) was characterised by increased hip flexion, pelvis contralateral drop, thorax tilt and increased hip work. Cluster 3 (45%) was characterised by high ankle dorsiflexion, thorax contralateral drop, ankle work and prolonged ground contact time. No correlation was observed between movement clusters and clinically palpated location of the participant's pain. Conclusions We identified three distinct movement strategies among athletes with long-standing groin pain during a maximum effort change-of-direction task These movement strategies were not related to clinical assessment findings but highlighted targets for rehabilitation in response to possible propagative mechanisms. Trial registration number NCT02437942, pre results. PMID:28209597

  20. Deep Learning Nuclei Detection in Digitized Histology Images by Superpixels

    PubMed Central

    Sornapudi, Sudhir; Stanley, Ronald Joe; Stoecker, William V.; Almubarak, Haidar; Long, Rodney; Antani, Sameer; Thoma, George; Zuna, Rosemary; Frazier, Shelliane R.

    2018-01-01

    Background: Advances in image analysis and computational techniques have facilitated automatic detection of critical features in histopathology images. Detection of nuclei is critical for squamous epithelium cervical intraepithelial neoplasia (CIN) classification into normal, CIN1, CIN2, and CIN3 grades. Methods: In this study, a deep learning (DL)-based nuclei segmentation approach is investigated based on gathering localized information through the generation of superpixels using a simple linear iterative clustering algorithm and training with a convolutional neural network. Results: The proposed approach was evaluated on a dataset of 133 digitized histology images and achieved an overall nuclei detection (object-based) accuracy of 95.97%, with demonstrated improvement over imaging-based and clustering-based benchmark techniques. Conclusions: The proposed DL-based nuclei segmentation Method with superpixel analysis has shown improved segmentation results in comparison to state-of-the-art methods. PMID:29619277

  1. Clusters of Healthy and Unhealthy Eating Behaviors are Associated with Body Mass Index Among Adults

    PubMed Central

    Heerman, William J.; Jackson, Natalie; Hargreaves, Margaret; Mulvaney, Shelagh A.; Schlundt, David; Wallston, Kenneth A.; Rothman, Russell L.

    2017-01-01

    Objective To identify eating styles from 6 eating behaviors and test their association with Body Mass Index (BMI) among adults. Design Cross-sectional analysis of self-report survey data Setting 12 primary care and specialty clinics in 5 states Participants 11,776 adult patients consented to participate; 9,977 completed survey questions. Variables measured Frequency of eating healthy food; frequency of eating unhealthy food; breakfast frequency; frequency of snacking; overall diet quality; and problem eating behaviors. The primary dependent variable was BMI, calculated from self-reported height and weight data. Analysis Kmeans cluster analysis of eating behaviors was used to determine eating styles. A categorical variable representing each eating style cluster was entered in a multivariate linear regression predicting BMI, controlling for covariates. Results Four eating styles were identified and defined by healthy vs. unhealthy diet patterns and engagement in problem eating behaviors. Each group had significantly higher average BMI than the healthy eating style: healthy with problem eating behaviors (β=1.9, p<0.001); unhealthy (β=2.5, p<0.001), and unhealthy with problem eating behaviors (β=5.1, p<0.001). Conclusions Future attempts to improve eating styles should address not only the consumption of healthy foods, but also snacking behaviors and the emotional component of eating. PMID:28363804

  2. Clustering XCO2 temporal change to assess CO2 exchanging strength of biosphere-atmosphere with GOSAT observations

    NASA Astrophysics Data System (ADS)

    He, Zhonghua; Lei, Liping; Bie, Nian; Yang, Shaoyuan; Wu, Changjiang; Zeng, Zhao-Cheng

    2017-04-01

    The temporal change of atmospheric carbon dioxide (CO2) concentration, greatly related to the local activities of CO2 uptake and emission, including biospheric exchange and anthropogenic emission, is one of important information for regions identification of carbon source and sink. Satellite observations of CO2 has been used for detecting the change of CO2 concentration for a long time. In this study, we used the grid data of column-averaged CO2 dry air mole fraction (XCO2) with the spatial resolution of 1 degree and the temporal resolution of 3 days from 1 June 2009 to 31 May 2014 over the land area of 30° - 60° N to implement a clustering of temporal changing characteristics for the Greenhouse Gases Observing Satellite (GOSAT) XCO2 retrievals. Grid data is derived using the gap filling method of spatio-temporal geostatistics. The clustering method is one adjusted K-mean for the gap existed time-series data. As a result, types and number of clusters are specified based on the temporal characteristic of XCO2 by using the optimal clustering parameters. The biospheric absorption and surface emission of atmospheric CO2 is discussed through the analysis of the different yearly increase and seasonal amplitude of XCO2 each cluster combined with correlation analysis with vegetation index from the Moderate-resolution Imaging Spectroradiometer (MODIS) and fossil fuel CO2 emission data from Open-source Data Inventory for Anthropogenic CO2 (Odiac). Regions of strong or weak biosphere-atmosphere exchange, or significant disturbance from anthropogenic activities can be identified. In conclusion, gap filled XCO2 from satellite observations can help us to take an analysis of atmospheric CO2, results of the coupled biosphere-atmosphere, by their spatio-temporal characteristics as well as the relationship with the other remote sensing parameters e.g. MODIS related with biospheric photosynthetic or respiration activities.

  3. Associations between a Genetic Risk Score for Clinical CAD and Early Stage Lesions in the Coronary Artery and the Aorta

    PubMed Central

    Herrington, David M.

    2016-01-01

    Objective The correlation between the extent of fatty streaks, more advanced atherosclerotic lesions, and community rates of coronary artery disease (CAD) is substantially higher for the coronary artery compared to the aorta. We sought to determine whether a genetic basis contributes to these differences. Approach and Results We conducted a cluster analysis of 6 subclinical atherosclerosis phenotypes documented in 564 white participants of the Pathobiological Determinants of Atherosclerosis in Youth study including the extent of fatty streaks and raised lesions in the coronary artery (CF and CR), thoracic aorta (TF and TR), and abdominal aorta (AF and AR) followed by a genetic association analysis of the same phenotypes. Our cluster analysis grouped all raised lesions and fatty streaks in the coronary into one cluster (CF, CR, TR, and AR) and the fatty streaks in the aorta into a second cluster (TF and AF). We found a genetic risk score of high-risk alleles at 57 susceptibility loci for CAD to be variably associated with the phenotypes in the first cluster (OR: 1.30 p = 0.009 for being in top quartile of degree of involvement of CF, 1.34 p = 0.005 for CR, 1.25: p = 0.11 for TR, and 1.19 p = 0.08 for AR) but not at all with the phenotypes in the second cluster (OR: 1.01, p = 0.95 for TF and 0.98, p = 0.82 for AF). Conclusions The genetic determinants of fatty streaks in the aorta do not appear to overlap substantially with the genetic determinants of fatty streaks in the coronary as well as raised lesions in both the coronary and the aorta. These findings may explain why a larger fraction of fatty streaks in the aorta are less likely to progress to raised lesions compared to the coronary artery. PMID:27861582

  4. Cardiometabolic Risk Clustering in Spinal Cord Injury: Results of Exploratory Factor Analysis

    PubMed Central

    2013-01-01

    Background: Evidence suggests an elevated prevalence of cardiometabolic risks among persons with spinal cord injury (SCI); however, the unique clustering of risk factors in this population has not been fully explored. Objective: The purpose of this study was to describe unique clustering of cardiometabolic risk factors differentiated by level of injury. Methods: One hundred twenty-one subjects (mean 37 ± 12 years; range, 18–73) with chronic C5 to T12 motor complete SCI were studied. Assessments included medical histories, anthropometrics and blood pressure, and fasting serum lipids, glucose, insulin, and hemoglobin A1c (HbA1c). Results: The most common cardiometabolic risk factors were overweight/obesity, high levels of low-density lipoprotein (LDL-C), and low levels of high-density lipoprotein (HDL-C). Risk clustering was found in 76.9% of the population. Exploratory principal component factor analysis using varimax rotation revealed a 3–factor model in persons with paraplegia (65.4% variance) and a 4–factor solution in persons with tetraplegia (73.3% variance). The differences between groups were emphasized by the varied composition of the extracted factors: Lipid Profile A (total cholesterol [TC] and LDL-C), Body Mass-Hypertension Profile (body mass index [BMI], systolic blood pressure [SBP], and fasting insulin [FI]); Glycemic Profile (fasting glucose and HbA1c), and Lipid Profile B (TG and HDL-C). BMI and SBP formed a separate factor only in persons with tetraplegia. Conclusions: Although the majority of the population with SCI has risk clustering, the composition of the risk clusters may be dependent on level of injury, based on a factor analysis group comparison. This is clinically plausible and relevant as tetraplegics tend to be hypo- to normotensive and more sedentary, resulting in lower HDL-C and a greater propensity toward impaired carbohydrate metabolism. PMID:23960702

  5. Transcriptional regulation of gene expression clusters in motor neurons following spinal cord injury

    PubMed Central

    2010-01-01

    Background Spinal cord injury leads to neurological dysfunctions affecting the motor, sensory as well as the autonomic systems. Increased excitability of motor neurons has been implicated in injury-induced spasticity, where the reappearance of self-sustained plateau potentials in the absence of modulatory inputs from the brain correlates with the development of spasticity. Results Here we examine the dynamic transcriptional response of motor neurons to spinal cord injury as it evolves over time to unravel common gene expression patterns and their underlying regulatory mechanisms. For this we use a rat-tail-model with complete spinal cord transection causing injury-induced spasticity, where gene expression profiles are obtained from labeled motor neurons extracted with laser microdissection 0, 2, 7, 21 and 60 days post injury. Consensus clustering identifies 12 gene clusters with distinct time expression profiles. Analysis of these gene clusters identifies early immunological/inflammatory and late developmental responses as well as a regulation of genes relating to neuron excitability that support the development of motor neuron hyper-excitability and the reappearance of plateau potentials in the late phase of the injury response. Transcription factor motif analysis identifies differentially expressed transcription factors involved in the regulation of each gene cluster, shaping the expression of the identified biological processes and their associated genes underlying the changes in motor neuron excitability. Conclusions This analysis provides important clues to the underlying mechanisms of transcriptional regulation responsible for the increased excitability observed in motor neurons in the late chronic phase of spinal cord injury suggesting alternative targets for treatment of spinal cord injury. Several transcription factors were identified as potential regulators of gene clusters containing elements related to motor neuron hyper-excitability, the manipulation of which potentially could be used to alter the transcriptional response to prevent the motor neurons from entering a state of hyper-excitability. PMID:20534130

  6. Addressing the complexity of water chemistry in environmental fate modeling for engineered nanoparticles.

    PubMed

    Sani-Kast, Nicole; Scheringer, Martin; Slomberg, Danielle; Labille, Jérôme; Praetorius, Antonia; Ollivier, Patrick; Hungerbühler, Konrad

    2015-12-01

    Engineered nanoparticle (ENP) fate models developed to date - aimed at predicting ENP concentration in the aqueous environment - have limited applicability because they employ constant environmental conditions along the modeled system or a highly specific environmental representation; both approaches do not show the effects of spatial and/or temporal variability. To address this conceptual gap, we developed a novel modeling strategy that: 1) incorporates spatial variability in environmental conditions in an existing ENP fate model; and 2) analyzes the effect of a wide range of randomly sampled environmental conditions (representing variations in water chemistry). This approach was employed to investigate the transport of nano-TiO2 in the Lower Rhône River (France) under numerous sets of environmental conditions. The predicted spatial concentration profiles of nano-TiO2 were then grouped according to their similarity by using cluster analysis. The analysis resulted in a small number of clusters representing groups of spatial concentration profiles. All clusters show nano-TiO2 accumulation in the sediment layer, supporting results from previous studies. Analysis of the characteristic features of each cluster demonstrated a strong association between the water conditions in regions close to the ENP emission source and the cluster membership of the corresponding spatial concentration profiles. In particular, water compositions favoring heteroaggregation between the ENPs and suspended particulate matter resulted in clusters of low variability. These conditions are, therefore, reliable predictors of the eventual fate of the modeled ENPs. The conclusions from this study are also valid for ENP fate in other large river systems. Our results, therefore, shift the focus of future modeling and experimental research of ENP environmental fate to the water characteristic in regions near the expected ENP emission sources. Under conditions favoring heteroaggregation in these regions, the fate of the ENPs can be readily predicted. Copyright © 2014 Elsevier B.V. All rights reserved.

  7. The cosmological analysis of X-ray cluster surveys. III. 4D X-ray observable diagrams

    NASA Astrophysics Data System (ADS)

    Pierre, M.; Valotti, A.; Faccioli, L.; Clerc, N.; Gastaud, R.; Koulouridis, E.; Pacaud, F.

    2017-11-01

    Context. Despite compelling theoretical arguments, the use of clusters as cosmological probes is, in practice, frequently questioned because of the many uncertainties surrounding cluster-mass estimates. Aims: Our aim is to develop a fully self-consistent cosmological approach of X-ray cluster surveys, exclusively based on observable quantities rather than masses. This procedure is justified given the possibility to directly derive the cluster properties via ab initio modelling, either analytically or by using hydrodynamical simulations. In this third paper, we evaluate the method on cluster toy-catalogues. Methods: We model the population of detected clusters in the count-rate - hardness-ratio - angular size - redshift space and compare the corresponding four-dimensional diagram with theoretical predictions. The best cosmology+physics parameter configuration is determined using a simple minimisation procedure; errors on the parameters are estimated by averaging the results from ten independent survey realisations. The method allows a simultaneous fit of the cosmological parameters of the cluster evolutionary physics and of the selection effects. Results: When using information from the X-ray survey alone plus redshifts, this approach is shown to be as accurate as the modelling of the mass function for the cosmological parameters and to perform better for the cluster physics, for a similar level of assumptions on the scaling relations. It enables the identification of degenerate combinations of parameter values. Conclusions: Given the considerably shorter computer times involved for running the minimisation procedure in the observed parameter space, this method appears to clearly outperform traditional mass-based approaches when X-ray survey data alone are available.

  8. A two-stage method for microcalcification cluster segmentation in mammography by deformable models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Arikidis, N.; Kazantzi, A.; Skiadopoulos, S.

    Purpose: Segmentation of microcalcification (MC) clusters in x-ray mammography is a difficult task for radiologists. Accurate segmentation is prerequisite for quantitative image analysis of MC clusters and subsequent feature extraction and classification in computer-aided diagnosis schemes. Methods: In this study, a two-stage semiautomated segmentation method of MC clusters is investigated. The first stage is targeted to accurate and time efficient segmentation of the majority of the particles of a MC cluster, by means of a level set method. The second stage is targeted to shape refinement of selected individual MCs, by means of an active contour model. Both methods aremore » applied in the framework of a rich scale-space representation, provided by the wavelet transform at integer scales. Segmentation reliability of the proposed method in terms of inter and intraobserver agreements was evaluated in a case sample of 80 MC clusters originating from the digital database for screening mammography, corresponding to 4 morphology types (punctate: 22, fine linear branching: 16, pleomorphic: 18, and amorphous: 24) of MC clusters, assessing radiologists’ segmentations quantitatively by two distance metrics (Hausdorff distance—HDIST{sub cluster}, average of minimum distance—AMINDIST{sub cluster}) and the area overlap measure (AOM{sub cluster}). The effect of the proposed segmentation method on MC cluster characterization accuracy was evaluated in a case sample of 162 pleomorphic MC clusters (72 malignant and 90 benign). Ten MC cluster features, targeted to capture morphologic properties of individual MCs in a cluster (area, major length, perimeter, compactness, and spread), were extracted and a correlation-based feature selection method yielded a feature subset to feed in a support vector machine classifier. Classification performance of the MC cluster features was estimated by means of the area under receiver operating characteristic curve (Az ± Standard Error) utilizing tenfold cross-validation methodology. A previously developed B-spline active rays segmentation method was also considered for comparison purposes. Results: Interobserver and intraobserver segmentation agreements (median and [25%, 75%] quartile range) were substantial with respect to the distance metrics HDIST{sub cluster} (2.3 [1.8, 2.9] and 2.5 [2.1, 3.2] pixels) and AMINDIST{sub cluster} (0.8 [0.6, 1.0] and 1.0 [0.8, 1.2] pixels), while moderate with respect to AOM{sub cluster} (0.64 [0.55, 0.71] and 0.59 [0.52, 0.66]). The proposed segmentation method outperformed (0.80 ± 0.04) statistically significantly (Mann-Whitney U-test, p < 0.05) the B-spline active rays segmentation method (0.69 ± 0.04), suggesting the significance of the proposed semiautomated method. Conclusions: Results indicate a reliable semiautomated segmentation method for MC clusters offered by deformable models, which could be utilized in MC cluster quantitative image analysis.« less

  9. Robust fiber clustering of cerebral fiber bundles in white matter

    NASA Astrophysics Data System (ADS)

    Yao, Xufeng; Wang, Yongxiong; Zhuang, Songlin

    2014-11-01

    Diffusion tensor imaging fiber tracking (DTI-FT) has been widely accepted in the diagnosis and treatment of brain diseases. During the rendering pipeline of specific fiber tracts, the image noise and low resolution of DTI would lead to false propagations. In this paper, we propose a robust fiber clustering (FC) approach to diminish false fibers from one fiber tract. Our algorithm consists of three steps. Firstly, the optimized fiber assignment continuous tracking (FACT) is implemented to reconstruct one fiber tract; and then each curved fiber in the fiber tract is mapped to a point by kernel principal component analysis (KPCA); finally, the point clouds of fiber tract are clustered by hierarchical clustering which could distinguish false fibers from true fibers in one tract. In our experiment, the corticospinal tract (CST) in one case of human data in vivo was used to validate our method. Our method showed reliable capability in decreasing the false fibers in one tract. In conclusion, our method could effectively optimize the visualization of fiber bundles and would help a lot in the field of fiber evaluation.

  10. Impact of a Participatory Intervention with Women’s Groups on Psychological Distress among Mothers in Rural Bangladesh: Secondary Analysis of a Cluster-Randomised Controlled Trial

    PubMed Central

    Clarke, Kelly; Azad, Kishwar; Kuddus, Abdul; Shaha, Sanjit; Nahar, Tasmin; Aumon, Bedowra Haq; Hossen, Mohammed Munir; Beard, James; Costello, Anthony; Houweling, Tanja A. J.; Prost, Audrey; Fottrell, Edward

    2014-01-01

    Background Perinatal common mental disorders (PCMDs) are a major cause of disability among women and disproportionately affect lower income countries. Interventions to address PCMDs are urgently needed in these settings, and group-based and peer-led approaches are potential strategies to increase access to mental health interventions. Participatory women’s health groups led by local women previously reduced postpartum psychological distress in eastern India. We assessed the effect of a similar intervention on postpartum psychological distress in rural Bangladesh. Method We conducted a secondary analysis of data from a cluster-randomised controlled trial with 18 clusters and an estimated population of 532,996. Nine clusters received an intervention comprising monthly meetings during which women’s groups worked through a participatory learning and action cycle to develop strategies for improving women’s and children’s health. There was one group for every 309 individuals in the population, 810 groups in total. Mothers in nine control clusters had access to usual perinatal care. Postpartum psychological distress was measured with the 20-item Self Reporting Questionnaire (SRQ-20) between six and 52 weeks after delivery, during the months of January to April, in 2010 and 2011. Results We analysed outcomes for 6275 mothers. Although the cluster mean SRQ-20 score was lower in the intervention arm (mean 5.2, standard deviation 1.8) compared to control (5.3, 1.2), the difference was not significant (β 1.44, 95% CI 0.28, 3.08). Conclusions Despite promising results in India, participatory women’s groups focused on women’s and children’s health had no significant effect on postpartum psychological distress in rural Bangladesh. PMID:25329470

  11. The Grism Lens-Amplified Survey from Space (GLASS). V. Extent and Spatial Distribution of Star Formation in z ~ 0.5 Cluster Galaxies

    NASA Astrophysics Data System (ADS)

    Vulcani, Benedetta; Treu, Tommaso; Schmidt, Kasper B.; Poggianti, Bianca M.; Dressler, Alan; Fontana, Adriano; Bradač, Marusa; Brammer, Gabriel B.; Hoag, Austin; Huang, Kuan-Han; Malkan, Matthew; Pentericci, Laura; Trenti, Michele; von der Linden, Anja; Abramson, Louis; He, Julie; Morris, Glenn

    2015-12-01

    We present the first study of the spatial distribution of star formation in z ˜ 0.5 cluster galaxies. The analysis is based on data taken with the Wide Field Camera 3 as part of the Grism Lens-Amplified Survey from Space (GLASS). We illustrate the methodology by focusing on two clusters (MACS 0717.5+3745 and MACS 1423.8+2404) with different morphologies (one relaxed and one merging) and use foreground and background galaxies as a field control sample. The cluster+field sample consists of 42 galaxies with stellar masses in the range 108-1011 M⊙ and star formation rates in the range 1-20 M⊙ yr-1. Both in clusters and in the field, Hα is more extended than the rest-frame UV continuum in 60% of the cases, consistent with diffuse star formation and inside-out growth. In ˜20% of the cases, the Hα emission appears more extended in cluster galaxies than in the field, pointing perhaps to ionized gas being stripped and/or star formation being enhanced at large radii. The peak of the Hα emission and that of the continuum are offset by less than 1 kpc. We investigate trends with the hot gas density as traced by the X-ray emission, and with the surface mass density as inferred from gravitational lens models, and find no conclusive results. The diversity of morphologies and sizes observed in Hα illustrates the complexity of the environmental processes that regulate star formation. Upcoming analysis of the full GLASS data set will increase our sample size by almost an order of magnitude, verifying and strengthening the inference from this initial data set.

  12. An off-axis galaxy cluster merger: Abell 0141

    NASA Astrophysics Data System (ADS)

    Caglar, Turgay

    2018-04-01

    We present structural analysis results of Abell 0141 (z = 0.23) based on X-ray data. The X-ray luminosity map demonstrates that Abell 0141 (A0141) is a bimodal galaxy cluster, which is separated on the sky by ˜0.65 Mpc with an elongation along the north-south direction. The optical galaxy density map also demonstrates this bimodality. We estimate sub-cluster ICM temperatures of 5.17^{+0.20}_{-0.19} keV for A0141N and 5.23^{+0.24}_{-0.23} keV for A0141S. We obtain X-ray morphological parameters w = 0.034 ± 0.004, c = 0.113 ± 0.004, and w = 0.039 ± 0.004, c = 0.104 ± 0.005 for A0141N and A0141S, respectively. The resulting X-ray morphological parameters indicate that both sub-clusters are moderately disturbed non-cool core structures. We find a slight brightness jump in the bridge region, and yet, there is still an absence of strong X-ray emitting gas between sub-clusters. We discover a significantly hotspot (˜10 keV) between sub-clusters, and a Mach number M = 1.69^{+0.40}_{-0.37} is obtained by using the temperature jump condition. However, we did not find direct evidence for shock-heating between sub-clusters. We estimate the sub-clusters' central entropies as K0 > 100 keV cm2, which indicates that the sub-clusters are not cool cores. We find some evidence that the system undergoes an off-axis collision; however, the cores of each sub-clusters have not yet been destroyed. Due to the orientation of X-ray tails of sub-clusters, we suggest that the northern sub-cluster moves through the south-west direction, and the southern cluster moves through the north-east direction. In conclusion, we are witnessing an earlier phase of close core passage between sub-clusters.

  13. Substructures in Clusters of Galaxies

    NASA Astrophysics Data System (ADS)

    Lehodey, Brigitte Tome

    2000-01-01

    This dissertation presents two methods for the detection of substructures in clusters of galaxies and the results of their application to a group of four clusters. In chapters 2 and 3, we remember the main properties of clusters of galaxies and give the definition of substructures. We also try to show why the study of substructures in clusters of galaxies is so important for Cosmology. Chapters 4 and 5 describe these two methods, the first one, the adaptive Kernel, is applied to the study of the spatial and kinematical distribution of the cluster galaxies. The second one, the MVM (Multiscale Vision Model), is applied to analyse the cluster diffuse X-ray emission, i.e., the intracluster gas distribution. At the end of these two chapters, we also present the results of the application of these methods to our sample of clusters. In chapter 6, we draw the conclusions from the comparison of the results we obtain with each method. In the last chapter, we present the main conclusions of this work trying to point out possible developments. We close with two appendices in which we detail some questions raised in this work not directly linked to the problem of substructures detection.

  14. Type 2 diabetes mellitus: distribution of genetic markers in Kazakh population

    PubMed Central

    Sikhayeva, Nurgul; Talzhanov, Yerkebulan; Iskakova, Aisha; Dzharmukhanov, Jarkyn; Nugmanova, Raushan; Zholdybaeva, Elena; Ramanculov, Erlan

    2018-01-01

    Background Ethnic differences exist in the frequencies of genetic variations that contribute to the risk of common disease. This study aimed to analyse the distribution of several genes, previously associated with susceptibility to type 2 diabetes and obesity-related phenotypes, in a Kazakh population. Methods A total of 966 individuals belonging to the Kazakh ethnicity were recruited from an outpatient clinic. We genotyped 41 common single nucleotide polymorphisms (SNPs) previously associated with type 2 diabetes in other ethnic groups and 31 of these were in Hardy–Weinberg equilibrium. The obtained allele frequencies were further compared to publicly available data from other ethnic populations. Allele frequencies for other (compared) populations were pooled from the haplotype map (HapMap) database. Principal component analysis (PCA), cluster analysis, and multidimensional scaling (MDS) were used for the analysis of genetic relationship between the populations. Results Comparative analysis of allele frequencies of the studied SNPs showed significant differentiation among the studied populations. The Kazakh population was grouped with Asian populations according to the cluster analysis and with the Caucasian populations according to PCA. According to MDS, results of the current study show that the Kazakh population holds an intermediate position between Caucasian and Asian populations. Conclusion A high percentage of population differentiation was observed between Kazakh and world populations. The Kazakh population was clustered with Caucasian populations, and this result may indicate a significant Caucasian component in the Kazakh gene pool. PMID:29551892

  15. Floral and Vegetative Morphometrics of Five Pleurothallis (Orchidaceae) Species: Correlation with Taxonomy, Phylogeny, Genetic Variability and Pollination Systems

    PubMed Central

    BORBA, EDUARDO L.; SHEPHERD, GEORGE J.; BERG, CÁSSIO VAN DEN; SEMIR, JOÃO

    2002-01-01

    Morphometric analyses of vegetative and floral characters were conducted in 21 populations of five Pleurothallis (Orchidaceae) species occurring in Brazilian ‘campo rupestre’ vegetation. A phylogenetic analysis of this species group was also carried out using nuclear ribosomal DNA internal transcribed spacers (ITS1 and ITS2). Results of the ordination and cluster analyses agree with species’ delimitation revealed by taxonomic and allozyme studies. The groups formed in ordination analysis correspond to the pollinator groups determined in a previous pollination study. Relationships among the species in the cluster analysis using only vegetative characters are similar to those found in a previous allozyme study, but those indicated by cluster analysis using only floral characters differ. These results support the hypothesis that floral similarities are due to convergence driven by similar pollination mechanisms, and therefore floral traits may not be good indicators of phylogenetic relationships in this group. The results of the phylogenetic analysis support this conclusion to some extent. There is no correlation between genetic (allozyme) and morphological variability in the populations nor in the way this variability is distributed among conspecific populations. We describe a new subspecies of Pleurothallis ochreata based on differences in vegetative and chemical characters as well as geographic distribution. Absence of differentiation in floral characters, attraction of the same pollinator species, interfertility and genetic similarity support the argument for subspecific rather than specific status. PMID:12197519

  16. Hybrid Collaborative Learning for Classification and Clustering in Sensor Networks

    NASA Technical Reports Server (NTRS)

    Wagstaff, Kiri L.; Sosnowski, Scott; Lane, Terran

    2012-01-01

    Traditionally, nodes in a sensor network simply collect data and then pass it on to a centralized node that archives, distributes, and possibly analyzes the data. However, analysis at the individual nodes could enable faster detection of anomalies or other interesting events as well as faster responses, such as sending out alerts or increasing the data collection rate. There is an additional opportunity for increased performance if learners at individual nodes can communicate with their neighbors. In previous work, methods were developed by which classification algorithms deployed at sensor nodes can communicate information about event labels to each other, building on prior work with co-training, self-training, and active learning. The idea of collaborative learning was extended to function for clustering algorithms as well, similar to ideas from penta-training and consensus clustering. However, collaboration between these learner types had not been explored. A new protocol was developed by which classifiers and clusterers can share key information about their observations and conclusions as they learn. This is an active collaboration in which learners of either type can query their neighbors for information that they then use to re-train or re-learn the concept they are studying. The protocol also supports broadcasts from the classifiers and clusterers to the rest of the network to announce new discoveries. Classifiers observe an event and assign it a label (type). Clusterers instead group observations into clusters without assigning them a label, and they collaborate in terms of pairwise constraints between two events [same-cluster (mustlink) or different-cluster (cannot-link)]. Fundamentally, these two learner types speak different languages. To bridge this gap, the new communication protocol provides four types of exchanges: hybrid queries for information, hybrid "broadcasts" of learned information, each specified for classifiers-to-clusterers, and clusterers-to-classifiers. The new capability has the potential to greatly expand the in situ analysis abilities of sensor networks. Classifiers seeking to categorize incoming data into different types of events can operate in tandem with clusterers that are sensitive to the occurrence of new kinds of events not known to the classifiers. In contrast to current approaches that treat these operations as independent components, a hybrid collaborative learning system can enable them to learn from each other.

  17. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ogden, K; O’Dwyer, R; Bradford, T

    Purpose: To reduce differences in features calculated from MRI brain scans acquired at different field strengths with or without Gadolinium contrast. Methods: Brain scans were processed for 111 epilepsy patients to extract hippocampus and thalamus features. Scans were acquired on 1.5 T scanners with Gadolinium contrast (group A), 1.5T scanners without Gd (group B), and 3.0 T scanners without Gd (group C). A total of 72 features were extracted. Features were extracted from original scans and from scans where the image pixel values were rescaled to the mean of the hippocampi and thalami values. For each data set, cluster analysismore » was performed on the raw feature set and for feature sets with normalization (conversion to Z scores). Two methods of normalization were used: The first was over all values of a given feature, and the second by normalizing within the patient group membership. The clustering software was configured to produce 3 clusters. Group fractions in each cluster were calculated. Results: For features calculated from both the non-rescaled and rescaled data, cluster membership was identical for both the non-normalized and normalized data sets. Cluster 1 was comprised entirely of Group A data, Cluster 2 contained data from all three groups, and Cluster 3 contained data from only groups 1 and 2. For the categorically normalized data sets there was a more uniform distribution of group data in the three Clusters. A less pronounced effect was seen in the rescaled image data features. Conclusion: Image Rescaling and feature renormalization can have a significant effect on the results of clustering analysis. These effects are also likely to influence the results of supervised machine learning algorithms. It may be possible to partly remove the influence of scanner field strength and the presence of Gadolinium based contrast in feature extraction for radiomics applications.« less

  18. Recent TB transmission, clustering and predictors of large clusters in London, 2010–2012: results from first 3 years of universal MIRU-VNTR strain typing

    PubMed Central

    Hamblion, Esther L; Le Menach, Arnaud; Anderson, Laura F; Lalor, Maeve K; Brown, Tim; Abubakar, Ibrahim; Anderson, Charlotte; Maguire, Helen; Anderson, Sarah R

    2016-01-01

    Background The incidence of TB has doubled in the last 20 years in London. A better understanding of risk groups for recent transmission is required to effectively target interventions. We investigated the molecular epidemiological characteristics of TB cases to estimate the proportion of cases due to recent transmission, and identify predictors for belonging to a cluster. Methods The study population included all culture-positive TB cases in London residents, notified between January 2010 and December 2012, strain typed using 24-loci multiple interspersed repetitive units-variable number tandem repeats. Multivariable logistic regression analysis was performed to assess the risk factors for clustering using sociodemographic and clinical characteristics of cases and for cluster size based on the characteristics of the first two cases. Results There were 10 147 cases of which 5728 (57%) were culture confirmed and 4790 isolates (84%) were typed. 2194 (46%) were clustered in 570 clusters, and the estimated proportion attributable to recent transmission was 34%. Clustered cases were more likely to be UK born, have pulmonary TB, a previous diagnosis, a history of substance abuse or alcohol abuse and imprisonment, be of white, Indian, black-African or Caribbean ethnicity. The time between notification of the first two cases was more likely to be <90 days in large clusters. Conclusions Up to a third of TB cases in London may be due to recent transmission. Resources should be directed to the timely investigation of clusters involving cases with risk factors, particularly those with a short period between the first two cases, to interrupt onward transmission of TB. PMID:27417280

  19. Intercenter Differences in Bronchopulmonary Dysplasia or Death Among Very Low Birth Weight Infants

    PubMed Central

    Walsh, Michele; Bobashev, Georgiy; Das, Abhik; Levine, Burton; Carlo, Waldemar A.; Higgins, Rosemary D.

    2011-01-01

    OBJECTIVES: To determine (1) the magnitude of clustering of bronchopulmonary dysplasia (36 weeks) or death (the outcome) across centers of the Eunice Kennedy Shriver National Institute of Child and Human Development National Research Network, (2) the infant-level variables associated with the outcome and estimate their clustering, and (3) the center-specific practices associated with the differences and build predictive models. METHODS: Data on neonates with a birth weight of <1250 g from the cluster-randomized benchmarking trial were used to determine the magnitude of clustering of the outcome according to alternating logistic regression by using pairwise odds ratio and predictive modeling. Clinical variables associated with the outcome were identified by using multivariate analysis. The magnitude of clustering was then evaluated after correction for infant-level variables. Predictive models were developed by using center-specific and infant-level variables for data from 2001 2004 and projected to 2006. RESULTS: In 2001–2004, clustering of bronchopulmonary dysplasia/death was significant (pairwise odds ratio: 1.3; P < .001) and increased in 2006 (pairwise odds ratio: 1.6; overall incidence: 52%; range across centers: 32%–74%); center rates were relatively stable over time. Variables that varied according to center and were associated with increased risk of outcome included lower body temperature at NICU admission, use of prophylactic indomethacin, specific drug therapy on day 1, and lack of endotracheal intubation. Center differences remained significant even after correction for clustered variables. CONCLUSION: Bronchopulmonary dysplasia/death rates demonstrated moderate clustering according to center. Clinical variables associated with the outcome were also clustered. Center differences after correction of clustered variables indicate presence of as-yet unmeasured center variables. PMID:21149431

  20. Chemometrics-based Approach in Analysis of Arnicae flos

    PubMed Central

    Zheleva-Dimitrova, Dimitrina Zh.; Balabanova, Vessela; Gevrenova, Reneta; Doichinova, Irini; Vitkova, Antonina

    2015-01-01

    Introduction: Arnica montana flowers have a long history as herbal medicines for external use on injuries and rheumatic complaints. Objective: To investigate Arnicae flos of cultivated accessions from Bulgaria, Poland, Germany, Finland, and Pharmacy store for phenolic derivatives and sesquiterpene lactones (STLs). Materials and Methods: Samples of Arnica from nine origins were prepared by ultrasound-assisted extraction with 80% methanol for phenolic compounds analysis. Subsequent reverse-phase high-performance liquid chromatography (HPLC) separation of the analytes was performed using gradient elution and ultraviolet detection at 280 and 310 nm (phenolic acids), and 360 nm (flavonoids). Total STLs were determined in chloroform extracts by solid-phase extraction-HPLC at 225 nm. The HPLC generated chromatographic data were analyzed using principal component analysis (PCA) and hierarchical clustering (HC). Results: The highest total amount of phenolic acids was found in the sample from Botanical Garden at Joensuu University, Finland (2.36 mg/g dw). Astragalin, isoquercitrin, and isorhamnetin 3-glucoside were the main flavonol glycosides being present up to 3.37 mg/g (astragalin). Three well-defined clusters were distinguished by PCA and HC. Cluster C1 comprised of the German and Finnish accessions characterized by the highest content of flavonols. Cluster C2 included the Bulgarian and Polish samples presenting a low content of flavonoids. Cluster C3 consisted only of one sample from a pharmacy store. Conclusion: A validated HPLC method for simultaneous determination of phenolic acids, flavonoid glycosides, and aglycones in A. montana flowers was developed. The PCA loading plot showed that quercetin, kaempferol, and isorhamnetin can be used to distinguish different Arnica accessions. SUMMARY A principal component analysis (PCA) on 13 phenolic compounds and total amount of sesquiterpene lactones in Arnicae flos collection tended to cluster the studied 9 accessions into three main groups. The profiles obtained demonstrated that the samples from Germany and Finland are characterized by greater amounts of phenolic derivatives than the Bulgarian and Polish ones. The PCA loading plot showed that quercetin, kaemferol and isorhamnetin can be used to distinguish different arnica accessions. PMID:27013791

  1. Hybrid cloud and cluster computing paradigms for life science applications

    PubMed Central

    2010-01-01

    Background Clouds and MapReduce have shown themselves to be a broadly useful approach to scientific computing especially for parallel data intensive applications. However they have limited applicability to some areas such as data mining because MapReduce has poor performance on problems with an iterative structure present in the linear algebra that underlies much data analysis. Such problems can be run efficiently on clusters using MPI leading to a hybrid cloud and cluster environment. This motivates the design and implementation of an open source Iterative MapReduce system Twister. Results Comparisons of Amazon, Azure, and traditional Linux and Windows environments on common applications have shown encouraging performance and usability comparisons in several important non iterative cases. These are linked to MPI applications for final stages of the data analysis. Further we have released the open source Twister Iterative MapReduce and benchmarked it against basic MapReduce (Hadoop) and MPI in information retrieval and life sciences applications. Conclusions The hybrid cloud (MapReduce) and cluster (MPI) approach offers an attractive production environment while Twister promises a uniform programming environment for many Life Sciences applications. Methods We used commercial clouds Amazon and Azure and the NSF resource FutureGrid to perform detailed comparisons and evaluations of different approaches to data intensive computing. Several applications were developed in MPI, MapReduce and Twister in these different environments. PMID:21210982

  2. TECHNOLOGICAL INNOVATION IN NEUROSURGERY: A QUANTITATIVE STUDY

    PubMed Central

    Marcus, Hani J; Hughes-Hallett, Archie; Kwasnicki, Richard M; Darzi, Ara; Yang, Guang-Zhong; Nandi, Dipankar

    2015-01-01

    Object Technological innovation within healthcare may be defined as the introduction of a new technology that initiates a change in clinical practice. Neurosurgery is a particularly technologically intensive surgical discipline, and new technologies have preceded many of the major advances in operative neurosurgical technique. The aim of the present study was to quantitatively evaluate technological innovation in neurosurgery using patents and peer-reviewed publications as metrics of technology development and clinical translation respectively. Methods A patent database was searched between 1960 and 2010 using the search terms “neurosurgeon” OR “neurosurgical” OR “neurosurgery”. The top 50 performing patent codes were then grouped into technology clusters. Patent and publication growth curves were then generated for these technology clusters. A top performing technology cluster was then selected as an exemplar for more detailed analysis of individual patents. Results In all, 11,672 patents and 208,203 publications relating to neurosurgery were identified. The top performing technology clusters over the 50 years were: image guidance devices, clinical neurophysiology devices, neuromodulation devices, operating microscopes and endoscopes. Image guidance and neuromodulation devices demonstrated a highly correlated rapid rise in patents and publications, suggesting they are areas of technology expansion. In-depth analysis of neuromodulation patents revealed that the majority of high performing patents were related to Deep Brain Stimulation (DBS). Conclusions Patent and publication data may be used to quantitatively evaluate technological innovation in neurosurgery. PMID:25699414

  3. Analysis of Chromobacterium sp. natural isolates from different Brazilian ecosystems

    PubMed Central

    Lima-Bittencourt, Cláudia I; Astolfi-Filho, Spartaco; Chartone-Souza, Edmar; Santos, Fabrício R; Nascimento, Andréa MA

    2007-01-01

    Background Chromobacterium violaceum is a free-living bacterium able to survive under diverse environmental conditions. In this study we evaluate the genetic and physiological diversity of Chromobacterium sp. isolates from three Brazilian ecosystems: Brazilian Savannah (Cerrado), Atlantic Rain Forest and Amazon Rain Forest. We have analyzed the diversity with molecular approaches (16S rRNA gene sequences and amplified ribosomal DNA restriction analysis) and phenotypic surveys of antibiotic resistance and biochemistry profiles. Results In general, the clusters based on physiological profiles included isolates from two or more geographical locations indicating that they are not restricted to a single ecosystem. The isolates from Brazilian Savannah presented greater physiologic diversity and their biochemical profile was the most variable of all groupings. The isolates recovered from Amazon and Atlantic Rain Forests presented the most similar biochemical characteristics to the Chromobacterium violaceum ATCC 12472 strain. Clusters based on biochemical profiles were congruent with clusters obtained by the 16S rRNA gene tree. According to the phylogenetic analyses, isolates from the Amazon Rain Forest and Savannah displayed a closer relationship to the Chromobacterium violaceum ATCC 12472. Furthermore, 16S rRNA gene tree revealed a good correlation between phylogenetic clustering and geographic origin. Conclusion The physiological analyses clearly demonstrate the high biochemical versatility found in the C. violaceum genome and molecular methods allowed to detect the intra and inter-population diversity of isolates from three Brazilian ecosystems. PMID:17584942

  4. Molecular Characterization of Cryptosporidium spp., Giardia duodenalis, and Enterocytozoon bieneusi in Captive Wildlife at Zhengzhou Zoo, China.

    PubMed

    Li, Junqiang; Qi, Meng; Chang, Yankai; Wang, Rongjun; Li, Tongyi; Dong, Haiju; Zhang, Longxian

    2015-01-01

    Cryptosporidium spp., Giardia duodenalis, and Enterocytozoon bieneusi are common gastrointestinal protists in humans and animals. Two hundred and three fecal specimens from 80 wildlife species were collected in Zhengzhou Zoo and their genomic DNA extracted. Three intestinal pathogens were characterized with a DNA sequence analysis of different loci. Cryptosporidium felis, C. baileyi, and avian genotype III were identified in three specimens (1.5%), the manul, red-crowned crane, and cockatiel, respectively. Giardia duodenalis was also found in five specimens (2.5%) firstly: assemblage B in a white-cheeked gibbon and beaver, and assemblage F in a Chinese leopard and two Siberian tigers, respectively. Thirteen genotypes of E. bieneusi (seven previously reported genotypes and six new genotypes) were detected in 32 specimens (15.8%), of which most were reported for the first time. A phylogenetic analysis of E. bieneusi showed that five genotypes (three known and two new) clustered in group 1; three known genotypes clustered in group 2; one known genotype clustered in group 4; and the remaining four genotypes clustered in a new group. In conclusion, zoonotic Cryptosporidium spp., G. duodenalis, and E. bieneusi are maintained in wildlife and transmitted between them. Zoonotic disease outbreaks of these infectious agents possibly originate in wildlife reservoirs. © 2015 The Author(s) Journal of Eukaryotic Microbiology © 2015 International Society of Protistologists.

  5. Amplified fragment length polymorphism of Streptococcus suis strains correlates with their profile of virulence-associated genes and clinical background.

    PubMed

    Rehm, Thomas; Baums, Christoph G; Strommenger, Birgit; Beyerbach, Martin; Valentin-Weigand, Peter; Goethe, Ralph

    2007-01-01

    Amplified fragment length polymorphism (AFLP) typing was applied to 116 Streptococcus suis isolates with different clinical backgrounds (invasive/pneumonia/carrier/human) and with known profiles of virulence-associated genes (cps1, -2, -7 and -9, as well as mrp, epf and sly). A dendrogram was generated that allowed identification of two clusters (A and C) with different subclusters (A1, A2, C1 and C2) and two heterogeneous groups of strains (B and D). For comparison, three strains from each AFLP subcluster and group were subjected to multilocus sequence typing (MLST) analysis. The closest relationship and lowest diversity were found for patterns clustering within AFLP subcluster A1, which corresponded with sequence type (ST) complex 1. Strains within subcluster A1 were mainly invasive cps1 and mrp+ epf+ (or epf*) sly+ cps2+ strains of porcine or human origin. A new finding of this study was the clustering of invasive mrp* cps9 isolates within subcluster A2. MLST analysis suggested that A2 correlates with a single ST complex (ST87). In contrast to A1 and A2, subclusters C1 and C2 contained mainly pneumonia isolates of genotype cps7 or cps2 and epf- sly-. In conclusion, this study demonstrates that AFLP allows identification of clusters of S. suis strains with clinical relevance.

  6. Spatial autocorrelation analysis of health care hotspots in Taiwan in 2006

    PubMed Central

    2009-01-01

    Background Spatial analytical techniques and models are often used in epidemiology to identify spatial anomalies (hotspots) in disease regions. These analytical approaches can be used to not only identify the location of such hotspots, but also their spatial patterns. Methods In this study, we utilize spatial autocorrelation methodologies, including Global Moran's I and Local Getis-Ord statistics, to describe and map spatial clusters, and areas in which these are situated, for the 20 leading causes of death in Taiwan. In addition, we use the fit to a logistic regression model to test the characteristics of similarity and dissimilarity by gender. Results Gender is compared in efforts to formulate the common spatial risk. The mean found by local spatial autocorrelation analysis is utilized to identify spatial cluster patterns. There is naturally great interest in discovering the relationship between the leading causes of death and well-documented spatial risk factors. For example, in Taiwan, we found the geographical distribution of clusters where there is a prevalence of tuberculosis to closely correspond to the location of aboriginal townships. Conclusions Cluster mapping helps to clarify issues such as the spatial aspects of both internal and external correlations for leading health care events. This is of great aid in assessing spatial risk factors, which in turn facilitates the planning of the most advantageous types of health care policies and implementation of effective health care services. PMID:20003460

  7. Clustering Heart Rate Dynamics Is Associated with β-Adrenergic Receptor Polymorphisms: Analysis by Information-Based Similarity Index

    PubMed Central

    Yang, Albert C.; Tsai, Shih-Jen; Hong, Chen-Jee; Wang, Cynthia; Chen, Tai-Jui; Liou, Ying-Jay; Peng, Chung-Kang

    2011-01-01

    Background Genetic polymorphisms in the gene encoding the β-adrenergic receptors (β-AR) have a pivotal role in the functions of the autonomic nervous system. Using heart rate variability (HRV) as an indicator of autonomic function, we present a bottom-up genotype–phenotype analysis to investigate the association between β-AR gene polymorphisms and heart rate dynamics. Methods A total of 221 healthy Han Chinese adults (59 males and 162 females, aged 33.6±10.8 years, range 19 to 63 years) were recruited and genotyped for three common β-AR polymorphisms: β1-AR Ser49Gly, β2-AR Arg16Gly and β2-AR Gln27Glu. Each subject underwent two hours of electrocardiogram monitoring at rest. We applied an information-based similarity (IBS) index to measure the pairwise dissimilarity of heart rate dynamics among study subjects. Results With the aid of agglomerative hierarchical cluster analysis, we categorized subjects into major clusters, which were found to have significantly different distributions of β2-AR Arg16Gly genotype. Furthermore, the non-randomness index, a nonlinear HRV measure derived from the IBS method, was significantly lower in Arg16 homozygotes than in Gly16 carriers. The non-randomness index was negatively correlated with parasympathetic-related HRV variables and positively correlated with those HRV indices reflecting a sympathovagal shift toward sympathetic activity. Conclusions We demonstrate a bottom-up categorization approach combining the IBS method and hierarchical cluster analysis to detect subgroups of subjects with HRV phenotypes associated with β-AR polymorphisms. Our results provide evidence that β2-AR polymorphisms are significantly associated with the acceleration/deceleration pattern of heart rate oscillation, reflecting the underlying mode of autonomic nervous system control. PMID:21573230

  8. Typology of adults diagnosed with mental disorders based on socio-demographics and clinical and service use characteristics

    PubMed Central

    2011-01-01

    Background Mental disorder is a leading cause of morbidity worldwide. Its cost and negative impact on productivity are substantial. Consequently, improving mental health-care system efficiency - especially service utilisation - is a priority. Few studies have explored the use of services by specific subgroups of persons with mental disorder; a better understanding of these individuals is key to improving service planning. This study develops a typology of individuals, diagnosed with mental disorder in a 12-month period, based on their individual characteristics and use of services within a Canadian urban catchment area of 258,000 persons served by a psychiatric hospital. Methods From among the 2,443 people who took part in the survey, 406 (17%) experienced at least one episode of mental disorder (as per the Composite International Diagnostic Interview (CIDI)) in the 12 months pre-interview. These individuals were selected for cluster analysis. Results Analysis yielded four user clusters: people who experienced mainly anxiety disorder; depressive disorder; alcohol and/or drug disorder; and multiple mental and dependence disorder. Two clusters were more closely associated with females and anxiety or depressive disorders. In the two other clusters, males were over-represented compared with the sample as a whole, namely, substance abuses with or without concomitant mental disorder. Clusters with the greatest number of mental disorders per subject used a greater number of mental health-care services. Conversely, clusters associated exclusively with dependence disorders used few services. Conclusion The study found considerable heterogeneity among socio-demographic characteristics, number of disorders, and number of health-care services used by individuals with mental or dependence disorders. Cluster analysis revealed important differences in service use with regard to gender and age. It reinforces the relevance of developing targeted programs for subgroups of individuals with mental and/or dependence disorders. Strategies aimed at changing low service users' attitude (youths and males) or instituting specialised programs for that particular clientele should be promoted. Finally, as concomitant disorders are frequent among individuals with mental disorder, psychological services and/or addiction programs must be prioritised as components of integrated services when planning treatment. PMID:21507251

  9. Family and community violence of schoolchildren from the city of São Gonçalo, Rio de Janeiro, Brazil.

    PubMed

    Pinto, Liana Wernersbach; Gonçalves de Assis, Simone

    2013-06-01

    This descriptive study aimed to investigate the association between violence in the family, school and community experienced by school children/adolescents of the city of São Gonçalo (RJ), Brazil. Questionnaires were administered to the mothers/guardians to assess violence in the family and school and to children to check their perceptions of community violence. Multiple correspondence analysis and cluster analysis, two exploratory descriptive techniques, were employed. Data from 280 schoolchildren were analyzed. A total of 43.9% of mothers reported that their children had been physically abused in their homes. With regard to children's/adolescents' perception of community violence, 93.2% said they had experienced or witnessed these events in their communities. For both sexes there was the formation of a cluster of categories with the presence of violence among siblings, presence of severe physical assault and verbal assault committed by parents. Among girls, the presence of violence in the school formed a cluster with the highest category of violence in the community. In conclusion, it should be emphasized that public policies aimed at dealing with violence should expand their scope to the various forms of violence affecting children.

  10. Cultivar identification and genetic relationship of pineapple (Ananas comosus) cultivars using SSR markers.

    PubMed

    Lin, Y S; Kuan, C S; Weng, I S; Tsai, C C

    2015-11-25

    The genetic relationships among 27 pineapple [Ananas comosus (L.) Merr.] cultivars and lines were examined using 16 simple sequence repeat (SSR) markers. The number of alleles per locus of the SSR markers ranged from 2 to 6 (average 3.19), for a total of 51 alleles. Similarity coefficients were calculated on the basis of 51 amplified bands. A dendrogram was created according to the 16 SSR markers by the unweighted pair-group method. The banding patterns obtained from the SSR primers allowed most of the cultivars and lines to be distinguished, with the exception of vegetative clones. According to the dendrogram, the 27 pineapple cultivars and lines were clustered into three main clusters and four individual clusters. As expected, the dendrogram showed that derived cultivars and lines are closely related to their parental cultivars; the genetic relationships between pineapple cultivars agree with the genealogy of their breeding history. In addition, the analysis showed that there is no obvious correlation between SSR markers and morphological characters. In conclusion, SSR analysis is an efficient method for pineapple cultivar identification and can offer valuable informative characters to identify pineapple cultivars in Taiwan.

  11. Transcriptomic markers meet the real world: finding diagnostic signatures of corticosteroid treatment in commercial beef samples

    PubMed Central

    2012-01-01

    Background The use of growth-promoters in beef cattle, despite the EU ban, remains a frequent practice. The use of transcriptomic markers has already proposed to identify indirect evidence of anabolic hormone treatment. So far, such approach has been tested in experimentally treated animals. Here, for the first time commercial samples were analyzed. Results Quantitative determination of Dexamethasone (DEX) residues in the urine collected at the slaughterhouse was performed by Liquid Chromatography-Mass Spectrometry (LC-MS). DNA-microarray technology was used to obtain transcriptomic profiles of skeletal muscle in commercial samples and negative controls. LC-MS confirmed the presence of low level of DEX residues in the urine of the commercial samples suspect for histological classification. Principal Component Analysis (PCA) on microarray data identified two clusters of samples. One cluster included negative controls and a subset of commercial samples, while a second cluster included part of the specimens collected at the slaughterhouse together with positives for corticosteroid treatment based on thymus histology and LC-MS. Functional analysis of the differentially expressed genes (3961) between the two groups provided further evidence that animals clustering with positive samples might have been treated with corticosteroids. These suspect samples could be reliably classified with a specific classification tool (Prediction Analysis of Microarray) using just two genes. Conclusions Despite broad variation observed in gene expression profiles, the present study showed that DNA-microarrays can be used to find transcriptomic signatures of putative anabolic treatments and that gene expression markers could represent a useful screening tool. PMID:23110699

  12. Major depressive disorder subtypes to predict long-term course

    PubMed Central

    van Loo, Hanna M.; Cai, Tianxi; Gruber, Michael J.; Li, Junlong; de Jonge, Peter; Petukhova, Maria; Rose, Sherri; Sampson, Nancy A.; Schoevers, Robert A.; Wardenaar, Klaas J.; Wilcox, Marsha A.; Al-Hamzawi, Ali Obaid; Andrade, Laura Helena; Bromet, Evelyn J.; Bunting, Brendan; Fayyad, John; Florescu, Silvia E.; Gureje, Oye; Hu, Chiyi; Huang, Yueqin; Levinson, Daphna; Medina-Mora, Maria Elena; Nakane, Yoshibumi; Posada-Villa, Jose; Scott, Kate M.; Xavier, Miguel; Zarkov, Zahari; Kessler, Ronald C.

    2016-01-01

    Background Variation in course of major depressive disorder (MDD) is not strongly predicted by existing subtype distinctions. A new subtyping approach is considered here. Methods Two data mining techniques, ensemble recursive partitioning and Lasso generalized linear models (GLMs) followed by k-means cluster analysis, are used to search for subtypes based on index episode symptoms predicting subsequent MDD course in the World Mental Health (WMH) Surveys. The WMH surveys are community surveys in 16 countries. Lifetime DSM-IV MDD was reported by 8,261 respondents. Retrospectively reported outcomes included measures of persistence (number of years with an episode; number of with an episode lasting most of the year) and severity (hospitalization for MDD; disability due to MDD). Results Recursive partitioning found significant clusters defined by the conjunctions of early onset, suicidality, and anxiety (irritability, panic, nervousness-worry-anxiety) during the index episode. GLMs found additional associations involving a number of individual symptoms. Predicted values of the four outcomes were strongly correlated. Cluster analysis of these predicted values found three clusters having consistently high, intermediate, or low predicted scores across all outcomes. The high-risk cluster (30.0% of respondents) accounted for 52.9-69.7% of high persistence and severity and was most strongly predicted by index episode severe dysphoria, suicidality, anxiety, and early onset. A total symptom count, in comparison, was not a significant predictor. Conclusions Despite being based on retrospective reports, results suggest that useful MDD subtyping distinctions can be made using data mining methods. Further studies are needed to test and expand these results with prospective data. PMID:24425049

  13. Psychological Factors Predict Local and Referred Experimental Muscle Pain: A Cluster Analysis in Healthy Adults

    PubMed Central

    Lee, Jennifer E.; Watson, David; Frey-Law, Laura A.

    2012-01-01

    Background Recent studies suggest an underlying three- or four-factor structure explains the conceptual overlap and distinctiveness of several negative emotionality and pain-related constructs. However, the validity of these latent factors for predicting pain has not been examined. Methods A cohort of 189 (99F; 90M) healthy volunteers completed eight self-report negative emotionality and pain-related measures (Eysenck Personality Questionnaire-Revised; Positive and Negative Affect Schedule; State-Trait Anxiety Inventory; Pain Catastrophizing Scale; Fear of Pain Questionnaire; Somatosensory Amplification Scale; Anxiety Sensitivity Index; Whiteley Index). Using principal axis factoring, three primary latent factors were extracted: General Distress; Catastrophic Thinking; and Pain-Related Fear. Using these factors, individuals clustered into three subgroups of high, moderate, and low negative emotionality responses. Experimental pain was induced via intramuscular acidic infusion into the anterior tibialis muscle, producing local (infusion site) and/or referred (anterior ankle) pain and hyperalgesia. Results Pain outcomes differed between clusters (multivariate analysis of variance and multinomial regression), with individuals in the highest negative emotionality cluster reporting the greatest local pain (p = 0.05), mechanical hyperalgesia (pressure pain thresholds; p = 0.009) and greater odds (2.21 OR) of experiencing referred pain compared to the lowest negative emotionality cluster. Conclusion Our results provide support for three latent psychological factors explaining the majority of the variance between several pain-related psychological measures, and that individuals in the high negative emotionality subgroup are at increased risk for (1) acute local muscle pain; (2) local hyperalgesia; and (3) referred pain using a standardized nociceptive input. PMID:23165778

  14. Serratia marcescens Bacteremia: Nosocomial Cluster Following Narcotic Diversion.

    PubMed

    Schuppener, Leah M; Pop-Vicas, Aurora E; Brooks, Erin G; Duster, Megan N; Crnich, Christopher J; Sterkel, Alana K; Webb, Aaron P; Safdar, Nasia

    2017-09-01

    OBJECTIVE To describe the investigation and control of a cluster of Serratia marcescens bacteremia in a 505-bed tertiary-care center. METHODS Cluster cases were defined as all patients with S. marcescens bacteremia between March 2 and April 7, 2014, who were found to have identical or related blood isolates determined by molecular typing with pulsed-field gel electrophoresis. Cases were compared using bivariate analysis with controls admitted at the same time and to the same service as the cases, in a 4:1 ratio. RESULTS In total, 6 patients developed S. marcescens bacteremia within 48 hours after admission within the above period. Of these, 5 patients had identical Serratia isolates determined by molecular typing, and were included in a case-control study. Exposure to the post-anesthesia care unit was a risk factor identified in bivariate analysis. Evidence of tampered opioid-containing syringes on several hospital units was discovered soon after the initial cluster case presented, and a full narcotic diversion investigation was conducted. A nurse working in the post-anesthesia care unit was identified as the employee responsible for the drug diversion and was epidemiologically linked to all 5 patients in the cluster. No further cases were identified once the implicated employee's job was terminated. CONCLUSION Illicit drug use by healthcare workers remains an important mechanism for the development of bloodstream infections in hospitalized patients. Active mechanisms and systems should remain in place to prevent, detect, and control narcotic drug diversions and associated patient harm in the healthcare setting. Infect Control Hosp Epidemiol 2017;38:1027-1031.

  15. Are There Subtypes of Panic Disorder? An Interpersonal Perspective

    PubMed Central

    Zilcha-Mano, Sigal; McCarthy, Kevin S.; Dinger, Ulrike; Chambless, Dianne L.; Milrod, Barbara L.; Kunik, Lauren; Barber, Jacques P.

    2015-01-01

    Objective Panic disorder (PD) is associated with significant personal, social, and economic costs. However, little is known about specific interpersonal dysfunctions that characterize the PD population. The current study systematically examined these interpersonal dysfunctions. Method The present analyses included 194 patients with PD out of a sample of 201 who were randomized to cognitive-behavioral therapy, panic-focused psychodynamic psychotherapy, or applied relaxation training. Interpersonal dysfunction was measured using the Inventory of Interpersonal Problems–Circumplex (Horowitz, Alden, Wiggins, & Pincus, 2000). Results Individuals with PD reported greater levels of interpersonal distress than that of a normative cohort (especially when PD was accompanied by agoraphobia), but lower than that of a cohort of patients with major depression. There was no single interpersonal profile that characterized PD patients. Symptom-based clusters (with versus without agoraphobia) could not be discriminated on core or central interpersonal problems. Rather, as revealed by cluster analysis based on the pathoplasticity framework, there were two empirically derived interpersonal clusters among PD patients which were not accounted for by symptom severity and were opposite in nature: domineering-intrusive and nonassertive. The empirically derived interpersonal clusters appear to be of clinical utility in predicting alliance development throughout treatment: While the domineering-intrusive cluster did not show any changes in the alliance throughout treatment, the non-assertive cluster showed a process of significant strengthening of the alliance. Conclusions Empirically derived interpersonal clusters in PD provide clinically useful and non-redundant information about individuals with PD. PMID:26030762

  16. A comparison of the near-infrared spectral features of early-type galaxies in the Coma Cluster, the Virgo cluster and the field

    NASA Technical Reports Server (NTRS)

    Houdashelt, Mark L.; Frogel, Jay A.

    1993-01-01

    Earlier researchers derived the relative distance between the Coma and Virgo clusters from color-magnitude relations of the early-type galaxies in each cluster. They found that the derived distance was color-dependent and concluded that the galaxies of similar luminosity in the two clusters differ in their red stellar populations. More recently, the color-dependence of the Coma-Virgo distance modulus has been called into question. However, because these two clusters differ so dramatically in their morphologies and kinematics, it is plausible that the star formation histories of the member galaxies also differed. If the conclusions of earlier researchers are indeed correct, then some signature of the resulting stellar population differences should appear in the near-infrared and/or infrared light of the respective galaxies. We have collected near-infrared spectra of 17 Virgo and 10 Coma early-type galaxies; this sample spans about four magnitudes in luminosity in each cluster. Seven field E/S0 galaxies have been observed for comparison. Pseudo-equivalent widths have been measured for all of the field galaxies, all but one of the Virgo members, and five of the Coma galaxies. The features examined are sensitive to the temperature, metallicity, and surface gravity of the reddest stars. A preliminary analysis of these spectral features has been performed, and, with a few notable exceptions, the measured pseudo-equivalent widths agree well with previously published values.

  17. Chaperone expression profiles correlate with distinct physiological states of Plasmodium falciparum in malaria patients

    PubMed Central

    2010-01-01

    Background Molecular chaperones have been shown to be important in the growth of the malaria parasite Plasmodium falciparum and inhibition of chaperone function by pharmacological agents has been shown to abrogate parasite growth. A recent study has demonstrated that clinical isolates of the parasite have distinct physiological states, one of which resembles environmental stress response showing up-regulation of specific molecular chaperones. Methods Chaperone networks operational in the distinct physiological clusters in clinical malaria parasites were constructed using cytoscape by utilizing their clinical expression profiles. Results Molecular chaperones show distinct profiles in the previously defined physiologically distinct states. Further, expression profiles of the chaperones from different cellular compartments correlate with specific patient clusters. While cluster 1 parasites, representing a starvation response, show up-regulation of organellar chaperones, cluster 2 parasites, which resemble active growth based on glycolysis, show up-regulation of cytoplasmic chaperones. Interestingly, cytoplasmic Hsp90 and its co-chaperones, previously implicated as drug targets in malaria, cluster in the same group. Detailed analysis of chaperone expression in the patient cluster 2 reveals up-regulation of the entire Hsp90-dependent pro-survival circuitries. In addition, cluster 2 also shows up-regulation of Plasmodium export element (PEXEL)-containing Hsp40s thought to have regulatory and host remodeling roles in the infected erythrocyte. Conclusion In all, this study demonstrates an intimate involvement of parasite-encoded chaperones, PfHsp90 in particular, in defining pathogenesis of malaria. PMID:20719001

  18. T-cell triggering thresholds are modulated by the number of antigen within individual T-cell receptor clusters

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Manz, Boryana N.; Jackson, Bryan L.; Petit, Rebecca S.

    2011-05-31

    T cells react to extremely small numbers of activating agonist peptides. Spatial organization of T-cell receptors (TCR) and their peptide-major histocompatibility complex (pMHC) ligands into microclusters is correlated with T-cell activation. In this study, we have designed an experimental strategy that enables control over the number of agonist peptides per TCR cluster, without altering the total number engaged by the cell. Supported membranes, partitioned with grids of barriers to lateral mobility, provide an effective way of limiting the total number of pMHC ligands that may be assembled within a single TCR cluster. Observations directly reveal that restriction of pMHC contentmore » within individual TCR clusters can decrease T-cell sensitivity for triggering initial calcium flux at fixed total pMHC density. Further analysis suggests that triggering thresholds are determined by the number of activating ligands available to individual TCR clusters, not by the total number encountered by the cell. Results from a series of experiments in which the overall agonist density and the maximum number of agonist per TCR cluster are independently varied in primary T cells indicate that the most probable minimal triggering unit for calcium signaling is at least four pMHC in a single cluster for this system. In conclusion, this threshold is unchanged by inclusion of coagonist pMHC, but costimulation of CD28 by CD80 can modulate the threshold lower.« less

  19. A highly efficient multi-core algorithm for clustering extremely large datasets

    PubMed Central

    2010-01-01

    Background In recent years, the demand for computational power in computational biology has increased due to rapidly growing data sets from microarray and other high-throughput technologies. This demand is likely to increase. Standard algorithms for analyzing data, such as cluster algorithms, need to be parallelized for fast processing. Unfortunately, most approaches for parallelizing algorithms largely rely on network communication protocols connecting and requiring multiple computers. One answer to this problem is to utilize the intrinsic capabilities in current multi-core hardware to distribute the tasks among the different cores of one computer. Results We introduce a multi-core parallelization of the k-means and k-modes cluster algorithms based on the design principles of transactional memory for clustering gene expression microarray type data and categorial SNP data. Our new shared memory parallel algorithms show to be highly efficient. We demonstrate their computational power and show their utility in cluster stability and sensitivity analysis employing repeated runs with slightly changed parameters. Computation speed of our Java based algorithm was increased by a factor of 10 for large data sets while preserving computational accuracy compared to single-core implementations and a recently published network based parallelization. Conclusions Most desktop computers and even notebooks provide at least dual-core processors. Our multi-core algorithms show that using modern algorithmic concepts, parallelization makes it possible to perform even such laborious tasks as cluster sensitivity and cluster number estimation on the laboratory computer. PMID:20370922

  20. Strong incidence of Pseudomonas aeruginosa on bacterial rrs and ITS genetic structures of cystic fibrosis sputa

    PubMed Central

    Pages-Monteiro, Laurence; Marti, Romain; Commun, Carine; Alliot, Nolwenn; Bardel, Claire; Meugnier, Helene; Perouse-de-Montclos, Michele; Reix, Philippe; Durieu, Isabelle; Durupt, Stephane; Vandenesch, Francois; Freney, Jean; Cournoyer, Benoit; Doleans-Jordheim, Anne

    2017-01-01

    Cystic fibrosis (CF) lungs harbor a complex community of interacting microbes, including pathogens like Pseudomonas aeruginosa. Meta-taxogenomic analysis based on V5-V6 rrs PCR products of 52 P. aeruginosa-positive (Pp) and 52 P. aeruginosa-negative (Pn) pooled DNA extracts from CF sputa suggested positive associations between P. aeruginosa and Stenotrophomonas and Prevotella, but negative ones with Haemophilus, Neisseria and Burkholderia. Internal Transcribed Spacer analyses (RISA) from individual DNA extracts identified three significant genetic structures within the CF cohorts, and indicated an impact of P. aeruginosa. RISA clusters Ip and IIIp contained CF sputa with a P. aeruginosa prevalence above 93%, and of 24.2% in cluster IIp. Clusters Ip and IIIp showed lower RISA genetic diversity and richness than IIp. Highly similar cluster IIp RISA profiles were obtained from two patients harboring isolates of a same P. aeruginosa clone, suggesting convergent evolution in the structure of their microbiota. CF patients of cluster IIp had received significantly less antibiotics than patients of clusters Ip and IIIp but harbored the most resistant P. aeruginosa strains. Patients of cluster IIIp were older than those of Ip. The effects of P. aeruginosa on the RISA structures could not be fully dissociated from the above two confounding factors but several trends in these datasets support the conclusion of a strong incidence of P. aeruginosa on the genetic structure of CF lung microbiota. PMID:28282386

  1. Unsupervised consensus cluster analysis of [18F]-fluoroethyl-L-tyrosine positron emission tomography identified textural features for the diagnosis of pseudoprogression in high-grade glioma

    PubMed Central

    Kebir, Sied; Khurshid, Zain; Gaertner, Florian C.; Essler, Markus; Hattingen, Elke; Fimmers, Rolf; Scheffler, Björn; Herrlinger, Ulrich; Bundschuh, Ralph A.; Glas, Martin

    2017-01-01

    Rationale Timely detection of pseudoprogression (PSP) is crucial for the management of patients with high-grade glioma (HGG) but remains difficult. Textural features of O-(2-[18F]fluoroethyl)-L-tyrosine positron emission tomography (FET-PET) mirror tumor uptake heterogeneity; some of them may be associated with tumor progression. Methods Fourteen patients with HGG and suspected of PSP underwent FET-PET imaging. A set of 19 conventional and textural FET-PET features were evaluated and subjected to unsupervised consensus clustering. The final diagnosis of true progression vs. PSP was based on follow-up MRI using RANO criteria. Results Three robust clusters have been identified based on 10 predominantly textural FET-PET features. None of the patients with PSP fell into cluster 2, which was associated with high values for textural FET-PET markers of uptake heterogeneity. Three out of 4 patients with PSP were assigned to cluster 3 that was largely associated with low values of textural FET-PET features. By comparison, tumor-to-normal brain ratio (TNRmax) at the optimal cutoff 2.1 was less predictive of PSP (negative predictive value 57% for detecting true progression, p=0.07 vs. 75% with cluster 3, p=0.04). Principal Conclusions Clustering based on textural O-(2-[18F]fluoroethyl)-L-tyrosine PET features may provide valuable information in assessing the elusive phenomenon of pseudoprogression. PMID:28030820

  2. Spatial dynamics of invasion: the geometry of introduced species.

    PubMed

    Korniss, Gyorgy; Caraco, Thomas

    2005-03-07

    Many exotic species combine low probability of establishment at each introduction with rapid population growth once introduction does succeed. To analyse this phenomenon, we note that invaders often cluster spatially when rare, and consequently an introduced exotic's population dynamics should depend on locally structured interactions. Ecological theory for spatially structured invasion relies on deterministic approximations, and determinism does not address the observed uncertainty of the exotic-introduction process. We take a new approach to the population dynamics of invasion and, by extension, to the general question of invasibility in any spatial ecology. We apply the physical theory for nucleation of spatial systems to a lattice-based model of competition between plant species, a resident and an invader, and the analysis reaches conclusions that differ qualitatively from the standard ecological theories. Nucleation theory distinguishes between dynamics of single- and multi-cluster invasion. Low introduction rates and small system size produce single-cluster dynamics, where success or failure of introduction is inherently stochastic. Single-cluster invasion occurs only if the cluster reaches a critical size, typically preceded by a number of failed attempts. For this case, we identify the functional form of the probability distribution of time elapsing until invasion succeeds. Although multi-cluster invasion for sufficiently large systems exhibits spatial averaging and almost-deterministic dynamics of the global densities, an analytical approximation from nucleation theory, known as Avrami's law, describes our simulation results far better than standard ecological approximations.

  3. Ancient genomic architecture for mammalian olfactory receptor clusters

    PubMed Central

    Aloni, Ronny; Olender, Tsviya; Lancet, Doron

    2006-01-01

    Background Mammalian olfactory receptor (OR) genes reside in numerous genomic clusters of up to several dozen genes. Whole-genome sequence alignment nets of five mammals allow their comprehensive comparison, aimed at reconstructing the ancestral olfactory subgenome. Results We developed a new and general tool for genome-wide definition of genomic gene clusters conserved in multiple species. Syntenic orthologs, defined as gene pairs showing conservation of both genomic location and coding sequence, were subjected to a graph theory algorithm for discovering CLICs (clusters in conservation). When applied to ORs in five mammals, including the marsupial opossum, more than 90% of the OR genes were found within a framework of 48 multi-species CLICs, invoking a general conservation of gene order and composition. A detailed analysis of individual CLICs revealed multiple differences among species, interpretable through species-specific genomic rearrangements and reflecting complex mammalian evolutionary dynamics. One significant instance involves CLIC #1, which lacks a human member, implying the human-specific deletion of an OR cluster, whose mouse counterpart has been tentatively associated with isovaleric acid odorant detection. Conclusion The identified multi-species CLICs demonstrate that most of the mammalian OR clusters have a common ancestry, preceding the split between marsupials and placental mammals. However, only two of these CLICs were capable of incorporating chicken OR genes, parsimoniously implying that all other CLICs emerged subsequent to the avian-mammalian divergence. PMID:17010214

  4. The Gaia-ESO Survey: open clusters in Gaia-DR1 . A way forward to stellar age calibration

    NASA Astrophysics Data System (ADS)

    Randich, S.; Tognelli, E.; Jackson, R.; Jeffries, R. D.; Degl'Innocenti, S.; Pancino, E.; Re Fiorentin, P.; Spagna, A.; Sacco, G.; Bragaglia, A.; Magrini, L.; Prada Moroni, P. G.; Alfaro, E.; Franciosini, E.; Morbidelli, L.; Roccatagliata, V.; Bouy, H.; Bravi, L.; Jiménez-Esteban, F. M.; Jordi, C.; Zari, E.; Tautvaišiene, G.; Drazdauskas, A.; Mikolaitis, S.; Gilmore, G.; Feltzing, S.; Vallenari, A.; Bensby, T.; Koposov, S.; Korn, A.; Lanzafame, A.; Smiljanic, R.; Bayo, A.; Carraro, G.; Costado, M. T.; Heiter, U.; Hourihane, A.; Jofré, P.; Lewis, J.; Monaco, L.; Prisinzano, L.; Sbordone, L.; Sousa, S. G.; Worley, C. C.; Zaggia, S.

    2018-05-01

    Context. Determination and calibration of the ages of stars, which heavily rely on stellar evolutionary models, are very challenging, while representing a crucial aspect in many astrophysical areas. Aims: We describe the methodologies that, taking advantage of Gaia-DR1 and the Gaia-ESO Survey data, enable the comparison of observed open star cluster sequences with stellar evolutionary models. The final, long-term goal is the exploitation of open clusters as age calibrators. Methods: We perform a homogeneous analysis of eight open clusters using the Gaia-DR1 TGAS catalogue for bright members and information from the Gaia-ESO Survey for fainter stars. Cluster membership probabilities for the Gaia-ESO Survey targets are derived based on several spectroscopic tracers. The Gaia-ESO Survey also provides the cluster chemical composition. We obtain cluster parallaxes using two methods. The first one relies on the astrometric selection of a sample of bona fide members, while the other one fits the parallax distribution of a larger sample of TGAS sources. Ages and reddening values are recovered through a Bayesian analysis using the 2MASS magnitudes and three sets of standard models. Lithium depletion boundary (LDB) ages are also determined using literature observations and the same models employed for the Bayesian analysis. Results: For all but one cluster, parallaxes derived by us agree with those presented in Gaia Collaboration (2017, A&A, 601, A19), while a discrepancy is found for NGC 2516; we provide evidence supporting our own determination. Inferred cluster ages are robust against models and are generally consistent with literature values. Conclusions: The systematic parallax errors inherent in the Gaia DR1 data presently limit the precision of our results. Nevertheless, we have been able to place these eight clusters onto the same age scale for the first time, with good agreement between isochronal and LDB ages where there is overlap. Our approach appears promising and demonstrates the potential of combining Gaia and ground-based spectroscopic datasets. Based on observations collected with the FLAMES instrument at VLT/UT2 telescope (Paranal Observatory, ESO, Chile), for the Gaia-ESO Large Public Spectroscopic Survey (188.B-3002, 193.B-0936).Additional tables are only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/612/A99

  5. tropical cyclone risk analysis: a decisive role of its track

    NASA Astrophysics Data System (ADS)

    Chelsea Nam, C.; Park, Doo-Sun R.; Ho, Chang-Hoi

    2016-04-01

    The tracks of 85 tropical cyclones (TCs) that made landfall to South Korea for the period 1979-2010 are classified into four clusters by using a fuzzy c-means clustering method. The four clusters are characterized by 1) east-short, 2) east-long, 3) west-long, and 4) west-short based on the moving routes around Korean peninsula. We conducted risk comparison analysis for these four clusters regarding their hazards, exposure, and damages. Here, hazard parameters are calculated from two different sources independently, one from the best-track data (BT) and the other from the 60 weather stations over the country (WS). The results show distinct characteristics of the four clusters in terms of the hazard parameters and economic losses (EL), suggesting that there is a clear track-dependency in the overall TC risk. It is appeared that whether there occurred an "effective collision" overweighs the intensity of the TC per se. The EL ranking did not agree with the BT parameters (maximum wind speed, central pressure, or storm radius), but matches to WS parameter (especially, daily accumulated rainfall and TC-influenced period). The west-approaching TCs (i.e. west-long and west-short clusters) generally recorded larger EL than the east-approaching TCs (i.e. east-short and east-long clusters), although the east-long clusters are the strongest in BT point of view. This can be explained through the spatial distribution of the WS parameters and the regional EL maps corresponding to it. West-approaching TCs accompanied heavy rainfall on the southern regions with the helps of the topographic effect on their tracks, and of the extended stay on the Korean Peninsula in their extratropical transition, that were not allowed to the east-approaching TCs. On the other hand, some regions had EL that are not directly proportional to the hazards, and this is partly attributed to spatial disparity in wealth and vulnerability. Correlation analysis also revealed the importance of rainfall; daily accumulated rainfall is the most-correlated with EL among all BT and WS hazard parameters for all clusters except the east-short. The least-correlated hazard parameter is the storm radius which showed significant correlations with EL for only the short clusters. In conclusion, this study suggests that TC track is essential in determining the way it brings damage on South Korea. Thus, it is suggested that the damage warning and adaptation policy need to be different for different TC tracks although South Korea is relatively small compared to average TC size.

  6. Defining syndromes using cattle meat inspection data for syndromic surveillance purposes: a statistical approach with the 2005–2010 data from ten French slaughterhouses

    PubMed Central

    2013-01-01

    Background The slaughterhouse is a central processing point for food animals and thus a source of both demographic data (age, breed, sex) and health-related data (reason for condemnation and condemned portions) that are not available through other sources. Using these data for syndromic surveillance is therefore tempting. However many possible reasons for condemnation and condemned portions exist, making the definition of relevant syndromes challenging. The objective of this study was to determine a typology of cattle with at least one portion of the carcass condemned in order to define syndromes. Multiple factor analysis (MFA) in combination with clustering methods was performed using both health-related data and demographic data. Results Analyses were performed on 381,186 cattle with at least one portion of the carcass condemned among the 1,937,917 cattle slaughtered in ten French abattoirs. Results of the MFA and clustering methods led to 12 clusters considered as stable according to year of slaughter and slaughterhouse. One cluster was specific to a disease of public health importance (cysticercosis). Two clusters were linked to the slaughtering process (fecal contamination of heart or lungs and deterioration lesions). Two clusters respectively characterized by chronic liver lesions and chronic peritonitis could be linked to diseases of economic importance to farmers. Three clusters could be linked respectively to reticulo-pericarditis, fatty liver syndrome and farmer’s lung syndrome, which are related to both diseases of economic importance to farmers and herd management issues. Three clusters respectively characterized by arthritis, myopathy and Dark Firm Dry (DFD) meat could notably be linked to animal welfare issues. Finally, one cluster, characterized by bronchopneumonia, could be linked to both animal health and herd management issues. Conclusion The statistical approach of combining multiple factor analysis with cluster analysis showed its relevance for the detection of syndromes using available large and complex slaughterhouse data. The advantages of this statistical approach are to i) define groups of reasons for condemnation based on meat inspection data, ii) help grouping reasons for condemnation among a list of various possible reasons for condemnation for which a consensus among experts could be difficult to reach, iii) assign each animal to a single syndrome which allows the detection of changes in trends of syndromes to detect unusual patterns in known diseases and emergence of new diseases. PMID:23628140

  7. Malignant pleural mesothelioma and mesothelial hyperplasia: A new molecular tool for the differential diagnosis.

    PubMed

    Bruno, Rossella; Alì, Greta; Giannini, Riccardo; Proietti, Agnese; Lucchi, Marco; Chella, Antonio; Melfi, Franca; Mussi, Alfredo; Fontanini, Gabriella

    2017-01-10

    Malignant pleural mesothelioma (MPM) is a rare asbestos related cancer, aggressive and unresponsive to therapies. Histological examination of pleural lesions is the gold standard of MPM diagnosis, although it is sometimes hard to discriminate the epithelioid type of MPM from benign mesothelial hyperplasia (MH).This work aims to define a new molecular tool for the differential diagnosis of MPM, using the expression profile of 117 genes deregulated in this tumour.The gene expression analysis was performed by nanoString System on tumour tissues from 36 epithelioid MPM and 17 MH patients, and on 14 mesothelial pleural samples analysed in a blind way. Data analysis included raw nanoString data normalization, unsupervised cluster analysis by Pearson correlation, non-parametric Mann Whitney U-test and molecular classification by the Uncorrelated Shrunken Centroid (USC) Algorithm.The Mann-Whitney U-test found 35 genes upregulated and 31 downregulated in MPM. The unsupervised cluster analysis revealed two clusters, one composed only of MPM and one only of MH samples, thus revealing class-specific gene profiles. The Uncorrelated Shrunken Centroid algorithm identified two classifiers, one including 22 genes and the other 40 genes, able to properly classify all the samples as benign or malignant using gene expression data; both classifiers were also able to correctly determine, in a blind analysis, the diagnostic categories of all the 14 unknown samples.In conclusion we delineated a diagnostic tool combining molecular data (gene expression) and computational analysis (USC algorithm), which can be applied in the clinical practice for the differential diagnosis of MPM.

  8. Characterizing decision-making and reward processing in bipolar disorder: A cluster analysis.

    PubMed

    Jiménez, E; Solé, B; Arias, B; Mitjans, M; Varo, C; Reinares, M; Bonnín, C M; Salagre, E; Ruíz, V; Torres, I; Tomioka, Y; Sáiz, P A; García-Portilla, M P; Burón, P; Bobes, J; Martínez-Arán, A; Torrent, C; Vieta, E; Benabarre, A

    2018-05-25

    The presence of abnormalities in emotional decision-making and reward processing among bipolar patients (BP) has been well rehearsed. These disturbances are not limited to acute phases and are common even during remission. In recent years, the existence of discrete cognitive profiles in this psychiatric population has been replicated. However, emotional decision making and reward processing domains have barely been studied. Therefore, our aim was to explore the existence of different profiles on the aforementioned cognitive dimensions in BP. The sample consisted of 126 euthymic BP. Main sociodemographic, clinical, functioning, and neurocognitive variables were gathered. A hierarchical-clustering technique was used to identify discrete neurocognitive profiles based on the performance in the Iowa Gambling Task. Afterward, the resulting clusters were compared using ANOVA or Chi-squared Test, as appropriate. Evidence for the existence of three different profiles was provided. Cluster 1 was mainly characterized by poor decision ability. Cluster 2 presented the lowest sensitivity to punishment. Finally, cluster 3 presented the best decision-making ability and the highest levels of punishment sensitivity. Comparison between the three clusters indicated that cluster 2 was the most functionally impaired group. The poorest outcomes in attention, executive function domains, and social cognition were also observed within the same group. In conclusion, similarly to that observed in "cold cognitive" domains, our results suggest the existence of three discrete cognitive profiles concerning emotional decision making and reward processing. Amongst all the indexes explored, low punishment sensitivity emerge as a potential correlate of poorer cognitive and functional outcomes in bipolar disorder. Copyright © 2018 Elsevier B.V. and ECNP. All rights reserved.

  9. Dietary BMAA Exposure in an Amyotrophic Lateral Sclerosis Cluster from Southern France

    PubMed Central

    Masseret, Estelle; Banack, Sandra; Boumédiène, Farid; Abadie, Eric; Brient, Luc; Pernet, Fabrice; Juntas-Morales, Raoul; Pageot, Nicolas; Metcalf, James; Cox, Paul; Camu, William

    2013-01-01

    Background Dietary exposure to the cyanotoxin BMAA is suspected to be the cause of amyotrophic lateral sclerosis in the Western Pacific Islands. In Europe and North America, this toxin has been identified in the marine environment of amyotrophic lateral sclerosis clusters but, to date, only few dietary exposures have been described. Objectives We aimed at identifying cluster(s) of amyotrophic lateral sclerosis in the Hérault district, a coastal district from Southern France, and to search, in the identified area(s), for the existence of a potential dietary source of BMAA. Methods A spatio-temporal cluster analysis was performed in the district, considering all incident amyotrophic lateral sclerosis cases identified from 1994 to 2009 by our expert center. We investigated the cluster area with serial collections of oysters and mussels that were subsequently analyzed blind for BMAA concentrations. Results We found one significant amyotrophic lateral sclerosis cluster (p = 0.0024), surrounding the Thau lagoon, the most important area of shellfish production and consumption along the French Mediterranean coast. BMAA was identified in mussels (1.8 µg/g to 6.0 µg/g) and oysters (0.6 µg/g to 1.6 µg/g). The highest concentrations of BMAA were measured during summer when the highest picocyanobacteria abundances were recorded. Conclusions While it is not possible to ascertain a direct link between shellfish consumption and the existence of this ALS cluster, these results add new data to the potential association of BMAA with sporadic amyotrophic lateral sclerosis, one of the most severe neurodegenerative disorder. PMID:24349504

  10. Zachary D. Barker: Final DHS HS-STEM Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barker, Z D

    Working at Lawrence Livermore National Laboratory (LLNL) this summer has provided a very unique and special experience for me. I feel that the research opportunities given to me have allowed me to significantly benefit my research group, the laboratory, the Department of Homeland Security, and the Department of Energy. The researchers in the Single Particle Aerosol Mass Spectrometry (SPAMS) group were very welcoming and clearly wanted me to get the most out of my time in Livermore. I feel that my research partner, Veena Venkatachalam of MIT, and I have been extremely productive in meeting our research goals throughout thismore » summer, and have learned much about working in research at a national laboratory such as Lawrence Livermore. I have learned much about the technical aspects of research while working at LLNL, however I have also gained important experience and insight into how research groups at national laboratories function. I believe that this internship has given me valuable knowledge and experience which will certainly help my transition to graduate study and a career in engineering. My work with Veena Venkatachalam in the SPAMS group this summer has focused on two major projects. Initially, we were tasked with an analysis of data collected by the group this past spring in a large public environment. The SPAMS instrument was deployed for over two months, collecting information on many of the ambient air particles circulating through the area. Our analysis of the particle data collected during this deployment concerned several aspects, including finding groups, or clusters, of particles that seemed to appear more during certain times of day, analyzing the mass spectral data of clusters and comparing them with mass spectral data of known substances, and comparing the real-time detection capability of the SPAMS instrument with that of a commercially available biological detection instrument. This analysis was performed in support of a group report to the Department of Homeland Security on the results of the deployment. The analysis of the deployment data revealed some interesting applications of the SPAMS instrument to homeland security situations. Using software developed in-house by SPAMS group member Dr. Paul Steele, Veena and I were able to cluster a subset of data over a certain timeframe (ranging from a single hour to an entire week). The software used makes clusters based on the mass spectral characteristics of the each particle in the data set, as well as other parameters. By looking more closely at the characteristics of individual clusters, including the mass spectra, conclusions could be made about what these particles are. This was achieved partially through examination and discussion of the mass spectral data with the members of the SPAMS group, as well as through comparison with known mass spectra collected from substances tested in the laboratory. In many cases, broad conclusions could be drawn about the identity of a cluster of particles.« less

  11. The XXL Survey. XII. Optical spectroscopy of X-ray-selected clusters and the frequency of AGN in superclusters

    NASA Astrophysics Data System (ADS)

    Koulouridis, E.; Poggianti, B.; Altieri, B.; Valtchanov, I.; Jaffé, Y.; Adami, C.; Elyiv, A.; Melnyk, O.; Fotopoulou, S.; Gastaldello, F.; Horellou, C.; Pierre, M.; Pacaud, F.; Plionis, M.; Sadibekova, T.; Surdej, J.

    2016-06-01

    Context. This article belongs to the first series of XXL publications. It presents multifibre spectroscopic observations of three 0.55 deg2 fields in the XXL Survey, which were selected on the basis of their high density of X-ray-detected clusters. The observations were obtained with the AutoFib2+WYFFOS (AF2) wide-field fibre spectrograph mounted on the 4.2 m William Herschel Telescope. Aims: The paper first describes the scientific rationale, the preparation, the data reduction, and the results of the observations, and then presents a study of active galactic nuclei (AGN) within three superclusters. Methods: To determine the redshift of galaxy clusters and AGN, we assign high priority to a) the brightest cluster galaxies (BCGs), b) the most probable cluster galaxy candidates, and c) the optical counterparts of X-ray point-like sources. We use the outcome of the observations to study the projected (2D) and the spatial (3D) overdensity of AGN in three superclusters. Results: We obtained redshifts for 455 galaxies in total, 56 of which are counterparts of X-ray point-like sources. We were able to determine the redshift of the merging supercluster XLSSC-e, which consists of six individual clusters at z ~ 0.43, and we confirmed the redshift of supercluster XLSSC-d at z ~ 0.3. More importantly, we discovered a new supercluster, XLSSC-f, that comprises three galaxy clusters also at z ~ 0.3. We find a significant 2D overdensity of X-ray point-like sources only around the supercluster XLSSC-f. This result is also supported by the spatial (3D) analysis of XLSSC-f, where we find four AGN with compatible spectroscopic redshifts and possibly one more with compatible photometric redshift. In addition, we find two AGN (3D analysis) at the redshift of XLSSC-e, but no AGN in XLSSC-d. Comparing these findings with the optical galaxy overdensity we conclude that the total number of AGN in the area of the three superclusters significantly exceeds the field expectations. All of the AGN found have luminosities below 7 × 1042 erg s-1. Conclusions: The difference in the AGN frequency between the three superclusters cannot be explained by the present study because of small number statistics. Further analysis of a larger number of superclusters within the 50 deg2 of the XXL is needed before any conclusions on the effect of the supercluster environment on AGN can be reached. Based on observations obtained with XMM-Newton, an ESA science mission with instruments and contributions directly funded by ESA Member States and NASA. Based on observations obtained with the William Herschel telescope during semester 13B.The Master Catalogue is available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/592/A2

  12. Geographical Analysis of the Distribution and Spread of Human Rabies in China from 2005 to 2011

    PubMed Central

    Yin, Wenwu; Yu, Hongjie; Si, Yali; Li, Jianhui; Zhou, Yuanchun; Zhou, Xiaoyan; Magalhães, Ricardo J. Soares.

    2013-01-01

    Background Rabies is a significant public health problem in China in that it records the second highest case incidence globally. Surveillance data on canine rabies in China is lacking and human rabies notifications can be a useful indicator of areas where animal and human rabies control could be integrated. Previous spatial epidemiological studies lacked adequate spatial resolution to inform targeted rabies control decisions. We aimed to describe the spatiotemporal distribution of human rabies and model its geographical spread to provide an evidence base to inform future integrated rabies control strategies in China. Methods We geo-referenced a total of 17,760 human rabies cases of China from 2005 to 2011. In our spatial analyses we used Gaussian kernel density analysis, average nearest neighbor distance, Spatial Temporal Density-Based Spatial Clustering of Applications with Noise and developed a model of rabies spatiotemporal spread. Findings Human rabies cases increased from 2005 to 2007 and decreased during 2008 to 2011 companying change of the spatial distribution. The ANN distance among human rabies cases increased between 2005 and 2011, and the degree of clustering of human rabies cases decreased during that period. A total 480 clusters were detected by ST-DBSCAN, 89.4% clusters initiated before 2007. Most of clusters were mainly found in South of China. The number and duration of cluster decreased significantly after 2008. Areas with the highest density of human rabies cases varied spatially each year and in some areas remained with high outbreak density for several years. Though few places have recovered from human rabies, most of affected places are still suffering from the disease. Conclusion Human rabies in mainland China is geographically clustered and its spatial extent changed during 2005 to 2011. The results provide a scientific basis for public health authorities in China to improve human rabies control and prevention program. PMID:23991098

  13. Evolution of coding and non-coding genes in HOX clusters of a marsupial

    PubMed Central

    2012-01-01

    Background The HOX gene clusters are thought to be highly conserved amongst mammals and other vertebrates, but the long non-coding RNAs have only been studied in detail in human and mouse. The sequencing of the kangaroo genome provides an opportunity to use comparative analyses to compare the HOX clusters of a mammal with a distinct body plan to those of other mammals. Results Here we report a comparative analysis of HOX gene clusters between an Australian marsupial of the kangaroo family and the eutherians. There was a strikingly high level of conservation of HOX gene sequence and structure and non-protein coding genes including the microRNAs miR-196a, miR-196b, miR-10a and miR-10b and the long non-coding RNAs HOTAIR, HOTAIRM1 and HOXA11AS that play critical roles in regulating gene expression and controlling development. By microRNA deep sequencing and comparative genomic analyses, two conserved microRNAs (miR-10a and miR-10b) were identified and one new candidate microRNA with typical hairpin precursor structure that is expressed in both fibroblasts and testes was found. The prediction of microRNA target analysis showed that several known microRNA targets, such as miR-10, miR-414 and miR-464, were found in the tammar HOX clusters. In addition, several novel and putative miRNAs were identified that originated from elsewhere in the tammar genome and that target the tammar HOXB and HOXD clusters. Conclusions This study confirms that the emergence of known long non-coding RNAs in the HOX clusters clearly predate the marsupial-eutherian divergence 160 Ma ago. It also identified a new potentially functional microRNA as well as conserved miRNAs. These non-coding RNAs may participate in the regulation of HOX genes to influence the body plan of this marsupial. PMID:22708672

  14. MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data

    PubMed Central

    Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko

    2007-01-01

    Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892

  15. Use of Spatial Epidemiology and Hot Spot Analysis to Target Women Eligible for Prenatal Women, Infants, and Children Services

    PubMed Central

    Krawczyk, Christopher; Gradziel, Pat; Geraghty, Estella M.

    2014-01-01

    Objectives. We used a geographic information system and cluster analyses to determine locations in need of enhanced Special Supplemental Nutrition Program for Women, Infants, and Children (WIC) Program services. Methods. We linked documented births in the 2010 California Birth Statistical Master File with the 2010 data from the WIC Integrated Statewide Information System. Analyses focused on the density of pregnant women who were eligible for but not receiving WIC services in California’s 7049 census tracts. We used incremental spatial autocorrelation and hot spot analyses to identify clusters of WIC-eligible nonparticipants. Results. We detected clusters of census tracts with higher-than-expected densities, compared with the state mean density of WIC-eligible nonparticipants, in 21 of 58 (36.2%) California counties (P < .05). In subsequent county-level analyses, we located neighborhood-level clusters of higher-than-expected densities of eligible nonparticipants in Sacramento, San Francisco, Fresno, and Los Angeles Counties (P < .05). Conclusions. Hot spot analyses provided a rigorous and objective approach to determine the locations of statistically significant clusters of WIC-eligible nonparticipants. Results helped inform WIC program and funding decisions, including the opening of new WIC centers, and offered a novel approach for targeting public health services. PMID:24354821

  16. Symptoms and Association with Health Outcomes in Relapsing-Remitting Multiple Sclerosis: Results of a US Patient Survey

    PubMed Central

    Williams, Angela E.; Vietri, Jeffrey T.

    2014-01-01

    Background. A variety of symptoms have been reported, but the prevalence of specific symptoms in relapsing-remitting multiple sclerosis (RRMS), how they are related to one another, and their impact on patient reported outcomes is not well understood. Objective. To describe how symptoms of RRMS cooccur and their impact on patient-reported outcomes. Methods. Individuals who reported a physician diagnosis of RRMS in a large general health survey in the United States indicated the symptoms they experience because of RRMS and completed validated scales, including the work productivity and activity impairment questionnaire and either the SF-12v2 or SF-36v2. Symptom clusters were identified through hierarchical cluster analysis, and the relationship between clusters and outcomes was assessed through regression. Results. Fatigue, difficulty walking, and numbness were the most commonly reported symptoms. Seven symptom clusters were identified, and several were significantly related to patient reported outcomes. Pain, muscle spasms, and stiffness formed a cluster strongly related to physical quality of life; depression was strongly related to mental quality of life and cognitive difficulty was associated with work impairment. Conclusions. Symptoms in RRMS show a strong relationship with quality of life and should be taken into consideration in treatment decisions and evaluation of treatment success. PMID:25328704

  17. Membership and Coronal Activity in the NGC 2232 and Cr 140 Open Clusters

    NASA Technical Reports Server (NTRS)

    Patten, Brian M.; Oliversen, Ronald J. (Technical Monitor)

    2001-01-01

    This is the second annual performance report for our grant "Membership and Coronal Activity in the NGC 2232 and Cr 140 Open Clusters." We propose to identify X-ray sources and extract net source counts in 8 archival ROSAT HRI images in the regions of the NGC 2232 and Cr 140 open clusters. These X-ray data will be combined with ground-based photometry and spectroscopy in order to identify G, K, and early-M type cluster members. At present, no members later than approximately F5 are currently known for either cluster. With ages of approximately 25 Myr and at a distance of just 320 - 360 pc, the combined late-type membership of the NGC 2232 and Cr 140 clusters will yield an almost unique sample of solar-type stars in the post-T Tauri/pre-main sequence phase of evolution. These stars will be used to assess the level and dispersion in coronal activity levels, as part of a probe of the importance of magnetic braking and the level of magnetic dynamo activity, for solar-type stars just before they reach the ZAMS. Over the past year we have successfully acquired all of the ground-based data necessary to support the analysis of the archival ROSAT X-ray data in the regions around both of these clusters. By the end of 2001 we expect to have completed the reduction and analysis of the ground-based photometry and spectroscopy and will begin the integration of these data with the ROSAT X-ray data. A certain amount of pressure to complete the work on NGC 2232 is coming from the SIRTF project, as this cluster may be a key component to a circumstellar disk evolution GTO program. We are only too happy to try to help and have worked to speed the analysis as much as possible. The primary activity to be undertaken in the next few months is the integration of the groundbased photometry and spectroscopy with the archival ROSAT X-ray data and then writing the paper summarizing our results. The most time consuming portion of this next phase is, of course, seeing the paper through publication in a peer-reviewed journal. Therefore, we have requested a no-cost extension to the grant to allow us to bring this project to a conclusion.

  18. Risk Profiles for Injurious Falls in People Over 60: A Population-Based Cohort Study

    PubMed Central

    Ek, Stina; Rizzuto, Debora; Fratiglioni, Laura; Johnell, Kristina; Xu, Weili

    2018-01-01

    Abstract Background Although falls in older adults are related to multiple risk factors, these factors have commonly been studied individually. We aimed to identify risk profiles for injurious falls in older adults by detecting clusters of established risk factors and quantifying their impact on fall risk. Methods Participants were 2,566 people, aged 60 years and older, from the population-based Swedish National Study on Aging and Care in Kungsholmen. Injurious falls was defined as hospitalization for or receipt of outpatient care because a fall. Cluster analysis was used to identify aggregation of possible risk factors including chronic diseases, fall-risk increasing drugs (FRIDs), physical and cognitive impairments, and lifestyle-related factors. Associations between the clusters and injurious falls over 3, 5, and 10 years were estimated using flexible parametric survival models. Results Five clusters were identified including: a “healthy”, a “well-functioning with multimorbidity”, a “well-functioning, with multimorbidity and high FRID consumption”, a “physically and cognitively impaired”, and a “disabled” cluster. The risk of injurious falls for all groups was significantly higher than for the first cluster of healthy individuals in the reference category. Hazard ratios (95% confidence intervals) ranged from 1.71 (1.02–2.66) for the second cluster to 12.67 (7.38–21.75) for the last cluster over 3 years of follow-up. The highest risk was observed in the last two clusters with high burden of physical and cognitive impairments. Conclusion Risk factors for injurious fall tend to aggregate, representing different levels of risk for falls. Our findings can be useful to tailor and prioritize clinical and public health interventions. PMID:28605455

  19. Tiotropium might improve survival in subjects with COPD at high risk of mortality

    PubMed Central

    2014-01-01

    Background Inhaled therapies reduce risk of chronic obstructive pulmonary disease (COPD) exacerbations, but their effect on mortality is less well established. We hypothesized that heterogeneity in baseline mortality risk influenced the results of drug trials assessing mortality in COPD. Methods The 5706 patients with COPD from the Understanding Potential Long-term Impacts on Function with Tiotropium (UPLIFT®) study that had complete clinical information for variables associated with mortality (age, forced expiratory volume in 1 s, St George’s Respiratory Questionnaire, pack-years and body mass index) were classified by cluster analysis. Baseline risk of mortality between clusters, and impact of tiotropium were evaluated during the 4-yr follow up. Results Four clusters were identified, including low-risk (low mortality rate) patients (n = 2339; 41%; cluster 2), and high-risk patients (n = 1022; 18%; cluster 3), who had a 2.6- and a six-fold increase in all-cause and respiratory mortality compared with cluster 2, respectively. Tiotropium reduced exacerbations in all clusters, and reduced hospitalizations in high-risk patients (p < 0.05). The beneficial effect of tiotropium on all-cause mortality in the overall population (hazard ratio, 0.87; 95% confidence interval, 0.75–1.00, p = 0.054) was explained by a 21% reduction in cluster 3 (p = 0.07), with no effect in other clusters. Conclusions Large variations in baseline risks of mortality existed among patients in the UPLIFT® study. Inclusion of numerous low-risk patients may have reduced the ability to show beneficial effect on mortality. Future clinical trials should consider selective inclusion of high-risk patients. PMID:24913266

  20. Using Cluster Analysis to Examine Husband-Wife Decision Making

    ERIC Educational Resources Information Center

    Bonds-Raacke, Jennifer M.

    2006-01-01

    Cluster analysis has a rich history in many disciplines and although cluster analysis has been used in clinical psychology to identify types of disorders, its use in other areas of psychology has been less popular. The purpose of the current experiments was to use cluster analysis to investigate husband-wife decision making. Cluster analysis was…

  1. TU-CD-BRB-12: Radiogenomics of MRI-Guided Prostate Cancer Biopsy Habitats

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Stoyanova, R; Lynne, C; Abraham, S

    2015-06-15

    Purpose: Diagnostic prostate biopsies are subject to sampling bias. We hypothesize that quantitative imaging with multiparametric (MP)-MRI can more accurately direct targeted biopsies to index lesions associated with highest risk clinical and genomic features. Methods: Regionally distinct prostate habitats were delineated on MP-MRI (T2-weighted, perfusion and diffusion imaging). Directed biopsies were performed on 17 habitats from 6 patients using MRI-ultrasound fusion. Biopsy location was characterized with 52 radiographic features. Transcriptome-wide analysis of 1.4 million RNA probes was performed on RNA from each habitat. Genomics features with insignificant expression values (<0.25) and interquartile range <0.5 were filtered, leaving total of 212more » genes. Correlation between imaging features, genes and a 22 feature genomic classifier (GC), developed as a prognostic assay for metastasis after radical prostatectomy was investigated. Results: High quality genomic data was derived from 17 (100%) biopsies. Using the 212 ‘unbiased’ genes, the samples clustered by patient origin in unsupervised analysis. When only prostate cancer related genomic features were used, hierarchical clustering revealed samples clustered by needle-biopsy Gleason score (GS). Similarly, principal component analysis of the imaging features, found the primary source of variance segregated the samples into high (≥7) and low (6) GS. Pearson’s correlation analysis of genes with significant expression showed two main patterns of gene expression clustering prostate peripheral and transitional zone MRI features. Two-way hierarchical clustering of GC with radiomics features resulted in the expected groupings of high and low expressed genes in this metastasis signature. Conclusions: MP-MRI-targeted diagnostic biopsies can potentially improve risk stratification by directing pathological and genomic analysis to clinically significant index lesions. As determinant lesions are more reliably identified, targeting with radiotherapy should improve outcome. This is the first demonstration of a link between quantitative imaging features (radiomics) with genomic features in MRI-directed prostate biopsies. The research was supported by NIH- NCI R01 CA 189295 and R01 CA 189295; E Davicioni is partial owner of GenomeDx Biosciences, Inc. M Takhar, N Erho, L Lam, C Buerki and E Davicioni are current employees at GenomeDx Biosciences, Inc.« less

  2. "K"-Means May Perform as well as Mixture Model Clustering but May Also Be Much Worse: Comment on Steinley and Brusco (2011)

    ERIC Educational Resources Information Center

    Vermunt, Jeroen K.

    2011-01-01

    Steinley and Brusco (2011) presented the results of a huge simulation study aimed at evaluating cluster recovery of mixture model clustering (MMC) both for the situation where the number of clusters is known and is unknown. They derived rather strong conclusions on the basis of this study, especially with regard to the good performance of…

  3. Individual participant data meta-analyses should not ignore clustering

    PubMed Central

    Abo-Zaid, Ghada; Guo, Boliang; Deeks, Jonathan J.; Debray, Thomas P.A.; Steyerberg, Ewout W.; Moons, Karel G.M.; Riley, Richard David

    2013-01-01

    Objectives Individual participant data (IPD) meta-analyses often analyze their IPD as if coming from a single study. We compare this approach with analyses that rather account for clustering of patients within studies. Study Design and Setting Comparison of effect estimates from logistic regression models in real and simulated examples. Results The estimated prognostic effect of age in patients with traumatic brain injury is similar, regardless of whether clustering is accounted for. However, a family history of thrombophilia is found to be a diagnostic marker of deep vein thrombosis [odds ratio, 1.30; 95% confidence interval (CI): 1.00, 1.70; P = 0.05] when clustering is accounted for but not when it is ignored (odds ratio, 1.06; 95% CI: 0.83, 1.37; P = 0.64). Similarly, the treatment effect of nicotine gum on smoking cessation is severely attenuated when clustering is ignored (odds ratio, 1.40; 95% CI: 1.02, 1.92) rather than accounted for (odds ratio, 1.80; 95% CI: 1.29, 2.52). Simulations show models accounting for clustering perform consistently well, but downwardly biased effect estimates and low coverage can occur when ignoring clustering. Conclusion Researchers must routinely account for clustering in IPD meta-analyses; otherwise, misleading effect estimates and conclusions may arise. PMID:23651765

  4. Clinical interpretation of the Spinal Cord Injury Functional Index (SCI-FI)

    PubMed Central

    Fyffe, Denise; Kalpakjian, Claire Z.; Slavin, Mary; Kisala, Pamela; Ni, Pengsheng; Kirshblum, Steven C.; Tulsky, David S.; Jette, Alan M.

    2016-01-01

    Objective: To provide validation of functional ability levels for the Spinal Cord Injury – Functional Index (SCI-FI). Design: Cross-sectional. Setting: Inpatient rehabilitation hospital and community settings. Participants: A sample of 855 individuals with traumatic spinal cord injury enrolled in 6 rehabilitation centers participating in the National Spinal Cord Injury Model Systems Network. Interventions: Not Applicable. Main Outcome Measures: Spinal Cord Injury-Functional Index (SCI-FI). Results: Cluster analyses identified three distinct groups that represent low, mid-range and high SCI-FI functional ability levels. Comparison of clusters on personal and other injury characteristics suggested some significant differences between groups. Conclusions: These results strongly support the use of SCI-FI functional ability levels to document the perceived functional abilities of persons with SCI. Results of the cluster analysis suggest that the SCI-FI functional ability levels capture function by injury characteristics. Clinical implications regarding tracking functional activity trajectories during follow-up visits are discussed. PMID:26781769

  5. Evolution of massive stars in very young clusters and associations

    NASA Technical Reports Server (NTRS)

    Stothers, R. B.

    1985-01-01

    Statistics concerning the stellar content of young galactic clusters and associations which show well defined main sequence turnups have been analyzed in order to derive information about stellar evolution in high-mass galaxies. The analytical approach is semiempirical and uses natural spectroscopic groups of stars on the H-R diagram together with the stars' apparent magnitudes. The new approach does not depend on absolute luminosities and requires only the most basic elements of stellar evolution theory. The following conclusions are offered on the basis of the statistical analysis: (1) O-tupe main-sequence stars evolve to a spectral type of B1 during core hydrogen burning; (2) most O-type blue stragglers are newly formed massive stars burning core hydrogen; (3) supergiants lying redward of the main-sequence turnup are burning core helium; and most Wolf-Rayet stars are burning core helium and originally had masses greater than 30-40 solar mass. The statistics of the natural spectroscopic stars in young galactic clusters and associations are given in a table.

  6. Clusters of Behaviors and Beliefs Predicting Adolescent Depression: Implications for Prevention

    PubMed Central

    Paunesku, David; Ellis, Justin; Fogel, Joshua; Kuwabara, Sachiko A; Gollan, Jackie; Gladstone, Tracy; Reinecke, Mark; Van Voorhees, Benjamin W.

    2009-01-01

    OBJECTIVE Risk factors for various disorders are known to cluster. However, the factor structure for behaviors and beliefs predicting depressive disorder in adolescents is not known. Knowledge of this structure can facilitate prevention planning. METHODS We used the National Longitudinal Study of Adolescent Health (AddHealth) data set to conduct an exploratory factor analysis to identify clusters of behaviors/experiences predicting the onset of major depressive disorder (MDD) at 1-year follow-up (N=4,791). RESULTS Four factors were identified: family/interpersonal relations, self-emancipation, avoidant problem solving/low self-worth, and religious activity. Strong family/interpersonal relations were the most significantly protective against depression at one year follow-up. Avoidant problem solving/low self-worth was not predictive of MDD on its own, but significantly amplified the risks associated with delinquency. CONCLUSION Depression prevention interventions should consider giving family relationships a more central role in their efforts. Programs teaching problem solving skills may be most appropriate for reducing MDD risk in delinquent youth. PMID:20502621

  7. Sampling methods for stellar masses and the mmax-Mecl relation in the starburst dwarf galaxy NGC 4214

    NASA Astrophysics Data System (ADS)

    Weidner, Carsten; Kroupa, Pavel; Pflamm-Altenburg, Jan

    2014-07-01

    It has been claimed in the recent literature that a non-trivial relation between the mass of the most-massive star, mmax, in a star cluster and its embedded star cluster mass (the mmax - Mecl relation) is falsified by observations of the most-massive stars and the Hα luminosity of young star clusters in the starburst dwarf galaxy NGC 4214. Here, it is shown by comparing the NGC 4214 results with observations from the Milky Way that NGC 4214 agrees very well with the predictions of the mmax - Mecl relation and with the integrated galactic stellar initial mass function theory. The difference in conclusions is based on a high degree of degeneracy between expectations from random sampling and those from the mmax - Mecl relation, but are also due to interpreting mmax as a truncation mass in a randomly sampled initial mass function. Additional analysis of galaxies with lower SFRs than those currently presented in the literature will be required to break this degeneracy.

  8. Spatio-Temporal Trends and Risk Factors for Shigella from 2001 to 2011 in Jiangsu Province, People's Republic of China

    PubMed Central

    Bao, Changjun; Hu, Jianli; Liu, Wendong; Liang, Qi; Wu, Ying; Norris, Jessie; Peng, Zhihang; Yu, Rongbin; Shen, Hongbing; Chen, Feng

    2014-01-01

    Objective This study aimed to describe the spatial and temporal trends of Shigella incidence rates in Jiangsu Province, People's Republic of China. It also intended to explore complex risk modes facilitating Shigella transmission. Methods County-level incidence rates were obtained for analysis using geographic information system (GIS) tools. Trend surface and incidence maps were established to describe geographic distributions. Spatio-temporal cluster analysis and autocorrelation analysis were used for detecting clusters. Based on the number of monthly Shigella cases, an autoregressive integrated moving average (ARIMA) model successfully established a time series model. A spatial correlation analysis and a case-control study were conducted to identify risk factors contributing to Shigella transmissions. Results The far southwestern and northwestern areas of Jiangsu were the most infected. A cluster was detected in southwestern Jiangsu (LLR = 11674.74, P<0.001). The time series model was established as ARIMA (1, 12, 0), which predicted well for cases from August to December, 2011. Highways and water sources potentially caused spatial variation in Shigella development in Jiangsu. The case-control study confirmed not washing hands before dinner (OR = 3.64) and not having access to a safe water source (OR = 2.04) as the main causes of Shigella in Jiangsu Province. Conclusion Improvement of sanitation and hygiene should be strengthened in economically developed counties, while access to a safe water supply in impoverished areas should be increased at the same time. PMID:24416167

  9. Hierarchical and Complex System Entropy Clustering Analysis Based Validation for Traditional Chinese Medicine Syndrome Patterns of Chronic Atrophic Gastritis.

    PubMed

    Zhang, Yin; Liu, Yue; Li, Yannan; Zhao, Xia; Zhuo, Lin; Zhou, Ajian; Zhang, Li; Su, Zeqi; Chen, Cen; Du, Shiyu; Liu, Daming; Ding, Xia

    2018-03-22

    Chronic atrophic gastritis (CAG) is the precancerous stage of gastric carcinoma. Traditional Chinese Medicine (TCM) has been widely used in treating CAG. This study aimed to reveal core pathogenesis of CAG by validating the TCM syndrome patterns and provide evidence for optimization of treatment strategies. This is a cross-sectional study conducted in 4 hospitals in China. Hierarchical clustering analysis (HCA) and complex system entropy clustering analysis (CSECA) were performed, respectively, to achieve syndrome pattern validation. Based on HCA, 15 common factors were assigned to 6 syndrome patterns: liver depression and spleen deficiency and blood stasis in the stomach collateral, internal harassment of phlegm-heat and blood stasis in the stomach collateral, phlegm-turbidity internal obstruction, spleen yang deficiency, internal harassment of phlegm-heat and spleen deficiency, and spleen qi deficiency. By CSECA, 22 common factors were assigned to 7 syndrome patterns: qi deficiency, qi stagnation, blood stasis, phlegm turbidity, heat, yang deficiency, and yin deficiency. Combination of qi deficiency, qi stagnation, blood stasis, phlegm turbidity, heat, yang deficiency, and yin deficiency may play a crucial role in CAG pathogenesis. In accord with this, treatment strategies by TCM herbal prescriptions should be targeted to regulating qi, activating blood, resolving turbidity, clearing heat, removing toxin, nourishing yin, and warming yang. Further explorations are needed to verify and expand the current conclusions.

  10. Covert checks by standardised patients of general practitioners' delivery of new periodic health examinations: clustered cross-sectional study from a consumer organisation

    PubMed Central

    Thaler, Kylie; Harris, Mark F

    2012-01-01

    Objective To assess if data collected by a consumer organisation are valid for a health service research study on physicians' performance in preventive care. To report first results of the analysis of physicians performance like consultation time and guideline adherence in history taking. Design Secondary data analysis of a clustered cross-sectional direct observation survey. Setting General practitioners (GPs) in Vienna, Austria, visited unannounced by mystery shoppers (incognito standardised patients (ISPs)). Participants 21 randomly selected GPs were visited by two different ISPs each. 40 observation protocols were realised. Main outcome measures Robustness of sampling and data collection by the consumer organisation. GPs consultation and waiting times, guideline adherence in history taking. Results The double stratified random sampling method was robust and representative for the private and contracted GPs mix of Vienna. The clinical scenarios presented by the ISPs were valid and believable, and no GP realised the ISPs were not genuine patients. The average consultation time was 46 min (95% CI 37 to 54 min). Waiting times differed more than consultation times between private and contracted GPs. No differences between private and contracted GPs in terms of adherence to the evidence-based guidelines regarding history taking including questions regarding alcohol use were found. According to the analysis, 20% of the GPs took a perfect history (95% CI 9% to 39%). Conclusions The analysis of secondary data collected by a consumer organisation was a valid method for drawing conclusions about GPs preventive practice. Initial results, like consultation times longer than anticipated, and the moderate quality of history taking encourage continuing the analysis on available clinical data. PMID:22872721

  11. Investigating the usefulness of a cluster-based trend analysis to detect visual field progression in patients with open-angle glaucoma.

    PubMed

    Aoki, Shuichiro; Murata, Hiroshi; Fujino, Yuri; Matsuura, Masato; Miki, Atsuya; Tanito, Masaki; Mizoue, Shiro; Mori, Kazuhiko; Suzuki, Katsuyoshi; Yamashita, Takehiro; Kashiwagi, Kenji; Hirasawa, Kazunori; Shoji, Nobuyuki; Asaoka, Ryo

    2017-12-01

    To investigate the usefulness of the Octopus (Haag-Streit) EyeSuite's cluster trend analysis in glaucoma. Ten visual fields (VFs) with the Humphrey Field Analyzer (Carl Zeiss Meditec), spanning 7.7 years on average were obtained from 728 eyes of 475 primary open angle glaucoma patients. Mean total deviation (mTD) trend analysis and EyeSuite's cluster trend analysis were performed on various series of VFs (from 1st to 10th: VF1-10 to 6th to 10th: VF6-10). The results of the cluster-based trend analysis, based on different lengths of VF series, were compared against mTD trend analysis. Cluster-based trend analysis and mTD trend analysis results were significantly associated in all clusters and with all lengths of VF series. Between 21.2% and 45.9% (depending on VF series length and location) of clusters were deemed to progress when the mTD trend analysis suggested no progression. On the other hand, 4.8% of eyes were observed to progress using the mTD trend analysis when cluster trend analysis suggested no progression in any two (or more) clusters. Whole field trend analysis can miss local VF progression. Cluster trend analysis appears as robust as mTD trend analysis and useful to assess both sectorial and whole field progression. Cluster-based trend analyses, in particular the definition of two or more progressing cluster, may help clinicians to detect glaucomatous progression in a timelier manner than using a whole field trend analysis, without significantly compromising specificity. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  12. Spatial Analysis of China Province-level Perinatal Mortality

    PubMed Central

    XIANG, Kun; SONG, Deyong

    2016-01-01

    Background: Using spatial analysis tools to determine the spatial patterns of China province-level perinatal mortality and using spatial econometric model to examine the impacts of health care resources and different socio-economic factors on perinatal mortality. Methods: The Global Moran’s I index is used to examine whether the spatial autocorrelation exists in selected regions and Moran’s I scatter plot to examine the spatial clustering among regions. Spatial econometric models are used to investigate the spatial relationships between perinatal mortality and contributing factors. Results: The overall Moran’s I index indicates that perinatal mortality displays positive spatial autocorrelation. Moran’s I scatter plot analysis implies that there is a significant clustering of mortality in both high-rate regions and low-rate regions. The spatial econometric models analyses confirm the existence of a direct link between perinatal mortality and health care resources, socio-economic factors. Conclusions: Since a positive spatial autocorrelation has been detected in China province-level perinatal mortality, the upgrading of regional economic development and medical service level will affect the mortality not only in region itself but also its adjacent regions. PMID:27398334

  13. Insight on AV-45 binding in white and grey matter from histogram analysis: a study on early Alzheimer's disease patients and healthy subjects

    PubMed Central

    Nemmi, Federico; Saint-Aubert, Laure; Adel, Djilali; Salabert, Anne-Sophie; Pariente, Jérémie; Barbeau, Emmanuel; Payoux, Pierre; Péran, Patrice

    2014-01-01

    Purpose AV-45 amyloid biomarker is known to show uptake in white matter in patients with Alzheimer’s disease (AD) but also in healthy population. This binding; thought to be of a non-specific lipophilic nature has not yet been investigated. The aim of this study was to determine the differential pattern of AV-45 binding in healthy and pathological populations in white matter. Methods We recruited 24 patients presenting with AD at early stage and 17 matched, healthy subjects. We used an optimized PET-MRI registration method and an approach based on intensity histogram using several indexes. We compared the results of the intensity histogram analyses with a more canonical approach based on target-to-cerebellum Standard Uptake Value (SUVr) in white and grey matters using MANOVA and discriminant analyses. A cluster analysis on white and grey matter histograms was also performed. Results White matter histogram analysis revealed significant differences between AD and healthy subjects, which were not revealed by SUVr analysis. However, white matter histograms was not decisive to discriminate groups, and indexes based on grey matter only showed better discriminative power than SUVr. The cluster analysis divided our sample in two clusters, showing different uptakes in grey but also in white matter. Conclusion These results demonstrate that AV-45 binding in white matter conveys subtle information not detectable using SUVr approach. Although it is not better than standard SUVr to discriminate AD patients from healthy subjects, this information could reveal white matter modifications. PMID:24573658

  14. TMEM88, CCL14 and CLEC3B as prognostic biomarkers for prognosis and palindromia of human hepatocellular carcinoma.

    PubMed

    Zhang, Xin; Wan, Jin-Xiang; Ke, Zun-Ping; Wang, Feng; Chai, Hai-Xia; Liu, Jia-Qiang

    2017-07-01

    Hepatocellular carcinoma is one of the most mortal and prevalent cancers with increasing incidence worldwide. Elucidating genetic driver genes for prognosis and palindromia of hepatocellular carcinoma helps managing clinical decisions for patients. In this study, the high-throughput RNA sequencing data on platform IlluminaHiSeq of hepatocellular carcinoma were downloaded from The Cancer Genome Atlas with 330 primary hepatocellular carcinoma patient samples. Stable key genes with differential expressions were identified with which Kaplan-Meier survival analysis was performed using Cox proportional hazards test in R language. Driver genes influencing the prognosis of this disease were determined using clustering analysis. Functional analysis of driver genes was performed by literature search and Gene Set Enrichment Analysis. Finally, the selected driver genes were verified using external dataset GSE40873. A total of 5781 stable key genes were identified, including 156 genes definitely related to prognoses of hepatocellular carcinoma. Based on the significant key genes, samples were grouped into five clusters which were further integrated into high- and low-risk classes based on clinical features. TMEM88, CCL14, and CLEC3B were selected as driver genes which clustered high-/low-risk patients successfully (generally, p = 0.0005124445). Finally, survival analysis of the high-/low-risk samples from external database illustrated significant difference with p value 0.0198. In conclusion, TMEM88, CCL14, and CLEC3B genes were stable and available in predicting the survival and palindromia time of hepatocellular carcinoma. These genes could function as potential prognostic genes contributing to improve patients' outcomes and survival.

  15. Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis

    PubMed Central

    Sun, Jiufeng; Huang, Yan; Huang, Huaiqiu; Liang, Pei; Wang, Xiaoyun; Mao, Qiang; Men, Jingtao; Chen, Wenjun; Deng, Chuanhuan; Zhou, Chenhui; Lv, Xiaoli; Zhou, Juanjuan; Zhang, Fan; Li, Ran; Tian, Yanli; Lei, Huali; Liang, Chi; Hu, Xuchu; Xu, Jin; Li, Xuerong; XinbingYu

    2013-01-01

    Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the present study improves our understanding of the global epidemiology and evolution of C. sinensis. PMID:23825605

  16. SU-F-R-33: Can CT and CBCT Be Used Simultaneously for Radiomics Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luo, R; Wang, J; Zhong, H

    2016-06-15

    Purpose: To investigate whether CBCT and CT can be used in radiomics analysis simultaneously. To establish a batch correction method for radiomics in two similar image modalities. Methods: Four sites including rectum, bladder, femoral head and lung were considered as region of interest (ROI) in this study. For each site, 10 treatment planning CT images were collected. And 10 CBCT images which came from same site of same patient were acquired at first radiotherapy fraction. 253 radiomics features, which were selected by our test-retest study at rectum cancer CT (ICC>0.8), were calculated for both CBCT and CT images in MATLAB.more » Simple scaling (z-score) and nonlinear correction methods were applied to the CBCT radiomics features. The Pearson Correlation Coefficient was calculated to analyze the correlation between radiomics features of CT and CBCT images before and after correction. Cluster analysis of mixed data (for each site, 5 CT and 5 CBCT data are randomly selected) was implemented to validate the feasibility to merge radiomics data from CBCT and CT. The consistency of clustering result and site grouping was verified by a chi-square test for different datasets respectively. Results: For simple scaling, 234 of the 253 features have correlation coefficient ρ>0.8 among which 154 features haveρ>0.9 . For radiomics data after nonlinear correction, 240 of the 253 features have ρ>0.8 among which 220 features have ρ>0.9. Cluster analysis of mixed data shows that data of four sites was almost precisely separated for simple scaling(p=1.29 * 10{sup −7}, χ{sup 2} test) and nonlinear correction (p=5.98 * 10{sup −7}, χ{sup 2} test), which is similar to the cluster result of CT data (p=4.52 * 10{sup −8}, χ{sup 2} test). Conclusion: Radiomics data from CBCT can be merged with those from CT by simple scaling or nonlinear correction for radiomics analysis.« less

  17. The Network Structure of Human Personality According to the NEO-PI-R: Matching Network Community Structure to Factor Structure

    PubMed Central

    Goekoop, Rutger; Goekoop, Jaap G.; Scholte, H. Steven

    2012-01-01

    Introduction Human personality is described preferentially in terms of factors (dimensions) found using factor analysis. An alternative and highly related method is network analysis, which may have several advantages over factor analytic methods. Aim To directly compare the ability of network community detection (NCD) and principal component factor analysis (PCA) to examine modularity in multidimensional datasets such as the neuroticism-extraversion-openness personality inventory revised (NEO-PI-R). Methods 434 healthy subjects were tested on the NEO-PI-R. PCA was performed to extract factor structures (FS) of the current dataset using both item scores and facet scores. Correlational network graphs were constructed from univariate correlation matrices of interactions between both items and facets. These networks were pruned in a link-by-link fashion while calculating the network community structure (NCS) of each resulting network using the Wakita Tsurumi clustering algorithm. NCSs were matched against FS and networks of best matches were kept for further analysis. Results At facet level, NCS showed a best match (96.2%) with a ‘confirmatory’ 5-FS. At item level, NCS showed a best match (80%) with the standard 5-FS and involved a total of 6 network clusters. Lesser matches were found with ‘confirmatory’ 5-FS and ‘exploratory’ 6-FS of the current dataset. Network analysis did not identify facets as a separate level of organization in between items and clusters. A small-world network structure was found in both item- and facet level networks. Conclusion We present the first optimized network graph of personality traits according to the NEO-PI-R: a ‘Personality Web’. Such a web may represent the possible routes that subjects can take during personality development. NCD outperforms PCA by producing plausible modularity at item level in non-standard datasets, and can identify the key roles of individual items and clusters in the network. PMID:23284713

  18. On the lithium dip in the metal poor open cluster NGC 2243

    NASA Astrophysics Data System (ADS)

    François, P.; Pasquini, L.; Biazzo, K.; Bonifacio, P.; Palsa, R.

    2014-05-01

    Lithium is a key element for studying the mixing mechanisms operating in stellar interiors. It can also be used to probe the chemical evolution of the Galaxy and the Big Bang nucleosynthesis. Measuring the abundance of Lithium in stars belonging to Open Clusters (hereafter OC) allows a detailed comparison with stellar evolutionary models. NGC 2243 is particularly interesting thanks to its relative low metallicity ([Fe/H]=-0.54 ± 0.10 dex). We performed a detailed analysis of high-resolution spectra obtained with the multi-object facility FLAMES at the VLT 8.2m telescope. Lithium abundance has been measured in 27 stars. We found a Li dip center of 1.06 M⊙, which is significantly smaller than that observed in solar metallicity and metal-rich clusters. This finding confirms and strengthens the conclusion that the mass of the stars in the Li dip strongly depends on stellar metallicity. The mean Li abundance of the cluster is log n(Li) = 2.70 dex, which is substantially higher than that observed in 47 Tue. We derived an iron abundance of [Fe/H]=-0.54±0.10 dex for NGC 2243, in agreement (within the errors) with previous findings.

  19. Myeloid Clusters Are Associated with a Pro-Metastatic Environment and Poor Prognosis in Smoking-Related Early Stage Non-Small Cell Lung Cancer

    PubMed Central

    Zhang, Wang; Pal, Sumanta K.; Liu, Xueli; Yang, Chunmei; Allahabadi, Sachin; Bhanji, Shaira; Figlin, Robert A.; Yu, Hua; Reckamp, Karen L.

    2013-01-01

    Background This study aimed to understand the role of myeloid cell clusters in uninvolved regional lymph nodes from early stage non-small cell lung cancer patients. Methods Uninvolved regional lymph node sections from 67 patients with stage I–III resected non-small cell lung cancer were immunostained to detect myeloid clusters, STAT3 activity and occult metastasis. Anthracosis intensity, myeloid cluster infiltration associated with anthracosis and pSTAT3 level were scored and correlated with patient survival. Multivariate Cox regression analysis was performed with prognostic variables. Human macrophages were used for in vitro nicotine treatment. Results CD68+ myeloid clusters associated with anthracosis and with an immunosuppressive and metastasis-promoting phenotype and elevated overall STAT3 activity were observed in uninvolved lymph nodes. In patients with a smoking history, myeloid cluster score significantly correlated with anthracosis intensity and pSTAT3 level (P<0.01). Nicotine activated STAT3 in macrophages in long-term culture. CD68+ myeloid clusters correlated and colocalized with occult metastasis. Myeloid cluster score was an independent prognostic factor (P = 0.049) and was associated with survival by Kaplan-Maier estimate in patients with a history of smoking (P = 0.055). The combination of myeloid cluster score with either lymph node stage or pSTAT3 level defined two populations with a significant difference in survival (P = 0.024 and P = 0.004, respectively). Conclusions Myeloid clusters facilitate a pro-metastatic microenvironment in uninvolved regional lymph nodes and associate with occult metastasis in early stage non-small cell lung cancer. Myeloid cluster score is an independent prognostic factor for survival in patients with a history of smoking, and may present a novel method to inform therapy choices in the adjuvant setting. Further validation studies are warranted. PMID:23717691

  20. The Gaia-ESO Survey: the present-day radial metallicity distribution of the Galactic disc probed by pre-main-sequence clusters

    NASA Astrophysics Data System (ADS)

    Spina, L.; Randich, S.; Magrini, L.; Jeffries, R. D.; Friel, E. D.; Sacco, G. G.; Pancino, E.; Bonito, R.; Bravi, L.; Franciosini, E.; Klutsch, A.; Montes, D.; Gilmore, G.; Vallenari, A.; Bensby, T.; Bragaglia, A.; Flaccomio, E.; Koposov, S. E.; Korn, A. J.; Lanzafame, A. C.; Smiljanic, R.; Bayo, A.; Carraro, G.; Casey, A. R.; Costado, M. T.; Damiani, F.; Donati, P.; Frasca, A.; Hourihane, A.; Jofré, P.; Lewis, J.; Lind, K.; Monaco, L.; Morbidelli, L.; Prisinzano, L.; Sousa, S. G.; Worley, C. C.; Zaggia, S.

    2017-05-01

    Context. The radial metallicity distribution in the Galactic thin disc represents a crucial constraint for modelling disc formation and evolution. Open star clusters allow us to derive both the radial metallicity distribution and its evolution over time. Aims: In this paper we perform the first investigation of the present-day radial metallicity distribution based on [Fe/H] determinations in late type members of pre-main-sequence clusters. Because of their youth, these clusters are therefore essential for tracing the current interstellar medium metallicity. Methods: We used the products of the Gaia-ESO Survey analysis of 12 young regions (age < 100 Myr), covering Galactocentric distances from 6.67 to 8.70 kpc. For the first time, we derived the metal content of star forming regions farther than 500 pc from the Sun. Median metallicities were determined through samples of reliable cluster members. For ten clusters the membership analysis is discussed in the present paper, while for other two clusters (I.e. Chamaeleon I and Gamma Velorum) we adopted the members identified in our previous works. Results: All the pre-main-sequence clusters considered in this paper have close-to-solar or slightly sub-solar metallicities. The radial metallicity distribution traced by these clusters is almost flat, with the innermost star forming regions having [Fe/H] values that are 0.10-0.15 dex lower than the majority of the older clusters located at similar Galactocentric radii. Conclusions: This homogeneous study of the present-day radial metallicity distribution in the Galactic thin disc favours models that predict a flattening of the radial gradient over time. On the other hand, the decrease of the average [Fe/H] at young ages is not easily explained by the models. Our results reveal a complex interplay of several processes (e.g. star formation activity, initial mass function, supernova yields, gas flows) that controlled the recent evolution of the Milky Way. Based on observations made with the ESO/VLT, at Paranal Observatory, under program 188.B-3002 (The Gaia-ESO Public Spectroscopic Survey).Full Table 1 is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/601/A70

  1. Cancer clusters in the USA: What do the last twenty years of state and federal investigations tell us?

    PubMed Central

    Goodman, Michael; Naiman, Joshua S.; Goodman, Dina; LaKind, Judy S.

    2012-01-01

    Background Cancer clusters garner considerable public and legislative attention, and there is often an expectation that cluster investigations in a community will reveal a causal link to an environmental exposure. At a 1989 national conference on disease clusters, it was reported that cluster studies conducted in the 1970s and 1980s rarely, if ever, produced important findings. We seek to answer the question: Have cancer cluster investigations conducted by US health agencies in the past 20 years improved our understanding of cancer etiology, or informed cancer prevention and control? Methods We reviewed publicly available cancer cluster investigation reports since 1990, obtained from literature searches and by canvassing all 50 states and the District of Columbia. Investigations were categorized with respect to cancer type(s), hypothesized exposure, whether perceived clusters were confirmed (e.g. by elevated incidence), and conclusions about a link between cancer(s) of concern and hypothesized environmental exposure(s). Results We reviewed 428 investigations evaluating 567 cancers of concern. An increase in incidence was confirmed for 72 (13%) cancer categories (including the category “all sites”). Three of those were linked (with variable degree of certainty) to hypothesized exposures, but only one investigation revealed a clear cause. Conclusions It is fair to state that extensive efforts to find causes of community cancer clusters have not been successful. There are fundamental shortcomings to our current methods of investigating community cancer clusters. We recommend a multidisciplinary national dialogue on creative, innovative approaches to understanding when and why cancer and other chronic diseases cluster in space and time. PMID:22519802

  2. Identification of Common Differentially Expressed Genes in Urinary Bladder Cancer

    PubMed Central

    Zaravinos, Apostolos; Lambrou, George I.; Boulalas, Ioannis; Delakas, Dimitris; Spandidos, Demetrios A.

    2011-01-01

    Background Current diagnosis and treatment of urinary bladder cancer (BC) has shown great progress with the utilization of microarrays. Purpose Our goal was to identify common differentially expressed (DE) genes among clinically relevant subclasses of BC using microarrays. Methodology/Principal Findings BC samples and controls, both experimental and publicly available datasets, were analyzed by whole genome microarrays. We grouped the samples according to their histology and defined the DE genes in each sample individually, as well as in each tumor group. A dual analysis strategy was followed. First, experimental samples were analyzed and conclusions were formulated; and second, experimental sets were combined with publicly available microarray datasets and were further analyzed in search of common DE genes. The experimental dataset identified 831 genes that were DE in all tumor samples, simultaneously. Moreover, 33 genes were up-regulated and 85 genes were down-regulated in all 10 BC samples compared to the 5 normal tissues, simultaneously. Hierarchical clustering partitioned tumor groups in accordance to their histology. K-means clustering of all genes and all samples, as well as clustering of tumor groups, presented 49 clusters. K-means clustering of common DE genes in all samples revealed 24 clusters. Genes manifested various differential patterns of expression, based on PCA. YY1 and NFκB were among the most common transcription factors that regulated the expression of the identified DE genes. Chromosome 1 contained 32 DE genes, followed by chromosomes 2 and 11, which contained 25 and 23 DE genes, respectively. Chromosome 21 had the least number of DE genes. GO analysis revealed the prevalence of transport and binding genes in the common down-regulated DE genes; the prevalence of RNA metabolism and processing genes in the up-regulated DE genes; as well as the prevalence of genes responsible for cell communication and signal transduction in the DE genes that were down-regulated in T1-Grade III tumors and up-regulated in T2/T3-Grade III tumors. Combination of samples from all microarray platforms revealed 17 common DE genes, (BMP4, CRYGD, DBH, GJB1, KRT83, MPZ, NHLH1, TACR3, ACTC1, MFAP4, SPARCL1, TAGLN, TPM2, CDC20, LHCGR, TM9SF1 and HCCS) 4 of which participate in numerous pathways. Conclusions/Significance The identification of the common DE genes among BC samples of different histology can provide further insight into the discovery of new putative markers. PMID:21483740

  3. Mammographic images segmentation based on chaotic map clustering algorithm

    PubMed Central

    2014-01-01

    Background This work investigates the applicability of a novel clustering approach to the segmentation of mammographic digital images. The chaotic map clustering algorithm is used to group together similar subsets of image pixels resulting in a medically meaningful partition of the mammography. Methods The image is divided into pixels subsets characterized by a set of conveniently chosen features and each of the corresponding points in the feature space is associated to a map. A mutual coupling strength between the maps depending on the associated distance between feature space points is subsequently introduced. On the system of maps, the simulated evolution through chaotic dynamics leads to its natural partitioning, which corresponds to a particular segmentation scheme of the initial mammographic image. Results The system provides a high recognition rate for small mass lesions (about 94% correctly segmented inside the breast) and the reproduction of the shape of regions with denser micro-calcifications in about 2/3 of the cases, while being less effective on identification of larger mass lesions. Conclusions We can summarize our analysis by asserting that due to the particularities of the mammographic images, the chaotic map clustering algorithm should not be used as the sole method of segmentation. It is rather the joint use of this method along with other segmentation techniques that could be successfully used for increasing the segmentation performance and for providing extra information for the subsequent analysis stages such as the classification of the segmented ROI. PMID:24666766

  4. Workplace cluster of Bell’s palsy in Lima, Peru

    PubMed Central

    2014-01-01

    Background We report on a workplace cluster of Bell’s palsy that occurred within a four-month period in 2011 among employees of a three-story office building in Lima, Peru and our investigation to determine the etiology and associated risk factors. Findings An outbreak investigation was conducted to identify possible common infectious or environmental exposures and included patient interviews, reviews of medical records, an epidemiologic survey, serological analysis for IgM and IgG antibodies to putative Bell’s palsy-inducing pathogens, and an environmental exposure assessment of the office building. Three cases of Bell’s palsy were reported among 65 at-risk employees, attack rate 4.6%. Although two patients had underlying risk factors, there was no clear association or common identifiable risk factor among all cases. Serologic analysis showed no evidence of recent infections, and air and water sample measures of all known chemical or neurotoxins were below maximum allowable concentrations for exposure. Conclusions An infection spread among workplace employees could not be excluded as a potential cause of this cluster; however, it was unlikely a pathogen commonly associated with individual cases of Bell’s palsy. Although a specific etiology was not identified among all cases, we believe this methodology will aid future outbreak investigations of Bell’s palsy and a better understanding of its etiology. While environmental assessments may be useful in their ability to ascertain the cause of clusters of Bell’s palsy, future investigations should prioritize focus on common infectious etiology. PMID:24885256

  5. A Highly Efficient Design Strategy for Regression with Outcome Pooling

    PubMed Central

    Mitchell, Emily M.; Lyles, Robert H.; Manatunga, Amita K.; Perkins, Neil J.; Schisterman, Enrique F.

    2014-01-01

    The potential for research involving biospecimens can be hindered by the prohibitive cost of performing laboratory assays on individual samples. To mitigate this cost, strategies such as randomly selecting a portion of specimens for analysis or randomly pooling specimens prior to performing laboratory assays may be employed. These techniques, while effective in reducing cost, are often accompanied by a considerable loss of statistical efficiency. We propose a novel pooling strategy based on the k-means clustering algorithm to reduce laboratory costs while maintaining a high level of statistical efficiency when predictor variables are measured on all subjects, but the outcome of interest is assessed in pools. We perform simulations motivated by the BioCycle study to compare this k-means pooling strategy with current pooling and selection techniques under simple and multiple linear regression models. While all of the methods considered produce unbiased estimates and confidence intervals with appropriate coverage, pooling under k-means clustering provides the most precise estimates, closely approximating results from the full data and losing minimal precision as the total number of pools decreases. The benefits of k-means clustering evident in the simulation study are then applied to an analysis of the BioCycle dataset. In conclusion, when the number of lab tests is limited by budget, pooling specimens based on k-means clustering prior to performing lab assays can be an effective way to save money with minimal information loss in a regression setting. PMID:25220822

  6. Galaxy CloudMan: delivering cloud compute clusters

    PubMed Central

    2010-01-01

    Background Widespread adoption of high-throughput sequencing has greatly increased the scale and sophistication of computational infrastructure needed to perform genomic research. An alternative to building and maintaining local infrastructure is “cloud computing”, which, in principle, offers on demand access to flexible computational infrastructure. However, cloud computing resources are not yet suitable for immediate “as is” use by experimental biologists. Results We present a cloud resource management system that makes it possible for individual researchers to compose and control an arbitrarily sized compute cluster on Amazon’s EC2 cloud infrastructure without any informatics requirements. Within this system, an entire suite of biological tools packaged by the NERC Bio-Linux team (http://nebc.nerc.ac.uk/tools/bio-linux) is available for immediate consumption. The provided solution makes it possible, using only a web browser, to create a completely configured compute cluster ready to perform analysis in less than five minutes. Moreover, we provide an automated method for building custom deployments of cloud resources. This approach promotes reproducibility of results and, if desired, allows individuals and labs to add or customize an otherwise available cloud system to better meet their needs. Conclusions The expected knowledge and associated effort with deploying a compute cluster in the Amazon EC2 cloud is not trivial. The solution presented in this paper eliminates these barriers, making it possible for researchers to deploy exactly the amount of computing power they need, combined with a wealth of existing analysis software, to handle the ongoing data deluge. PMID:21210983

  7. A highly efficient design strategy for regression with outcome pooling.

    PubMed

    Mitchell, Emily M; Lyles, Robert H; Manatunga, Amita K; Perkins, Neil J; Schisterman, Enrique F

    2014-12-10

    The potential for research involving biospecimens can be hindered by the prohibitive cost of performing laboratory assays on individual samples. To mitigate this cost, strategies such as randomly selecting a portion of specimens for analysis or randomly pooling specimens prior to performing laboratory assays may be employed. These techniques, while effective in reducing cost, are often accompanied by a considerable loss of statistical efficiency. We propose a novel pooling strategy based on the k-means clustering algorithm to reduce laboratory costs while maintaining a high level of statistical efficiency when predictor variables are measured on all subjects, but the outcome of interest is assessed in pools. We perform simulations motivated by the BioCycle study to compare this k-means pooling strategy with current pooling and selection techniques under simple and multiple linear regression models. While all of the methods considered produce unbiased estimates and confidence intervals with appropriate coverage, pooling under k-means clustering provides the most precise estimates, closely approximating results from the full data and losing minimal precision as the total number of pools decreases. The benefits of k-means clustering evident in the simulation study are then applied to an analysis of the BioCycle dataset. In conclusion, when the number of lab tests is limited by budget, pooling specimens based on k-means clustering prior to performing lab assays can be an effective way to save money with minimal information loss in a regression setting. Copyright © 2014 John Wiley & Sons, Ltd.

  8. Subtypes of female juvenile offenders: a cluster analysis of the Millon Adolescent Clinical Inventory.

    PubMed

    Stefurak, Tres; Calhoun, Georgia B

    2007-01-01

    The current study sought to explore subtypes of adolescents within a sample of female juvenile offenders. Using the Millon Adolescent Clinical Inventory with 101 female juvenile offenders, a two-step cluster analysis was performed beginning with a Ward's method hierarchical cluster analysis followed by a K-Means iterative partitioning cluster analysis. The results suggest an optimal three-cluster solution, with cluster profiles leading to the following group labels: Externalizing Problems, Depressed/Interpersonally Ambivalent, and Anxious Prosocial. Analysis along the factors of age, race, offense typology and offense chronicity were conducted to further understand the nature of found clusters. Only the effect for race was significant with the Anxious Prosocial and Depressed Intepersonally Ambivalent clusters appearing disproportionately comprised of African American girls. To establish external validity, clusters were compared across scales of the Behavioral Assessment System for Children - Self Report of Personality, and corroborative distinctions between clusters were found here.

  9. [Cluster analysis in biomedical researches].

    PubMed

    Akopov, A S; Moskovtsev, A A; Dolenko, S A; Savina, G D

    2013-01-01

    Cluster analysis is one of the most popular methods for the analysis of multi-parameter data. The cluster analysis reveals the internal structure of the data, group the separate observations on the degree of their similarity. The review provides a definition of the basic concepts of cluster analysis, and discusses the most popular clustering algorithms: k-means, hierarchical algorithms, Kohonen networks algorithms. Examples are the use of these algorithms in biomedical research.

  10. Microsatellites Reveal a High Population Structure in Triatoma infestans from Chuquisaca, Bolivia

    PubMed Central

    Pizarro, Juan Carlos; Gilligan, Lauren M.; Stevens, Lori

    2008-01-01

    Background For Chagas disease, the most serious infectious disease in the Americas, effective disease control depends on elimination of vectors through spraying with insecticides. Molecular genetic research can help vector control programs by identifying and characterizing vector populations and then developing effective intervention strategies. Methods and Findings The population genetic structure of Triatoma infestans (Hemiptera: Reduviidae), the main vector of Chagas disease in Bolivia, was investigated using a hierarchical sampling strategy. A total of 230 adults and nymphs from 23 localities throughout the department of Chuquisaca in Southern Bolivia were analyzed at ten microsatellite loci. Population structure, estimated using analysis of molecular variance (AMOVA) to estimate FST (infinite alleles model) and RST (stepwise mutation model), was significant between western and eastern regions within Chuquisaca and between insects collected in domestic and peri-domestic habitats. Genetic differentiation at three different hierarchical geographic levels was significant, even in the case of adjacent households within a single locality (R ST = 0.14, F ST = 0.07). On the largest geographic scale, among five communities up to 100 km apart, R ST = 0.12 and F ST = 0.06. Cluster analysis combined with assignment tests identified five clusters within the five communities. Conclusions Some houses are colonized by insects from several genetic clusters after spraying, whereas other households are colonized predominately by insects from a single cluster. Significant population structure, measured by both R ST and F ST, supports the hypothesis of poor dispersal ability and/or reduced migration of T. infestans. The high degree of genetic structure at small geographic scales, inferences from cluster analysis and assignment tests, and demographic data suggest reinfesting vectors are coming from nearby and from recrudescence (hatching of eggs that were laid before insecticide spraying). Suggestions for using these results in vector control strategies are made. PMID:18365033

  11. Conformational and functional analysis of molecular dynamics trajectories by Self-Organising Maps

    PubMed Central

    2011-01-01

    Background Molecular dynamics (MD) simulations are powerful tools to investigate the conformational dynamics of proteins that is often a critical element of their function. Identification of functionally relevant conformations is generally done clustering the large ensemble of structures that are generated. Recently, Self-Organising Maps (SOMs) were reported performing more accurately and providing more consistent results than traditional clustering algorithms in various data mining problems. We present a novel strategy to analyse and compare conformational ensembles of protein domains using a two-level approach that combines SOMs and hierarchical clustering. Results The conformational dynamics of the α-spectrin SH3 protein domain and six single mutants were analysed by MD simulations. The Cα's Cartesian coordinates of conformations sampled in the essential space were used as input data vectors for SOM training, then complete linkage clustering was performed on the SOM prototype vectors. A specific protocol to optimize a SOM for structural ensembles was proposed: the optimal SOM was selected by means of a Taguchi experimental design plan applied to different data sets, and the optimal sampling rate of the MD trajectory was selected. The proposed two-level approach was applied to single trajectories of the SH3 domain independently as well as to groups of them at the same time. The results demonstrated the potential of this approach in the analysis of large ensembles of molecular structures: the possibility of producing a topological mapping of the conformational space in a simple 2D visualisation, as well as of effectively highlighting differences in the conformational dynamics directly related to biological functions. Conclusions The use of a two-level approach combining SOMs and hierarchical clustering for conformational analysis of structural ensembles of proteins was proposed. It can easily be extended to other study cases and to conformational ensembles from other sources. PMID:21569575

  12. Investigation of spacial clustering of rare diseases: childhood malignancies in North Humberside.

    PubMed

    Alexander, F; Cartwright, R; McKinney, P A; Ricketts, T J

    1990-03-01

    The aims of the study were (1) to test for uniformity of distribution of childhood leukaemias and other malignancies; and (2) to consider the aetiological implications of unusual distributions. A test for spacial clustering was applied using a method which allows for unequal distribution of the population at risk and avoids using census data to provide population denominators. When clustering was identified, four possible aetiological links which had already been suggested to the Leukaemia Research Fund Centre were examined in a local area. The study was carried out in the Yorkshire Health Region in the north of England. 144 children under 15 years of age with a diagnosis of malignant disease known to the Yorkshire Regional Childhood Tumour Registry between 1974 and 1986 were included in the analysis. Of these 53 had leukaemias and nine had lymphomas. Significant localised clustering was found in North Humberside, though not in the whole of the Yorkshire Health Region. A number of clustered cases were identified, some of whom were in a post code sector, Hull 10, to the west of Kingston-upon-Hull, about which concern had been expressed since 1985. There was however no evidence that disease clustering was confined to this area. Four previously suggested hypotheses about causation in this particular area were examined but the results were negative or inconclusive. The identification of spacial clustering must be seen as only the first step in a series of investigations; it can only rarely lead to aetiological conclusions by itself, but it can motivate and target other investigations.

  13. Parenting practices are associated with fruit and vegetable consumption in pre-school children

    PubMed Central

    O’Connor, Teresia M; Hughes, Sheryl O; Watson, Kathy B; Baranowski, Tom; Nicklas, Theresa A; Fisher, Jennie O; Beltran, Alicia; Baranowski, Janice C; Qu, Haiyan; Shewchuk, Richard M

    2009-01-01

    Objective Parents may influence children’s fruit and vegetable (F&V) consumption in many ways, but research has focused primarily on counterproductive parenting practices, such as restriction and pressure to eat. The present study aimed to assess the association of diverse parenting practices to promote F&V and its consumption among pre-school children. Design An exploratory analysis was performed on cross-sectional data from 755 Head Start pre-school children and their parents collected in 2004–5. Data included parent practices to facilitate child F&V consumption (grouped into five categories); parent-reported dietary intake of their child over 3 d; and a number of potential correlates. K-means cluster analysis assigned parents to groups with similar use of the food parenting practice categories. Stepwise linear regression analyses investigated the association of parent clusters with children’s consumption of F&V, after controlling for potential confounding factors. Results A three-cluster solution provided the best fit (R2 = 0 62), with substantial differences in the use of parenting practices. The clusters were labelled Indiscriminate Food Parenting, Non-directive Food Parenting and Low-involved Food Parenting. Non-directive parents extensively used enhanced availability and teachable moments’ practices, but less firm discipline practices than the other clusters, and were significantly associated with child F&V intake (standardized β = 0·09, P < 0·1; final model R2 = 0·17) after controlling for confounders, including parental feeding styles. Conclusions Parents use a variety of parenting practices, beyond pressuring to eat and restrictive practices, to promote F&V intake in their young child. Evaluating the use of combinations of practices may provide a better understanding of parental influences on children’s F&V intake. PMID:19490734

  14. Wheat EST resources for functional genomics of abiotic stress

    PubMed Central

    Houde, Mario; Belcaid, Mahdi; Ouellet, François; Danyluk, Jean; Monroy, Antonio F; Dryanova, Ani; Gulick, Patrick; Bergeron, Anne; Laroche, André; Links, Matthew G; MacCarthy, Luke; Crosby, William L; Sarhan, Fathey

    2006-01-01

    Background Wheat is an excellent species to study freezing tolerance and other abiotic stresses. However, the sequence of the wheat genome has not been completely characterized due to its complexity and large size. To circumvent this obstacle and identify genes involved in cold acclimation and associated stresses, a large scale EST sequencing approach was undertaken by the Functional Genomics of Abiotic Stress (FGAS) project. Results We generated 73,521 quality-filtered ESTs from eleven cDNA libraries constructed from wheat plants exposed to various abiotic stresses and at different developmental stages. In addition, 196,041 ESTs for which tracefiles were available from the National Science Foundation wheat EST sequencing program and DuPont were also quality-filtered and used in the analysis. Clustering of the combined ESTs with d2_cluster and TGICL yielded a few large clusters containing several thousand ESTs that were refractory to routine clustering techniques. To resolve this problem, the sequence proximity and "bridges" were identified by an e-value distance graph to manually break clusters into smaller groups. Assembly of the resolved ESTs generated a 75,488 unique sequence set (31,580 contigs and 43,908 singletons/singlets). Digital expression analyses indicated that the FGAS dataset is enriched in stress-regulated genes compared to the other public datasets. Over 43% of the unique sequence set was annotated and classified into functional categories according to Gene Ontology. Conclusion We have annotated 29,556 different sequences, an almost 5-fold increase in annotated sequences compared to the available wheat public databases. Digital expression analysis combined with gene annotation helped in the identification of several pathways associated with abiotic stress. The genomic resources and knowledge developed by this project will contribute to a better understanding of the different mechanisms that govern stress tolerance in wheat and other cereals. PMID:16772040

  15. Analyzing polysemous concepts from a clinical perspective: Application to auditing concept categorization in the UMLS

    PubMed Central

    Mougin, Fleur; Bodenreider, Olivier; Burgun, Anita

    2015-01-01

    Objectives Polysemy is a frequent issue in biomedical terminologies. In the Unified Medical Language System (UMLS), polysemous terms are either represented as several independent concepts, or clustered into a single, multiply-categorized concept. The objective of this study is to analyze polysemous concepts in the UMLS through their categorization and hierarchical relations for auditing purposes. Methods We used the association of a concept with multiple Semantic Groups (SGs) as a surrogate for polysemy. We first extracted multi-SG (MSG) concepts from the UMLS Metathesaurus and characterized them in terms of the combinations of SGs with which they are associated. We then clustered MSG concepts in order to identify major types of polysemy. We also analyzed the inheritance of SGs in MSG concepts. Finally, we manually reviewed the categorization of the MSG concepts for auditing purposes. Results The 1208 MSG concepts in the Metathesaurus are associated with 30 distinct pairs of SGs. We created 75 semantically homogeneous clusters of MSG concepts, and 276 MSG concepts could not be clustered for lack of hierarchical relations. The clusters were characterized by the most frequent pairs of semantic types of their constituent MSG concepts. MSG concepts exhibit limited semantic compatibility with their parent and child concepts. A large majority of MSG concepts (92%) are adequately categorized. Examples of miscategorized concepts are presented. Conclusion This work is a systematic analysis and manual review of all concepts categorized by multiple SGs in the UMLS. The correctly-categorized MSG concepts do reflect polysemy in the UMLS Metathesaurus. The analysis of inheritance of SGs proved useful for auditing concept categorization in the UMLS. PMID:19303057

  16. Symptom Clusters and Quality of Life in Hospice Patients with Cancer

    PubMed Central

    Omran, Suha; Khader, Yousef; McMillan, Susan

    2017-01-01

    Background: Symptom control is an important part of palliative care and important to achieve optimal quality of life (QOL). Studies have shown that patients with advanced cancer suffer from diverse and often severe physical and psychological symptoms. The aim is to explore the influence of symptom clusters on QOL among patients with advanced cancer. Materials and Methods: 709 patients with advanced cancer were recruited to participate in a clinical trial focusing on symptom management and QOL. Patients were adults newly admitted to hospice home care in one of two hospices in southwest Florida, who could pass mental status screening. The instruments used for data collection were the Demographic Data Form, Memorial Symptom Assessment Scale (MSAS), and the Hospice Quality of Life Index-14. Results: Exploratory factor analysis and multiple regression were used to identify symptom clusters and their influence on QOL. The results revealed that the participants experienced multiple concurrent symptoms. There were four symptom clusters found among these cancer patients. Individual symptom distress scores that were the strongest predictors of QOL were: feeling pain; dry mouth; feeling drowsy; nausea; difficulty swallowing; worrying and feeling nervous. Conclusions: Patients with advanced cancer reported various concurrent symptoms, and these form symptom clusters of four main categories. The four symptoms clusters have a negative influence on patients’ QOL and required specific care from different members of the hospice healthcare team. The results of this study should be used to guide health care providers’ symptom management. Proper attention to symptom clusters should be the basis for accurate planning of effective interventions to manage the symptom clusters experienced by advanced cancer patients. The health care provider needs to plan ahead for these symptoms and manage any concurrent symptoms for successful promotion of their patient’s QOL. PMID:28950683

  17. Symptom Clusters and Quality of Life in Hospice Patients with Cancer

    PubMed

    Omran, Suha; Khader, Yousef; McMillan, Susan

    2017-09-27

    Background: Symptom control is an important part of palliative care and important to achieve optimal quality of life (QOL). Studies have shown that patients with advanced cancer suffer from diverse and often severe physical and psychological symptoms. The aim is to explore the influence of symptom clusters on QOL among patients with advanced cancer. Materials and Methods: 709 patients with advanced cancer were recruited to participate in a clinical trial focusing on symptom management and QOL. Patients were adults newly admitted to hospice home care in one of two hospices in southwest Florida, who could pass mental status screening. The instruments used for data collection were the Demographic Data Form, Memorial Symptom Assessment Scale (MSAS), and the Hospice Quality of Life Index-14. Results: Exploratory factor analysis and multiple regression were used to identify symptom clusters and their influence on QOL. The results revealed that the participants experienced multiple concurrent symptoms. There were four symptom clusters found among these cancer patients. Individual symptom distress scores that were the strongest predictors of QOL were: feeling pain; dry mouth; feeling drowsy; nausea; difficulty swallowing; worrying and feeling nervous. Conclusions: Patients with advanced cancer reported various concurrent symptoms, and these form symptom clusters of four main categories. The four symptoms clusters have a negative influence on patients’ QOL and required specific care from different members of the hospice healthcare team. The results of this study should be used to guide health care providers’ symptom management. Proper attention to symptom clusters should be the basis for accurate planning of effective interventions to manage the symptom clusters experienced by advanced cancer patients. The health care provider needs to plan ahead for these symptoms and manage any concurrent symptoms for successful promotion of their patient’s QOL. Creative Commons Attribution License

  18. Performance of small cluster surveys and the clustered LQAS design to estimate local-level vaccination coverage in Mali

    PubMed Central

    2012-01-01

    Background Estimation of vaccination coverage at the local level is essential to identify communities that may require additional support. Cluster surveys can be used in resource-poor settings, when population figures are inaccurate. To be feasible, cluster samples need to be small, without losing robustness of results. The clustered LQAS (CLQAS) approach has been proposed as an alternative, as smaller sample sizes are required. Methods We explored (i) the efficiency of cluster surveys of decreasing sample size through bootstrapping analysis and (ii) the performance of CLQAS under three alternative sampling plans to classify local VC, using data from a survey carried out in Mali after mass vaccination against meningococcal meningitis group A. Results VC estimates provided by a 10 × 15 cluster survey design were reasonably robust. We used them to classify health areas in three categories and guide mop-up activities: i) health areas not requiring supplemental activities; ii) health areas requiring additional vaccination; iii) health areas requiring further evaluation. As sample size decreased (from 10 × 15 to 10 × 3), standard error of VC and ICC estimates were increasingly unstable. Results of CLQAS simulations were not accurate for most health areas, with an overall risk of misclassification greater than 0.25 in one health area out of three. It was greater than 0.50 in one health area out of two under two of the three sampling plans. Conclusions Small sample cluster surveys (10 × 15) are acceptably robust for classification of VC at local level. We do not recommend the CLQAS method as currently formulated for evaluating vaccination programmes. PMID:23057445

  19. The effect of cognitive appraisal for stressors on the oral health-related QOL of dry mouth patients

    PubMed Central

    2014-01-01

    Background Dry mouth is very common symptom, and psychological factors have an influence on this symptom. Although the influence of emotional factor related to patients with oral dryness has been examined in previous studies, the cognitive factors have not been examined thus far. Objective The purpose of this study was to examine the influence of cognitive factors on patients with oral dryness. Methods The participants were 106 patients complaining of oral dryness. They were required to complete a questionnaire measuring subjective oral dryness, oral-related QOL, cognition for stressors, and mood state. Results Correlational analyses revealed that OHIP-14 is significantly related to oral dryness, appraisal for effect, appraisal for threat, and commitment. These correlations were maintained even after controlling for the influence of depression and anxiety. Using oral dryness, appraisal for effect, appraisal for threat, and commitment, cluster analysis was done and three clusters (cluster-1, severe oral dryness; cluster-2, positive cognitive style: cluster-3, negative cognitive style) were extracted. The results of ANOVA showed that the group with severe oral dryness (cluster-1) had a significantly higher score on OHIP-14 than the other two groups. There was no significant difference between the groups with positive (cluster-2) and negative (cluster-3) cognitive style. Conclusion Although the group of patients with positive cognitive style complained of more severe oral dryness than the group with negative cognitive style, no significant difference was observed between these two groups in OHIP-14. These results indicate that cognitive factors would be a useful therapeutic target for the improvement of the oral-related QOL of patients with oral dryness. PMID:26019720

  20. Electronic medical records and physician stress in primary care: results from the MEMO Study

    PubMed Central

    Babbott, Stewart; Manwell, Linda Baier; Brown, Roger; Montague, Enid; Williams, Eric; Schwartz, Mark; Hess, Erik; Linzer, Mark

    2014-01-01

    Background Little has been written about physician stress that may be associated with electronic medical records (EMR). Objective We assessed relationships between the number of EMR functions, primary care work conditions, and physician satisfaction, stress and burnout. Design and participants 379 primary care physicians and 92 managers at 92 clinics from New York City and the upper Midwest participating in the 2001–5 Minimizing Error, Maximizing Outcome (MEMO) Study. A latent class analysis identified clusters of physicians within clinics with low, medium and high EMR functions. Main measures We assessed physician-reported stress, burnout, satisfaction, and intent to leave the practice, and predictors including time pressure during visits. We used a two-level regression model to estimate the mean response for each physician cluster to each outcome, adjusting for physician age, sex, specialty, work hours and years using the EMR. Effect sizes (ES) of these relationships were considered small (0.14), moderate (0.39), and large (0.61). Key results Compared to the low EMR cluster, physicians in the moderate EMR cluster reported more stress (ES 0.35, p=0.03) and lower satisfaction (ES −0.45, p=0.006). Physicians in the high EMR cluster indicated lower satisfaction than low EMR cluster physicians (ES −0.39, p=0.01). Time pressure was associated with significantly more burnout, dissatisfaction and intent to leave only within the high EMR cluster. Conclusions Stress may rise for physicians with a moderate number of EMR functions. Time pressure was associated with poor physician outcomes mainly in the high EMR cluster. Work redesign may address these stressors. PMID:24005796

  1. Pain Sensitivity Subgroups in Individuals With Spine Pain: Potential Relevance to Short-Term Clinical Outcome

    PubMed Central

    Bialosky, Joel E.; Robinson, Michael E.

    2014-01-01

    Background Cluster analysis can be used to identify individuals similar in profile based on response to multiple pain sensitivity measures. There are limited investigations into how empirically derived pain sensitivity subgroups influence clinical outcomes for individuals with spine pain. Objective The purposes of this study were: (1) to investigate empirically derived subgroups based on pressure and thermal pain sensitivity in individuals with spine pain and (2) to examine subgroup influence on 2-week clinical pain intensity and disability outcomes. Design A secondary analysis of data from 2 randomized trials was conducted. Methods Baseline and 2-week outcome data from 157 participants with low back pain (n=110) and neck pain (n=47) were examined. Participants completed demographic, psychological, and clinical information and were assessed using pain sensitivity protocols, including pressure (suprathreshold pressure pain) and thermal pain sensitivity (thermal heat threshold and tolerance, suprathreshold heat pain, temporal summation). A hierarchical agglomerative cluster analysis was used to create subgroups based on pain sensitivity responses. Differences in data for baseline variables, clinical pain intensity, and disability were examined. Results Three pain sensitivity cluster groups were derived: low pain sensitivity, high thermal static sensitivity, and high pressure and thermal dynamic sensitivity. There were differences in the proportion of individuals meeting a 30% change in pain intensity, where fewer individuals within the high pressure and thermal dynamic sensitivity group (adjusted odds ratio=0.3; 95% confidence interval=0.1, 0.8) achieved successful outcomes. Limitations Only 2-week outcomes are reported. Conclusions Distinct pain sensitivity cluster groups for individuals with spine pain were identified, with the high pressure and thermal dynamic sensitivity group showing worse clinical outcome for pain intensity. Future studies should aim to confirm these findings. PMID:24764070

  2. Deriving temperature, mass, and age of evolved stars from high-resolution spectra. Application to field stars and the open cluster IC 4651

    NASA Astrophysics Data System (ADS)

    Biazzo, K.; Pasquini, L.; Girardi, L.; Frasca, A.; da Silva, L.; Setiawan, J.; Marilli, E.; Hatzes, A. P.; Catalano, S.

    2007-12-01

    Aims:We test our capability of deriving stellar physical parameters of giant stars by analysing a sample of field stars and the well studied open cluster IC 4651 with different spectroscopic methods. Methods: The use of a technique based on line-depth ratios (LDRs) allows us to determine with high precision the effective temperature of the stars and to compare the results with those obtained with a classical LTE abundance analysis. Results: (i) For the field stars we find that the temperatures derived by means of the LDR method are in excellent agreement with those found by the spectral synthesis. This result is extremely encouraging because it shows that spectra can be used to firmly derive population characteristics (e.g., mass and age) of the observed stars. (ii) For the IC 4651 stars we use the determined effective temperature to derive the following results. a) The reddening E(B-V) of the cluster is 0.12±0.02, largely independent of the color-temperature calibration used. b) The age of the cluster is 1.2±0.2 Gyr. c) The typical mass of the analysed giant stars is 2.0±0.2~M⊙. Moreover, we find a systematic difference of about 0.2 dex in log g between spectroscopic and evolutionary values. Conclusions: We conclude that, in spite of known limitations, a classical spectroscopic analysis of giant stars may indeed result in very reliable stellar parameters. We caution that the quality of the agreement, on the other hand, depends on the details of the adopted spectroscopic analysis. Based on observations collected at the ESO telescopes at the Paranal and La Silla Observatories, Chile.

  3. Assessment of genetic diversity and phylogenetic relationships of Korean native chicken breeds using microsatellite markers

    PubMed Central

    Seo, Joo Hee; Lee, Jun Heon; Kong, Hong Sik

    2017-01-01

    Objective This study was conducted to investigate the basic information on genetic structure and characteristics of Korean Native chickens (NC) and foreign breeds through the analysis of the pure chicken populations and commercial chicken lines of the Hanhyup Company which are popular in the NC market, using the 20 microsatellite markers. Methods In this study, the genetic diversity and phylogenetic relationships of 445 NC from five different breeds (NC, Leghorn [LH], Cornish [CS], Rhode Island Red [RIR], and Hanhyup [HH] commercial line) were investigated by performing genotyping using 20 microsatellite markers. Results The highest genetic distance was observed between RIR and LH (18.9%), whereas the lowest genetic distance was observed between HH and NC (2.7%). In the principal coordinates analysis (PCoA) illustrated by the first component, LH was clearly separated from the other groups. The correspondence analysis showed close relationship among individuals belonging to the NC, CS, and HH lines. From the STRUCTURE program, the presence of 5 clusters was detected and it was found that the proportion of membership in the different clusters was almost comparable among the breeds with the exception of one breed (HH), although it was highest in LH (0.987) and lowest in CS (0.578). For the cluster 1 it was high in HH (0.582) and in CS (0.368), while for the cluster 4 it was relatively higher in HH (0.392) than other breeds. Conclusion Our study showed useful genetic diversity and phylogenetic relationship data that can be utilized for NC breeding and development by the commercial chicken industry to meet consumer demands. PMID:28335091

  4. Progressive colonization and restricted gene flow shape island-dependent population structure in Galápagos marine iguanas (Amblyrhynchus cristatus)

    PubMed Central

    2009-01-01

    Background Marine iguanas (Amblyrhynchus cristatus) inhabit the coastlines of large and small islands throughout the Galápagos archipelago, providing a rich system to study the spatial and temporal factors influencing the phylogeographic distribution and population structure of a species. Here, we analyze the microevolution of marine iguanas using the complete mitochondrial control region (CR) as well as 13 microsatellite loci representing more than 1200 individuals from 13 islands. Results CR data show that marine iguanas occupy three general clades: one that is widely distributed across the northern archipelago, and likely spread from east to west by way of the South Equatorial current, a second that is found mostly on the older eastern and central islands, and a third that is limited to the younger northern and western islands. Generally, the CR haplotype distribution pattern supports the colonization of the archipelago from the older, eastern islands to the younger, western islands. However, there are also signatures of recurrent, historical gene flow between islands after population establishment. Bayesian cluster analysis of microsatellite genotypes indicates the existence of twenty distinct genetic clusters generally following a one-cluster-per-island pattern. However, two well-differentiated clusters were found on the easternmost island of San Cristóbal, while nine distinct and highly intermixed clusters were found on youngest, westernmost islands of Isabela and Fernandina. High mtDNA and microsatellite genetic diversity were observed for populations on Isabela and Fernandina that may be the result of a recent population expansion and founder events from multiple sources. Conclusions While a past genetic study based on pure FST analysis suggested that marine iguana populations display high levels of nuclear (but not mitochondrial) gene flow due to male-biased dispersal, the results of our sex-biased dispersal tests and the finding of strong genetic differentiation between islands do not support this view. Therefore, our study is a nice example of how recently developed analytical tools such as Bayesian clustering analysis and DNA sequence-based demographic analyses can overcome potential biases introduced by simply relying on FST estimates from markers with different inheritance patterns. PMID:20028547

  5. Molecular Epidemiology of Pulmonary Tuberculosis in Belgrade, Central Serbia

    PubMed Central

    Vuković, Dragana; Rüsch-Gerdes, Sabine; Savić, Branislava; Niemann, Stefan

    2003-01-01

    In order to gain precise data on the actual epidemiology of tuberculosis (TB) in Belgrade, central Serbia, we conducted the molecular epidemiological investigation described herein. IS6110 restriction fragment length polymorphism (RFLP) typing of 176 Mycobacterium tuberculosis isolates was performed. These strains were obtained from 48.4% of all patients diagnosed with culture-proven pulmonary TB from April through September 1998 and from May through October 1999. Clusters containing strains with identical RFLP IS6110 patterns were assumed to have arisen from recent transmission. Of the 176 cases, 55 (31.2%) were grouped into 23 clusters ranging in size from two to six patients. Nearly 80% of clustered patients were directly interviewed, and transmission between family-unrelated contacts was found to be predominant in the study population. Classical contact investigation identified only 2 (3.6%) of the 55 clustered patients. The clustering of TB patients was not associated with any demographic or clinical characteristic other than infection with multidrug-resistant (MDR) M. tuberculosis strains. Nearly 70% of MDR strains were clustered, which indicates active transmission of MDR TB in Belgrade. However, this was not observed by conventional epidemiologic surveillance. In conclusion, the first molecular epidemiologic analysis of TB in the region revealed frequent recent transmission of TB and pointed out significant shortcomings of the current concept for conventional contact tracing. The results presented also demonstrate that transmission of MDR TB in Belgrade is not optimally controlled, and they provide support for the development of improved control strategies, including application of molecular methods. PMID:12958271

  6. Early Environment and Neurobehavioral Development Predict Adult Temperament Clusters

    PubMed Central

    Congdon, Eliza; Service, Susan; Wessman, Jaana; Seppänen, Jouni K.; Schönauer, Stefan; Miettunen, Jouko; Turunen, Hannu; Koiranen, Markku; Joukamaa, Matti; Järvelin, Marjo-Riitta; Veijola, Juha; Mannila, Heikki; Paunio, Tiina; Freimer, Nelson B.

    2012-01-01

    Background Investigation of the environmental influences on human behavioral phenotypes is important for our understanding of the causation of psychiatric disorders. However, there are complexities associated with the assessment of environmental influences on behavior. Methods/Principal Findings We conducted a series of analyses using a prospective, longitudinal study of a nationally representative birth cohort from Finland (the Northern Finland 1966 Birth Cohort). Participants included a total of 3,761 male and female cohort members who were living in Finland at the age of 16 years and who had complete temperament scores. Our initial analyses (Wessman et al., in press) provide evidence in support of four stable and robust temperament clusters. Using these temperament clusters, as well as independent temperament dimensions for comparison, we conducted a data-driven analysis to assess the influence of a broad set of life course measures, assessed pre-natally, in infancy, and during adolescence, on adult temperament. Results Measures of early environment, neurobehavioral development, and adolescent behavior significantly predict adult temperament, classified by both cluster membership and temperament dimensions. Specifically, our results suggest that a relatively consistent set of life course measures are associated with adult temperament profiles, including maternal education, characteristics of the family’s location and residence, adolescent academic performance, and adolescent smoking. Conclusions Our finding that a consistent set of life course measures predict temperament clusters indicate that these clusters represent distinct developmental temperament trajectories and that information about a subset of life course measures has implications for adult health outcomes. PMID:22815688

  7. 2013 multistate outbreaks of Cyclospora cayetanensis infections associated with fresh produce: focus on the Texas investigations.

    PubMed

    Abanyie, F; Harvey, R R; Harris, J R; Wiegand, R E; Gaul, L; Desvignes-Kendrick, M; Irvin, K; Williams, I; Hall, R L; Herwaldt, B; Gray, E B; Qvarnstrom, Y; Wise, M E; Cantu, V; Cantey, P T; Bosch, S; DA Silva, A J; Fields, A; Bishop, H; Wellman, A; Beal, J; Wilson, N; Fiore, A E; Tauxe, R; Lance, S; Slutsker, L; Parise, M

    2015-12-01

    The 2013 multistate outbreaks contributed to the largest annual number of reported US cases of cyclosporiasis since 1997. In this paper we focus on investigations in Texas. We defined an outbreak-associated case as laboratory-confirmed cyclosporiasis in a person with illness onset between 1 June and 31 August 2013, with no history of international travel in the previous 14 days. Epidemiological, environmental, and traceback investigations were conducted. Of the 631 cases reported in the multistate outbreaks, Texas reported the greatest number of cases, 270 (43%). More than 70 clusters were identified in Texas, four of which were further investigated. One restaurant-associated cluster of 25 case-patients was selected for a case-control study. Consumption of cilantro was most strongly associated with illness on meal date-matched analysis (matched odds ratio 19·8, 95% confidence interval 4·0-∞). All case-patients in the other three clusters investigated also ate cilantro. Traceback investigations converged on three suppliers in Puebla, Mexico. Cilantro was the vehicle of infection in the four clusters investigated; the temporal association of these clusters with the large overall increase in cyclosporiasis cases in Texas suggests cilantro was the vehicle of infection for many other cases. However, the paucity of epidemiological and traceback information does not allow for a conclusive determination; moreover, molecular epidemiological tools for cyclosporiasis that could provide more definitive linkage between case clusters are needed.

  8. Epigenetic transgenerational inheritance of somatic transcriptomes and epigenetic control regions

    PubMed Central

    2012-01-01

    Background Environmentally induced epigenetic transgenerational inheritance of adult onset disease involves a variety of phenotypic changes, suggesting a general alteration in genome activity. Results Investigation of different tissue transcriptomes in male and female F3 generation vinclozolin versus control lineage rats demonstrated all tissues examined had transgenerational transcriptomes. The microarrays from 11 different tissues were compared with a gene bionetwork analysis. Although each tissue transgenerational transcriptome was unique, common cellular pathways and processes were identified between the tissues. A cluster analysis identified gene modules with coordinated gene expression and each had unique gene networks regulating tissue-specific gene expression and function. A large number of statistically significant over-represented clusters of genes were identified in the genome for both males and females. These gene clusters ranged from 2-5 megabases in size, and a number of them corresponded to the epimutations previously identified in sperm that transmit the epigenetic transgenerational inheritance of disease phenotypes. Conclusions Combined observations demonstrate that all tissues derived from the epigenetically altered germ line develop transgenerational transcriptomes unique to the tissue, but common epigenetic control regions in the genome may coordinately regulate these tissue-specific transcriptomes. This systems biology approach provides insight into the molecular mechanisms involved in the epigenetic transgenerational inheritance of a variety of adult onset disease phenotypes. PMID:23034163

  9. Active microbial soil communities in different agricultural managements

    NASA Astrophysics Data System (ADS)

    Landi, S.; Pastorelli, R.

    2009-04-01

    We studied the composition of active eubacterial microflora by RNA extraction from soil (bulk and rhizosphere) under different environmental impact managements, in a hilly basin in Gallura (Sardinia). We contrasted grassy vineyard, in which the soil had been in continuous contact with plant roots for a long period of time, with traditional tilled vineyard. Moreover, we examined permanent grassland, in which plants had been present for some years, with temporary grassland, in which varying plants had been present only during the respective growing seasons. Molecular analysis of total population was carried out by electrophoretic separation by Denaturing Gradient Gel Electrophoresis (DGGE) of amplified cDNA fragments obtained from 16S rRNA. In vineyards UPGMA (Unweighted Pair Group Mathematical Average) analysis made up separate clusters depending on soil management. In spring both clusters showed similarity over 70%, while in autumn the similarity increased, 84% and 90% for grassy and conventional tilled vineyard respectively. Permanent and temporary grassland joined in a single cluster in spring, while in autumn a partial separation was evidenced. The grassy vineyard, permanent and temporary grassland showed higher richness and diversity Shannon-Weiner index values than vineyard with conventional tillage although no significant. In conclusion the expected effect of the rhizosphere was visible: the grass cover influenced positively the diversity of active microbial population.

  10. The interest of gait markers in the identification of subgroups among fibromyalgia patients

    PubMed Central

    2011-01-01

    Background Fibromyalgia (FM) is a heterogeneous syndrome and its classification into subgroups calls for broad-based discussion. FM subgrouping, which aims to adapt treatment according to different subgroups, relies in part, on psychological and cognitive dysfunctions. Since motor control of gait is closely related to cognitive function, we hypothesized that gait markers could be of interest in the identification of FM patients' subgroups. This controlled study aimed at characterizing gait disorders in FM, and subgrouping FM patients according to gait markers such as stride frequency (SF), stride regularity (SR), and cranio-caudal power (CCP) which measures kinesia. Methods A multicentre, observational open trial enrolled patients with primary FM (44.1 ± 8.1 y), and matched controls (44.1 ± 7.3 y). Outcome measurements and gait analyses were available for 52 pairs. A 3-step statistical analysis was carried out. A preliminary single blind analysis using k-means cluster was performed as an initial validation of gait markers. Then in order to quantify FM patients according to psychometric and gait variables an open descriptive analysis comparing patients and controls were made, and correlations between gait variables and main outcomes were calculated. Finally using cluster analysis, we described subgroups for each gait variable and looked for significant differences in self-reported assessments. Results SF was the most discriminating gait variable (73% of patients and controls). SF, SR, and CCP were different between patients and controls. There was a non-significant association between SF, FIQ and physical components from Short-Form 36 (p = 0.06). SR was correlated to FIQ (p = 0.01) and catastrophizing (p = 0.05) while CCP was correlated to pain (p = 0.01). The SF cluster identified 3 subgroups with a particular one characterized by normal SF, low pain, high activity and hyperkinesia. The SR cluster identified 2 distinct subgroups: the one with a reduced SR was distinguished by high FIQ, poor coping and altered affective status. Conclusion Gait analysis may provide additional information in the identification of subgroups among fibromyalgia patients. Gait analysis provided relevant information about physical and cognitive status, and pain behavior. Further studies are needed to better understand gait analysis implications in FM. PMID:22078002

  11. EXPLORING FUNCTIONAL CONNECTIVITY IN FMRI VIA CLUSTERING.

    PubMed

    Venkataraman, Archana; Van Dijk, Koene R A; Buckner, Randy L; Golland, Polina

    2009-04-01

    In this paper we investigate the use of data driven clustering methods for functional connectivity analysis in fMRI. In particular, we consider the K-Means and Spectral Clustering algorithms as alternatives to the commonly used Seed-Based Analysis. To enable clustering of the entire brain volume, we use the Nyström Method to approximate the necessary spectral decompositions. We apply K-Means, Spectral Clustering and Seed-Based Analysis to resting-state fMRI data collected from 45 healthy young adults. Without placing any a priori constraints, both clustering methods yield partitions that are associated with brain systems previously identified via Seed-Based Analysis. Our empirical results suggest that clustering provides a valuable tool for functional connectivity analysis.

  12. Risk, Resiliency and Coping in National Guard Families

    DTIC Science & Technology

    2015-10-01

    future research and larger quantitative studies of injury trajectory. CONCLUSION This study increases our understanding of risk, resilience and coping... Solomon et al. 2008). This finding is supported by other research that found emotional numbing behaviors associated with the avoidance cluster to be...and effectively, all four symptomatic clusters of PTSD within a relational context. Research suggests that each cluster impacts veterans’ intimate

  13. Three-dimensional visualization of cultural clusters in the 1878 yellow fever epidemic of New Orleans

    PubMed Central

    Curtis, Andrew J

    2008-01-01

    Background An epidemic may exhibit different spatial patterns with a change in geographic scale, with each scale having different conduits and impediments to disease spread. Mapping disease at each of these scales often reveals different cluster patterns. This paper will consider this change of geographic scale in an analysis of yellow fever deaths for New Orleans in 1878. Global clustering for the whole city, will be followed by a focus on the French Quarter, then clusters of that area, and finally street-level patterns of a single cluster. The three-dimensional visualization capabilities of a GIS will be used as part of a cluster creation process that incorporates physical buildings in calculating mortality-to-mortality distance. Including nativity of the deceased will also capture cultural connection. Results Twenty-two yellow fever clusters were identified for the French Quarter. These generally mirror the results of other global cluster and density surfaces created for the entire epidemic in New Orleans. However, the addition of building-distance, and disease specific time frame between deaths reveal that disease spread contains a cultural component. Same nativity mortality clusters emerge in a similar time frame irrespective of proximity. Italian nativity mortalities were far more densely grouped than any of the other cohorts. A final examination of mortalities for one of the nativity clusters reveals that further sub-division is present, and that this pattern would only be revealed at this scale (street level) of investigation. Conclusion Disease spread in an epidemic is complex resulting from a combination of geographic distance, geographic distance with specific connection to the built environment, disease-specific time frame between deaths, impediments such as herd immunity, and social or cultural connection. This research has shown that the importance of cultural connection may be more important than simple proximity, which in turn might mean traditional quarantine measures should be re-evaluated. PMID:18721469

  14. A Proposal to Investigate Outstanding Problems in Astronomy

    NASA Technical Reports Server (NTRS)

    Ford, Holland

    2003-01-01

    During the past year the ACS science team has concentrated on analyzing ACS observations, writing papers, and disseminating our results to the astronomy community at conferences and workshops around the world. We also have put considerable effort in getting our results to the public via public lectures and through press releases. Taking a very broad view of our program, we are investigating the evolution of galaxies and clusters of galaxies from their birth, approximately one billion years after the beginning of the Universe, to the present. We have found and characterized a population of galaxies that are no more than 1.4 billion years old. These may well be the Universe s first generation of infant galaxies. Looking at the Universe 500,000 years later, we see what appears to be a cluster of galaxies just beginning to form (a proto-cluster) around a luminous radio galaxy. Moving forward in time and closer to the present, we are studying clusters of galaxies that are less than half the age of the Universe. Our observations and analysis lead us to the important conclusion that the elliptical galaxies in these clusters must have had their last significant star formation some three billion years earlier, which is about the time when the proto-cluster was forming. Coming still closer to home, we are observing nearby massive clusters of galaxies that are approximately 12 billion years old. The gravity from these large aggregates of dark and luminous matter is so strong it warps space-time itself, and makes the cluster act as a cosmic telescope that magnifies the distant galaxies behind the cluster. We used the magnified (or lensed) galaxies to map the distribution of the dominant matter within the clusters, which is the so-called dark matter (the matter is invisible, and its nature is unknown). We also are using these cosmic telescopes to study the distant lensed galaxies that would otherwise be too small and too faint to be seen even by Hubble and the ACS.

  15. Spatial cluster detection using dynamic programming

    PubMed Central

    2012-01-01

    Background The task of spatial cluster detection involves finding spatial regions where some property deviates from the norm or the expected value. In a probabilistic setting this task can be expressed as finding a region where some event is significantly more likely than usual. Spatial cluster detection is of interest in fields such as biosurveillance, mining of astronomical data, military surveillance, and analysis of fMRI images. In almost all such applications we are interested both in the question of whether a cluster exists in the data, and if it exists, we are interested in finding the most accurate characterization of the cluster. Methods We present a general dynamic programming algorithm for grid-based spatial cluster detection. The algorithm can be used for both Bayesian maximum a-posteriori (MAP) estimation of the most likely spatial distribution of clusters and Bayesian model averaging over a large space of spatial cluster distributions to compute the posterior probability of an unusual spatial clustering. The algorithm is explained and evaluated in the context of a biosurveillance application, specifically the detection and identification of Influenza outbreaks based on emergency department visits. A relatively simple underlying model is constructed for the purpose of evaluating the algorithm, and the algorithm is evaluated using the model and semi-synthetic test data. Results When compared to baseline methods, tests indicate that the new algorithm can improve MAP estimates under certain conditions: the greedy algorithm we compared our method to was found to be more sensitive to smaller outbreaks, while as the size of the outbreaks increases, in terms of area affected and proportion of individuals affected, our method overtakes the greedy algorithm in spatial precision and recall. The new algorithm performs on-par with baseline methods in the task of Bayesian model averaging. Conclusions We conclude that the dynamic programming algorithm performs on-par with other available methods for spatial cluster detection and point to its low computational cost and extendability as advantages in favor of further research and use of the algorithm. PMID:22443103

  16. A population of gamma-ray emitting globular clusters seen with the Fermi Large Area Telescope

    DOE PAGES

    Abdo, A. A.

    2010-11-24

    Context. Globular clusters with their large populations of millisecond pulsars (MSPs) are believed to be potential emitters of high-energy gamma-ray emission. The observation of this emission provides a powerful tool to assess the millisecond pulsar population of a cluster, is essential for understanding the importance of binary systems for the evolution of globular clusters, and provides complementary insights into magnetospheric emission processes. Aims. Our goal is to constrain the millisecond pulsar populations in globular clusters from analysis of gamma-ray observations. Methods. We use 546 days of continuous sky-survey observations obtained with the Large Area Telescope aboard the Fermi Gamma-ray Spacemore » Telescope to study the gamma-ray emission towards 13 globular clusters. Results. Steady point-like high-energy gamma-ray emission has been significantly detected towards 8 globular clusters. Five of them (47 Tucanae, Omega Cen, NGC 6388, Terzan 5, and M 28) show hard spectral power indices (0.7 < Γ < 1.4) and clear evidence for an exponential cut-off in the range 1.0 - 2.6 GeV, which is the characteristic signature of magnetospheric emission from MSPs. Three of them (M 62, NGC 6440 and NGC 6652) also show hard spectral indices (1.0 < Γ < 1.7), however the presence of an exponential cut-off can not be unambiguously established. Three of them (Omega Cen, NGC 6388, NGC 6652) have no known radio or X-ray MSPs yet still exhibit MSP spectral properties. From the observed gamma-ray luminosities, we estimate the total number of MSPs that is expected to be present in these globular clusters. We show that our estimates of the MSP population correlate with the stellar encounter rate and we estimate 2600 - 4700 MSPs in Galactic globular clusters, commensurate with previous estimates. Conclusions. The observation of high-energy gamma-ray emission from globular clusters thus provides a reliable independent method to assess their millisecond pulsar populations.« less

  17. Analyzing spatial clustering and the spatiotemporal nature and trends of HIV/AIDS prevalence using GIS: the case of Malawi, 1994-2010

    PubMed Central

    2014-01-01

    Background Although local spatiotemporal analysis can improve understanding of geographic variation of the HIV epidemic, its drivers, and the search for targeted interventions, it is limited in sub-Saharan Africa. Despite recent declines, Malawi’s estimated 10.0% HIV prevalence (2011) remained among the highest globally. Using data on pregnant women in Malawi, this study 1) examines spatiotemporal trends in HIV prevalence 1994-2010, and 2) for 2010, identifies and maps the spatial variation/clustering of factors associated with HIV prevalence at district level. Methods Inverse distance weighting was used within ArcGIS Geographic Information Systems (GIS) software to generate continuous surfaces of HIV prevalence from point data (1994, 1996, 1999, 2001, 2003, 2005, 2007, and 2010) obtained from surveillance antenatal clinics. From the surfaces prevalence estimates were extracted at district level and the results mapped nationally. Spatial dependency (autocorrelation) and clustering of HIV prevalence were also analyzed. Correlation and multiple regression analyses were used to identify factors associated with HIV prevalence for 2010 and their spatial variation/clustering mapped and compared to HIV clustering. Results Analysis revealed wide spatial variation in HIV prevalence at regional, urban/rural, district and sub-district levels. However, prevalence was spatially leveling out within and across ‘sub-epidemics’ while declining significantly after 1999. Prevalence exhibited statistically significant spatial dependence nationally following initial (1995-1999) localized, patchy low/high patterns as the epidemic spread rapidly. Locally, HIV “hotspots” clustered among eleven southern districts/cities while a “coldspot” captured configurations of six central region districts. Preliminary multiple regression of 2010 HIV prevalence produced a model with four significant explanatory factors (adjusted R2 = 0.688): mean distance to main roads, mean travel time to nearest transport, percentage that had taken an HIV test ever, and percentage attaining a senior primary education. Spatial clustering linked some factors to particular subsets of high HIV-prevalence districts. Conclusions Spatial analysis enhanced understanding of local spatiotemporal variation in HIV prevalence, possible underlying factors, and potential for differentiated spatial targeting of interventions. Findings suggest that intervention strategies should also emphasize improved access to health/HIV services, basic education, and syphilis management, particularly in rural hotspot districts, as further research is done on drivers at finer scale. PMID:24886573

  18. Comparison of Genetic Diversity between Chinese and American Soybean (Glycine max (L.)) Accessions Revealed by High-Density SNPs

    PubMed Central

    Liu, Zhangxiong; Li, Huihui; Wen, Zixiang; Fan, Xuhong; Li, Yinghui; Guan, Rongxia; Guo, Yong; Wang, Shuming; Wang, Dechun; Qiu, Lijuan

    2017-01-01

    Soybean is one of the most important economic crops for both China and the United States (US). The exchange of germplasm between these two countries has long been active. In order to investigate genetic relationships between Chinese and US soybean germplasm, 277 Chinese soybean accessions and 300 US soybean accessions from geographically diverse regions were analyzed using 5,361 SNP markers. The genetic diversity and the polymorphism information content (PIC) of the Chinese accessions was higher than that of the US accessions. Population structure analysis, principal component analysis, and cluster analysis all showed that the genetic basis of Chinese soybeans is distinct from that of the USA. The groupings observed in clustering analysis reflected the geographical origins of the accessions; this conclusion was validated with both genetic distance analysis and relative kinship analysis. FST-based and EigenGWAS statistical analysis revealed high genetic variation between the two subpopulations. Analysis of the 10 loci with the strongest selection signals showed that many loci were located in chromosome regions that have previously been identified as quantitative trait loci (QTL) associated with environmental-adaptation-related and yield-related traits. The pattern of diversity among the American and Chinese accessions should help breeders to select appropriate parental accessions to enhance the performance of future soybean cultivars. PMID:29250088

  19. ClusterViz: A Cytoscape APP for Cluster Analysis of Biological Network.

    PubMed

    Wang, Jianxin; Zhong, Jiancheng; Chen, Gang; Li, Min; Wu, Fang-xiang; Pan, Yi

    2015-01-01

    Cluster analysis of biological networks is one of the most important approaches for identifying functional modules and predicting protein functions. Furthermore, visualization of clustering results is crucial to uncover the structure of biological networks. In this paper, ClusterViz, an APP of Cytoscape 3 for cluster analysis and visualization, has been developed. In order to reduce complexity and enable extendibility for ClusterViz, we designed the architecture of ClusterViz based on the framework of Open Services Gateway Initiative. According to the architecture, the implementation of ClusterViz is partitioned into three modules including interface of ClusterViz, clustering algorithms and visualization and export. ClusterViz fascinates the comparison of the results of different algorithms to do further related analysis. Three commonly used clustering algorithms, FAG-EC, EAGLE and MCODE, are included in the current version. Due to adopting the abstract interface of algorithms in module of the clustering algorithms, more clustering algorithms can be included for the future use. To illustrate usability of ClusterViz, we provided three examples with detailed steps from the important scientific articles, which show that our tool has helped several research teams do their research work on the mechanism of the biological networks.

  20. Parasites as valuable stock markers for fisheries in Australasia, East Asia and the Pacific Islands.

    PubMed

    Lester, R J G; Moore, B R

    2015-01-01

    Over 30 studies in Australasia, East Asia and the Pacific Islands region have collected and analysed parasite data to determine the ranges of individual fish, many leading to conclusions about stock delineation. Parasites used as biological tags have included both those known to have long residence times in the fish and those thought to be relatively transient. In many cases the parasitological conclusions have been supported by other methods especially analysis of the chemical constituents of otoliths, and to a lesser extent, genetic data. In analysing parasite data, authors have applied multiple different statistical methodologies, including summary statistics, and univariate and multivariate approaches. Recently, a growing number of researchers have found non-parametric methods, such as analysis of similarities and cluster analysis, to be valuable. Future studies into the residence times, life cycles and geographical distributions of parasites together with more robust analytical methods will yield much important information to clarify stock structures in the area.

  1. Quantitative application of the primary progressive aphasia consensus criteria

    PubMed Central

    Wicklund, Meredith R.; Duffy, Joseph R.; Strand, Edythe A.; Machulda, Mary M.; Whitwell, Jennifer L.

    2014-01-01

    Objective: To determine how well the consensus criteria could classify subjects with primary progressive aphasia (PPA) using a quantitative speech and language battery that matches the test descriptions provided by the consensus criteria. Methods: A total of 105 participants with a neurodegenerative speech and language disorder were prospectively recruited and underwent neurologic, neuropsychological, and speech and language testing and MRI in this case-control study. Twenty-one participants with apraxia of speech without aphasia served as controls. Select tests from the speech and language battery were chosen for application of consensus criteria and cutoffs were employed to determine syndromic classification. Hierarchical cluster analysis was used to examine participants who could not be classified. Results: Of the 84 participants, 58 (69%) could be classified as agrammatic (27%), semantic (7%), or logopenic (35%) variants of PPA. The remaining 31% of participants could not be classified. Of the unclassifiable participants, 2 clusters were identified. The speech and language profile of the first cluster resembled mild logopenic PPA and the second cluster semantic PPA. Gray matter patterns of loss of these 2 clusters of unclassified participants also resembled mild logopenic and semantic variants. Conclusions: Quantitative application of consensus PPA criteria yields the 3 syndromic variants but leaves a large proportion unclassified. Therefore, the current consensus criteria need to be modified in order to improve sensitivity. PMID:24598709

  2. Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations.

    PubMed

    Gupta, Mayetri; Cheung, Ching-Lung; Hsu, Yi-Hsiang; Demissie, Serkalem; Cupples, L Adrienne; Kiel, Douglas P; Karasik, David

    2011-06-01

    Genome-wide association studies (GWAS) using high-density genotyping platforms offer an unbiased strategy to identify new candidate genes for osteoporosis. It is imperative to be able to clearly distinguish signal from noise by focusing on the best phenotype in a genetic study. We performed GWAS of multiple phenotypes associated with fractures [bone mineral density (BMD), bone quantitative ultrasound (QUS), bone geometry, and muscle mass] with approximately 433,000 single-nucleotide polymorphisms (SNPs) and created a database of resulting associations. We performed analysis of GWAS data from 23 phenotypes by a novel modification of a block clustering algorithm followed by gene-set enrichment analysis. A data matrix of standardized regression coefficients was partitioned along both axes--SNPs and phenotypes. Each partition represents a distinct cluster of SNPs that have similar effects over a particular set of phenotypes. Application of this method to our data shows several SNP-phenotype connections. We found a strong cluster of association coefficients of high magnitude for 10 traits (BMD at several skeletal sites, ultrasound measures, cross-sectional bone area, and section modulus of femoral neck and shaft). These clustered traits were highly genetically correlated. Gene-set enrichment analyses indicated the augmentation of genes that cluster with the 10 osteoporosis-related traits in pathways such as aldosterone signaling in epithelial cells, role of osteoblasts, osteoclasts, and chondrocytes in rheumatoid arthritis, and Parkinson signaling. In addition to several known candidate genes, we also identified PRKCH and SCNN1B as potential candidate genes for multiple bone traits. In conclusion, our mining of GWAS results revealed the similarity of association results between bone strength phenotypes that may be attributed to pleiotropic effects of genes. This knowledge may prove helpful in identifying novel genes and pathways that underlie several correlated phenotypes, as well as in deciphering genetic and phenotypic modularity underlying osteoporosis risk. Copyright © 2011 American Society for Bone and Mineral Research.

  3. [A study on genotype of 271 mycobacterium tuberculosis isolates in 6 prefectures in Yunnan Province].

    PubMed

    Chen, L Y; Yang, X; Ru, H H; Yang, H J; Yan, S Q; Ma, L; Chen, J O; Yang, R; Xu, L

    2018-01-06

    Objective: To understand the characteristics of genotypes of Mycobacterium tuberculosis isolates in Yunnan province, and provide the molecular epidemiological evidence for prevention and control of tuberculosis in Yunnan Province. Methods: Mycobacterium Tuberculosis isolates were collected from 6 prefectures of Yunnan province in 2014 and their Genetypes of Mycobacterium tuberculosis isolates were obtained using spoligotyping and multiple locus variable numbers of tandem repeats analysis (MLVA). The results of spoligotyping were entered into the SITVITWEB database to obtain the Spoligotyping International Type (SIT) patterns and the sublineages of MTB isolates. The genoyping patterns were clustered with BioNumerics (version 5.0). Results: A total of 271 MTB isolates represented patients were collected from six prefectures in Yunnan province. Out of these patients, 196 (72.3%) were male. The mean age of the patients was (41.9±15.1) years. The most MTB isolates were from Puer, totally 94 iusolates(34.69%). Spoligotyping analysis revealed that 151 (55.72%) MTB isolates belonged to the Beijing genotype, while the other 120 (44.28%) were from non-Beijing genotype; 40 genotypes were consisted of 24 unique genotypes and 16 clusters. The 271 isolates were differentiated into 30 clusters (2 to 17 isolates per cluster) and 177 unique genotypes, showing a clustering rate of 23.62%. Beijing genotype strains showed higher clustering rate than non-Beijing genotype strains (29.14% vs 16.67%). The HGI of 12-locus VNTR in total MTB strains, Beijing genotype strains and non-Beijing genotype was 0.993, 0.982 and 0.995 respectively. Conclusion: The Beijing genotype was the predominant genotype in Yunnan Province, the characteristics of Mycobacterium tuberculosis showed high genetic diversity. The genotyping data reflect the potential recent ongoing transmission in some area, which highlights the urgent need for early diagnosis and treatment of the infectious TB cases, to cut off the transmission and avoid a large TB outbreak.

  4. Computer-aided detection of clustered microcalcifications in multiscale bilateral filtering regularized reconstructed digital breast tomosynthesis volume

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Samala, Ravi K., E-mail: rsamala@umich.edu; Chan, Heang-Ping; Lu, Yao

    Purpose: Develop a computer-aided detection (CADe) system for clustered microcalcifications in digital breast tomosynthesis (DBT) volume enhanced with multiscale bilateral filtering (MSBF) regularization. Methods: With Institutional Review Board approval and written informed consent, two-view DBT of 154 breasts, of which 116 had biopsy-proven microcalcification (MC) clusters and 38 were free of MCs, was imaged with a General Electric GEN2 prototype DBT system. The DBT volumes were reconstructed with MSBF-regularized simultaneous algebraic reconstruction technique (SART) that was designed to enhance MCs and reduce background noise while preserving the quality of other tissue structures. The contrast-to-noise ratio (CNR) of MCs was furthermore » improved with enhancement-modulated calcification response (EMCR) preprocessing, which combined multiscale Hessian response to enhance MCs by shape and bandpass filtering to remove the low-frequency structured background. MC candidates were then located in the EMCR volume using iterative thresholding and segmented by adaptive region growing. Two sets of potential MC objects, cluster centroid objects and MC seed objects, were generated and the CNR of each object was calculated. The number of candidates in each set was controlled based on the breast volume. Dynamic clustering around the centroid objects grouped the MC candidates to form clusters. Adaptive criteria were designed to reduce false positive (FP) clusters based on the size, CNR values and the number of MCs in the cluster, cluster shape, and cluster based maximum intensity projection. Free-response receiver operating characteristic (FROC) and jackknife alternative FROC (JAFROC) analyses were used to assess the performance and compare with that of a previous study. Results: Unpaired two-tailedt-test showed a significant increase (p < 0.0001) in the ratio of CNRs for MCs with and without MSBF regularization compared to similar ratios for FPs. For view-based detection, a sensitivity of 85% was achieved at an FP rate of 2.16 per DBT volume. For case-based detection, a sensitivity of 85% was achieved at an FP rate of 0.85 per DBT volume. JAFROC analysis showed a significant improvement in the performance of the current CADe system compared to that of our previous system (p = 0.003). Conclusions: MBSF regularized SART reconstruction enhances MCs. The enhancement in the signals, in combination with properly designed adaptive threshold criteria, effective MC feature analysis, and false positive reduction techniques, leads to a significant improvement in the detection of clustered MCs in DBT.« less

  5. An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics

    PubMed Central

    2010-01-01

    Background Bioinformatics researchers are now confronted with analysis of ultra large-scale data sets, a problem that will only increase at an alarming rate in coming years. Recent developments in open source software, that is, the Hadoop project and associated software, provide a foundation for scaling to petabyte scale data warehouses on Linux clusters, providing fault-tolerant parallelized analysis on such data using a programming style named MapReduce. Description An overview is given of the current usage within the bioinformatics community of Hadoop, a top-level Apache Software Foundation project, and of associated open source software projects. The concepts behind Hadoop and the associated HBase project are defined, and current bioinformatics software that employ Hadoop is described. The focus is on next-generation sequencing, as the leading application area to date. Conclusions Hadoop and the MapReduce programming paradigm already have a substantial base in the bioinformatics community, especially in the field of next-generation sequencing analysis, and such use is increasing. This is due to the cost-effectiveness of Hadoop-based analysis on commodity Linux clusters, and in the cloud via data upload to cloud vendors who have implemented Hadoop/HBase; and due to the effectiveness and ease-of-use of the MapReduce method in parallelization of many data analysis algorithms. PMID:21210976

  6. Genetic divergence through joint analysis of morphoagronomic and molecular characters in accessions of Jatropha curcas.

    PubMed

    Pestana-Caldas, C N; Silva, S A; Machado, E L; de Souza, D R; Cerqueira-Pereira, E C; Silva, M S

    2016-10-05

    The aim of this study was to investigate the genetic divergence between accessions of Jatropha curcas through joint analysis of morphoagronomic and molecular characters. To this end, we investigated 11 morphoagronomic characters and performed molecular genotyping, using 23 inter-simple sequence repeat (ISSR) primers in 46 accessions of J. curcas. We calculated the contribution of each character on divergence using analysis of variance. The grouping among accessions was performed using the Ward-MLM (modified location model) method, using morphoagronomic and molecular data, whereas the cophenetic correlation was obtained based on Gower's algorithm. There were significant differences in all growth-related characteristics: number of primary and secondary branches per plant, plant height, and stem diameter. For characters related to grain production, differences were found for number of fruit clusters per plant and number of inflorescence clusters per plant and average number of seeds per fruit. The greatest phenotypic variation was found in plant height (59.67- 222.33 cm), whereas the smallest variation was found in average number of seeds per fruit (0-2.90), followed by the number of fruit clusters per plant (0-8.67). In total, 94 polymorphic ISSR fragments were obtained. The genotypic grouping identified six groups, indicating that there is genetic divergence among the accessions. The most promising crossings for future hybridization were identified among accessions UFRB60 and UFVJC45, and UFRB61 and UFVJC18. In conclusion, the joint analysis of morphoagronomic characters and ISSR markers is an efficient method to assess the genetic divergence in J. curcas.

  7. A study on phenomenology of Dhat syndrome in men in a general medical setting

    PubMed Central

    Prakash, Sathya; Sharan, Pratap; Sood, Mamta

    2016-01-01

    Background: “Dhat syndrome” is believed to be a culture-bound syndrome of the Indian subcontinent. Although many studies have been performed, many have methodological limitations and there is a lack of agreement in many areas. Aims: The aim is to study the phenomenology of “Dhat syndrome” in men and to explore the possibility of subtypes within this entity. Settings and Design: It is a cross-sectional descriptive study conducted at a sex and marriage counseling clinic of a tertiary care teaching hospital in Northern India. Materials and Methods: An operational definition and assessment instrument for “Dhat syndrome” was developed after taking all concerned stakeholders into account and review of literature. It was applied on 100 patients along with socio-demographic profile, Hamilton Depression Rating Scale, Hamilton Anxiety Rating Scale, Mini International Neuropsychiatric Interview, and Postgraduate Institute Neuroticism Scale. Statistical Analysis: For statistical analysis, descriptive statistics, group comparisons, and Pearson's product moment correlations were carried out. Factor analysis and cluster analysis were done to determine the factor structure and subtypes of “Dhat syndrome.” Results: A diagnostic and assessment instrument for “Dhat syndrome” has been developed and the phenomenology in 100 patients has been described. Both the health beliefs scale and associated symptoms scale demonstrated a three-factor structure. The patients with “Dhat syndrome” could be categorized into three clusters based on severity. Conclusions: There appears to be a significant agreement among various stakeholders on the phenomenology of “Dhat syndrome” although some differences exist. “Dhat syndrome” could be subtyped into three clusters based on severity. PMID:27385844

  8. A Single Session of rTMS Enhances Small-Worldness in Writer's Cramp: Evidence from Simultaneous EEG-fMRI Multi-Modal Brain Graph.

    PubMed

    Bharath, Rose D; Panda, Rajanikant; Reddam, Venkateswara Reddy; Bhaskar, M V; Gohel, Suril; Bhardwaj, Sujas; Prajapati, Arvind; Pal, Pramod Kumar

    2017-01-01

    Background and Purpose : Repetitive transcranial magnetic stimulation (rTMS) induces widespread changes in brain connectivity. As the network topology differences induced by a single session of rTMS are less known we undertook this study to ascertain whether the network alterations had a small-world morphology using multi-modal graph theory analysis of simultaneous EEG-fMRI. Method : Simultaneous EEG-fMRI was acquired in duplicate before (R1) and after (R2) a single session of rTMS in 14 patients with Writer's Cramp (WC). Whole brain neuronal and hemodynamic network connectivity were explored using the graph theory measures and clustering coefficient, path length and small-world index were calculated for EEG and resting state fMRI (rsfMRI). Multi-modal graph theory analysis was used to evaluate the correlation of EEG and fMRI clustering coefficients. Result : A single session of rTMS was found to increase the clustering coefficient and small-worldness significantly in both EEG and fMRI ( p < 0.05). Multi-modal graph theory analysis revealed significant modulations in the fronto-parietal regions immediately after rTMS. The rsfMRI revealed additional modulations in several deep brain regions including cerebellum, insula and medial frontal lobe. Conclusion : Multi-modal graph theory analysis of simultaneous EEG-fMRI can supplement motor physiology methods in understanding the neurobiology of rTMS in vivo . Coinciding evidence from EEG and rsfMRI reports small-world morphology for the acute phase network hyper-connectivity indicating changes ensuing low-frequency rTMS is probably not "noise".

  9. Biochemical characterization and phylogenetic analysis based on 16S rRNA sequences for V-factor dependent members of Pasteurellaceae derived from laboratory rats.

    PubMed

    Hayashimoto, Nobuhito; Ueno, Masami; Tkakura, Akira; Itoh, Toshio

    2007-06-01

    Phylogenetic analysis based on 16S rRNA sequences with sequence data of some bacterial species of Pasteurellaceae related to rodents deposited in GenBank was performed along with biochemical characterization for the 20 strains of V-factor dependent members of Pasteurellaceae derived from laboratory rats to obtain basic information and to investigate the taxonomic positions. The results of biochemical tests for all strains were identical except for three tests, the ornithine decarboxylase test, and fermentation tests of D(+) mannose and D(+) xylose. The biochemical properties of 8 of 20 strains that showed negative results for the fermentation test of D(+) xylose agreed with those of Haemophilus parainfluenzae complex. By phylogenetic analysis, the strains were divided into two clusters that agreed with the results of the fermentation test of xylose (group I: negative reaction for xylose, group II: positive reaction for xylose). The clusters were independent of other bacterial species of Pasteurellaceae tested. The sequences of the strains in group I showed 99.7-99.8% similarity and the strains in group II showed 99.3-99.7% similarity. None of the strains in group I had a close relation with Haemophilus parainfluenzae by phylogenetic analysis, although they showed the same biochemical properties. In conclusion, the strains had characteristic biochemical properties and formed two independent groups within the "rodent cluster" of Pasteurellaceae that differed in the results of the fermentation test of xylose. Therefore, they seemed to be hitherto undescribed taxa in Pasteurellaceae.

  10. Cluster Analysis in Nursing Research: An Introduction, Historical Perspective, and Future Directions.

    PubMed

    Dunn, Heather; Quinn, Laurie; Corbridge, Susan J; Eldeirawi, Kamal; Kapella, Mary; Collins, Eileen G

    2017-05-01

    The use of cluster analysis in the nursing literature is limited to the creation of classifications of homogeneous groups and the discovery of new relationships. As such, it is important to provide clarity regarding its use and potential. The purpose of this article is to provide an introduction to distance-based, partitioning-based, and model-based cluster analysis methods commonly utilized in the nursing literature, provide a brief historical overview on the use of cluster analysis in nursing literature, and provide suggestions for future research. An electronic search included three bibliographic databases, PubMed, CINAHL and Web of Science. Key terms were cluster analysis and nursing. The use of cluster analysis in the nursing literature is increasing and expanding. The increased use of cluster analysis in the nursing literature is positioning this statistical method to result in insights that have the potential to change clinical practice.

  11. Cluster randomized trials utilizing primary care electronic health records: methodological issues in design, conduct, and analysis (eCRT Study)

    PubMed Central

    2014-01-01

    Background There is growing interest in conducting clinical and cluster randomized trials through electronic health records. This paper reports on the methodological issues identified during the implementation of two cluster randomized trials using the electronic health records of the Clinical Practice Research Datalink (CPRD). Methods Two trials were completed in primary care: one aimed to reduce inappropriate antibiotic prescribing for acute respiratory infection; the other aimed to increase physician adherence with secondary prevention interventions after first stroke. The paper draws on documentary records and trial datasets to report on the methodological experience with respect to research ethics and research governance approval, general practice recruitment and allocation, sample size calculation and power, intervention implementation, and trial analysis. Results We obtained research governance approvals from more than 150 primary care organizations in England, Wales, and Scotland. There were 104 CPRD general practices recruited to the antibiotic trial and 106 to the stroke trial, with the target number of practices being recruited within six months. Interventions were installed into practice information systems remotely over the internet. The mean number of participants per practice was 5,588 in the antibiotic trial and 110 in the stroke trial, with the coefficient of variation of practice sizes being 0.53 and 0.56 respectively. Outcome measures showed substantial correlations between the 12 months before, and after intervention, with coefficients ranging from 0.42 for diastolic blood pressure to 0.91 for proportion of consultations with antibiotics prescribed, defining practice and participant eligibility for analysis requires careful consideration. Conclusions Cluster randomized trials may be performed efficiently in large samples from UK general practices using the electronic health records of a primary care database. The geographical dispersal of trial sites presents a difficulty for research governance approval and intervention implementation. Pretrial data analyses should inform trial design and analysis plans. Trial registration Current Controlled Trials ISRCTN 47558792 and ISRCTN 35701810 (both registered on 17 March 2010). PMID:24919485

  12. A novel exploratory chemometric approach to environmental monitorring by combining block clustering with Partial Least Square (PLS) analysis

    PubMed Central

    2013-01-01

    Background Given the serious threats posed to terrestrial ecosystems by industrial contamination, environmental monitoring is a standard procedure used for assessing the current status of an environment or trends in environmental parameters. Measurement of metal concentrations at different trophic levels followed by their statistical analysis using exploratory multivariate methods can provide meaningful information on the status of environmental quality. In this context, the present paper proposes a novel chemometric approach to standard statistical methods by combining the Block clustering with Partial least square (PLS) analysis to investigate the accumulation patterns of metals in anthropized terrestrial ecosystems. The present study focused on copper, zinc, manganese, iron, cobalt, cadmium, nickel, and lead transfer along a soil-plant-snai food chain, and the hepatopancreas of the Roman snail (Helix pomatia) was used as a biological end-point of metal accumulation. Results Block clustering deliniates between the areas exposed to industrial and vehicular contamination. The toxic metals have similar distributions in the nettle leaves and snail hepatopancreas. PLS analysis showed that (1) zinc and copper concentrations at the lower trophic levels are the most important latent factors that contribute to metal accumulation in land snails; (2) cadmium and lead are the main determinants of pollution pattern in areas exposed to industrial contamination; (3) at the sites located near roads lead is the most threatfull metal for terrestrial ecosystems. Conclusion There were three major benefits by applying block clustering with PLS for processing the obtained data: firstly, it helped in grouping sites depending on the type of contamination. Secondly, it was valuable for identifying the latent factors that contribute the most to metal accumulation in land snails. Finally, it optimized the number and type of data that are best for monitoring the status of metallic contamination in terrestrial ecosystems exposed to different kinds of anthropic polution. PMID:23987502

  13. Global aphasia without hemiparesis: language profiles and lesion distribution

    PubMed Central

    Hanlon, R.; Lux, W.; Dromerick, A.

    1999-01-01

    OBJECTIVES—Global aphasia without hemiparesis (GAWH) is an uncommon stroke syndrome involving receptive and expressive language impairment, without the hemiparesis typically manifested by patients with global aphasia after large left perisylvian lesions. A few cases of GAWH have been reported with conflicting conclusions regarding pathogenesis, lesion localisation, and recovery. The current study was conducted to attempt to clarify these issues.
METHODS—Ten cases of GAWH were prospectively studied with language profiles and lesion analysis; five patients had multiple lesions, four patients had a single lesion, and one had a subarachnoid haemorrhage. Eight patients met criteria for cardioembolic ischaemic stroke.
RESULTS—Cluster analysis based on acute language profiles disclosed three subtypes of patients with GAWH; these clusters persisted on follow up language assessment. Each cluster evolved into a different aphasia subtype: persistent GAWH, Wernicke's aphasia, or transcortical motor aphasia (TCM). Composite lesion analysis showed that persistent GAWH was related to lesioning of the left superior temporal gyrus. Patients with acute GAWH who evolved into TCM type aphasia had common lesioning of the left inferior frontal gyrus and adjacent subcortical white matter. Patients with acute GAWH who evolved into Wernicke's type aphasia were characterised by lesioning of the left precentral and postcentral gyri. Recovery of language was poor in all but one patient.
CONCLUSIONS—Although patients with acute GAWH are similar on neurological examination, they are heterogeneous with respect to early aphasia profile, language recovery, and lesion profile.

 PMID:10084536

  14. Application of the Linux cluster for exhaustive window haplotype analysis using the FBAT and Unphased programs

    PubMed Central

    Mishima, Hiroyuki; Lidral, Andrew C; Ni, Jun

    2008-01-01

    Background Genetic association studies have been used to map disease-causing genes. A newly introduced statistical method, called exhaustive haplotype association study, analyzes genetic information consisting of different numbers and combinations of DNA sequence variations along a chromosome. Such studies involve a large number of statistical calculations and subsequently high computing power. It is possible to develop parallel algorithms and codes to perform the calculations on a high performance computing (HPC) system. However, most existing commonly-used statistic packages for genetic studies are non-parallel versions. Alternatively, one may use the cutting-edge technology of grid computing and its packages to conduct non-parallel genetic statistical packages on a centralized HPC system or distributed computing systems. In this paper, we report the utilization of a queuing scheduler built on the Grid Engine and run on a Rocks Linux cluster for our genetic statistical studies. Results Analysis of both consecutive and combinational window haplotypes was conducted by the FBAT (Laird et al., 2000) and Unphased (Dudbridge, 2003) programs. The dataset consisted of 26 loci from 277 extended families (1484 persons). Using the Rocks Linux cluster with 22 compute-nodes, FBAT jobs performed about 14.4–15.9 times faster, while Unphased jobs performed 1.1–18.6 times faster compared to the accumulated computation duration. Conclusion Execution of exhaustive haplotype analysis using non-parallel software packages on a Linux-based system is an effective and efficient approach in terms of cost and performance. PMID:18541045

  15. Strategic groups, performance, and strategic response in the nursing home industry.

    PubMed Central

    Zinn, J S; Aaronson, W E; Rosko, M D

    1994-01-01

    OBJECTIVE. This study examines the effect of strategic group membership on nursing home performance and strategic behavior. DATA SOURCES AND STUDY SETTING. Data from the 1987 Medicare and Medicaid Automated Certification Survey were combined with data from the 1987 and 1989 Pennsylvania Long Term Care Facility Questionnaire. The sample consisted of 383 Pennsylvania nursing homes. STUDY DESIGN. Cluster analysis was used to place the 383 nursing homes into strategic groups on the basis of variables measuring scope and resource deployment. Performance was measured by indicators of the quality of nursing home care (rates of pressure ulcers, catheterization, and restraint usage) and efficiency in services provision. Changes in Medicare participation after passage of the 1988 Medicare Catastrophic Coverage Act (MCCA) measured strategic behavior. MANOVA and Turkey HSD post hoc means tests determined if significant differences were associated with strategic group membership. FINDINGS. Cluster analysis produced an optimal seven-group solution. Differences in group means were significant for the clustering, performance, and conduct variables (p < .0001). Strategic groups characterized by facilities providing a continuum of care services had the best patient care outcomes. The most efficient groups were characterized by facilities with high Medicare census. While all strategic groups increased Medicare census following passage of the MCCA, those dominated by for-profits had the greatest increases. CONCLUSIONS. Our analysis demonstrates that strategic orientation influences nursing home response to regulatory initiatives, a factor that should be recognized in policy formation directed at nursing home reform. PMID:8005789

  16. Discrimination and chemical phylogenetic study of seven species of Dendrobium using infrared spectroscopy combined with cluster analysis

    NASA Astrophysics Data System (ADS)

    Luo, Congpei; He, Tao; Chun, Ze

    2013-04-01

    Dendrobium is a commonly used and precious herb in Traditional Chinese Medicine. The high biodiversity of Dendrobium and the therapeutic needs require tools for the correct and fast discrimination of different Dendrobium species. This study investigates Fourier transform infrared spectroscopy followed by cluster analysis for discrimination and chemical phylogenetic study of seven Dendrobium species. Despite the general pattern of the IR spectra, different intensities, shapes, peak positions were found in the IR spectra of these samples, especially in the range of 1800-800 cm-1. The second derivative transformation and alcoholic extracting procedure obviously enlarged the tiny spectral differences among these samples. The results indicated each Dendrobium species had a characteristic IR spectra profile, which could be used to discriminate them. The similarity coefficients among the samples were analyzed based on their second derivative IR spectra, which ranged from 0.7632 to 0.9700, among the seven Dendrobium species, and from 0.5163 to 0.9615, among the ethanol extracts. A dendrogram was constructed based on cluster analysis the IR spectra for studying the chemical phylogenetic relationships among the samples. The results indicated that D. denneanum and D. crepidatum could be the alternative resources to substitute D. chrysotoxum, D. officinale and D. nobile which were officially recorded in Chinese Pharmacopoeia. In conclusion, with the advantages of high resolution, speediness and convenience, the experimental approach can successfully discriminate and construct the chemical phylogenetic relationships of the seven Dendrobium species.

  17. ICAP - An Interactive Cluster Analysis Procedure for analyzing remotely sensed data

    NASA Technical Reports Server (NTRS)

    Wharton, S. W.; Turner, B. J.

    1981-01-01

    An Interactive Cluster Analysis Procedure (ICAP) was developed to derive classifier training statistics from remotely sensed data. ICAP differs from conventional clustering algorithms by allowing the analyst to optimize the cluster configuration by inspection, rather than by manipulating process parameters. Control of the clustering process alternates between the algorithm, which creates new centroids and forms clusters, and the analyst, who can evaluate and elect to modify the cluster structure. Clusters can be deleted, or lumped together pairwise, or new centroids can be added. A summary of the cluster statistics can be requested to facilitate cluster manipulation. The principal advantage of this approach is that it allows prior information (when available) to be used directly in the analysis, since the analyst interacts with ICAP in a straightforward manner, using basic terms with which he is more likely to be familiar. Results from testing ICAP showed that an informed use of ICAP can improve classification, as compared to an existing cluster analysis procedure.

  18. New detections of embedded clusters in the Galactic halo

    NASA Astrophysics Data System (ADS)

    Camargo, D.; Bica, E.; Bonatto, C.

    2016-09-01

    Context. Until recently it was thought that high Galactic latitude clouds were a non-star-forming ensemble. However, in a previous study we reported the discovery of two embedded clusters (ECs) far away from the Galactic plane (~ 5 kpc). In our recent star cluster catalogue we provided additional high and intermediate latitude cluster candidates. Aims: This work aims to clarify whether our previous detection of star clusters far away from the disc represents just an episodic event or whether star cluster formation is currently a systematic phenomenon in the Galactic halo. We analyse the nature of four clusters found in our recent catalogue and report the discovery of three new ECs each with an unusually high latitude and distance from the Galactic disc midplane. Methods: The analysis is based on 2MASS and WISE colour-magnitude diagrams (CMDs), and stellar radial density profiles (RDPs). The CMDs are built by applying a field-star decontamination procedure, which uncovers the cluster's intrinsic CMD morphology. Results: All of these clusters are younger than 5 Myr. The high-latitude ECs C 932, C 934, and C 939 appear to be related to a cloud complex about 5 kpc below the Galactic disc, under the Local arm. The other clusters are above the disc, C 1074 and C 1100 with a vertical distance of ~3 kpc, C 1099 with ~ 2 kpc, and C 1101 with ~1.8 kpc. Conclusions: According to the derived parameters ECs located below and above the disc occur, which gives evidence of widespread star cluster formation throughout the Galactic halo. This study therefore represents a paradigm shift, by demonstrating that a sterile halo must now be understood as a host for ongoing star formation. The origin and fate of these ECs remain open. There are two possibilities for their origin, Galactic fountains or infall. The discovery of ECs far from the disc suggests that the Galactic halo is more actively forming stars than previously thought. Furthermore, since most ECs do not survive the infant mortality, stars may be raining from the halo into the disc, and/or the halo may be harbouring generations of stars formed in clusters like those detected in our survey.

  19. [Analysis on HIV-1 subtypes and transmission clusters in newly reported HIV/AIDS cases in Yiwu, Zhejiang Province, 2016].

    PubMed

    Zhang, J F; Yao, J M; Fan, Q; Chen, W J; Pan, X H; Ding, X B; Yang, J Z; Fu, T

    2017-12-10

    Objective: To understand the characteristics of distribution on HIV-1 subtypes and the transmission clusters in Yiwu in Zhejiang province. Methods: A cross-sectional study of molecular epidemiology was carried out on newly reported HIV/AIDS cases in Yiwu. RNA was extracted from 168 plasma samples, followed by RT-PCR and nest-PCR for pol gene amplification, sequencing, phylogenetic tree construction used for analyzing the subtypes and transmission clusters. Mutations on drug resistance was analyzed by CPR 6.0 online tool. Results: Subjects were mainly males (86.3%, 145/168), with average age as (39.1±13.4) years old and most of them were migrants (66.7%, 112/168). The major routes of transmission included homosexual (51.2%, 86/168) and heterosexual (48.8%, 82/168) contacts. The rate of success for sequence acquisition was 89.9% (151/168). The dominant subtypes showed as CRF01_AE (74, 49.0%) and CRF07_BC (64, 42.4%), followed by CRF08_BC (5, 3.3%), CRF55_01B (3, 2.0%), each case of subtype B, CRF45_cpx, CRF59_01B, CRF85_BC and URF (B/C). CRF45_cpx and CRF85_BC were discovered the first time in Zhejiang province. Twenty-six transmission clusters involving 65 cases were found, with the total clustered rate as 43.0% (65/151), in which the CRF01_AE clustered rate appeared as 54.1% (40/74), higher than that of CRF07_BC (21/64, 32.8%). The average size of cluster was 2.5 cases/cluster, with average size of cluster in CRF01_AE patients infected through heterosexual transmission as the largest (3.5 cases/cluster). The prevalence of transmitted drug resistance was 4.6% (7/151). Seven cases with surveillance drug resistant mutations (SDRM) were found, including 5 cases of M46L (3.3%), and one case of F77L or Y181C. Conclusion: HIV genetic diversity and a variety of transmission clusters had been noticed in this study area (Yiwu). Programs on monitoring the subtypes and transmission clusters should be continued and strengthened.

  20. Identification of segregated regions in the functional brain connectome of autistic patients by a combination of fuzzy spectral clustering and entropy analysis

    PubMed Central

    Sato, João Ricardo; Balardin, Joana; Vidal, Maciel Calebe; Fujita, André

    2016-01-01

    Background Several neuroimaging studies support the model of abnormal development of brain connectivity in patients with autism-spectrum disorders (ASD). In this study, we aimed to test the hypothesis of reduced functional network segregation in autistic patients compared with controls. Methods Functional MRI data from children acquired under a resting-state protocol (Autism Brain Imaging Data Exchange [ABIDE]) were submitted to both fuzzy spectral clustering (FSC) with entropy analysis and graph modularity analysis. Results We included data from 814 children in our analysis. We identified 5 regions of interest comprising the motor, temporal and occipito-temporal cortices with increased entropy (p < 0.05) in the clustering structure (i.e., more segregation in the controls). Moreover, we noticed a statistically reduced modularity (p < 0.001) in the autistic patients compared with the controls. Significantly reduced eigenvector centrality values (p < 0.05) in the patients were observed in the same regions that were identified in the FSC analysis. Limitations There is considerable heterogeneity in the fMRI acquisition protocols among the sites that contributed to the ABIDE data set (e.g., scanner type, pulse sequence, duration of scan and resting-state protocol). Moreover, the sites differed in many variables related to sample characterization (e.g., age, IQ and ASD diagnostic criteria). Therefore, we cannot rule out the possibility that additional differences in functional network organization would be found in a more homogeneous data sample of individuals with ASD. Conclusion Our results suggest that the organization of the whole-brain functional network in patients with ASD is different from that observed in controls, which implies a reduced modularity of the brain functional networks involved in sensorimotor, social, affective and cognitive processing. PMID:26505141

  1. Missing continuous outcomes under covariate dependent missingness in cluster randomised trials

    PubMed Central

    Diaz-Ordaz, Karla; Bartlett, Jonathan W

    2016-01-01

    Attrition is a common occurrence in cluster randomised trials which leads to missing outcome data. Two approaches for analysing such trials are cluster-level analysis and individual-level analysis. This paper compares the performance of unadjusted cluster-level analysis, baseline covariate adjusted cluster-level analysis and linear mixed model analysis, under baseline covariate dependent missingness in continuous outcomes, in terms of bias, average estimated standard error and coverage probability. The methods of complete records analysis and multiple imputation are used to handle the missing outcome data. We considered four scenarios, with the missingness mechanism and baseline covariate effect on outcome either the same or different between intervention groups. We show that both unadjusted cluster-level analysis and baseline covariate adjusted cluster-level analysis give unbiased estimates of the intervention effect only if both intervention groups have the same missingness mechanisms and there is no interaction between baseline covariate and intervention group. Linear mixed model and multiple imputation give unbiased estimates under all four considered scenarios, provided that an interaction of intervention and baseline covariate is included in the model when appropriate. Cluster mean imputation has been proposed as a valid approach for handling missing outcomes in cluster randomised trials. We show that cluster mean imputation only gives unbiased estimates when missingness mechanism is the same between the intervention groups and there is no interaction between baseline covariate and intervention group. Multiple imputation shows overcoverage for small number of clusters in each intervention group. PMID:27177885

  2. Missing continuous outcomes under covariate dependent missingness in cluster randomised trials.

    PubMed

    Hossain, Anower; Diaz-Ordaz, Karla; Bartlett, Jonathan W

    2017-06-01

    Attrition is a common occurrence in cluster randomised trials which leads to missing outcome data. Two approaches for analysing such trials are cluster-level analysis and individual-level analysis. This paper compares the performance of unadjusted cluster-level analysis, baseline covariate adjusted cluster-level analysis and linear mixed model analysis, under baseline covariate dependent missingness in continuous outcomes, in terms of bias, average estimated standard error and coverage probability. The methods of complete records analysis and multiple imputation are used to handle the missing outcome data. We considered four scenarios, with the missingness mechanism and baseline covariate effect on outcome either the same or different between intervention groups. We show that both unadjusted cluster-level analysis and baseline covariate adjusted cluster-level analysis give unbiased estimates of the intervention effect only if both intervention groups have the same missingness mechanisms and there is no interaction between baseline covariate and intervention group. Linear mixed model and multiple imputation give unbiased estimates under all four considered scenarios, provided that an interaction of intervention and baseline covariate is included in the model when appropriate. Cluster mean imputation has been proposed as a valid approach for handling missing outcomes in cluster randomised trials. We show that cluster mean imputation only gives unbiased estimates when missingness mechanism is the same between the intervention groups and there is no interaction between baseline covariate and intervention group. Multiple imputation shows overcoverage for small number of clusters in each intervention group.

  3. Spatial and Temporal Distribution of Tuberculosis in the State of Mexico, Mexico

    PubMed Central

    Zaragoza Bastida, Adrian; Hernández Tellez, Marivel; Bustamante Montes, Lilia P.; Medina Torres, Imelda; Jaramillo Paniagua, Jaime Nicolás; Mendoza Martínez, Germán David; Ramírez Durán, Ninfa

    2012-01-01

    Tuberculosis (TB) is one of the oldest human diseases that still affects large population groups. According to the World Health Organization (WHO), there were approximately 9.4 million new cases worldwide in the year 2010. In Mexico, there were 18,848 new cases of TB of all clinical variants in 2010. The identification of clusters in space-time is of great interest in epidemiological studies. The objective of this research was to identify the spatial and temporal distribution of TB during the period 2006–2010 in the State of Mexico, using geographic information system (GIS) and SCAN statistics program. Nine significant clusters (P < 0.05) were identified using spatial and space-time analysis. The conclusion is that TB in the State of Mexico is not randomly distributed but is concentrated in areas close to Mexico City. PMID:22919337

  4. Cluster and principal component analysis based on SSR markers of Amomum tsao-ko in Jinping County of Yunnan Province

    NASA Astrophysics Data System (ADS)

    Ma, Mengli; Lei, En; Meng, Hengling; Wang, Tiantao; Xie, Linyan; Shen, Dong; Xianwang, Zhou; Lu, Bingyue

    2017-08-01

    Amomum tsao-ko is a commercial plant that used for various purposes in medicinal and food industries. For the present investigation, 44 germplasm samples were collected from Jinping County of Yunnan Province. Clusters analysis and 2-dimensional principal component analysis (PCA) was used to represent the genetic relations among Amomum tsao-ko by using simple sequence repeat (SSR) markers. Clustering analysis clearly distinguished the samples groups. Two major clusters were formed; first (Cluster I) consisted of 34 individuals, the second (Cluster II) consisted of 10 individuals, Cluster I as the main group contained multiple sub-clusters. PCA also showed 2 groups: PCA Group 1 included 29 individuals, PCA Group 2 included 12 individuals, consistent with the results of cluster analysis. The purpose of the present investigation was to provide information on genetic relationship of Amomum tsao-ko germplasm resources in main producing areas, also provide a theoretical basis for the protection and utilization of Amomum tsao-ko resources.

  5. Development and optimization of SPECT gated blood pool cluster analysis for the prediction of CRT outcome.

    PubMed

    Lalonde, Michel; Wells, R Glenn; Birnie, David; Ruddy, Terrence D; Wassenaar, Richard

    2014-07-01

    Phase analysis of single photon emission computed tomography (SPECT) radionuclide angiography (RNA) has been investigated for its potential to predict the outcome of cardiac resynchronization therapy (CRT). However, phase analysis may be limited in its potential at predicting CRT outcome as valuable information may be lost by assuming that time-activity curves (TAC) follow a simple sinusoidal shape. A new method, cluster analysis, is proposed which directly evaluates the TACs and may lead to a better understanding of dyssynchrony patterns and CRT outcome. Cluster analysis algorithms were developed and optimized to maximize their ability to predict CRT response. About 49 patients (N = 27 ischemic etiology) received a SPECT RNA scan as well as positron emission tomography (PET) perfusion and viability scans prior to undergoing CRT. A semiautomated algorithm sampled the left ventricle wall to produce 568 TACs from SPECT RNA data. The TACs were then subjected to two different cluster analysis techniques, K-means, and normal average, where several input metrics were also varied to determine the optimal settings for the prediction of CRT outcome. Each TAC was assigned to a cluster group based on the comparison criteria and global and segmental cluster size and scores were used as measures of dyssynchrony and used to predict response to CRT. A repeated random twofold cross-validation technique was used to train and validate the cluster algorithm. Receiver operating characteristic (ROC) analysis was used to calculate the area under the curve (AUC) and compare results to those obtained for SPECT RNA phase analysis and PET scar size analysis methods. Using the normal average cluster analysis approach, the septal wall produced statistically significant results for predicting CRT results in the ischemic population (ROC AUC = 0.73;p < 0.05 vs. equal chance ROC AUC = 0.50) with an optimal operating point of 71% sensitivity and 60% specificity. Cluster analysis results were similar to SPECT RNA phase analysis (ROC AUC = 0.78, p = 0.73 vs cluster AUC; sensitivity/specificity = 59%/89%) and PET scar size analysis (ROC AUC = 0.73, p = 1.0 vs cluster AUC; sensitivity/specificity = 76%/67%). A SPECT RNA cluster analysis algorithm was developed for the prediction of CRT outcome. Cluster analysis results produced results equivalent to those obtained from Fourier and scar analysis.

  6. Patterns of Childhood Abuse and Neglect in a Representative German Population Sample

    PubMed Central

    Schilling, Christoph; Weidner, Kerstin; Brähler, Elmar; Glaesmer, Heide; Häuser, Winfried; Pöhlmann, Karin

    2016-01-01

    Background Different types of childhood maltreatment, like emotional abuse, emotional neglect, physical abuse, physical neglect and sexual abuse are interrelated because of their co-occurrence. Different patterns of childhood abuse and neglect are associated with the degree of severity of mental disorders in adulthood. The purpose of this study was (a) to identify different patterns of childhood maltreatment in a representative German community sample, (b) to replicate the patterns of childhood neglect and abuse recently found in a clinical German sample, (c) to examine whether participants reporting exposure to specific patterns of child maltreatment would report different levels of psychological distress, and (d) to compare the results of the typological approach and the results of a cumulative risk model based on our data set. Methods In a cross-sectional survey conducted in 2010, a representative random sample of 2504 German participants aged between 14 and 92 years completed the Childhood Trauma Questionnaire (CTQ). General anxiety and depression were assessed by standardized questionnaires (GAD-2, PHQ-2). Cluster analysis was conducted with the CTQ-subscales to identify different patterns of childhood maltreatment. Results Three different patterns of childhood abuse and neglect could be identified by cluster analysis. Cluster one showed low values on all CTQ-scales. Cluster two showed high values in emotional and physical neglect. Only cluster three showed high values in physical and sexual abuse. The three patterns of childhood maltreatment showed different degrees of depression (PHQ-2) and anxiety (GAD-2). Cluster one showed lowest levels of psychological distress, cluster three showed highest levels of mental distress. Conclusion The results show that different types of childhood maltreatment are interrelated and can be grouped into specific patterns of childhood abuse and neglect, which are associated with differing severity of psychological distress in adulthood. The results correspond to those recently found in a German clinical sample and support a typological approach in the research of maltreatment. While cumulative risk models focus on the number of maltreatment types, the typological approach takes the number as well as the severity of the maltreatment types into account. Thus, specific patterns of maltreatment can be examined with regard to specific long-term psychological consequences. PMID:27442446

  7. A snapshot of the predominant single nucleotide polymorphism cluster groups of Mycobacterium tuberculosis clinical isolates in Delhi, India.

    PubMed

    Varma-Basil, Mandira; Narang, Anshika; Chakravorty, Soumitesh; Garima, Kushal; Gupta, Shraddha; Kumar Sharma, Naresh; Giri, Astha; Zozio, Thierry; Couvin, David; Hanif, Mahmud; Bhatnagar, Anuj; Menon, Balakrishnan; Niemann, Stefan; Rastogi, Nalin; Alland, David; Bose, Mridula

    2016-09-01

    Several attempts have been made to associate phylogenetic differences among Mycobacterium tuberculosis strains to variations in the clinical outcome of the disease and to drug resistance. We genotyped 139 clinical isolates of M. tuberculosis obtained from patients of pulmonary tuberculosis in North Delhi region. The isolates were analyzed using nine Single nucleotide polymorphism (SNP) markers, spoligotyping and MIRU-VNTRs; and the results were correlated with their drug susceptibility profile. Results of SNP cluster group (SCG) analysis (available for 138 isolates) showed that the most predominant cluster was SCG 3a, observed in 58.7% (81/138) of the isolates with 44.4% (36/81) of these being drug susceptible, while 16% (13/81) were multidrug resistant (MDR). Of the ancestral cluster SCG 1 observed in 19.5% (27/138) of the isolates, 14.8% (4/27) were MDR while 44.4% (12/27) were drug susceptible. SCG 2 formed 5.79% (8/138) of the isolates and 50% (4/8) of these were multidrug resistant (MDR). Spoligotyping subdivided the strains into 45 shared types (n = 125) and 14 orphan strains. The orphan strains were mostly associated with SCG 3a or SCG 1, reflecting the principal SCGs found in the Indian population. SCG 1 and SCG 2 genotypes were concordant with the East African Indian (EAI) and Beijing families respectively. Central Asian (CAS) clade and its sublineages were predominantly associated with SCG 3a. No consistent association was seen between the SCGs and Harlem, T or X clades. The 15 loci MIRU-VNTR typing revealed 123/136 isolates to be unclustered, while 13 isolates were present in 6 clusters of 2-3 isolates each. However, correlating the cluster analysis with patient details did not suggest any evidence of recent transmission. In conclusion, though our study revealed the preponderance of SCG 1 and 3a in the M. tuberculosis population circulating in the region, the diversity of strains highlights the changes occurring within lineages and reemphasizes the importance of cluster investigations in extended studies. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Near real-time space-time cluster analysis for detection of enteric disease outbreaks in a community setting.

    PubMed

    Glatman-Freedman, Aharona; Kaufman, Zalman; Kopel, Eran; Bassal, Ravit; Taran, Diana; Valinsky, Lea; Agmon, Vered; Shpriz, Manor; Cohen, Daniel; Anis, Emilia; Shohat, Tamy

    2016-08-01

    To enhance timely surveillance of bacterial enteric pathogens, space-time cluster analysis was introduced in Israel in May 2013. Stool isolation data of Salmonella, Shigella, and Campylobacter from patients of a large Health Maintenance Organization were analyzed weekly by ArcGIS and SaTScan, and cluster results were sent promptly to local departments of health (LDOHs). During eighteen months, we identified 52 Shigella sonnei clusters, two Salmonella clusters, and no Campylobacter clusters. S. sonnei clusters lasted from one to 33 days and included three to 30 individuals. Thirty-one (60%) of the S. sonnei clusters were known to LDOHs prior to cluster analysis. Clusters not previously known by the LDOHs prompted epidemiologic investigations. In 31 of the 37 (84%) confirmed clusters, educational institutes (nursery schools, kindergartens, and a primary school) were involved. Cluster analysis demonstrated capability to complement enteric disease surveillance. Scaling up the system can further enhance timely detection and control of outbreaks. Copyright © 2016 The British Infection Association. Published by Elsevier Ltd. All rights reserved.

  9. An effective fuzzy kernel clustering analysis approach for gene expression data.

    PubMed

    Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao

    2015-01-01

    Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.

  10. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering.

    PubMed

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M

    2015-05-01

    To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.

  11. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering

    PubMed Central

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor

    2015-01-01

    Abstract To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice. PMID:25560745

  12. Broca’s area network in language function: a pooling-data connectivity study

    PubMed Central

    Bernal, Byron; Ardila, Alfredo; Rosselli, Monica

    2015-01-01

    Background and Objective: Modern neuroimaging developments have demonstrated that cognitive functions correlate with brain networks rather than specific areas. The purpose of this paper was to analyze the connectivity of Broca’s area based on language tasks. Methods: A connectivity modeling study was performed by pooling data of Broca’s activation in language tasks. Fifty-seven papers that included 883 subjects in 84 experiments were analyzed. Analysis of Likelihood Estimates of pooled data was utilized to generate the map; thresholds at p < 0.01 were corrected for multiple comparisons and false discovery rate. Resulting images were co-registered into MNI standard space. Results: A network consisting of 16 clusters of activation was obtained. Main clusters were located in the frontal operculum, left posterior temporal region, supplementary motor area, and the parietal lobe. Less common clusters were seen in the sub-cortical structures including the left thalamus, left putamen, secondary visual areas, and the right cerebellum. Conclusion: Broca’s area-44-related networks involved in language processing were demonstrated utilizing a pooling-data connectivity study. Significance, interpretation, and limitations of the results are discussed. PMID:26074842

  13. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ding, Jun; Ma, Evan; Asta, Mark

    Using molecular dynamics simulations, we have studied the atomic correlations characterizing the second peak in the radial distribution function (RDF) of metallic glasses and liquids. The analysis was conducted from the perspective of different connection schemes of atomic packing motifs, based on the number of shared atoms between two linked coordination polyhedra. The results demonstrate that the cluster connections by face-sharing, specifically with three common atoms, are most favored when transitioning from the liquid to glassy state, and exhibit the stiffest elastic response during shear deformation. These properties of the connections and the resultant atomic correlations are generally the samemore » for different types of packing motifs in different alloys. Splitting of the second RDF peak was observed for the inherent structure of the equilibrium liquid, originating solely from cluster connections; this trait can then be inherited in the metallic glass formed via subsequent quenching of the parent liquid through the glass transition, in the absence of any additional type of local structural order. In conclusion, increasing ordering and cluster connection during cooling, however, may tune the position and intensity of the split peaks.« less

  14. Community involvement in dengue vector control: cluster randomised trial

    PubMed Central

    Toledo, M E; Rodríguez, M; Gomez, D; Baly, A; Benitez, J R; Van der Stuyft, P

    2009-01-01

    Objective To assess the effectiveness of an integrated community based environmental management strategy to control Aedes aegypti, the vector of dengue, compared with a routine strategy. Design Cluster randomised trial. Setting Guantanamo, Cuba. Participants 32 circumscriptions (around 2000 inhabitants each). Interventions The circumscriptions were randomly allocated to control clusters (n=16) comprising routine Aedes control programme (entomological surveillance, source reduction, selective adulticiding, and health education) and to intervention clusters (n=16) comprising the routine Aedes control programme combined with a community based environmental management approach. Main outcome measures The primary outcome was levels of Aedes infestation: house index (number of houses positive for at least one container with immature stages of Ae aegypti per 100 inspected houses), Breteau index (number of containers positive for immature stages of Ae aegypti per 100 inspected houses), and the pupae per inhabitant statistic (number of Ae aegypti pupae per inhabitant). Results All clusters were subjected to the intended intervention; all completed the study protocol up to February 2006 and all were included in the analysis. At baseline the Aedes infestation levels were comparable between intervention and control clusters: house index 0.25% v 0.20%, pupae per inhabitant 0.44×10−3 v 0.29×10−3. At the end of the intervention these indices were significantly lower in the intervention clusters: rate ratio for house indices 0.49 (95% confidence interval 0.27 to 0.88) and rate ratio for pupae per inhabitant 0.27 (0.09 to 0.76). Conclusion A community based environmental management embedded in a routine control programme was effective at reducing levels of Aedes infestation. Trial registration Current Controlled Trials ISRCTN88405796. PMID:19509031

  15. Health in police officers: Role of risk factor clusters and police divisions

    PubMed Central

    Habersaat, Stephanie A.; Geiger, Ashley M.; Abdellaoui, Sid; Wolf, Jutta M.

    2015-01-01

    Objective Law enforcement is a stressful occupation associated with significant health problems. To date, most studies have focused on one specific factor or one domain of risk factors (e.g., organizational, personal). However, it is more likely that specific combinations of risk factors are differentially health relevant and further, depend on the area of police work. Methods A self-selected group of officers from the criminal, community, and emergency division (N = 84) of a Swiss state police department answered questionnaires assessing personal and organizational risk factors as well as mental and physical health indicators. Results In general, few differences were observed across divisions in terms of risk factors or health indicators. Cluster analysis of all risk factors established a high-risk and a low-risk cluster with significant links to all mental health outcomes. Risk cluster-by-division interactions revealed that, in the high-risk cluster, Emergency officers reported fewer physical symptoms, while community officers reported more posttraumatic stress symptoms. Criminal officers in the high-risk cluster tended to perceived more stress. Finally, perceived stress did not mediate the relationship between risk clusters and posttraumatic stress symptoms. Conclusion In summary, our results support the notion that police officers are a heterogeneous population in terms of processes linking risk factors and health indicators. This heterogeneity thereby appeared to be more dependent on personal factors and individuals' perception of their own work conditions than division-specific work environments. Our findings further suggest that stress-reduction interventions that do not target job-relevant sources of stress may only show limited effectiveness in reducing health risks associated with police work. PMID:26364008

  16. Networking between community health programs: a case study outlining the effectiveness, barriers and enablers.

    PubMed

    Grills, Nathan J; Robinson, Priscilla; Phillip, Maneesh

    2012-07-19

    In India, since the 1990s, there has been a burgeoning of NGOs involved in providing primary health care. This has resulted in a complex NGO-Government interface which is difficult for lone NGOs to navigate. The Uttarakhand Cluster, India, links such small community health programs together to build NGO capacity, increase visibility and better link to the government schemes and the formal healthcare system. This research, undertaken between 1998 and 2011, aims to examine barriers and facilitators to such linking, or clustering, and the effectiveness of this clustering approach. Interviews, indicator surveys and participant observation were used to document the process and explore the enablers, the barriers and the effectiveness of networks improving community health. The analysis revealed that when activating, framing, mobilising and synthesizing the Uttarakhand Cluster, key brokers and network players were important in bridging between organisations. The ties (or relationships) that held the cluster together included homophily around common faith, common friendships and geographical location and common mission. Self interest whereby members sought funds, visibility, credibility, increased capacity and access to trainings was also a commonly identified motivating factor for networking. Barriers to network synthesizing included lack of funding, poor communication, limited time and lack of human resources. Risk aversion and mistrust remained significant barriers to overcome for such a network. In conclusion, specific enabling factors allowed the clustering approach to be effective at increasing access to resources, creating collaborative opportunities and increasing visibility, credibility and confidence of the cluster members. These findings add to knowledge regarding social network formation and collaboration, and such knowledge will assist in the conceptualisation, formation and success of potential health networks in India and other developing world countries.

  17. Identifying Clusters of Active Transportation Using Spatial Scan Statistics

    PubMed Central

    Huang, Lan; Stinchcomb, David G.; Pickle, Linda W.; Dill, Jennifer; Berrigan, David

    2009-01-01

    Background There is an intense interest in the possibility that neighborhood characteristics influence active transportation such as walking or biking. The purpose of this paper is to illustrate how a spatial cluster identification method can evaluate the geographic variation of active transportation and identify neighborhoods with unusually high/low levels of active transportation. Methods Self-reported walking/biking prevalence, demographic characteristics, street connectivity variables, and neighborhood socioeconomic data were collected from respondents to the 2001 California Health Interview Survey (CHIS; N=10,688) in Los Angeles County (LAC) and San Diego County (SDC). Spatial scan statistics were used to identify clusters of high or low prevalence (with and without age-adjustment) and the quantity of time spent walking and biking. The data, a subset from the 2001 CHIS, were analyzed in 2007–2008. Results Geographic clusters of significantly high or low prevalence of walking and biking were detected in LAC and SDC. Structural variables such as street connectivity and shorter block lengths are consistently associated with higher levels of active transportation, but associations between active transportation and socioeconomic variables at the individual and neighborhood levels are mixed. Only one cluster with less time spent walking and biking among walkers/bikers was detected in LAC, and this was of borderline significance. Age-adjustment affects the clustering pattern of walking/biking prevalence in LAC, but not in SDC. Conclusions The use of spatial scan statistics to identify significant clustering of health behaviors such as active transportation adds to the more traditional regression analysis that examines associations between behavior and environmental factors by identifying specific geographic areas with unusual levels of the behavior independent of predefined administrative units. PMID:19589451

  18. Mycobacterium tuberculosis Transmission in a Country with Low Tuberculosis Incidence: Role of Immigration and HIV Infection

    PubMed Central

    Gagneux, Sebastien; Helbling, Peter; Battegay, Manuel; Rieder, Hans L.; Pfyffer, Gaby E.; Zwahlen, Marcel; Furrer, Hansjakob; Siegrist, Hans H.; Fehr, Jan; Dolina, Marisa; Calmy, Alexandra; Stucki, David; Jaton, Katia; Janssens, Jean-Paul; Stalder, Jesica Mazza; Bodmer, Thomas; Ninet, Beatrice; Böttger, Erik C.; Egger, Matthias; Barth, J.; Battegay, M.; Bernasconi, E.; Böni, J.; Bucher, H. C.; Burton-Jeangros, A. Calmy; Cavassini, M.; Cellerai, C.; Egger, M.; Elzi, L.; Fehr, J.; Fellay, J.; Flepp, M.; Francioli, P.; Furrer, H.; Fux, C. A.; Gorgievski, M.; Günthard, H.; Haerry, D.; Hasse, B.; Hirschel, B.; Hirsch, H. H.; Hirschel, B.; Hoffmann, M.; Hösli, I.; Kahlert, C.; Kaiser, L.; Kaiser, O.; Kind, C.; Klimkait, T.; Kovari, H.; Ledergerber, B.; Lugano, A. P.; Martinetti, G.; Martinez de Tejada, B.; Metzner, K.; Müller, N.; Nadal, D.; Pantaleo, G.; Rauch, A.; Regenass, S.; Rickenbach, M.; Rudin, C.; Schmid, P.; Schultze, D.; Schöni-Affolter, F.; Schüpbach, J.; Speck, R.; Taffé, P.; Tarr, P.; Telenti, A.; Trkola, A.; Vernazza, P.; Weber, R.; Yerly, S.

    2012-01-01

    Immigrants from high-burden countries and HIV-coinfected individuals are risk groups for tuberculosis (TB) in countries with low TB incidence. Therefore, we studied their role in transmission of Mycobacterium tuberculosis in Switzerland. We included all TB patients from the Swiss HIV Cohort and a sample of patients from the national TB registry. We identified molecular clusters by spoligotyping and mycobacterial interspersed repetitive-unit–variable-number tandem-repeat (MIRU-VNTR) analysis and used weighted logistic regression adjusted for age and sex to identify risk factors for clustering, taking sampling proportions into account. In total, we analyzed 520 TB cases diagnosed between 2000 and 2008; 401 were foreign born, and 113 were HIV coinfected. The Euro-American M. tuberculosis lineage dominated throughout the study period (378 strains; 72.7%), with no evidence for another lineage, such as the Beijing genotype, emerging. We identified 35 molecular clusters with 90 patients, indicating recent transmission; 31 clusters involved foreign-born patients, and 15 involved HIV-infected patients. Birth origin was not associated with clustering (adjusted odds ratio [aOR], 1.58; 95% confidence interval [CI], 0.73 to 3.43; P = 0.25, comparing Swiss-born with foreign-born patients), but clustering was reduced in HIV-infected patients (aOR, 0.49; 95% CI, 0.26 to 0.93; P = 0.030). Cavitary disease, male sex, and younger age were all associated with molecular clustering. In conclusion, most TB patients in Switzerland were foreign born, but transmission of M. tuberculosis was not more common among immigrants and was reduced in HIV-infected patients followed up in the national HIV cohort study. Continued access to health services and clinical follow-up will be essential to control TB in this population. PMID:22116153

  19. Three candidate double clusters in the LMC: truth or dare?

    NASA Astrophysics Data System (ADS)

    Dalessandro, Emanuele; Zocchi, Alice; Varri, Anna Lisa; Mucciarelli, Alessio; Bellazzini, Michele; Ferraro, Francesco R.; Lanzoni, Barbara; Lapenna, Emilio; Origlia, Livia

    2018-02-01

    The Large Magellanic Cloud (LMC) hosts a large number of candidate stellar cluster pairs. Binary stellar clusters provide important clues about cluster formation processes and the evolutionary history of the host galaxy. However, to properly extract and interpret this information, it is crucial to fully constrain the fraction of real binary systems and their physical properties. Here we present a detailed photometric analysis based on ESO-FORS2 images of three candidate cluster multiplets in the LMC, namely SL349-SL353, SL385-SL387-NGC 1922 and NGC 1836-BRHT4b-NGC 1839. For each cluster, we derived ages, structural parameters and morphological properties. We have also estimated the degree of filling of their Roche lobe, as an approximate tool to measure the strength of the tidal perturbations induced by the LMC. We find that the members of the possible pairs SL349-SL353 and BRHT4b-NGC 1839 have a similar age (t = 1.00 ± 0.12 Gyr and t = 140 ± 15 Myr, respectively), thus possibly hinting at a common origin of their member systems. We also find that all candidate pairs in our sample show evidence of intracluster overdensities that can be a possible indication of real binarity. Particularly interesting is the case of SL349-SL353. In fact, SL353 is relatively close to the condition of critical filling, thus suggesting that these systems might actually constitute an energetically bound pair. It is therefore key to pursue a detailed kinematic screening of such clusters, without which, at present, we do not dare making a conclusive statement about the true nature of this putative pair.

  20. Comprehensive annotation of secondary metabolite biosynthetic genes and gene clusters of Aspergillus nidulans, A. fumigatus, A. niger and A. oryzae

    PubMed Central

    2013-01-01

    Background Secondary metabolite production, a hallmark of filamentous fungi, is an expanding area of research for the Aspergilli. These compounds are potent chemicals, ranging from deadly toxins to therapeutic antibiotics to potential anti-cancer drugs. The genome sequences for multiple Aspergilli have been determined, and provide a wealth of predictive information about secondary metabolite production. Sequence analysis and gene overexpression strategies have enabled the discovery of novel secondary metabolites and the genes involved in their biosynthesis. The Aspergillus Genome Database (AspGD) provides a central repository for gene annotation and protein information for Aspergillus species. These annotations include Gene Ontology (GO) terms, phenotype data, gene names and descriptions and they are crucial for interpreting both small- and large-scale data and for aiding in the design of new experiments that further Aspergillus research. Results We have manually curated Biological Process GO annotations for all genes in AspGD with recorded functions in secondary metabolite production, adding new GO terms that specifically describe each secondary metabolite. We then leveraged these new annotations to predict roles in secondary metabolism for genes lacking experimental characterization. As a starting point for manually annotating Aspergillus secondary metabolite gene clusters, we used antiSMASH (antibiotics and Secondary Metabolite Analysis SHell) and SMURF (Secondary Metabolite Unknown Regions Finder) algorithms to identify potential clusters in A. nidulans, A. fumigatus, A. niger and A. oryzae, which we subsequently refined through manual curation. Conclusions This set of 266 manually curated secondary metabolite gene clusters will facilitate the investigation of novel Aspergillus secondary metabolites. PMID:23617571

  1. Security practices and regulatory compliance in the healthcare industry

    PubMed Central

    Kwon, Juhee; Johnson, M Eric

    2013-01-01

    Objective Securing protected health information is a critical responsibility of every healthcare organization. We explore information security practices and identify practice patterns that are associated with improved regulatory compliance. Design We employed Ward's cluster analysis using minimum variance based on the adoption of security practices. Variance between organizations was measured using dichotomous data indicating the presence or absence of each security practice. Using t tests, we identified the relationships between the clusters of security practices and their regulatory compliance. Measurement We utilized the results from the Kroll/Healthcare Information and Management Systems Society telephone-based survey of 250 US healthcare organizations including adoption status of security practices, breach incidents, and perceived compliance levels on Health Information Technology for Economic and Clinical Health, Health Insurance Portability and Accountability Act, Red Flags rules, Centers for Medicare and Medicaid Services, and state laws governing patient information security. Results Our analysis identified three clusters (which we call leaders, followers, and laggers) based on the variance of security practice patterns. The clusters have significant differences among non-technical practices rather than technical practices, and the highest level of compliance was associated with hospitals that employed a balanced approach between technical and non-technical practices (or between one-off and cultural practices). Conclusions Hospitals in the highest level of compliance were significantly managing third parties’ breaches and training. Audit practices were important to those who scored in the middle of the pack on compliance. Our results provide security practice benchmarks for healthcare administrators and can help policy makers in developing strategic and practical guidelines for practice adoption. PMID:22955497

  2. Symptom clusters at midlife: A four-country comparison of checklist and qualitative responses

    PubMed Central

    Sievert, Lynnette Leidy; Obermeyer, Carla Makhlouf

    2011-01-01

    Objectives The purpose of this study was to examine the frequency and clustering of somatic symptoms as reported by women aged 45-55 years in four countries, to compare women's responses to open-ended questions with those derived from structured checklists, and to assess the extent to which bodily symptoms grouped with emotional complaints. Methods The Decisions at Menopause Study (DAMES) recruited 1,193 women from the general population in Beirut, Lebanon; Rabat, Morocco; Madrid, Spain; and central Massachusetts. Women participated in semi-structured interviews about health, menopause, and bodily changes at midlife. Women's responses to symptom checklists and their statements in response to open-ended questions were analyzed through factor analysis and textual analysis. Results There was considerable consistency between the frequencies of quantitative and qualitative responses, and the analyses of qualitative data illustrate the extent to which women associate somatic and emotional complaints. In open-ended responses, women in Massachusetts and Spain did not often cluster somatic symptoms together with emotional symptoms. In Morocco, dizziness, fatigue, and headaches were clustered with emotional symptoms. Women in Lebanon explicitly associated shortness of breath, chest pain, palpitations, dizziness, fatigue, gastro-intestinal complaints, headaches, and, to a lesser extent, joint pain and numbness with emotional symptoms. Conclusions The number of volunteered symptom responses was small because respondents were relatively healthy; however, the extent and pattern of association between somatic and emotional symptoms varied across sites. Certain somatic symptoms may be more likely to communicate psychosocial distress in particular cultures. These results have implications for patterns of health care utilization. PMID:22042326

  3. Using spatial analysis to demonstrate the heterogeneity of the cardiovascular drug-prescribing pattern in Taiwan

    PubMed Central

    2011-01-01

    Background Geographic Information Systems (GIS) combined with spatial analytical methods could be helpful in examining patterns of drug use. Little attention has been paid to geographic variation of cardiovascular prescription use in Taiwan. The main objective was to use local spatial association statistics to test whether or not the cardiovascular medication-prescribing pattern is homogenous across 352 townships in Taiwan. Methods The statistical methods used were the global measures of Moran's I and Local Indicators of Spatial Association (LISA). While Moran's I provides information on the overall spatial distribution of the data, LISA provides information on types of spatial association at the local level. LISA statistics can also be used to identify influential locations in spatial association analysis. The major classes of prescription cardiovascular drugs were taken from Taiwan's National Health Insurance Research Database (NHIRD), which has a coverage rate of over 97%. The dosage of each prescription was converted into defined daily doses to measure the consumption of each class of drugs. Data were analyzed with ArcGIS and GeoDa at the township level. Results The LISA statistics showed an unusual use of cardiovascular medications in the southern townships with high local variation. Patterns of drug use also showed more low-low spatial clusters (cold spots) than high-high spatial clusters (hot spots), and those low-low associations were clustered in the rural areas. Conclusions The cardiovascular drug prescribing patterns were heterogeneous across Taiwan. In particular, a clear pattern of north-south disparity exists. Such spatial clustering helps prioritize the target areas that require better education concerning drug use. PMID:21609462

  4. Effects of Group Size and Lack of Sphericity on the Recovery of Clusters in K-Means Cluster Analysis

    ERIC Educational Resources Information Center

    de Craen, Saskia; Commandeur, Jacques J. F.; Frank, Laurence E.; Heiser, Willem J.

    2006-01-01

    K-means cluster analysis is known for its tendency to produce spherical and equally sized clusters. To assess the magnitude of these effects, a simulation study was conducted, in which populations were created with varying departures from sphericity and group sizes. An analysis of the recovery of clusters in the samples taken from these…

  5. Clustering of health behaviours in adult survivors of childhood cancer and the general population

    PubMed Central

    Rebholz, C E; Rueegg, C S; Michel, G; Ammann, R A; von der Weid, N X; Kuehni, C E; Spycher, B D

    2012-01-01

    Background: Little is known about engagement in multiple health behaviours in childhood cancer survivors. Methods: Using latent class analysis, we identified health behaviour patterns in 835 adult survivors of childhood cancer (age 20–35 years) and 1670 age- and sex-matched controls from the general population. Behaviour groups were determined from replies to questions on smoking, drinking, cannabis use, sporting activities, diet, sun protection and skin examination. Results: The model identified four health behaviour patterns: ‘risk-avoidance', with a generally healthy behaviour; ‘moderate drinking', with higher levels of sporting activities, but moderate alcohol-consumption; ‘risk-taking', engaging in several risk behaviours; and ‘smoking', smoking but not drinking. Similar proportions of survivors and controls fell into the ‘risk-avoiding' (42% vs 44%) and the ‘risk-taking' cluster (14% vs 12%), but more survivors were in the ‘moderate drinking' (39% vs 28%) and fewer in the ‘smoking' cluster (5% vs 16%). Determinants of health behaviour clusters were gender, migration background, income and therapy. Conclusion: A comparable proportion of childhood cancer survivors as in the general population engage in multiple health-compromising behaviours. Because of increased vulnerability of survivors, multiple risk behaviours should be addressed in targeted health interventions. PMID:22722311

  6. Development of a model of the tobacco industry's interference with tobacco control programmes

    PubMed Central

    Trochim, W; Stillman, F; Clark, P; Schmitt, C

    2003-01-01

    Objective: To construct a conceptual model of tobacco industry tactics to undermine tobacco control programmes for the purposes of: (1) developing measures to evaluate industry tactics, (2) improving tobacco control planning, and (3) supplementing current or future frameworks used to classify and analyse tobacco industry documents. Design: Web based concept mapping was conducted, including expert brainstorming, sorting, and rating of statements describing industry tactics. Statistical analyses used multidimensional scaling and cluster analysis. Interpretation of the resulting maps was accomplished by an expert panel during a face-to-face meeting. Subjects: 34 experts, selected because of their previous encounters with industry resistance or because of their research into industry tactics, took part in some or all phases of the project. Results: Maps with eight non-overlapping clusters in two dimensional space were developed, with importance ratings of the statements and clusters. Cluster and quadrant labels were agreed upon by the experts. Conclusions: The conceptual maps summarise the tactics used by the industry and their relationships to each other, and suggest a possible hierarchy for measures that can be used in statistical modelling of industry tactics and for review of industry documents. Finally, the maps enable hypothesis of a likely progression of industry reactions as public health programmes become more successful, and therefore more threatening to industry profits. PMID:12773723

  7. Attention Dysfunction Subtypes of Developmental Dyslexia

    PubMed Central

    Lewandowska, Monika; Milner, Rafał; Ganc, Małgorzata; Włodarczyk, Elżbieta; Skarżyński, Henryk

    2014-01-01

    Background Previous studies indicate that many different aspects of attention are impaired in children diagnosed with developmental dyslexia (DD). The objective of the present study was to identify cognitive profiles of DD on the basis of attentional test performance. Material/Methods 78 children with DD (30 girls, 48 boys, mean age of 12 years ±8 months) and 32 age- and sex-matched non-dyslexic children (14 girls, 18 boys) were examined using a battery of standardized tests of reading, phonological and attentional processes (alertness, covert shift of attention, divided attention, inhibition, flexibility, vigilance, and visual search). Cluster analysis was used to identify subtypes of DD. Results Dyslexic children showed deficits in alertness, covert shift of attention, divided attention, flexibility, and visual search. Three different subtypes of DD were identified, each characterized by poorer performance on the reading, phonological awareness, and visual search tasks. Additionally, children in cluster no. 1 displayed deficits in flexibility and divided attention. In contrast to non-dyslexic children, cluster no. 2 performed poorer in tasks involving alertness, covert shift of attention, divided attention, and vigilance. Cluster no. 3 showed impaired covert shift of attention. Conclusions These results indicate different patterns of attentional impairments in dyslexic children. Remediation programs should address the individual child’s deficit profile. PMID:25387479

  8. Inter-individual variability and pattern recognition of surface electromyography in front crawl swimming.

    PubMed

    Martens, Jonas; Daly, Daniel; Deschamps, Kevin; Staes, Filip; Fernandes, Ricardo J

    2016-12-01

    Variability of electromyographic (EMG) recordings is a complex phenomenon rarely examined in swimming. Our purposes were to investigate inter-individual variability in muscle activation patterns during front crawl swimming and assess if there were clusters of sub patterns present. Bilateral muscle activity of rectus abdominis (RA) and deltoideus medialis (DM) was recorded using wireless surface EMG in 15 adult male competitive swimmers. The amplitude of the median EMG trial of six upper arm movement cycles was used for the inter-individual variability assessment, quantified with the coefficient of variation, coefficient of quartile variation, the variance ratio and mean deviation. Key features were selected based on qualitative and quantitative classification strategies to enter in a k-means cluster analysis to examine the presence of strong sub patterns. Such strong sub patterns were found when clustering in two, three and four clusters. Inter-individual variability in a group of highly skilled swimmers was higher compared to other cyclic movements which is in contrast to what has been reported in the previous 50years of EMG research in swimming. This leads to the conclusion that coaches should be careful in using overall reference EMG information to enhance the individual swimming technique of their athletes. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Characterizing Heterogeneity within Head and Neck Lesions Using Cluster Analysis of Multi-Parametric MRI Data.

    PubMed

    Borri, Marco; Schmidt, Maria A; Powell, Ceri; Koh, Dow-Mu; Riddell, Angela M; Partridge, Mike; Bhide, Shreerang A; Nutting, Christopher M; Harrington, Kevin J; Newbold, Katie L; Leach, Martin O

    2015-01-01

    To describe a methodology, based on cluster analysis, to partition multi-parametric functional imaging data into groups (or clusters) of similar functional characteristics, with the aim of characterizing functional heterogeneity within head and neck tumour volumes. To evaluate the performance of the proposed approach on a set of longitudinal MRI data, analysing the evolution of the obtained sub-sets with treatment. The cluster analysis workflow was applied to a combination of dynamic contrast-enhanced and diffusion-weighted imaging MRI data from a cohort of squamous cell carcinoma of the head and neck patients. Cumulative distributions of voxels, containing pre and post-treatment data and including both primary tumours and lymph nodes, were partitioned into k clusters (k = 2, 3 or 4). Principal component analysis and cluster validation were employed to investigate data composition and to independently determine the optimal number of clusters. The evolution of the resulting sub-regions with induction chemotherapy treatment was assessed relative to the number of clusters. The clustering algorithm was able to separate clusters which significantly reduced in voxel number following induction chemotherapy from clusters with a non-significant reduction. Partitioning with the optimal number of clusters (k = 4), determined with cluster validation, produced the best separation between reducing and non-reducing clusters. The proposed methodology was able to identify tumour sub-regions with distinct functional properties, independently separating clusters which were affected differently by treatment. This work demonstrates that unsupervised cluster analysis, with no prior knowledge of the data, can be employed to provide a multi-parametric characterization of functional heterogeneity within tumour volumes.

  10. Predictability of Sleep in Patients with Insomnia

    PubMed Central

    Vallières, Annie; Ivers, Hans; Beaulieu-Bonneau, Simon; Morin, Charles M.

    2011-01-01

    Study Objectives: To evaluate whether the night-to-night variability in insomnia follows specific predictable patterns and to characterize sleep patterns using objective sleep and clinical variables. Design: Prospective observational study. Setting: University-affiliated sleep disorders center. Participants: 146 participants suffering from chronic and primary insomnia. Measurements and Results: Daily sleep diaries were completed for an average of 48 days and self-reported questionnaires once. Three nights were spent in the sleep laboratory for polysomnographic (PSG) assessment. Sleep efficiency, sleep onset latency, wake after sleep onset, and total sleep time were derived from sleep diaries and PSG. Time-series diary data were used to compute conditional probabilities of having an insomnia night after 1, 2, or 3 consecutive insomnia night(s). Conditional probabilities were submitted to a k-means cluster analysis. A 3-cluster solution was retained. One cluster included 38 participants exhibiting an unpredictable insomnia pattern. Another included 30 participants with a low and decreasing probability to have an insomnia night. The last cluster included 49 participants exhibiting a high probability to have insomnia every night. Clusters differed on age, insomnia severity, and mental fatigue, and on subjective sleep variables, but not on PSG sleep variables. Conclusion: These findings replicate our previous study and provide additional evidence that unpredictability is a less prevalent feature of insomnia than suggested previously in the literature. The presence of the 3 clusters is discussed in term of sleep perception and sleep homeostasis dysregulation. Citation: Vallières A; Ivers H; Beaulieu-Bonneau S; Morin CM. Predictability of sleep in patients with insomnia. SLEEP 2011;34(5):609-617. PMID:21532954

  11. Plant polycistronic precursors containing non-homologous microRNAs target transcripts encoding functionally related proteins

    PubMed Central

    2009-01-01

    Background MicroRNAs (miRNAs) are endogenous single-stranded small RNAs that regulate the expression of specific mRNAs involved in diverse biological processes. In plants, miRNAs are generally encoded as a single species in independent transcriptional units, referred to as MIRNA genes, in contrast to animal miRNAs, which are frequently clustered. Results We performed a comparative genomic analysis in three model plants (rice, poplar and Arabidopsis) and characterized miRNA clusters containing two to eight miRNA species. These clusters usually encode miRNAs of the same family and certain share a common evolutionary origin across monocot and dicot lineages. In addition, we identified miRNA clusters harboring miRNAs with unrelated sequences that are usually not evolutionarily conserved. Strikingly, non-homologous miRNAs from the same cluster were predicted to target transcripts encoding related proteins. At least four Arabidopsis non-homologous clusters were expressed as single transcriptional units. Overexpression of one of these polycistronic precursors, producing Ath-miR859 and Ath-miR774, led to the DCL1-dependent accumulation of both miRNAs and down-regulation of their different mRNA targets encoding F-box proteins. Conclusions In addition to polycistronic precursors carrying related miRNAs, plants also contain precursors allowing coordinated expression of non-homologous miRNAs to co-regulate functionally related target transcripts. This mechanism paves the way for using polycistronic MIRNA precursors as a new molecular tool for plant biologists to simultaneously control the expression of different genes. PMID:19951405

  12. The Influence of Social Network Characteristics on Peer Clustering in Smoking: A Two-Wave Panel Study of 19- and 23-Year-Old Swedes

    PubMed Central

    Rostila, Mikael; Edling, Christofer; Rydgren, Jens

    2016-01-01

    Objectives The present study examines how the composition of social networks and perceived relationship content influence peer clustering in smoking, and how the association changes during the transition from late adolescence to early adulthood. Methods The analysis was based on a Swedish two-wave survey sample comprising ego-centric network data. Respondents were 19 years old in the initial wave, and 23 when the follow-up sample was conducted. 17,227 ego-alter dyads were included in the analyses, which corresponds to an average response rate of 48.7 percent. Random effects logistic regression models were performed to calculate gender-specific average marginal effects of social network characteristics on smoking. Results The association of egos’ and alters’ smoking behavior was confirmed and found to be stronger when correlated in the female sample. For females, the associations decreased between age 19 and 23. Interactions between network characteristics and peer clustering in smoking showed that intense social interactions with smokers increase egos’ smoking probability. The influence of network structures on peer clustering in smoking decreased during the transition from late adolescence to early adulthood. Conclusions The study confirmed peer clustering in smoking and revealed that females’ smoking behavior in particular is determined by social interactions. Female smokers’ propensity to interact with other smokers was found to be associated with the quality of peer relationships, frequent social interactions, and network density. The influence of social networks on peer clustering in smoking decreased during the transition from late adolescence to early adulthood. PMID:27727314

  13. Spatial heterogeneity of type I error for local cluster detection tests

    PubMed Central

    2014-01-01

    Background Just as power, type I error of cluster detection tests (CDTs) should be spatially assessed. Indeed, CDTs’ type I error and power have both a spatial component as CDTs both detect and locate clusters. In the case of type I error, the spatial distribution of wrongly detected clusters (WDCs) can be particularly affected by edge effect. This simulation study aims to describe the spatial distribution of WDCs and to confirm and quantify the presence of edge effect. Methods A simulation of 40 000 datasets has been performed under the null hypothesis of risk homogeneity. The simulation design used realistic parameters from survey data on birth defects, and in particular, two baseline risks. The simulated datasets were analyzed using the Kulldorff’s spatial scan as a commonly used test whose behavior is otherwise well known. To describe the spatial distribution of type I error, we defined the participation rate for each spatial unit of the region. We used this indicator in a new statistical test proposed to confirm, as well as quantify, the edge effect. Results The predefined type I error of 5% was respected for both baseline risks. Results showed strong edge effect in participation rates, with a descending gradient from center to edge, and WDCs more often centrally situated. Conclusions In routine analysis of real data, clusters on the edge of the region should be carefully considered as they rarely occur when there is no cluster. Further work is needed to combine results from power studies with this work in order to optimize CDTs performance. PMID:24885343

  14. The effect of billboard design specifications on driving: A pilot study.

    PubMed

    Marciano, Hadas; Setter, Pe'erly

    2017-07-01

    Decades of research on the effects of advertising billboards on road accident rates, driver performance, and driver visual scanning behavior, has produced no conclusive findings. We suggest that road safety researchers should shift their focus and attempt to identify the billboard characteristics that are most distracting to drivers. This line of research may produce concrete guidelines for permissible billboards that would be likely to reduce the influence of the billboards on road safety. The current study is a first step towards this end. A pool of 161 photos of real advertising billboards was used as stimuli within a triple task paradigm designed to simulate certain components of driving. Each trial consisted of one ongoing tracking task accompanied by two additional concurrent tasks: (1) billboard observation task; and (2) circle color change identification task. Five clusters of billboards, identified by conducting a cluster analysis of their graphic content, were used as a within variable in one-way ANOVAs conducted on performance level data collected from the multiple tasks. Cluster 5, labeled Loaded Billboards, yielded significantly deteriorated performance on the tracking task. Cluster 4, labeled Graphical Billboards, yielded deteriorated performance primarily on the color change identification task. Cluster 3, labeled Minimal Billboards, had no effect on any of these tasks. We strongly recommend that these clusters be systematically explored in experiments involving additional real driving settings, such as driving simulators and field studies. This will enable validation of the current results and help incorporate them into real driving situations. Copyright © 2017. Published by Elsevier Ltd.

  15. Typologies of Social Support and Associations with Mental Health Outcomes Among LGBT Youth

    PubMed Central

    Birkett, Michelle A.; Mustanski, Brian

    2015-01-01

    Abstract Purpose: Lesbian, gay, bisexual, and transgender (LGBT) youth show increased risk for a number of negative mental health outcomes, which research has linked to minority stressors such as victimization. Further, social support promotes positive mental health outcomes for LGBT youth, and different sources of social support show differential relationships with mental health outcomes. However, little is known about how combinations of different sources of support impact mental health. Methods: In the present study, we identify clusters of family, peer, and significant other social support and then examine demographic and mental health differences by cluster in an analytic sample of 232 LGBT youth between the ages of 16 and 20 years. Results: Using k-means cluster analysis, three social support cluster types were identified: high support (44.0% of participants), low support (21.6%), and non-family support (34.5%). A series of chi-square tests were used to examine demographic differences between these clusters, which were found for socio-economic status (SES). Regression analyses indicated that, while controlling for victimization, individuals within the three clusters showed different relationships with multiple mental health outcomes: loneliness, hopelessness, depression, anxiety, somatization, general symptom severity, and symptoms of major depressive disorder (MDD). Conclusion: Findings suggest the combinations of sources of support LGBT youth receive are related to their mental health. Higher SES youth are more likely to receive support from family, peers, and significant others. For most mental health outcomes, family support appears to be an especially relevant and important source of support to target for LGBT youth. PMID:26790019

  16. Evaluating Mixture Modeling for Clustering: Recommendations and Cautions

    ERIC Educational Resources Information Center

    Steinley, Douglas; Brusco, Michael J.

    2011-01-01

    This article provides a large-scale investigation into several of the properties of mixture-model clustering techniques (also referred to as latent class cluster analysis, latent profile analysis, model-based clustering, probabilistic clustering, Bayesian classification, unsupervised learning, and finite mixture models; see Vermunt & Magdison,…

  17. Investigating Subtypes of Child Development: A Comparison of Cluster Analysis and Latent Class Cluster Analysis in Typology Creation

    ERIC Educational Resources Information Center

    DiStefano, Christine; Kamphaus, R. W.

    2006-01-01

    Two classification methods, latent class cluster analysis and cluster analysis, are used to identify groups of child behavioral adjustment underlying a sample of elementary school children aged 6 to 11 years. Behavioral rating information across 14 subscales was obtained from classroom teachers and used as input for analyses. Both the procedures…

  18. Improving Fraud and Abuse Detection in General Physician Claims: A Data Mining Study

    PubMed Central

    Joudaki, Hossein; Rashidian, Arash; Minaei-Bidgoli, Behrouz; Mahmoodi, Mahmood; Geraili, Bijan; Nasiri, Mahdi; Arab, Mohammad

    2016-01-01

    Background: We aimed to identify the indicators of healthcare fraud and abuse in general physicians’ drug prescription claims, and to identify a subset of general physicians that were more likely to have committed fraud and abuse. Methods: We applied data mining approach to a major health insurance organization dataset of private sector general physicians’ prescription claims. It involved 5 steps: clarifying the nature of the problem and objectives, data preparation, indicator identification and selection, cluster analysis to identify suspect physicians, and discriminant analysis to assess the validity of the clustering approach. Results: Thirteen indicators were developed in total. Over half of the general physicians (54%) were ‘suspects’ of conducting abusive behavior. The results also identified 2% of physicians as suspects of fraud. Discriminant analysis suggested that the indicators demonstrated adequate performance in the detection of physicians who were suspect of perpetrating fraud (98%) and abuse (85%) in a new sample of data. Conclusion: Our data mining approach will help health insurance organizations in low-and middle-income countries (LMICs) in streamlining auditing approaches towards the suspect groups rather than routine auditing of all physicians. PMID:26927587

  19. Essential Oil from Piper aduncum: Chemical Analysis, Antimicrobial Assessment, and Literature Review

    PubMed Central

    Monzote, Lianet; Scull, Ramón; Cos, Paul; Setzer, William N.

    2017-01-01

    Background: The challenge in antimicrobial chemotherapy is to find safe and selective agents with potency that will not be compromised by previously developed resistance. Terrestrial plants could provide new leads to antibacterial, antifungal, or antiprotozoal activity. Methods: The essential oil (EO) of Piper aduncum L. (Piperaceae) from Cuba was analyzed by gas chromatography—mass spectrometry (GC-MS). A cluster analysis of P. aduncum EO compositions reported in the literature was carried out. The EO was screened against a panel of microorganisms (bacteria, fungi, parasitic protozoa) as well as for cytotoxicity against human cells. In addition, a review of scientific literature and a bibliometric study was also conducted. Results: A total of 90 compounds were identified in the EO, of which camphor (17.1%), viridiflorol (14.5%), and piperitone (23.7%) were the main components. The cluster analysis revealed at least nine different chemotypes. The EO did not show notable activity against bacteria or fungi, but was active against parasitic protozoa. Conclusions: The results from this study indicate P. aduncum from Cuba is a unique chemotype, support the importance of P. aduncum EOs as medicines, and demonstrate the promise of Cuban P. aduncum EO as a chemotherapeutic agent against parasitic protozoal infections. PMID:28930264

  20. Essential Oil from Piper aduncum: Chemical Analysis, Antimicrobial Assessment, and Literature Review.

    PubMed

    Monzote, Lianet; Scull, Ramón; Cos, Paul; Setzer, William N

    2017-07-02

    Background: The challenge in antimicrobial chemotherapy is to find safe and selective agents with potency that will not be compromised by previously developed resistance. Terrestrial plants could provide new leads to antibacterial, antifungal, or antiprotozoal activity. Methods: The essential oil (EO) of Piper aduncum L. (Piperaceae) from Cuba was analyzed by gas chromatography-mass spectrometry (GC-MS). A cluster analysis of P. aduncum EO compositions reported in the literature was carried out. The EO was screened against a panel of microorganisms (bacteria, fungi, parasitic protozoa) as well as for cytotoxicity against human cells. In addition, a review of scientific literature and a bibliometric study was also conducted. Results: A total of 90 compounds were identified in the EO, of which camphor (17.1%), viridiflorol (14.5%), and piperitone (23.7%) were the main components. The cluster analysis revealed at least nine different chemotypes. The EO did not show notable activity against bacteria or fungi, but was active against parasitic protozoa. Conclusions: The results from this study indicate P. aduncum from Cuba is a unique chemotype, support the importance of P. aduncum EOs as medicines, and demonstrate the promise of Cuban P. aduncum EO as a chemotherapeutic agent against parasitic protozoal infections.

  1. Molecular clustering of patients with diabetes and pulmonary tuberculosis: A systematic review and meta-analysis.

    PubMed

    Blanco-Guillot, Francles; Delgado-Sánchez, Guadalupe; Mongua-Rodríguez, Norma; Cruz-Hervert, Pablo; Ferreyra-Reyes, Leticia; Ferreira-Guerrero, Elizabeth; Yanes-Lane, Mercedes; Montero-Campos, Rogelio; Bobadilla-Del-Valle, Miriam; Torres-González, Pedro; Ponce-de-León, Alfredo; Sifuentes-Osornio, José; Garcia-Garcia, Lourdes

    2017-01-01

    Many studies have explored the relationship between diabetes mellitus (DM) and tuberculosis (TB) demonstrating increased risk of TB among patients with DM and poor prognosis of patients suffering from the association of DM/TB. Owing to a paucity of studies addressing this question, it remains unclear whether patients with DM and TB are more likely than TB patients without DM to be grouped into molecular clusters defined according to the genotype of the infecting Mycobacterium tuberculosis bacillus. That is, whether there is convincing molecular epidemiological evidence for TB transmission among DM patients. Objective: We performed a systematic review and meta-analysis to quantitatively evaluate the propensity for patients with DM and pulmonary TB (PTB) to cluster according to the genotype of the infecting M. tuberculosis bacillus. We conducted a systematic search in MEDLINE and LILACS from 1990 to June, 2016 with the following combinations of key words "tuberculosis AND transmission" OR "tuberculosis diabetes mellitus" OR "Mycobacterium tuberculosis molecular epidemiology" OR "RFLP-IS6110" OR "Spoligotyping" OR "MIRU-VNTR". Studies were included if they met the following criteria: (i) studies based on populations from defined geographical areas; (ii) use of genotyping by IS6110- restriction fragment length polymorphism (RFLP) analysis and spoligotyping or mycobacterial interspersed repetitive unit-variable number of tandem repeats (MIRU-VNTR) or other amplification methods to identify molecular clustering; (iii) genotyping and analysis of 50 or more cases of PTB; (iv) study duration of 11 months or more; (v) identification of quantitative risk factors for molecular clustering including DM; (vi) > 60% coverage of the study population; and (vii) patients with PTB confirmed bacteriologically. The exclusion criteria were: (i) Extrapulmonary TB; (ii) TB caused by nontuberculous mycobacteria; (iii) patients with PTB and HIV; (iv) pediatric PTB patients; (v) TB in closed environments (e.g. prisons, elderly homes, etc.); (vi) diabetes insipidus and (vii) outbreak reports. Hartung-Knapp-Sidik-Jonkman method was used to estimate the odds ratio (OR) of the association between DM with molecular clustering of cases with TB. In order to evaluate the degree of heterogeneity a statistical Q test was done. The publication bias was examined with Begg and Egger tests. Review Manager 5.3.5 CMA v.3 and Biostat and Software package R were used. Selection criteria were met by six articles which included 4076 patients with PTB of which 13% had DM. Twenty seven percent of the cases were clustered. The majority of cases (48%) were reported in a study in China with 31% clustering. The highest incidence of TB occurred in two studies from China. The global OR for molecular clustering was 0.84 (IC 95% 0.40-1.72). The heterogeneity between studies was moderate (I2 = 55%, p = 0.05), although there was no publication bias (Beggs test p = 0.353 and Eggers p = 0.429). There were very few studies meeting our selection criteria. The wide confidence interval indicates that there is not enough evidence to draw conclusions about the association. Clustering of patients with DM in TB transmission chains should be investigated in areas where both diseases are prevalent and focus on specific contexts.

  2. Cluster analysis in phenotyping a Portuguese population.

    PubMed

    Loureiro, C C; Sa-Couto, P; Todo-Bom, A; Bousquet, J

    2015-09-03

    Unbiased cluster analysis using clinical parameters has identified asthma phenotypes. Adding inflammatory biomarkers to this analysis provided a better insight into the disease mechanisms. This approach has not yet been applied to asthmatic Portuguese patients. To identify phenotypes of asthma using cluster analysis in a Portuguese asthmatic population treated in secondary medical care. Consecutive patients with asthma were recruited from the outpatient clinic. Patients were optimally treated according to GINA guidelines and enrolled in the study. Procedures were performed according to a standard evaluation of asthma. Phenotypes were identified by cluster analysis using Ward's clustering method. Of the 72 patients enrolled, 57 had full data and were included for cluster analysis. Distribution was set in 5 clusters described as follows: cluster (C) 1, early onset mild allergic asthma; C2, moderate allergic asthma, with long evolution, female prevalence and mixed inflammation; C3, allergic brittle asthma in young females with early disease onset and no evidence of inflammation; C4, severe asthma in obese females with late disease onset, highly symptomatic despite low Th2 inflammation; C5, severe asthma with chronic airflow obstruction, late disease onset and eosinophilic inflammation. In our study population, the identified clusters were mainly coincident with other larger-scale cluster analysis. Variables such as age at disease onset, obesity, lung function, FeNO (Th2 biomarker) and disease severity were important for cluster distinction. Copyright © 2015. Published by Elsevier España, S.L.U.

  3. Phenotypes Determined by Cluster Analysis in Moderate to Severe Bronchial Asthma.

    PubMed

    Youroukova, Vania M; Dimitrova, Denitsa G; Valerieva, Anna D; Lesichkova, Spaska S; Velikova, Tsvetelina V; Ivanova-Todorova, Ekaterina I; Tumangelova-Yuzeir, Kalina D

    2017-06-01

    Bronchial asthma is a heterogeneous disease that includes various subtypes. They may share similar clinical characteristics, but probably have different pathological mechanisms. To identify phenotypes using cluster analysis in moderate to severe bronchial asthma and to compare differences in clinical, physiological, immunological and inflammatory data between the clusters. Forty adult patients with moderate to severe bronchial asthma out of exacerbation were included. All underwent clinical assessment, anthropometric measurements, skin prick testing, standard spirometry and measurement fraction of exhaled nitric oxide. Blood eosinophilic count, serum total IgE and periostin levels were determined. Two-step cluster approach, hierarchical clustering method and k-mean analysis were used for identification of the clusters. We have identified four clusters. Cluster 1 (n=14) - late-onset, non-atopic asthma with impaired lung function, Cluster 2 (n=13) - late-onset, atopic asthma, Cluster 3 (n=6) - late-onset, aspirin sensitivity, eosinophilic asthma, and Cluster 4 (n=7) - early-onset, atopic asthma. Our study is the first in Bulgaria in which cluster analysis is applied to asthmatic patients. We identified four clusters. The variables with greatest force for differentiation in our study were: age of asthma onset, duration of diseases, atopy, smoking, blood eosinophils, nonsteroidal anti-inflammatory drugs hypersensitivity, baseline FEV1/FVC and symptoms severity. Our results support the concept of heterogeneity of bronchial asthma and demonstrate that cluster analysis can be an useful tool for phenotyping of disease and personalized approach to the treatment of patients.

  4. Comparison of tests for spatial heterogeneity on data with global clustering patterns and outliers

    PubMed Central

    Jackson, Monica C; Huang, Lan; Luo, Jun; Hachey, Mark; Feuer, Eric

    2009-01-01

    Background The ability to evaluate geographic heterogeneity of cancer incidence and mortality is important in cancer surveillance. Many statistical methods for evaluating global clustering and local cluster patterns are developed and have been examined by many simulation studies. However, the performance of these methods on two extreme cases (global clustering evaluation and local anomaly (outlier) detection) has not been thoroughly investigated. Methods We compare methods for global clustering evaluation including Tango's Index, Moran's I, and Oden's I*pop; and cluster detection methods such as local Moran's I and SaTScan elliptic version on simulated count data that mimic global clustering patterns and outliers for cancer cases in the continental United States. We examine the power and precision of the selected methods in the purely spatial analysis. We illustrate Tango's MEET and SaTScan elliptic version on a 1987-2004 HIV and a 1950-1969 lung cancer mortality data in the United States. Results For simulated data with outlier patterns, Tango's MEET, Moran's I and I*pop had powers less than 0.2, and SaTScan had powers around 0.97. For simulated data with global clustering patterns, Tango's MEET and I*pop (with 50% of total population as the maximum search window) had powers close to 1. SaTScan had powers around 0.7-0.8 and Moran's I has powers around 0.2-0.3. In the real data example, Tango's MEET indicated the existence of global clustering patterns in both the HIV and lung cancer mortality data. SaTScan found a large cluster for HIV mortality rates, which is consistent with the finding from Tango's MEET. SaTScan also found clusters and outliers in the lung cancer mortality data. Conclusion SaTScan elliptic version is more efficient for outlier detection compared with the other methods evaluated in this article. Tango's MEET and Oden's I*pop perform best in global clustering scenarios among the selected methods. The use of SaTScan for data with global clustering patterns should be used with caution since SatScan may reveal an incorrect spatial pattern even though it has enough power to reject a null hypothesis of homogeneous relative risk. Tango's method should be used for global clustering evaluation instead of SaTScan. PMID:19822013

  5. Cross-scale analysis of cluster correspondence using different operational neighborhoods

    NASA Astrophysics Data System (ADS)

    Lu, Yongmei; Thill, Jean-Claude

    2008-09-01

    Cluster correspondence analysis examines the spatial autocorrelation of multi-location events at the local scale. This paper argues that patterns of cluster correspondence are highly sensitive to the definition of operational neighborhoods that form the spatial units of analysis. A subset of multi-location events is examined for cluster correspondence if they are associated with the same operational neighborhood. This paper discusses the construction of operational neighborhoods for cluster correspondence analysis based on the spatial properties of the underlying zoning system and the scales at which the zones are aggregated into neighborhoods. Impacts of this construction on the degree of cluster correspondence are also analyzed. Empirical analyses of cluster correspondence between paired vehicle theft and recovery locations are conducted on different zoning methods and across a series of geographic scales and the dynamics of cluster correspondence patterns are discussed.

  6. First Description of a Cluster of Acute/Subacute Paracoccidioidomycosis Cases and Its Association with a Climatic Anomaly

    PubMed Central

    Silva, Maria Elisa Siqueira; Bagagli, Eduardo; Marques, Silvio Alencar; Mendes, Rinaldo Poncio

    2010-01-01

    Background Identifying clusters of acute paracoccidioidomycosis cases could potentially help in identifying the environmental factors that influence the incidence of this mycosis. However, unlike other endemic mycoses, there are no published reports of clusters of paracoccidioidomycosis. Methodology/Principal Findings A retrospective cluster detection test was applied to verify if an excess of acute form (AF) paracoccidioidomycosis cases in time and/or space occurred in Botucatu, an endemic area in São Paulo State. The scan-test SaTScan v7.0.3 was set to find clusters for the maximum temporal period of 1 year. The temporal test indicated a significant cluster in 1985 (P<0.005). This cluster comprised 10 cases, although 2.19 were expected for this year in this area. Age and clinical presentation of these cases were typical of AF paracccidioidomycosis. The space-time test confirmed the temporal cluster in 1985 and showed the localities where the risk was higher in that year. The cluster suggests that some particularities took place in the antecedent years in those localities. Analysis of climate variables showed that soil water storage was atypically high in 1982/83 (∼2.11/2.5 SD above mean), and the absolute air humidity in 1984, the year preceding the cluster, was much higher than normal (∼1.6 SD above mean), conditions that may have favored, respectively, antecedent fungal growth in the soil and conidia liberation in 1984, the probable year of exposure. These climatic anomalies in this area was due to the 1982/83 El Niño event, the strongest in the last 50 years. Conclusions/Significance We describe the first cluster of AF paracoccidioidomycosis, which was potentially linked to a climatic anomaly caused by the 1982/83 El Niño Southern Oscillation. This finding is important because it may help to clarify the conditions that favor Paracoccidioides brasiliensis survival and growth in the environment and that enhance human exposure, thus allowing the development of preventive measures. PMID:20361032

  7. Characterization of Patients Who Present With Insomnia: Is There Room for a Symptom Cluster-Based Approach?

    PubMed Central

    Crawford, Megan R.; Chirinos, Diana A.; Iurcotta, Toni; Edinger, Jack D.; Wyatt, James K.; Manber, Rachel; Ong, Jason C.

    2017-01-01

    Study Objectives: This study examined empirically derived symptom cluster profiles among patients who present with insomnia using clinical data and polysomnography. Methods: Latent profile analysis was used to identify symptom cluster profiles of 175 individuals (63% female) with insomnia disorder based on total scores on validated self-report instruments of daytime and nighttime symptoms (Insomnia Severity Index, Glasgow Sleep Effort Scale, Fatigue Severity Scale, Beliefs and Attitudes about Sleep, Epworth Sleepiness Scale, Pre-Sleep Arousal Scale), mean values from a 7-day sleep diary (sleep onset latency, wake after sleep onset, and sleep efficiency), and total sleep time derived from an in-laboratory PSG. Results: The best-fitting model had three symptom cluster profiles: “High Subjective Wakefulness” (HSW), “Mild Insomnia” (MI) and “Insomnia-Related Distress” (IRD). The HSW symptom cluster profile (26.3% of the sample) reported high wake after sleep onset, high sleep onset latency, and low sleep efficiency. Despite relatively comparable PSG-derived total sleep time, they reported greater levels of daytime sleepiness. The MI symptom cluster profile (45.1%) reported the least disturbance in the sleep diary and questionnaires and had the highest sleep efficiency. The IRD symptom cluster profile (28.6%) reported the highest mean scores on the insomnia-related distress measures (eg, sleep effort and arousal) and waking correlates (fatigue). Covariates associated with symptom cluster membership were older age for the HSW profile, greater obstructive sleep apnea severity for the MI profile, and, when adjusting for obstructive sleep apnea severity, being overweight/obese for the IRD profile. Conclusions: The heterogeneous nature of insomnia disorder is captured by this data-driven approach to identify symptom cluster profiles. The adaptation of a symptom cluster-based approach could guide tailored patient-centered management of patients presenting with insomnia, and enhance patient care. Citation: Crawford MR, Chirinos DA, Iurcotta T, Edinger JD, Wyatt JK, Manber R, Ong JC. Characterization of patients who present with insomnia: is there room for a symptom cluster-based approach? J Clin Sleep Med. 2017;13(7):911–921. PMID:28633722

  8. Modest validity and fair reproducibility of dietary patterns derived by cluster analysis.

    PubMed

    Funtikova, Anna N; Benítez-Arciniega, Alejandra A; Fitó, Montserrat; Schröder, Helmut

    2015-03-01

    Cluster analysis is widely used to analyze dietary patterns. We aimed to analyze the validity and reproducibility of the dietary patterns defined by cluster analysis derived from a food frequency questionnaire (FFQ). We hypothesized that the dietary patterns derived by cluster analysis have fair to modest reproducibility and validity. Dietary data were collected from 107 individuals from population-based survey, by an FFQ at baseline (FFQ1) and after 1 year (FFQ2), and by twelve 24-hour dietary recalls (24-HDR). Repeatability and validity were measured by comparing clusters obtained by the FFQ1 and FFQ2 and by the FFQ2 and 24-HDR (reference method), respectively. Cluster analysis identified a "fruits & vegetables" and a "meat" pattern in each dietary data source. Cluster membership was concordant for 66.7% of participants in FFQ1 and FFQ2 (reproducibility), and for 67.0% in FFQ2 and 24-HDR (validity). Spearman correlation analysis showed reasonable reproducibility, especially in the "fruits & vegetables" pattern, and lower validity also especially in the "fruits & vegetables" pattern. κ statistic revealed a fair validity and reproducibility of clusters. Our findings indicate a reasonable reproducibility and fair to modest validity of dietary patterns derived by cluster analysis. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Cluster Analysis to Identify Possible Subgroups in Tinnitus Patients.

    PubMed

    van den Berge, Minke J C; Free, Rolien H; Arnold, Rosemarie; de Kleine, Emile; Hofman, Rutger; van Dijk, J Marc C; van Dijk, Pim

    2017-01-01

    In tinnitus treatment, there is a tendency to shift from a "one size fits all" to a more individual, patient-tailored approach. Insight in the heterogeneity of the tinnitus spectrum might improve the management of tinnitus patients in terms of choice of treatment and identification of patients with severe mental distress. The goal of this study was to identify subgroups in a large group of tinnitus patients. Data were collected from patients with severe tinnitus complaints visiting our tertiary referral tinnitus care group at the University Medical Center Groningen. Patient-reported and physician-reported variables were collected during their visit to our clinic. Cluster analyses were used to characterize subgroups. For the selection of the right variables to enter in the cluster analysis, two approaches were used: (1) variable reduction with principle component analysis and (2) variable selection based on expert opinion. Various variables of 1,783 tinnitus patients were included in the analyses. Cluster analysis (1) included 976 patients and resulted in a four-cluster solution. The effect of external influences was the most discriminative between the groups, or clusters, of patients. The "silhouette measure" of the cluster outcome was low (0.2), indicating a "no substantial" cluster structure. Cluster analysis (2) included 761 patients and resulted in a three-cluster solution, comparable to the first analysis. Again, a "no substantial" cluster structure was found (0.2). Two cluster analyses on a large database of tinnitus patients revealed that clusters of patients are mostly formed by a different response of external influences on their disease. However, both cluster outcomes based on this dataset showed a poor stability, suggesting that our tinnitus population comprises a continuum rather than a number of clearly defined subgroups.

  10. Ecological tolerances of Miocene larger benthic foraminifera from Indonesia

    NASA Astrophysics Data System (ADS)

    Novak, Vibor; Renema, Willem

    2018-01-01

    To provide a comprehensive palaeoenvironmental reconstruction based on larger benthic foraminifera (LBF), a quantitative analysis of their assemblage composition is needed. Besides microfacies analysis which includes environmental preferences of foraminiferal taxa, statistical analyses should also be employed. Therefore, detrended correspondence analysis and cluster analysis were performed on relative abundance data of identified LBF assemblages deposited in mixed carbonate-siliciclastic (MCS) systems and blue-water (BW) settings. Studied MCS system localities include ten sections from the central part of the Kutai Basin in East Kalimantan, ranging from late Burdigalian to Serravallian age. The BW samples were collected from eleven sections of the Bulu Formation on Central Java, dated as Serravallian. Results from detrended correspondence analysis reveal significant differences between these two environmental settings. Cluster analysis produced five clusters of samples; clusters 1 and 2 comprise dominantly MCS samples, clusters 3 and 4 with dominance of BW samples, and cluster 5 showing a mixed composition with both MCS and BW samples. The results of cluster analysis were afterwards subjected to indicator species analysis resulting in the interpretation that generated three groups among LBF taxa: typical assemblage indicators, regularly occurring taxa and rare taxa. By interpreting the results of detrended correspondence analysis, cluster analysis and indicator species analysis, along with environmental preferences of identified LBF taxa, a palaeoenvironmental model is proposed for the distribution of LBF in Miocene MCS systems and adjacent BW settings of Indonesia.

  11. Identification of five chronic obstructive pulmonary disease subgroups with different prognoses in the ECLIPSE cohort using cluster analysis.

    PubMed

    Rennard, Stephen I; Locantore, Nicholas; Delafont, Bruno; Tal-Singer, Ruth; Silverman, Edwin K; Vestbo, Jørgen; Miller, Bruce E; Bakke, Per; Celli, Bartolomé; Calverley, Peter M A; Coxson, Harvey; Crim, Courtney; Edwards, Lisa D; Lomas, David A; MacNee, William; Wouters, Emiel F M; Yates, Julie C; Coca, Ignacio; Agustí, Alvar

    2015-03-01

    Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease that likely includes clinically relevant subgroups. To identify subgroups of COPD in ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints) subjects using cluster analysis and to assess clinically meaningful outcomes of the clusters during 3 years of longitudinal follow-up. Factor analysis was used to reduce 41 variables determined at recruitment in 2,164 patients with COPD to 13 main factors, and the variables with the highest loading were used for cluster analysis. Clusters were evaluated for their relationship with clinically meaningful outcomes during 3 years of follow-up. The relationships among clinical parameters were evaluated within clusters. Five subgroups were distinguished using cross-sectional clinical features. These groups differed regarding outcomes. Cluster A included patients with milder disease and had fewer deaths and hospitalizations. Cluster B had less systemic inflammation at baseline but had notable changes in health status and emphysema extent. Cluster C had many comorbidities, evidence of systemic inflammation, and the highest mortality. Cluster D had low FEV1, severe emphysema, and the highest exacerbation and COPD hospitalization rate. Cluster E was intermediate for most variables and may represent a mixed group that includes further clusters. The relationships among clinical variables within clusters differed from that in the entire COPD population. Cluster analysis using baseline data in ECLIPSE identified five COPD subgroups that differ in outcomes and inflammatory biomarkers and show different relationships between clinical parameters, suggesting the clusters represent clinically and biologically different subtypes of COPD.

  12. Infalling groups and galaxy transformations in the cluster A2142

    NASA Astrophysics Data System (ADS)

    Einasto, Maret; Deshev, Boris; Lietzen, Heidi; Kipper, Rain; Tempel, Elmo; Park, Changbom; Gramann, Mirt; Heinämäki, Pekka; Saar, Enn; Einasto, Jaan

    2018-03-01

    Context. Superclusters of galaxies provide dynamical environments for the study of the formation and evolution of structures in the cosmic web from galaxies, to the richest galaxy clusters, and superclusters themselves. Aims: We study galaxy populations and search for possible merging substructures in the rich galaxy cluster A2142 in the collapsing core of the supercluster SCl A2142, which may give rise to radio and X-ray structures in the cluster, and affect galaxy properties of this cluster. Methods: We used normal mixture modelling to select substructure of the cluster A2142. We compared alignments of the cluster, its brightest galaxies (hereafter BCGs), subclusters, and supercluster axes. The projected phase space (PPS) diagram and clustercentric distributions are used to analyse the dynamics of the cluster and study the distribution of various galaxy populations in the cluster and subclusters. Results: We find several infalling galaxy groups and subclusters. The cluster, supercluster, BCGs, and one infalling subcluster are all aligned. Their orientation is correlated with the alignment of the radio and X-ray haloes of the cluster. Galaxy populations in the main cluster and in the outskirts subclusters are different. Galaxies in the centre of the main cluster at the clustercentric distances 0.5 h-1 Mpc (Dc/Rvir < 0.5, Rvir = 0.9 h-1 Mpc) have older stellar populations (with the median age of 10-11 Gyr) than galaxies at larger clustercentric distances. Star-forming and recently quenched galaxies are located mostly at the clustercentric distances Dc ≈ 1.8 h-1 Mpc, where subclusters fall into the cluster and the properties of galaxies change rapidly. In this region the median age of stellar populations of galaxies is about 2 Gyr. Galaxies in A2142 on average have higher stellar masses, lower star formation rates, and redder colours than galaxies in rich groups. The total mass in infalling groups and subclusters is M ≈ 6 × 1014 h-1 M⊙, that is approximately half of the mass of the cluster. This mass is sufficient for the mass growth of the cluster from redshift z = 0.5 (half-mass epoch) to the present. Conclusions: Our analysis suggests that the cluster A2142 has formed as a result of past and present mergers and infallen groups, predominantly along the supercluster axis. Mergers cause complex radio and X-ray structure of the cluster and affect the properties of galaxies in the cluster, especially at the boundaries of the cluster in the infall region. Explaining the differences between galaxy populations, mass, and richness of A2142, and other groups and clusters may lead to better insight about the formation and evolution of rich galaxy clusters.

  13. Interactive visual exploration and refinement of cluster assignments.

    PubMed

    Kern, Michael; Lex, Alexander; Gehlenborg, Nils; Johnson, Chris R

    2017-09-12

    With ever-increasing amounts of data produced in biology research, scientists are in need of efficient data analysis methods. Cluster analysis, combined with visualization of the results, is one such method that can be used to make sense of large data volumes. At the same time, cluster analysis is known to be imperfect and depends on the choice of algorithms, parameters, and distance measures. Most clustering algorithms don't properly account for ambiguity in the source data, as records are often assigned to discrete clusters, even if an assignment is unclear. While there are metrics and visualization techniques that allow analysts to compare clusterings or to judge cluster quality, there is no comprehensive method that allows analysts to evaluate, compare, and refine cluster assignments based on the source data, derived scores, and contextual data. In this paper, we introduce a method that explicitly visualizes the quality of cluster assignments, allows comparisons of clustering results and enables analysts to manually curate and refine cluster assignments. Our methods are applicable to matrix data clustered with partitional, hierarchical, and fuzzy clustering algorithms. Furthermore, we enable analysts to explore clustering results in context of other data, for example, to observe whether a clustering of genomic data results in a meaningful differentiation in phenotypes. Our methods are integrated into Caleydo StratomeX, a popular, web-based, disease subtype analysis tool. We show in a usage scenario that our approach can reveal ambiguities in cluster assignments and produce improved clusterings that better differentiate genotypes and phenotypes.

  14. Somatotyping using 3D anthropometry: a cluster analysis.

    PubMed

    Olds, Tim; Daniell, Nathan; Petkov, John; David Stewart, Arthur

    2013-01-01

    Somatotyping is the quantification of human body shape, independent of body size. Hitherto, somatotyping (including the most popular method, the Heath-Carter system) has been based on subjective visual ratings, sometimes supported by surface anthropometry. This study used data derived from three-dimensional (3D) whole-body scans as inputs for cluster analysis to objectively derive clusters of similar body shapes. Twenty-nine dimensions normalised for body size were measured on a purposive sample of 301 adults aged 17-56 years who had been scanned using a Vitus Smart laser scanner. K-means Cluster Analysis with v-fold cross-validation was used to determine shape clusters. Three male and three female clusters emerged, and were visualised using those scans closest to the cluster centroid and a caricature defined by doubling the difference between the average scan and the cluster centroid. The male clusters were decidedly endomorphic (high fatness), ectomorphic (high linearity), and endo-mesomorphic (a mixture of fatness and muscularity). The female clusters were clearly endomorphic, ectomorphic, and the ecto-mesomorphic (a mixture of linearity and muscularity). An objective shape quantification procedure combining 3D scanning and cluster analysis yielded shape clusters strikingly similar to traditional somatotyping.

  15. Chromatographic and computational assessment of lipophilicity using sum of ranking differences and generalized pair-correlation.

    PubMed

    Andrić, Filip; Héberger, Károly

    2015-02-06

    Lipophilicity (logP) represents one of the most studied and most frequently used fundamental physicochemical properties. At present there are several possibilities for its quantitative expression and many of them stems from chromatographic experiments. Numerous attempts have been made to compare different computational methods, chromatographic methods vs. computational approaches, as well as chromatographic methods and direct shake-flask procedure without definite results or these findings are not accepted generally. In the present work numerous chromatographically derived lipophilicity measures in combination with diverse computational methods were ranked and clustered using the novel variable discrimination and ranking approaches based on the sum of ranking differences and the generalized pair correlation method. Available literature logP data measured on HILIC, and classical reversed-phase combining different classes of compounds have been compared with most frequently used multivariate data analysis techniques (principal component and hierarchical cluster analysis) as well as with the conclusions in the original sources. Chromatographic lipophilicity measures obtained under typical reversed-phase conditions outperform the majority of computationally estimated logPs. Oppositely, in the case of HILIC none of the many proposed chromatographic indices overcomes any of the computationally assessed logPs. Only two of them (logkmin and kmin) may be selected as recommended chromatographic lipophilicity measures. Both ranking approaches, sum of ranking differences and generalized pair correlation method, although based on different backgrounds, provides highly similar variable ordering and grouping leading to the same conclusions. Copyright © 2015. Published by Elsevier B.V.

  16. Clusters of Occupations Based on Systematically Derived Work Dimensions: An Exploratory Study.

    ERIC Educational Resources Information Center

    Cunningham, J. W.; And Others

    The study explored the feasibility of deriving an educationally relevant occupational cluster structure based on Occupational Analysis Inventory (OAI) work dimensions. A hierarchical cluster analysis was applied to the factor score profiles of 814 occupations on 22 higher-order OAI work dimensions. From that analysis, 73 occupational clusters were…

  17. Using cluster analysis to identify phenotypes and validation of mortality in men with COPD.

    PubMed

    Chen, Chiung-Zuei; Wang, Liang-Yi; Ou, Chih-Ying; Lee, Cheng-Hung; Lin, Chien-Chung; Hsiue, Tzuen-Ren

    2014-12-01

    Cluster analysis has been proposed to examine phenotypic heterogeneity in chronic obstructive pulmonary disease (COPD). The aim of this study was to use cluster analysis to define COPD phenotypes and validate them by assessing their relationship with mortality. Male subjects with COPD were recruited to identify and validate COPD phenotypes. Seven variables were assessed for their relevance to COPD, age, FEV(1) % predicted, BMI, history of severe exacerbations, mMRC, SpO(2), and Charlson index. COPD groups were identified by cluster analysis and validated prospectively against mortality during a 4-year follow-up. Analysis of 332 COPD subjects identified five clusters from cluster A to cluster E. Assessment of the predictive validity of these clusters of COPD showed that cluster E patients had higher all cause mortality (HR 18.3, p < 0.0001), and respiratory cause mortality (HR 21.5, p < 0.0001) than those in the other four groups. Cluster E patients also had higher all cause mortality (HR 14.3, p = 0.0002) and respiratory cause mortality (HR 10.1, p = 0.0013) than patients in cluster D alone. COPD patient with severe airflow limitation, many symptoms, and a history of frequent severe exacerbations was a novel and distinct clinical phenotype predicting mortality in men with COPD.

  18. Salience Network and Depressive Severities in Parkinson's Disease with Mild Cognitive Impairment: A Structural Covariance Network Analysis.

    PubMed

    Chang, Ya-Ting; Lu, Cheng-Hsien; Wu, Ming-Kung; Hsu, Shih-Wei; Huang, Chi-Wei; Chang, Wen-Neng; Lien, Chia-Yi; Lee, Jun-Jun; Chang, Chiung-Chih

    2017-01-01

    Purpose: In Parkinson's disease with mild cognitive impairment (PD-MCI), we investigated the clinical significance of salience network (SN) in depression and cognitive performance. Methods: Seventy seven PD-MCI patients that fulfilled multi-domain and non-amnestic subtype were included. Gray matter structural covariance networks were constructed by 3D T1-magnetic resonance imaging and seed based analysis. The patients were divided into two groups by psychiatric interviews and screening of Geriatric Depression Scale (GDS): PD-MCI with depression (PD-MCI-D) or without depression (PD-MCI-ND). The seed or peak cluster volume, or the significant differences in the regression slopes in each seed-peak cluster correlation, were used to evaluate the significance with the neurobehavioral scores. Results: This study is the first to demonstrate that the PD-MCI-ND group presented a larger number of voxels of structural covariance in SN than the PD-MCI-D group. The right fronto-insular seed volumes and the peak cluster of left lingual gyrus showed significant inverse correlation with the Geriatric Depression Scale (GDS; r = -0.231, P = 0.046). Conclusions: This study is the first to validate the clinical significance of the SN in PD-MCI-D. The right insular seed value and the SN correlated with the severity of depression in PD-MCI.

  19. Salience Network and Depressive Severities in Parkinson’s Disease with Mild Cognitive Impairment: A Structural Covariance Network Analysis

    PubMed Central

    Chang, Ya-Ting; Lu, Cheng-Hsien; Wu, Ming-Kung; Hsu, Shih-Wei; Huang, Chi-Wei; Chang, Wen-Neng; Lien, Chia-Yi; Lee, Jun-Jun; Chang, Chiung-Chih

    2018-01-01

    Purpose: In Parkinson’s disease with mild cognitive impairment (PD-MCI), we investigated the clinical significance of salience network (SN) in depression and cognitive performance. Methods: Seventy seven PD-MCI patients that fulfilled multi-domain and non-amnestic subtype were included. Gray matter structural covariance networks were constructed by 3D T1-magnetic resonance imaging and seed based analysis. The patients were divided into two groups by psychiatric interviews and screening of Geriatric Depression Scale (GDS): PD-MCI with depression (PD-MCI-D) or without depression (PD-MCI-ND). The seed or peak cluster volume, or the significant differences in the regression slopes in each seed-peak cluster correlation, were used to evaluate the significance with the neurobehavioral scores. Results: This study is the first to demonstrate that the PD-MCI-ND group presented a larger number of voxels of structural covariance in SN than the PD-MCI-D group. The right fronto-insular seed volumes and the peak cluster of left lingual gyrus showed significant inverse correlation with the Geriatric Depression Scale (GDS; r = -0.231, P = 0.046). Conclusions: This study is the first to validate the clinical significance of the SN in PD-MCI-D. The right insular seed value and the SN correlated with the severity of depression in PD-MCI. PMID:29375361

  20. Resuscitation Outcomes Consortium (ROC) PRIMED Cardiac Arrest Trial Methods Part 2: Rationale and Methodology for “Analyze Later” Protocol

    PubMed Central

    Stiell, Ian G.; Callaway, Clif; Davis, Dan; Terndrup, Tom; Powell, Judy; Cook, Andrea; Kudenchuk, Peter J.; Daya, Mohamud; Kerber, Richard; Idris, Ahamed; Morrison, Laurie J.; Aufderheide, Tom

    2008-01-01

    Objective The primary objective of the trial is to compare survival to hospital discharge with Modified Rankin Score (MRS) ≤3 between a strategy that prioritizes a specified period of CPR before rhythm analysis (Analyze Later) versus a strategy of minimal CPR followed by early rhythm analysis (Analyze Early) in patients with out-of-hospital cardiac arrest. Methods   Design Cluster randomized trial with cluster units defined by geographic region, or monitor/defibrillator machine. Population Adults treated by Emergency Medical Service (EMS) providers for non-traumatic out-of-hospital cardiac arrest not witnessed by EMS. Setting EMS systems participating in the Resuscitation Outcomes Consortium and agreeing to cluster randomization to the Analyze Later versus Analyze Early intervention in a crossover fashion. Sample Size Based on a two-sided significance level of 0.05, a maximum of 13,239 evaluable patients will allow statistical power of 0.996 to detect a hypothesized improvement in the probability of survival to discharge with MRS ≤ 3 rate from 5.41% after Analyze Early to 7.45% after Analyze Later (2.04% absolute increase in primary outcome). Conclusion If this trial demonstrates a significant improvement in survival with a strategy of Analyze Later, it is estimated that 4,000 premature deaths from cardiac arrest would be averted annually in North America alone. PMID:18487004

  1. On the clustering of multidimensional pictorial data

    NASA Technical Reports Server (NTRS)

    Bryant, J. D. (Principal Investigator)

    1979-01-01

    Obvious approaches to reducing the cost (in computer resources) of applying current clustering techniques to the problem of remote sensing are discussed. The use of spatial information in finding fields and in classifying mixture pixels is examined, and the AMOEBA clustering program is described. Internally, a pattern recognition program, from without, AMOEBA appears to be an unsupervised clustering program. It is fast and automatic. No choices (such as arbitrary thresholds to set split/combine sequences) need be made. The problem of finding the number of clusters is solved automatically. At the conclusion of the program, all points in the scene are classified; however, a provision is included for a reject classification of some points which, within the theoretical framework, cannot rationally be assigned to any cluster.

  2. Hydration of a Large Anionic Charge Distribution - Naphthalene-Water Cluster Anions

    NASA Astrophysics Data System (ADS)

    Weber, J. Mathias; Adams, Christopher L.

    2010-06-01

    We report the infrared spectra of anionic clusters of naphthalene with up to three water molecules. Comparison of the experimental infrared spectra with theoretically predicted spectra from quantum chemistry calculations allow conclusions regarding the structures of the clusters under study. The first water molecule forms two hydrogen bonds with the π electron system of the naphthalene moiety. Subsequent water ligands interact with both the naphthalene and the other water ligands to form hydrogen bonded networks, similar to other hydrated anion clusters. Naphthalene-water anion clusters illustrate how water interacts with negative charge delocalized over a large π electron system. The clusters are interesting model systems that are discussed in the context of wetting of graphene surfaces and polyaromatic hydrocarbons.

  3. Modularization of biochemical networks based on classification of Petri net t-invariants

    PubMed Central

    Grafahrend-Belau, Eva; Schreiber, Falk; Heiner, Monika; Sackmann, Andrea; Junker, Björn H; Grunwald, Stefanie; Speer, Astrid; Winder, Katja; Koch, Ina

    2008-01-01

    Background Structural analysis of biochemical networks is a growing field in bioinformatics and systems biology. The availability of an increasing amount of biological data from molecular biological networks promises a deeper understanding but confronts researchers with the problem of combinatorial explosion. The amount of qualitative network data is growing much faster than the amount of quantitative data, such as enzyme kinetics. In many cases it is even impossible to measure quantitative data because of limitations of experimental methods, or for ethical reasons. Thus, a huge amount of qualitative data, such as interaction data, is available, but it was not sufficiently used for modeling purposes, until now. New approaches have been developed, but the complexity of data often limits the application of many of the methods. Biochemical Petri nets make it possible to explore static and dynamic qualitative system properties. One Petri net approach is model validation based on the computation of the system's invariant properties, focusing on t-invariants. T-invariants correspond to subnetworks, which describe the basic system behavior. With increasing system complexity, the basic behavior can only be expressed by a huge number of t-invariants. According to our validation criteria for biochemical Petri nets, the necessary verification of the biological meaning, by interpreting each subnetwork (t-invariant) manually, is not possible anymore. Thus, an automated, biologically meaningful classification would be helpful in analyzing t-invariants, and supporting the understanding of the basic behavior of the considered biological system. Methods Here, we introduce a new approach to automatically classify t-invariants to cope with network complexity. We apply clustering techniques such as UPGMA, Complete Linkage, Single Linkage, and Neighbor Joining in combination with different distance measures to get biologically meaningful clusters (t-clusters), which can be interpreted as modules. To find the optimal number of t-clusters to consider for interpretation, the cluster validity measure, Silhouette Width, is applied. Results We considered two different case studies as examples: a small signal transduction pathway (pheromone response pathway in Saccharomyces cerevisiae) and a medium-sized gene regulatory network (gene regulation of Duchenne muscular dystrophy). We automatically classified the t-invariants into functionally distinct t-clusters, which could be interpreted biologically as functional modules in the network. We found differences in the suitability of the various distance measures as well as the clustering methods. In terms of a biologically meaningful classification of t-invariants, the best results are obtained using the Tanimoto distance measure. Considering clustering methods, the obtained results suggest that UPGMA and Complete Linkage are suitable for clustering t-invariants with respect to the biological interpretability. Conclusion We propose a new approach for the biological classification of Petri net t-invariants based on cluster analysis. Due to the biologically meaningful data reduction and structuring of network processes, large sets of t-invariants can be evaluated, allowing for model validation of qualitative biochemical Petri nets. This approach can also be applied to elementary mode analysis. PMID:18257938

  4. Diversity in Older Adults’ Use of the Internet: Identifying Subgroups Through Latent Class Analysis

    PubMed Central

    van Boekel, Leonieke C; Peek, Sebastiaan TM; Luijkx, Katrien G

    2017-01-01

    Background As for all individuals, the Internet is important in the everyday life of older adults. Research on older adults’ use of the Internet has merely focused on users versus nonusers and consequences of Internet use and nonuse. Older adults are a heterogeneous group, which may implicate that their use of the Internet is diverse as well. Older adults can use the Internet for different activities, and this usage can be of influence on benefits the Internet can have for them. Objective The aim of this paper was to describe the diversity or heterogeneity in the activities for which older adults use the Internet and determine whether diversity is related to social or health-related variables. Methods We used data of a national representative Internet panel in the Netherlands. Panel members aged 65 years and older and who have access to and use the Internet were selected (N=1418). We conducted a latent class analysis based on the Internet activities that panel members reported to spend time on. Second, we described the identified clusters with descriptive statistics and compared the clusters using analysis of variance (ANOVA) and chi-square tests. Results Four clusters were distinguished. Cluster 1 was labeled as the “practical users” (36.88%, n=523). These respondents mainly used the Internet for practical and financial purposes such as searching for information, comparing products, and banking. Respondents in Cluster 2, the “minimizers” (32.23%, n=457), reported lowest frequency on most Internet activities, are older (mean age 73 years), and spent the smallest time on the Internet. Cluster 3 was labeled as the “maximizers” (17.77%, n=252); these respondents used the Internet for various activities, spent most time on the Internet, and were relatively younger (mean age below 70 years). Respondents in Cluster 4, the “social users,” mainly used the Internet for social and leisure-related activities such as gaming and social network sites. The identified clusters significantly differed in age (P<.001, ω2=0.07), time spent on the Internet (P<.001, ω2=0.12), and frequency of downloading apps (P<.001, ω2=0.14), with medium to large effect sizes. Social and health-related variables were significantly different between the clusters, except social and emotional loneliness. However, effect sizes were small. The minimizers scored significantly lower on psychological well-being, instrumental activities of daily living (iADL), and experienced health compared with the practical users and maximizers. Conclusions Older adults are a diverse group in terms of their activities on the Internet. This underlines the importance to look beyond use versus nonuse when studying older adults’ Internet use. The clusters we have identified in this study can help tailor the development and deployment of eHealth intervention to specific segments of the older population. PMID:28539302

  5. clusterProfiler: an R package for comparing biological themes among gene clusters.

    PubMed

    Yu, Guangchuang; Wang, Li-Gen; Han, Yanyan; He, Qing-Yu

    2012-05-01

    Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.

  6. Combination of automated high throughput platforms, flow cytometry, and hierarchical clustering to detect cell state.

    PubMed

    Kitsos, Christine M; Bhamidipati, Phani; Melnikova, Irena; Cash, Ethan P; McNulty, Chris; Furman, Julia; Cima, Michael J; Levinson, Douglas

    2007-01-01

    This study examined whether hierarchical clustering could be used to detect cell states induced by treatment combinations that were generated through automation and high-throughput (HT) technology. Data-mining techniques were used to analyze the large experimental data sets to determine whether nonlinear, non-obvious responses could be extracted from the data. Unary, binary, and ternary combinations of pharmacological factors (examples of stimuli) were used to induce differentiation of HL-60 cells using a HT automated approach. Cell profiles were analyzed by incorporating hierarchical clustering methods on data collected by flow cytometry. Data-mining techniques were used to explore the combinatorial space for nonlinear, unexpected events. Additional small-scale, follow-up experiments were performed on cellular profiles of interest. Multiple, distinct cellular profiles were detected using hierarchical clustering of expressed cell-surface antigens. Data-mining of this large, complex data set retrieved cases of both factor dominance and cooperativity, as well as atypical cellular profiles. Follow-up experiments found that treatment combinations producing "atypical cell types" made those cells more susceptible to apoptosis. CONCLUSIONS Hierarchical clustering and other data-mining techniques were applied to analyze large data sets from HT flow cytometry. From each sample, the data set was filtered and used to define discrete, usable states that were then related back to their original formulations. Analysis of resultant cell populations induced by a multitude of treatments identified unexpected phenotypes and nonlinear response profiles.

  7. Mapping the spatial distribution of star formation in cluster galaxies at z ~ 0.5 with the Grism Lens-Amplified Survey from Space (GLASS)

    NASA Astrophysics Data System (ADS)

    Vulcani, Benedetta; Vulcani

    We present the first study of the spatial distribution of star formation in z ~ 0.5 cluster galaxies. The analysis is based on data taken with the Wide Field Camera 3 as part of the Grism Lens-Amplified Survey from Space (GLASS). We illustrate the methodology by focusing on two clusters (MACS0717.5+3745 and MACS1423.8+2404) with different morphologies (one relaxed and one merging) and use foreground and background galaxies as field control sample. The cluster+field sample consists of 42 galaxies with stellar masses in the range 108-1011 M ⊙, and star formation rates in the range 1-20 M⊙ yr -1. In both environments, Hα is more extended than the rest-frame UV continuum in 60% of the cases, consistent with diffuse star formation and inside out growth. The Hα emission appears more extended in cluster galaxies than in the field, pointing perhaps to ionized gas being stripped and/or star formation being enhanced at large radii. The peak of the Hα emission and that of the continuum are offset by less than 1 kpc. We investigate trends with the hot gas density as traced by the X-ray emission, and with the surface mass density as inferred from gravitational lens models and find no conclusive results. The diversity of morphologies and sizes observed in Hα illustrates the complexity of the environmental process that regulate star formation.

  8. Skewed Riskscapes and Gentrified Inequities: Environmental Exposure Disparities in Seattle, Washington

    PubMed Central

    White, Jonah

    2011-01-01

    Objectives. Few studies have considered the sociohistorical intersection of environmental injustice and gentrification; a gap addressed by this case study of Seattle, Washington. This study explored the advantages of integrating air toxic risk screening with gentrification research to enhance proximity and health equity analysis methodologies. It was hypothesized that Seattle's industrial air toxic exposure risk was unevenly dispersed, that gentrification stratified the city's neighborhoods, and that the inequities of both converged. Methods. Spatial characterizations of air toxic pollution risk exposures from 1990 to 2007 were combined with longitudinal cluster analysis of census block groups in Seattle, Washington, from 1990 to 2000. Results. A cluster of air toxic exposure inequality and socioeconomic inequity converged in 1 area of south central Seattle. Minority and working class residents were more concentrated in the same neighborhoods near Seattle's worst industrial pollution risks. Conclusions. Not all pollution was distributed equally in a dynamic urban landscape. Using techniques to examine skewed riskscapes and socioeconomic urban geographies provided a foundation for future research on the connections among environmental health hazard sources, socially vulnerable neighborhoods, and health inequity. PMID:21836115

  9. Clinical Characteristics of Exacerbation-Prone Adult Asthmatics Identified by Cluster Analysis.

    PubMed

    Kim, Mi Ae; Shin, Seung Woo; Park, Jong Sook; Uh, Soo Taek; Chang, Hun Soo; Bae, Da Jeong; Cho, You Sook; Park, Hae Sim; Yoon, Ho Joo; Choi, Byoung Whui; Kim, Yong Hoon; Park, Choon Sik

    2017-11-01

    Asthma is a heterogeneous disease characterized by various types of airway inflammation and obstruction. Therefore, it is classified into several subphenotypes, such as early-onset atopic, obese non-eosinophilic, benign, and eosinophilic asthma, using cluster analysis. A number of asthmatics frequently experience exacerbation over a long-term follow-up period, but the exacerbation-prone subphenotype has rarely been evaluated by cluster analysis. This prompted us to identify clusters reflecting asthma exacerbation. A uniform cluster analysis method was applied to 259 adult asthmatics who were regularly followed-up for over 1 year using 12 variables, selected on the basis of their contribution to asthma phenotypes. After clustering, clinical profiles and exacerbation rates during follow-up were compared among the clusters. Four subphenotypes were identified: cluster 1 was comprised of patients with early-onset atopic asthma with preserved lung function, cluster 2 late-onset non-atopic asthma with impaired lung function, cluster 3 early-onset atopic asthma with severely impaired lung function, and cluster 4 late-onset non-atopic asthma with well-preserved lung function. The patients in clusters 2 and 3 were identified as exacerbation-prone asthmatics, showing a higher risk of asthma exacerbation. Two different phenotypes of exacerbation-prone asthma were identified among Korean asthmatics using cluster analysis; both were characterized by impaired lung function, but the age at asthma onset and atopic status were different between the two. Copyright © 2017 The Korean Academy of Asthma, Allergy and Clinical Immunology · The Korean Academy of Pediatric Allergy and Respiratory Disease

  10. Annotation of gene function in citrus using gene expression information and co-expression networks

    PubMed Central

    2014-01-01

    Background The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world’s most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a “guilt-by-association” principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. Results We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. Conclusions Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus. PMID:25023870

  11. Temporal and spatial analysis of psittacosis in association with poultry farming in the Netherlands, 2000-2015.

    PubMed

    Hogerwerf, Lenny; Holstege, Manon M C; Benincà, Elisa; Dijkstra, Frederika; van der Hoek, Wim

    2017-07-26

    Human psittacosis is a highly under diagnosed zoonotic disease, commonly linked to psittacine birds. Psittacosis in birds, also known as avian chlamydiosis, is endemic in poultry, but the risk for people living close to poultry farms is unknown. Therefore, our study aimed to explore the temporal and spatial patterns of human psittacosis infections and identify possible associations with poultry farming in the Netherlands. We analysed data on 700 human cases of psittacosis notified between 01-01-2000 and 01-09-2015. First, we studied the temporal behaviour of psittacosis notifications by applying wavelet analysis. Then, to identify possible spatial patterns, we applied spatial cluster analysis. Finally, we investigated the possible spatial association between psittacosis notifications and data on the Dutch poultry sector at municipality level using a multivariable model. We found a large spatial cluster that covered a highly poultry-dense area but additional clusters were found in areas that had a low poultry density. There were marked geographical differences in the awareness of psittacosis and the amount and the type of laboratory diagnostics used for psittacosis, making it difficult to draw conclusions about the correlation between the large cluster and poultry density. The multivariable model showed that the presence of chicken processing plants and slaughter duck farms in a municipality was associated with a higher rate of human psittacosis notifications. The significance of the associations was influenced by the inclusion or exclusion of farm density in the model. Our temporal and spatial analyses showed weak associations between poultry-related variables and psittacosis notifications. Because of the low number of psittacosis notifications available for analysis, the power of our analysis was relative low. Because of the exploratory nature of this research, the associations found cannot be interpreted as evidence for airborne transmission of psittacosis from poultry to the general population. Further research is needed to determine the prevalence of C. psittaci in Dutch poultry. Also, efforts to promote PCR-based testing for C. psittaci and genotyping for source tracing are important to reduce the diagnostic deficit, and to provide better estimates of the human psittacosis burden, and the possible role of poultry.

  12. Towards accurate modeling of noncovalent interactions for protein rigidity analysis

    PubMed Central

    2013-01-01

    Background Protein rigidity analysis is an efficient computational method for extracting flexibility information from static, X-ray crystallography protein data. Atoms and bonds are modeled as a mechanical structure and analyzed with a fast graph-based algorithm, producing a decomposition of the flexible molecule into interconnected rigid clusters. The result depends critically on noncovalent atomic interactions, primarily on how hydrogen bonds and hydrophobic interactions are computed and modeled. Ongoing research points to the stringent need for benchmarking rigidity analysis software systems, towards the goal of increasing their accuracy and validating their results, either against each other and against biologically relevant (functional) parameters. We propose two new methods for modeling hydrogen bonds and hydrophobic interactions that more accurately reflect a mechanical model, without being computationally more intensive. We evaluate them using a novel scoring method, based on the B-cubed score from the information retrieval literature, which measures how well two cluster decompositions match. Results To evaluate the modeling accuracy of KINARI, our pebble-game rigidity analysis system, we use a benchmark data set of 20 proteins, each with multiple distinct conformations deposited in the Protein Data Bank. Cluster decompositions for them were previously determined with the RigidFinder method from Gerstein's lab and validated against experimental data. When KINARI's default tuning parameters are used, an improvement of the B-cubed score over a crude baseline is observed in 30% of this data. With our new modeling options, improvements were observed in over 70% of the proteins in this data set. We investigate the sensitivity of the cluster decomposition score with case studies on pyruvate phosphate dikinase and calmodulin. Conclusion To substantially improve the accuracy of protein rigidity analysis systems, thorough benchmarking must be performed on all current systems and future extensions. We have measured the gain in performance by comparing different modeling methods for noncovalent interactions. We showed that new criteria for modeling hydrogen bonds and hydrophobic interactions can significantly improve the results. The two new methods proposed here have been implemented and made publicly available in the current version of KINARI (v1.3), together with the benchmarking tools, which can be downloaded from our software's website, http://kinari.cs.umass.edu. PMID:24564209

  13. Cluster analysis of autoantibodies in 852 patients with systemic lupus erythematosus from a single center.

    PubMed

    Artim-Esen, Bahar; Çene, Erhan; Şahinkaya, Yasemin; Ertan, Semra; Pehlivan, Özlem; Kamali, Sevil; Gül, Ahmet; Öcal, Lale; Aral, Orhan; Inanç, Murat

    2014-07-01

    Associations between autoantibodies and clinical features have been described in systemic lupus erythematosus (SLE). Herein, we aimed to define autoantibody clusters and their clinical correlations in a large cohort of patients with SLE. We analyzed 852 patients with SLE who attended our clinic. Seven autoantibodies were selected for cluster analysis: anti-DNA, anti-Sm, anti-RNP, anticardiolipin (aCL) immunoglobulin (Ig)G or IgM, lupus anticoagulant (LAC), anti-Ro, and anti-La. Two-step clustering and Kaplan-Meier survival analyses were used. Five clusters were identified. A cluster consisted of patients with only anti-dsDNA antibodies, a cluster of anti-Sm and anti-RNP, a cluster of aCL IgG/M and LAC, and a cluster of anti-Ro and anti-La antibodies. Analysis revealed 1 more cluster that consisted of patients who did not belong to any of the clusters formed by antibodies chosen for cluster analysis. Sm/RNP cluster had significantly higher incidence of pulmonary hypertension and Raynaud phenomenon. DsDNA cluster had the highest incidence of renal involvement. In the aCL/LAC cluster, there were significantly more patients with neuropsychiatric involvement, antiphospholipid syndrome, autoimmune hemolytic anemia, and thrombocytopenia. According to the Systemic Lupus International Collaborating Clinics damage index, the highest frequency of damage was in the aCL/LAC cluster. Comparison of 10 and 20 years survival showed reduced survival in the aCL/LAC cluster. This study supports the existence of autoantibody clusters with distinct clinical features in SLE and shows that forming clinical subsets according to autoantibody clusters may be useful in predicting the outcome of the disease. Autoantibody clusters in SLE may exhibit differences according to the clinical setting or population.

  14. A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluation method

    PubMed Central

    Yoo, Illhoi; Hu, Xiaohua; Song, Il-Yeol

    2007-01-01

    Background A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free text, document clustering and text summarization together are used as a solution for text information overload problem. In this paper, we introduce a coherent graph-based semantic clustering and summarization approach for biomedical literature. Results Our extensive experimental results show the approach shows 45% cluster quality improvement and 72% clustering reliability improvement, in terms of misclassification index, over Bisecting K-means as a leading document clustering approach. In addition, our approach provides concise but rich text summary in key concepts and sentences. Conclusion Our coherent biomedical literature clustering and summarization approach that takes advantage of ontology-enriched graphical representations significantly improves the quality of document clusters and understandability of documents through summaries. PMID:18047705

  15. The mystery of the Hawaii liver disease cluster in summer 2013: A pragmatic and clinical approach to solve the problem.

    PubMed

    Teschke, Rolf; Schwarzenboeck, Alexander; Frenzel, Christian; Schulze, Johannes; Eickhoff, Axel; Wolff, Albrecht

    2016-01-01

    In the fall of 2013, the US Centers for Disease Control and Prevention (CDC) published a preliminary report on a cluster of liver disease cases that emerged in Hawaii in the summer 2013. This report claimed a temporal association as sufficient evidence that OxyELITE Pro (OEP), a dietary supplement (DS) mainly for weight loss, was the cause of this mysterious cluster. However, the presented data were inconsistent and required a thorough reanalysis. To further investigate the cause(s) of this cluster, we critically evaluated redacted raw clinical data of the cluster patients, as the CDC report received tremendous publicity in local and nationwide newspapers and television. This attention put regulators and physicians from the medical center in Honolulu that reported the cluster, under enormous pressure to succeed, risking biased evaluations and hasty conclusions. We noted pervasive bias in the documentation, conclusions, and public statements, also poor quality of case management. Among the cases we reviewed, many causes unrelated to any DS were evident, including decompensated liver cirrhosis, acute liver failure by acetaminophen overdose, acute cholecystitis with gallstones, resolving acute hepatitis B, acute HSV and VZV hepatitis, hepatitis E suspected after consumption of wild hog meat, and hepatotoxicity by acetaminophen or ibuprofen. Causality assessments based on the updated CIOMS scale confirmed the lack of evidence for any DS including OEP as culprit for the cluster. Thus, the Hawaii liver disease cluster is now best explained by various liver diseases rather than any DS, including OEP.

  16. Is It Feasible to Identify Natural Clusters of TSC-Associated Neuropsychiatric Disorders (TAND)?

    PubMed

    Leclezio, Loren; Gardner-Lubbe, Sugnet; de Vries, Petrus J

    2018-04-01

    Tuberous sclerosis complex (TSC) is a genetic disorder with multisystem involvement. The lifetime prevalence of TSC-Associated Neuropsychiatric Disorders (TAND) is in the region of 90% in an apparently unique, individual pattern. This "uniqueness" poses significant challenges for diagnosis, psycho-education, and intervention planning. To date, no studies have explored whether there may be natural clusters of TAND. The purpose of this feasibility study was (1) to investigate the practicability of identifying natural TAND clusters, and (2) to identify appropriate multivariate data analysis techniques for larger-scale studies. TAND Checklist data were collected from 56 individuals with a clinical diagnosis of TSC (n = 20 from South Africa; n = 36 from Australia). Using R, the open-source statistical platform, mean squared contingency coefficients were calculated to produce a correlation matrix, and various cluster analyses and exploratory factor analysis were examined. Ward's method rendered six TAND clusters with good face validity and significant convergence with a six-factor exploratory factor analysis solution. The "bottom-up" data-driven strategies identified a "scholastic" cluster of TAND manifestations, an "autism spectrum disorder-like" cluster, a "dysregulated behavior" cluster, a "neuropsychological" cluster, a "hyperactive/impulsive" cluster, and a "mixed/mood" cluster. These feasibility results suggest that a combination of cluster analysis and exploratory factor analysis methods may be able to identify clinically meaningful natural TAND clusters. Findings require replication and expansion in larger dataset, and could include quantification of cluster or factor scores at an individual level. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. Psychosocial Costs of Racism to Whites: Exploring Patterns through Cluster Analysis

    ERIC Educational Resources Information Center

    Spanierman, Lisa B.; Poteat, V. Paul; Beer, Amanda M.; Armstrong, Patrick Ian

    2006-01-01

    Participants (230 White college students) completed the Psychosocial Costs of Racism to Whites (PCRW) Scale. Using cluster analysis, we identified 5 distinct cluster groups on the basis of PCRW subscale scores: the unempathic and unaware cluster contained the lowest empathy scores; the insensitive and afraid cluster consisted of low empathy and…

  18. Allergen Sensitization Pattern by Sex: A Cluster Analysis in Korea.

    PubMed

    Ohn, Jungyoon; Paik, Seung Hwan; Doh, Eun Jin; Park, Hyun-Sun; Yoon, Hyun-Sun; Cho, Soyun

    2017-12-01

    Allergens tend to sensitize simultaneously. Etiology of this phenomenon has been suggested to be allergen cross-reactivity or concurrent exposure. However, little is known about specific allergen sensitization patterns. To investigate the allergen sensitization characteristics according to gender. Multiple allergen simultaneous test (MAST) is widely used as a screening tool for detecting allergen sensitization in dermatologic clinics. We retrospectively reviewed the medical records of patients with MAST results between 2008 and 2014 in our Department of Dermatology. A cluster analysis was performed to elucidate the allergen-specific immunoglobulin (Ig)E cluster pattern. The results of MAST (39 allergen-specific IgEs) from 4,360 cases were analyzed. By cluster analysis, 39items were grouped into 8 clusters. Each cluster had characteristic features. When compared with female, the male group tended to be sensitized more frequently to all tested allergens, except for fungus allergens cluster. The cluster and comparative analysis results demonstrate that the allergen sensitization is clustered, manifesting allergen similarity or co-exposure. Only the fungus cluster allergens tend to sensitize female group more frequently than male group.

  19. Cluster analysis reveals seasonal variation of sperm subpopulations in extended boar semen

    PubMed Central

    IBĂNESCU, Iulian; LEIDING, Claus; BOLLWEIN, Heinrich

    2017-01-01

    This study aimed to identify motile sperm subpopulations in extended boar semen and to observe the presumptive seasonal variation in their distribution. Data from 4837 boar ejaculates collected over a two-year period were analyzed in terms of kinematic parameters by Computer Assisted Sperm Analysis (CASA). Individual sperm data were used to determine subgroups of motile sperm within the ejaculates using cluster analysis. Four motile sperm subpopulations (SP) were identified, with distinct movement patterns: SP1 sperm with high velocity and high linearity; SP2 sperm with high velocity but low linearity; SP3 sperm with low velocity but high linearity; and SP4 sperm with low velocity and low linearity. SP1 constituted the least overall proportion within the ejaculates (P < 0.05). Season of semen collection significantly influenced the different proportions of sperm subpopulations. Spring was characterized by similar proportions of SP1 and SP4 (NS) and higher proportions of SP3. Summer brought a decrease in both subgroups containing fast sperm (SP1 and SP2) (P < 0.05). During autumn, increases in SP2 and SP4 were recorded. Winter substantially affected the proportions of all sperm subpopulations (P < 0.05) and SP2 became the most represented subgroup, while SP1 (fast and linear) reached its highest proportion compared to other seasons. In conclusion, extended boar semen is structured in distinct motile sperm subpopulations whose proportions vary according to the season of collection. Summer and autumn seem to have a negative impact on the fast and linear subpopulation. Cluster analysis can be useful in revealing differences in semen quality that are not normally detected by classical evaluation based on mean values. PMID:29081440

  20. The relationship between leadership, teamworking, structure, burnout and attitude to patients on acute psychiatric wards

    PubMed Central

    Nijman, Henk; Simpson, Alan; Jones, Julia

    2010-01-01

    Background Conflict (aggression, substance use, absconding, etc.) and containment (coerced medication, manual restraint, etc.) threaten the safety of patients and staff on psychiatric wards. Previous work has suggested that staff variables may be significant in explaining differences between wards in their rates of these behaviours, and that structure (ward organisation, rules and daily routines) might be the most critical of these. This paper describes the exploration of a large dataset to assess the relationship between structure and other staff variables. Methods A multivariate cross-sectional design was utilised. Data were collected from staff on 136 acute psychiatric wards in 26 NHS Trusts in England, measuring leadership, teamwork, structure, burnout and attitudes towards difficult patients. Relationships between these variables were explored through principal components analysis (PCA), structural equation modelling and cluster analysis. Results Principal components analysis resulted in the identification of each questionnaire as a separate factor, indicating that the selected instruments assessed a number of non-overlapping items relevant for ward functioning. Structural equation modelling suggested a linear model in which leadership influenced teamwork, teamwork structure; structure burnout; and burnout feelings about difficult patients. Finally, cluster analysis identified two significantly distinct groups of wards: the larger of which had particularly good leadership, teamwork, structure, attitudes towards patients and low burnout; and the second smaller proportion which was poor on all variables and high on burnout. The better functioning cluster of wards had significantly lower rates of containment events. Conclusion The overall performance of staff teams is associated with differing rates of containment on wards. Interventions to reduce rates of containment on wards may need to address staff issues at every level, from leadership through to staff attitudes. PMID:20082064

  1. Inequalities in neighbourhood socioeconomic characteristics: potential evidence-base for neighbourhood health planning

    PubMed Central

    Odoi, Agricola; Wray, Ron; Emo, Marion; Birch, Stephen; Hutchison, Brian; Eyles, John; Abernathy, Tom

    2005-01-01

    Background Population health planning aims to improve the health of the entire population and to reduce health inequities among population groups. Socioeconomic factors are increasingly being recognized as major determinants of many aspects of health and causes of health inequities. Knowledge of socioeconomic characteristics of neighbourhoods is necessary to identify their unique health needs and enhance identification of socioeconomically disadvantaged populations. Careful integration of this knowledge into health planning activities is necessary to ensure that health planning and service provision are tailored to unique neighbourhood population health needs. In this study, we identify unique neighbourhood socioeconomic characteristics and classify the neighbourhoods based on these characteristics. Principal components analysis (PCA) of 18 socioeconomic variables was used to identify the principal components explaining most of the variation in socioeconomic characteristics across the neighbourhoods. Cluster analysis was used to classify neighbourhoods based on their socioeconomic characteristics. Results Results of the PCA and cluster analysis were similar but the latter were more objective and easier to interpret. Five neighbourhood types with distinguishing socioeconomic and demographic characteristics were identified. The methodology provides a more complete picture of the neighbourhood socioeconomic characteristics than when a single variable (e.g. income) is used to classify neighbourhoods. Conclusion Cluster analysis is useful for generating neighbourhood population socioeconomic and demographic characteristics that can be useful in guiding neighbourhood health planning and service provision. This study is the first of a series of studies designed to investigate health inequalities at the neighbourhood level with a view to providing evidence-base for health planners, service providers and policy makers to help address health inequity issues at the neighbourhood level. Subsequent studies will investigate inequalities in health outcomes both within and across the neighbourhood types identified in the current study. PMID:16092969

  2. A Phylogenetic Analysis of the Genus Fragaria (Strawberry) Using Intron-Containing Sequence from the ADH-1 Gene

    PubMed Central

    DiMeglio, Laura M.; Yu, Hongrun; Davis, Thomas M.

    2014-01-01

    The genus Fragaria encompasses species at ploidy levels ranging from diploid to decaploid. The cultivated strawberry, Fragaria×ananassa, and its two immediate progenitors, F. chiloensis and F. virginiana, are octoploids. To elucidate the ancestries of these octoploid species, we performed a phylogenetic analysis using intron-containing sequences of the nuclear ADH-1 gene from 39 germplasm accessions representing nineteen Fragaria species and one outgroup species, Dasiphora fruticosa. All trees from Maximum Parsimony and Maximum Likelihood analyses showed two major clades, Clade A and Clade B. Each of the sampled octoploids contributed alleles to both major clades. All octoploid-derived alleles in Clade A clustered with alleles of diploid F. vesca, with the exception of one octoploid allele that clustered with the alleles of diploid F. mandshurica. All octoploid-derived alleles in clade B clustered with the alleles of only one diploid species, F. iinumae. When gaps encoded as binary characters were included in the Maximum Parsimony analysis, tree resolution was improved with the addition of six nodes, and the bootstrap support was generally higher, rising above the 50% threshold for an additional nine branches. These results, coupled with the congruence of the sequence data and the coded gap data, validate and encourage the employment of sequence sets containing gaps for phylogenetic analysis. Our phylogenetic conclusions, based upon sequence data from the ADH-1 gene located on F. vesca linkage group II, complement and generally agree with those obtained from analyses of protein-encoding genes GBSSI-2 and DHAR located on F. vesca linkage groups V and VII, respectively, but differ from a previous study that utilized rDNA sequences and did not detect the ancestral role of F. iinumae. PMID:25078607

  3. Microarray characterization of gene expression changes in blood during acute ethanol exposure

    PubMed Central

    2013-01-01

    Background As part of the civil aviation safety program to define the adverse effects of ethanol on flying performance, we performed a DNA microarray analysis of human whole blood samples from a five-time point study of subjects administered ethanol orally, followed by breathalyzer analysis, to monitor blood alcohol concentration (BAC) to discover significant gene expression changes in response to the ethanol exposure. Methods Subjects were administered either orange juice or orange juice with ethanol. Blood samples were taken based on BAC and total RNA was isolated from PaxGene™ blood tubes. The amplified cDNA was used in microarray and quantitative real-time polymerase chain reaction (RT-qPCR) analyses to evaluate differential gene expression. Microarray data was analyzed in a pipeline fashion to summarize and normalize and the results evaluated for relative expression across time points with multiple methods. Candidate genes showing distinctive expression patterns in response to ethanol were clustered by pattern and further analyzed for related function, pathway membership and common transcription factor binding within and across clusters. RT-qPCR was used with representative genes to confirm relative transcript levels across time to those detected in microarrays. Results Microarray analysis of samples representing 0%, 0.04%, 0.08%, return to 0.04%, and 0.02% wt/vol BAC showed that changes in gene expression could be detected across the time course. The expression changes were verified by qRT-PCR. The candidate genes of interest (GOI) identified from the microarray analysis and clustered by expression pattern across the five BAC points showed seven coordinately expressed groups. Analysis showed function-based networks, shared transcription factor binding sites and signaling pathways for members of the clusters. These include hematological functions, innate immunity and inflammation functions, metabolic functions expected of ethanol metabolism, and pancreatic and hepatic function. Five of the seven clusters showed links to the p38 MAPK pathway. Conclusions The results of this study provide a first look at changing gene expression patterns in human blood during an acute rise in blood ethanol concentration and its depletion because of metabolism and excretion, and demonstrate that it is possible to detect changes in gene expression using total RNA isolated from whole blood. The analysis approach for this study serves as a workflow to investigate the biology linked to expression changes across a time course and from these changes, to identify target genes that could serve as biomarkers linked to pilot performance. PMID:23883607

  4. Orbit Clustering Based on Transfer Cost

    NASA Technical Reports Server (NTRS)

    Gustafson, Eric D.; Arrieta-Camacho, Juan J.; Petropoulos, Anastassios E.

    2013-01-01

    We propose using cluster analysis to perform quick screening for combinatorial global optimization problems. The key missing component currently preventing cluster analysis from use in this context is the lack of a useable metric function that defines the cost to transfer between two orbits. We study several proposed metrics and clustering algorithms, including k-means and the expectation maximization algorithm. We also show that proven heuristic methods such as the Q-law can be modified to work with cluster analysis.

  5. The formation and evolution of M33 as revealed by its star clusters

    NASA Astrophysics Data System (ADS)

    San Roman, Izaskun

    2012-03-01

    Numerical simulations based on the Lambda-Cold Dark Matter (Λ-CDM) model predict a scenario consistent with observational evidence in terms of the build-up of Milky Way-like halos. Under this scenario, large disk galaxies derive from the merger and accretion of many smaller subsystems. However, it is less clear how low-mass spiral galaxies fit into this picture. The best way to answer this question is to study the nearest example of a dwarf spiral galaxy, M33. We will use star clusters to understand the structure, kinematics and stellar populations of this galaxy. Star clusters provide a unique and powerful tool for studying the star formation histories of galaxies. In particular, the ages and metallicities of star clusters bear the imprint of the galaxy formation process. We have made use of the star clusters to uncover the formation and evolution of M33. In this dissertation, we have carried out a comprehensive study of the M33 star cluster system, including deep photometry as well as high signal-to-noise spectroscopy. In order to mitigate the significant incompleteness presents in previous catalogs, we have conducted ground-based and space-based photometric surveys of M33 star clusters. Using archival images, we have analyzed 12 fields using the Advanced Camera for Surveys Wide Field Channel onboard the Hubble Space Telescope (ACS/HST) along the major axis of the galaxy. We present integrated photometry and color-magnitude diagrams for 161 star clusters in M33, of which 115 were previously uncataloged. This survey extends the depth of the existing M33 cluster catalogs by ˜ 1 mag. We have expanded our search through a photometric survey in a 1° x 1° area centered on M33 using the MegaCam camera on the 3.6m Canada-France-Hawaii Telescope (CFHT). In this work we discuss the photometric properties of the sample, including color-color diagrams of 599 new candidate stellar clusters, and 204 confirmed clusters. Comparisons with models of simple stellar populations suggest a large range of ages some as old as ˜ 10 Gyr. In addition, we find in the color-color diagrams a significant population of very young clusters (< 10 Myr) possessing nebular emission. Analysis of the radial density distribution suggests that the cluster system of M33 has suffered from significant depletion, possibly due to interactions with M31. To further understand the properties of M33 star clusters, we have carried out a morphological study 161 star clusters in M33 using ACS/HST images. We have obtained, for the first time, ellipticities, position angles, and surface brightness profiles of a statistically significant number of clusters. Ellipticities show that, on average, M33 clusters are more flattened than those of the Milky Way and M31, and more similar to clusters in the Small Magellanic Cloud. The ellipticities do not show any correlation with age or mass, suggesting that rotation is not the main cause of elongation in the M33 clusters. The position angles of the clusters show a bimodality with a strong peak perpendicular to the position angle of the galaxy. These results support the notion that tidal forces are the reason for the cluster flattening. We have fit analytical models to the surface brightness profiles, and derived structural parameters. The overall analysis shows several differences between the structural properties of the M33 cluster system and cluster systems in nearby galaxies. Finally, we have performed a spectroscopic study of star clusters in the above mentioned catalog. We present high-precision velocity measures of 45 star clusters, based on observations from the 10.4m Gran Telescopio Canarias (GTC) using OSIRIS and 4.2m William Herschel Telescope (WHT) using WYFFOS. All the clusters have been previously confirmed using HST imaging, and ages and integrated photometry are known. The velocity of the clusters with respect to local disk motion increases with age for young and intermediate clusters. The mean dispersion velocity for the intermediate age clusters in our sample is significantly larger than in previous studies. Analysis of these velocities along the major axis of the galaxy show no net rotation of the intermediate age subsample. The small number of old clusters in our sample does not allow for any conclusive evidence in that age division.

  6. Hierarchical cluster analysis of labour market regulations and population health: a taxonomy of low- and middle-income countries

    PubMed Central

    2012-01-01

    Background An important contribution of the social determinants of health perspective has been to inquire about non-medical determinants of population health. Among these, labour market regulations are of vital significance. In this study, we investigate the labour market regulations among low- and middle-income countries (LMICs) and propose a labour market taxonomy to further understand population health in a global context. Methods Using Gross National Product per capita, we classify 113 countries into either low-income (n = 71) or middle-income (n = 42) strata. Principal component analysis of three standardized indicators of labour market inequality and poverty is used to construct 2 factor scores. Factor score reliability is evaluated with Cronbach's alpha. Using these scores, we conduct a hierarchical cluster analysis to produce a labour market taxonomy, conduct zero-order correlations, and create box plots to test their associations with adult mortality, healthy life expectancy, infant mortality, maternal mortality, neonatal mortality, under-5 mortality, and years of life lost to communicable and non-communicable diseases. Labour market and health data are retrieved from the International Labour Organization's Key Indicators of Labour Markets and World Health Organization's Statistical Information System. Results Six labour market clusters emerged: Residual (n = 16), Emerging (n = 16), Informal (n = 10), Post-Communist (n = 18), Less Successful Informal (n = 22), and Insecure (n = 31). Primary findings indicate: (i) labour market poverty and population health is correlated in both LMICs; (ii) association between labour market inequality and health indicators is significant only in low-income countries; (iii) Emerging (e.g., East Asian and Eastern European countries) and Insecure (e.g., sub-Saharan African nations) clusters are the most advantaged and disadvantaged, respectively, with the remaining clusters experiencing levels of population health consistent with their labour market characteristics. Conclusions The labour market regulations of LMICs appear to be important social determinant of population health. This study demonstrates the heuristic value of understanding the labour markets of LMICs and their health effects using exploratory taxonomy approaches. PMID:22512892

  7. Alcohol outlets and clusters of violence

    PubMed Central

    2011-01-01

    Background Alcohol related violence continues to be a major public health problem in the United States. In particular, there is substantial evidence of an association between alcohol outlets and assault. However, because the specific geographic relationships between alcohol outlets and the distribution of violence remains obscured, it is important to identify the spatial linkages that may exist, enhancing public health efforts to curb both violence and morbidity. Methods The present study utilizes police-recorded data on simple and aggravated assaults in Cincinnati, Ohio. Addresses of alcohol outlets for Cincinnati, including all bars, alcohol-serving restaurants, and off-premise liquor and convenience stores were obtained from the Ohio Division of Liquor Control and geocoded for analysis. A combination of proximity analysis, spatial cluster detection approaches and a geographic information system were used to identify clusters of alcohol outlets and the distribution of violence around them. Results A brief review of the empirical work relating to alcohol outlet density and violence is provided, noting that the majority of this literature is cross-sectional and ecological in nature, yielding a somewhat haphazard and aggregate view of how outlet type(s) and neighborhood characteristics like social organization and land use are related to assaultive violence. The results of the statistical analysis for Cincinnati suggest that while alcohol outlets are not problematic per se, assaultive violence has a propensity to cluster around agglomerations of alcohol outlets. This spatial relationship varies by distance and is also related to the characteristics of the alcohol outlet agglomeration. Specifically, spatially dense distributions of outlets appear to be more prone to clusters of assaultive violence when compared to agglomerations with a lower density of outlets. Conclusion With a more thorough understanding of the spatial relationships between alcohol outlets and the distribution of assaults, policymakers in urban areas can make more informed regulatory decisions regarding alcohol licenses. Further, this research suggests that public health officials and epidemiologists need to develop a better understanding of what actually occurs in and around alcohol outlets, determining what factors (whether outlet, neighborhood, or spatially related) help fuel their relationship with violence and other alcohol-related harm. PMID:21542932

  8. Brain Activity and Human Unilateral Chewing

    PubMed Central

    Quintero, A.; Ichesco, E.; Myers, C.; Schutt, R.; Gerstner, G.E.

    2012-01-01

    Brain mechanisms underlying mastication have been studied in non-human mammals but less so in humans. We used functional magnetic resonance imaging (fMRI) to evaluate brain activity in humans during gum chewing. Chewing was associated with activations in the cerebellum, motor cortex and caudate, cingulate, and brainstem. We also divided the 25-second chew-blocks into 5 segments of equal 5-second durations and evaluated activations within and between each of the 5 segments. This analysis revealed activation clusters unique to the initial segment, which may indicate brain regions involved with initiating chewing. Several clusters were uniquely activated during the last segment as well, which may represent brain regions involved with anticipatory or motor events associated with the end of the chew-block. In conclusion, this study provided evidence for specific brain areas associated with chewing in humans and demonstrated that brain activation patterns may dynamically change over the course of chewing sequences. PMID:23103631

  9. Interactive K-Means Clustering Method Based on User Behavior for Different Analysis Target in Medicine.

    PubMed

    Lei, Yang; Yu, Dai; Bin, Zhang; Yang, Yang

    2017-01-01

    Clustering algorithm as a basis of data analysis is widely used in analysis systems. However, as for the high dimensions of the data, the clustering algorithm may overlook the business relation between these dimensions especially in the medical fields. As a result, usually the clustering result may not meet the business goals of the users. Then, in the clustering process, if it can combine the knowledge of the users, that is, the doctor's knowledge or the analysis intent, the clustering result can be more satisfied. In this paper, we propose an interactive K -means clustering method to improve the user's satisfactions towards the result. The core of this method is to get the user's feedback of the clustering result, to optimize the clustering result. Then, a particle swarm optimization algorithm is used in the method to optimize the parameters, especially the weight settings in the clustering algorithm to make it reflect the user's business preference as possible. After that, based on the parameter optimization and adjustment, the clustering result can be closer to the user's requirement. Finally, we take an example in the breast cancer, to testify our method. The experiments show the better performance of our algorithm.

  10. Spatial and temporal structure of typhoid outbreaks in Washington, D.C., 1906–1909: evaluating local clustering with the Gi* statistic

    PubMed Central

    Hinman, Sarah E; Blackburn, Jason K; Curtis, Andrew

    2006-01-01

    Background To better understand the distribution of typhoid outbreaks in Washington, D.C., the U.S. Public Health Service (PHS) conducted four investigations of typhoid fever. These studies included maps of cases reported between 1 May – 31 October 1906 – 1909. These data were entered into a GIS database and analyzed using Ripley's K-function followed by the Gi* statistic in yearly intervals to evaluate spatial clustering, the scale of clustering, and the temporal stability of these clusters. Results The Ripley's K-function indicated no global spatial autocorrelation. The Gi* statistic indicated clustering of typhoid at multiple scales across the four year time period, refuting the conclusions drawn in all four PHS reports concerning the distribution of cases. While the PHS reports suggested an even distribution of the disease, this study quantified both areas of localized disease clustering, as well as mobile larger regions of clustering. Thus, indicating both highly localized and periodic generalized sources of infection within the city. Conclusion The methodology applied in this study was useful for evaluating the spatial distribution and annual-level temporal patterns of typhoid outbreaks in Washington, D.C. from 1906 to 1909. While advanced spatial analyses of historical data sets must be interpreted with caution, this study does suggest that there is utility in these types of analyses and that they provide new insights into the urban patterns of typhoid outbreaks during the early part of the twentieth century. PMID:16566830

  11. Color gradients in cooling flow cluster central galaxies and the ionization of cluster emission line systems

    NASA Technical Reports Server (NTRS)

    Romanishin, W.

    1988-01-01

    Preliminary results are given for a program to measure color gradients in the central galaxies in clusters with a variety of cooling flow rates. The objectives are to search for extended blue continuum regions indicative of star formation, to study the spatial distribution of star formation, and to make a quantitative measure of the amount of light from young stars, which can lead to a measure of the star formation rate (for an assumed initial mass function). Four clusters with large masses and large cluster H-alpha emission fluxes are found to have an excess of blue light concentrated to the centers of the cluster central galaxy. Assumption of a disk IMF leads to the conclusion that the starlight might play a major role in ionizing the emission line gas in these clusters.

  12. Clustering of attitudes towards obesity: a mixed methods study of Australian parents and children

    PubMed Central

    2013-01-01

    Background Current population-based anti-obesity campaigns often target individuals based on either weight or socio-demographic characteristics, and give a ‘mass’ message about personal responsibility. There is a recognition that attempts to influence attitudes and opinions may be more effective if they resonate with the beliefs that different groups have about the causes of, and solutions for, obesity. Limited research has explored how attitudinal factors may inform the development of both upstream and downstream social marketing initiatives. Methods Computer-assisted face-to-face interviews were conducted with 159 parents and 184 of their children (aged 9–18 years old) in two Australian states. A mixed methods approach was used to assess attitudes towards obesity, and elucidate why different groups held various attitudes towards obesity. Participants were quantitatively assessed on eight dimensions relating to the severity and extent, causes and responsibility, possible remedies, and messaging strategies. Cluster analysis was used to determine attitudinal clusters. Participants were also able to qualify each answer. Qualitative responses were analysed both within and across attitudinal clusters using a constant comparative method. Results Three clusters were identified. Concerned Internalisers (27% of the sample) judged that obesity was a serious health problem, that Australia had among the highest levels of obesity in the world and that prevalence was rapidly increasing. They situated the causes and remedies for the obesity crisis in individual choices. Concerned Externalisers (38% of the sample) held similar views about the severity and extent of the obesity crisis. However, they saw responsibility and remedies as a societal rather than an individual issue. The final cluster, the Moderates, which contained significantly more children and males, believed that obesity was not such an important public health issue, and judged the extent of obesity to be less extreme than the other clusters. Conclusion Attitudinal clusters provide new information and insights which may be useful in tailoring anti-obesity social marketing initiatives. PMID:24119724

  13. Group sequential designs for stepped-wedge cluster randomised trials

    PubMed Central

    Grayling, Michael J; Wason, James MS; Mander, Adrian P

    2017-01-01

    Background/Aims: The stepped-wedge cluster randomised trial design has received substantial attention in recent years. Although various extensions to the original design have been proposed, no guidance is available on the design of stepped-wedge cluster randomised trials with interim analyses. In an individually randomised trial setting, group sequential methods can provide notable efficiency gains and ethical benefits. We address this by discussing how established group sequential methodology can be adapted for stepped-wedge designs. Methods: Utilising the error spending approach to group sequential trial design, we detail the assumptions required for the determination of stepped-wedge cluster randomised trials with interim analyses. We consider early stopping for efficacy, futility, or efficacy and futility. We describe first how this can be done for any specified linear mixed model for data analysis. We then focus on one particular commonly utilised model and, using a recently completed stepped-wedge cluster randomised trial, compare the performance of several designs with interim analyses to the classical stepped-wedge design. Finally, the performance of a quantile substitution procedure for dealing with the case of unknown variance is explored. Results: We demonstrate that the incorporation of early stopping in stepped-wedge cluster randomised trial designs could reduce the expected sample size under the null and alternative hypotheses by up to 31% and 22%, respectively, with no cost to the trial’s type-I and type-II error rates. The use of restricted error maximum likelihood estimation was found to be more important than quantile substitution for controlling the type-I error rate. Conclusion: The addition of interim analyses into stepped-wedge cluster randomised trials could help guard against time-consuming trials conducted on poor performing treatments and also help expedite the implementation of efficacious treatments. In future, trialists should consider incorporating early stopping of some kind into stepped-wedge cluster randomised trials according to the needs of the particular trial. PMID:28653550

  14. The mitochondrial COB region in yeast codes for apocytochrome b and is mosaic.

    PubMed

    Haid, A; Schweyen, R J; Bechmann, H; Kaudewitz, F; Solioz, M; Schatz, G

    1979-03-01

    Mitochondrial mutants of Saccharomyces cerevisiae defective in cytochrome b were analyzed genetically and biochemically in order to elucidate the role of the mitochondrial genetic system in the biosynthesis of this cytochrome. The mutants mapped between OLI1 and OLI2 on mitochondrial DNA in a region called COB. A fine structure map of the COB region was constructed by rho- deletion mapping and recombination analysis. The combined genetic and biochemical data indicate that the COB region is mosaic and contains at least five distinct clusters of mutants, A-E, with A being closest to OLI2 and E being closest to OLI1. Clusters A, C and E are probably coding regions for apocytochrome b, whereas clusters B and D seem to be involved in as yet unknown functions. These conclusions rest on the following evidence. 1. Most mutants in clusters A, C and E have specifically lost cytochrome b. Many of them accumulate smaller mitochondrial translation products; some of these were identified as fragments of apocytochrome b by proteolytic fingerprinting. The molecular weight of these fragments depends on the map position of the mutant, increasing in the direction OLI2 leads to OLI1. The mutant closest to OLI1 accumulates an apocytochrome b which is slightly larger than that of wild type. 2. A mutant in cluster C exhibits a spectral absorption band of cytochrome b that is shifted 1.5 nm to the red. 3. Mutants in clusters B and D are pleiotropic. A majority of them are conditional and lack the absorption bands of both cytochrome b and cytochrome aa3; these mutants also fail to accumulate apocytochrome b and subunit I of cytochrome c oxidase and instead form a large number of abnormal translation products whose nature is unknown. 4. Zygotic complementation tests reveal at least two complementation groups: The first group includes all mutants in cluster B and the second group includes mutants in clusters (A + C + D + E).

  15. [Typologies of Madrid's citizens (Spain) at the end-of-life: cluster analysis].

    PubMed

    Ortiz-Gonçalves, Belén; Perea-Pérez, Bernardo; Labajo González, Elena; Albarrán Juan, Elena; Santiago-Sáez, Andrés

    2018-03-06

    To establish typologies within Madrid's citizens (Spain) with regard to end-of-life by cluster analysis. The SPAD 8 programme was implemented in a sample from a health care centre in the autonomous region of Madrid (Spain). A multiple correspondence analysis technique was used, followed by a cluster analysis to create a dendrogram. A cross-sectional study was made beforehand with the results of the questionnaire. Five clusters stand out. Cluster 1: a group who preferred not to answer numerous questions (5%). Cluster 2: in favour of receiving palliative care and euthanasia (40%). Cluster 3: would oppose assisted suicide and would not ask for spiritual assistance (15%). Cluster 4: would like to receive palliative care and assisted suicide (16%). Cluster 5: would oppose assisted suicide and would ask for spiritual assistance (24%). The following four clusters stood out. Clusters 2 and 4 would like to receive palliative care, euthanasia (2) and assisted suicide (4). Clusters 4 and 5 regularly practiced their faith and their family members did not receive palliative care. Clusters 3 and 5 would be opposed to euthanasia and assisted suicide in particular. Clusters 2, 4 and 5 had not completed an advance directive document (2, 4 and 5). Clusters 2 and 3 seldom practiced their faith. This study could be taken into consideration to improve the quality of end-of-life care choices. Copyright © 2017 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.

  16. Two worlds collide: Image analysis methods for quantifying structural variation in cluster molecular dynamics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.

    2014-02-14

    Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less

  17. Two worlds collide: image analysis methods for quantifying structural variation in cluster molecular dynamics.

    PubMed

    Steenbergen, K G; Gaston, N

    2014-02-14

    Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.

  18. Clinical Phenotype of Diabetic Peripheral Neuropathy and Relation to Symptom Patterns: Cluster and Factor Analysis in Patients with Type 2 Diabetes in Korea.

    PubMed

    Won, Jong Chul; Im, Yong-Jin; Lee, Ji-Hyun; Kim, Chong Hwa; Kwon, Hyuk Sang; Cha, Bong-Yun; Park, Tae Sun

    2017-01-01

    Patients with diabetic peripheral neuropathy (DPN) is the most common complication. However, patients are usually suffering from not only diverse sensory deficit but also neuropathy-related discomforts. The aim of this study is to identify distinct groups of patients with DPN with respect to its clinical impacts on symptom patterns and comorbidities. A hierarchical cluster analysis and factor analysis were performed to identify relevant subgroups of patients with DPN ( n = 1338) and symptom patterns. Patients with DPN were divided into three clusters: asymptomatic (cluster 1, n = 448, 33.5%), moderate symptoms with disturbed sleep (cluster 2, n = 562, 42.0%), and severe symptoms with decreased quality of life (cluster 3, n = 328, 24.5%). Patients in cluster 3, compared with clusters 1 and 2, were characterized by higher levels of HbA1c and more severe pain and physical impairments. Patients in cluster 2 had moderate pain levels but disturbed sleep patterns comparable to those in cluster 3. The frequency of symptoms on each item of MNSI by "painful" symptom pattern showed a similar distribution pattern with increasing intensities along the three clusters. Cluster and factor analysis endorsed the use of comprehensive and symptomatic subgrouping to individualize the evaluation of patients with DPN.

  19. Kinematic foot types in youth with equinovarus secondary to hemiplegia

    PubMed Central

    Krzak, Joseph J.; Corcos, Daniel M.; Damiano, Diane L.; Graf, Adam; Hedeker, Donald; Smith, Peter A.; Harris, Gerald F.

    2015-01-01

    Background Elevated kinematic variability of the foot and ankle segments exists during gait among individuals with equinovarus secondary to hemiplegic cerebral palsy (CP). Clinicians have previously addressed such variability by developing classification schemes to identify subgroups of individuals based on their kinematics. Objective To identify kinematic subgroups among youth with equinovarus secondary to CP using 3-dimensional multi-segment foot and ankle kinematics during locomotion as inputs for principal component analysis (PCA), and K-means cluster analysis. Methods In a single assessment session, multi-segment foot and ankle kinematics using the Milwaukee Foot Model (MFM) were collected in 24 children/adolescents with equinovarus and 20 typically developing children/adolescents. Results PCA was used as a data reduction technique on 40 variables. K-means cluster analysis was performed on the first six principal components (PCs) which accounted for 92% of the variance of the dataset. The PCs described the location and plane of involvement in the foot and ankle. Five distinct kinematic subgroups were identified using K-means clustering. Participants with equinovarus presented with variable involvement ranging from primary hindfoot or forefoot deviations to deformtiy that included both segments in multiple planes. Conclusion This study provides further evidence of the variability in foot characteristics associated with equinovarus secondary to hemiplegic CP. These findings would not have been detected using a single segment foot model. The identification of multiple kinematic subgroups with unique foot and ankle characteristics has the potential to improve treatment since similar patients within a subgroup are likely to benefit from the same intervention(s). PMID:25467429

  20. ETE: a python Environment for Tree Exploration

    PubMed Central

    2010-01-01

    Background Many bioinformatics analyses, ranging from gene clustering to phylogenetics, produce hierarchical trees as their main result. These are used to represent the relationships among different biological entities, thus facilitating their analysis and interpretation. A number of standalone programs are available that focus on tree visualization or that perform specific analyses on them. However, such applications are rarely suitable for large-scale surveys, in which a higher level of automation is required. Currently, many genome-wide analyses rely on tree-like data representation and hence there is a growing need for scalable tools to handle tree structures at large scale. Results Here we present the Environment for Tree Exploration (ETE), a python programming toolkit that assists in the automated manipulation, analysis and visualization of hierarchical trees. ETE libraries provide a broad set of tree handling options as well as specific methods to analyze phylogenetic and clustering trees. Among other features, ETE allows for the independent analysis of tree partitions, has support for the extended newick format, provides an integrated node annotation system and permits to link trees to external data such as multiple sequence alignments or numerical arrays. In addition, ETE implements a number of built-in analytical tools, including phylogeny-based orthology prediction and cluster validation techniques. Finally, ETE's programmable tree drawing engine can be used to automate the graphical rendering of trees with customized node-specific visualizations. Conclusions ETE provides a complete set of methods to manipulate tree data structures that extends current functionality in other bioinformatic toolkits of a more general purpose. ETE is free software and can be downloaded from http://ete.cgenomics.org. PMID:20070885

  1. Knowledge transfer for the management of dementia: a cluster-randomised trial of blended learning in general practice

    PubMed Central

    2010-01-01

    Background The implementation of new medical knowledge into general practice is a complex process. Blended learning may offer an effective and efficient educational intervention to reduce the knowledge-to-practice gap. The aim of this study was to compare knowledge acquisition about dementia management between a blended learning approach using online modules in addition to quality circles (QCs) and QCs alone. Methods In this cluster-randomised trial with QCs as clusters and general practitioners (GPs) as participants, 389 GPs from 26 QCs in the western part of Germany were invited to participate. Data on the GPs' knowledge were obtained at three points in time by means of a questionnaire survey. Primary outcome was the knowledge gain before and after the interventions. A subgroup analysis of the users of the online modules was performed. Results 166 GPs were available for analysis and filled out a knowledge test at least two times. A significant increase of knowledge was found in both groups that indicated positive learning effects of both approaches. However, there was no significant difference between the groups. A subgroup analysis of the GPs who self-reported that they had actually used the online modules showed that they had a significant increase in their knowledge scores. Conclusion A blended learning approach was not superior to a QCs approach for improving knowledge about dementia management. However, a subgroup of GPs who were motivated to actually use the online modules had a gain in knowledge. Trial registration Current Controlled Trials ISRCTN36550981. PMID:20047652

  2. The myeloproliferative neoplasms, unclassifiable: clinical and pathological considerations.

    PubMed

    Gianelli, Umberto; Cattaneo, Daniele; Bossi, Anna; Cortinovis, Ivan; Boiocchi, Leonardo; Liu, Yen-Chun; Augello, Claudia; Bonometti, Arturo; Fiori, Stefano; Orofino, Nicola; Guidotti, Francesca; Orazi, Attilio; Iurlo, Alessandra

    2017-02-01

    In this study, we investigate in detail the morphological, clinical and molecular features of 71 consecutive patients with a diagnosis of myeloproliferative neoplasms, unclassifiable. We performed a meticulous morphological analysis and found that most of the cases displayed a hypercellular bone marrow (70%) with normal erythropoiesis without left-shifting (59%), increased granulopoiesis with left-shifting (73%) and increased megakaryocytes with loose clustering (96%). Megakaryocytes displayed frequent giant forms with hyperlobulated or bulbous nuclei and/or other maturation defects. Interestingly, more than half of the cases displayed severe bone marrow fibrosis (59%). Median values of hemoglobin level and white blood cells count were all within the normal range; in contrast, median platelets count and lactate dehydrogenase were increased. Little less than half of the patients (44%) showed splenomegaly. JAK2V617F mutation was detected in 72% of all patients. Among the JAK2-negative cases, MPLW515L mutation was found in 17% and CALR mutations in 67% of the investigated cases, respectively. Finally, by multiple correspondence analysis of the morphological profiles, we found that all but four of the cases could be grouped in three morphological clusters with some features similar to those of the classic BCR-ABL1-negative myeloproliferative neoplasms. Analysis of the clinical parameters in these three clusters revealed discrepancies with the morphological profile in about 55% of the patients. In conclusion, we found that the category of myeloproliferative neoplasm, unclassifiable is heterogeneous but identification of different subgroups is possible and should be recommended for a better management of these patients.

  3. A hybrid monkey search algorithm for clustering analysis.

    PubMed

    Chen, Xin; Zhou, Yongquan; Luo, Qifang

    2014-01-01

    Clustering is a popular data analysis and data mining technique. The k-means clustering algorithm is one of the most commonly used methods. However, it highly depends on the initial solution and is easy to fall into local optimum solution. In view of the disadvantages of the k-means method, this paper proposed a hybrid monkey algorithm based on search operator of artificial bee colony algorithm for clustering analysis and experiment on synthetic and real life datasets to show that the algorithm has a good performance than that of the basic monkey algorithm for clustering analysis.

  4. The use of oxygen in cluster headache treatment worldwide - a survey of the International Headache Society (IHS).

    PubMed

    Evers, Stefan; Rapoport, Alan

    2017-04-01

    Background Oxygen is recommended for the treatment of acute cluster headache attacks. However, it is not available worldwide. Methods The International Headache Society performed a survey among its national member societies on the availability and the restrictions for oxygen in the treatment of cluster headache. Results Oxygen is reimbursed in 50% of all countries responding ( n = 22). There are additional restrictions in the reimbursement of the facial mask and with respect to age. Conclusion Oxygen for the treatment of cluster headache attack is not reimbursed worldwide. Headache societies should pressure national/public health authorities to reimburse oxygen for cluster headache in all countries.

  5. Method for exploratory cluster analysis and visualisation of single-trial ERP ensembles.

    PubMed

    Williams, N J; Nasuto, S J; Saddy, J D

    2015-07-30

    The validity of ensemble averaging on event-related potential (ERP) data has been questioned, due to its assumption that the ERP is identical across trials. Thus, there is a need for preliminary testing for cluster structure in the data. We propose a complete pipeline for the cluster analysis of ERP data. To increase the signal-to-noise (SNR) ratio of the raw single-trials, we used a denoising method based on Empirical Mode Decomposition (EMD). Next, we used a bootstrap-based method to determine the number of clusters, through a measure called the Stability Index (SI). We then used a clustering algorithm based on a Genetic Algorithm (GA) to define initial cluster centroids for subsequent k-means clustering. Finally, we visualised the clustering results through a scheme based on Principal Component Analysis (PCA). After validating the pipeline on simulated data, we tested it on data from two experiments - a P300 speller paradigm on a single subject and a language processing study on 25 subjects. Results revealed evidence for the existence of 6 clusters in one experimental condition from the language processing study. Further, a two-way chi-square test revealed an influence of subject on cluster membership. Our analysis operates on denoised single-trials, the number of clusters are determined in a principled manner and the results are presented through an intuitive visualisation. Given the cluster structure in some experimental conditions, we suggest application of cluster analysis as a preliminary step before ensemble averaging. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Prevalence of the Catatonic Syndrome in an Acute Inpatient Sample

    PubMed Central

    Stuivenga, Mirella; Morrens, Manuel

    2014-01-01

    Objective: In this exploratory open label study, we investigated the prevalence of catatonia in an acute psychiatric inpatient population. In addition, differences in symptom presentation of catatonia depending on the underlying psychiatric illness were investigated. Methods: One hundred thirty patients were assessed with the Bush–Francis Catatonia Rating Scale (BFCRS), the Positive and Negative Syndrome Scale, the Young Mania Rating Scale, and the Simpson–Angus Scale. A factor analysis was conducted in order to generate six catatonic symptom clusters. Composite scores based on this principal component analysis were calculated. Results: When focusing on the first 14 items of the BFCRS, 101 patients (77.7%) had at least 1 symptom scoring 1 or higher, whereas, 66 patients (50.8%) had at least 2 symptoms. Interestingly, when focusing on the DSM-5 criteria of catatonia, 22 patients (16.9%) could be considered for this diagnosis. Furthermore, different symptom profiles were found, depending on the underlying psychopathology. Psychotic symptomatology correlated strongly with excitement symptomatology (r = 0.528, p < 0.001) and to a lesser degree with the stereotypy/mannerisms symptom cluster (r = 0.289; p = 0.001) and the echo/perseveration symptom cluster (r = 0.185; p = 0.035). Similarly, manic symptomatology correlated strongly with the excitement symptom cluster (r = 0.596; p < 0.001) and to a lesser extent with the stereotypy/mannerisms symptom cluster (r = 0.277; p = 0.001). Conclusion: There was a high prevalence of catatonic symptomatology. Depending on the criteria being used, we noticed an important difference in exact prevalence, which makes it clear that we need clear-cut criteria. Another important finding is the fact that the catatonic presentation may vary depending on the underlying pathology, although an unambiguous delineation between these catatonic presentations cannot be made. Future research is needed to determine diagnostical criteria of catatonia, which are clinically relevant. PMID:25520674

  7. Emergence of sporadic non-clustered cases of hospital-associated listeriosis among immunocompromised adults in southern Taiwan from 1992 to 2013: effect of precipitating immunosuppressive agents

    PubMed Central

    2014-01-01

    Background Sporadic non-clustered hospital-associated listeriosis is an emerging infectious disease in immunocompromised hosts. The current study was designed to determine the impact of long-term and precipitating immunosuppressive agents and underlying diseases on triggering the expression of the disease, and to compare the clinical features and outcome of hospital-associated and community-associated listeriosis. Methods We reviewed the medical records of all patients with Listeria monocytogenes isolated from sterile body sites at a large medical center in southern Taiwan during 1992–2013. Non-clustered cases were defined as those unrelated to any other in time or place. Multivariable regression analysis was used to determine factors associated with prognosis. Results Thirty-five non-clustered cases of listeriosis were identified. Twelve (34.2%) were hospital-associated, and 23 (65.7%) were community-associated. The 60-day mortality was significantly greater in hospital-associated than in community-associated cases (66.7% vs. 17.4%, p = 0.007). Significantly more hospital-associated than community-associated cases were treated with a precipitating immunosuppressive agent within 4 weeks prior to onset of listeriosis (91.7% vs. 4.3%, respectively p < 0.001). The median period from the start of precipitating immunosuppressive treatment to the onset of listeriosis-related symptoms was 12 days (range, 4–27 days) in 11 of the 12 hospital-associated cases. In the multivariable analysis, APACHE II score >21 (p = 0.04) and receipt of precipitating immunosuppressive therapy (p = 0.02) were independent risk factors for 60-day mortality. Conclusions Sporadic non-clustered hospital-associated listeriosis needs to be considered in the differential diagnosis of sepsis in immunocompromised patients, particularly in those treated with new or increased doses of immunosuppressive agents. PMID:24641498

  8. Self-similarity of temperature profiles in distant galaxy clusters: the quest for a universal law

    NASA Astrophysics Data System (ADS)

    Baldi, A.; Ettori, S.; Molendi, S.; Gastaldello, F.

    2012-09-01

    Context. We present the XMM-Newton temperature profiles of 12 bright (LX > 4 × 1044 erg s-1) clusters of galaxies at 0.4 < z < 0.9, having an average temperature in the range 5 ≲ kT ≲ 11 keV. Aims: The main goal of this paper is to study for the first time the temperature profiles of a sample of high-redshift clusters, to investigate their properties, and to define a universal law to describe the temperature radial profiles in galaxy clusters as a function of both cosmic time and their state of relaxation. Methods: We performed a spatially resolved spectral analysis, using Cash statistics, to measure the temperature in the intracluster medium at different radii. Results: We extracted temperature profiles for the clusters in our sample, finding that all profiles are declining toward larger radii. The normalized temperature profiles (normalized by the mean temperature T500) are found to be generally self-similar. The sample was subdivided into five cool-core (CC) and seven non cool-core (NCC) clusters by introducing a pseudo-entropy ratio σ = (TIN/TOUT) × (EMIN/EMOUT)-1/3 and defining the objects with σ < 0.6 as CC clusters and those with σ ≥ 0.6 as NCC clusters. The profiles of CC and NCC clusters differ mainly in the central regions, with the latter exhibiting a slightly flatter central profile. A significant dependence of the temperature profiles on the pseudo-entropy ratio σ is detected by fitting a function of r and σ, showing an indication that the outer part of the profiles becomes steeper for higher values of σ (i.e. transitioning toward the NCC clusters). No significant evidence of redshift evolution could be found within the redshift range sampled by our clusters (0.4 < z < 0.9). A comparison of our high-z sample with intermediate clusters at 0.1 < z < 0.3 showed how the CC and NCC cluster temperature profiles have experienced some sort of evolution. This can happen because higher z clusters are at a less advanced stage of their formation and did not have enough time to create a relaxed structure, which is characterized by a central temperature dip in CC clusters and by flatter profiles in NCC clusters. Conclusions: This is the first time that a systematic study of the temperature profiles of galaxy clusters at z > 0.4 has been attempted. We were able to define the closest possible relation to a universal law for the temperature profiles of galaxy clusters at 0.1 < z < 0.9, showing a dependence on both the relaxation state of the clusters and the redshift. Appendix A is only available in electronic form at http://www.aanda.org

  9. Elucidation of the Pattern of the Onset of Male Lower Urinary Tract Symptoms Using Cluster Analysis: Efficacy of Tamsulosin in Each Symptom Group.

    PubMed

    Aikawa, Ken; Kataoka, Masao; Ogawa, Soichiro; Akaihata, Hidenori; Sato, Yuichi; Yabe, Michihiro; Hata, Junya; Koguchi, Tomoyuki; Kojima, Yoshiyuki; Shiragasawa, Chihaya; Kobayashi, Toshimitsu; Yamaguchi, Osamu

    2015-08-01

    To present a new grouping of male patients with lower urinary tract symptoms (LUTS) based on symptom patterns and clarify whether the therapeutic effect of α1-blocker differs among the groups. We performed secondary analysis of anonymous data from 4815 patients enrolled in a postmarketing surveillance study of tamsulosin in Japan. Data on 7 International Prostate Symptom Score (IPSS) items at the initial visit were used in the cluster analysis. IPSS and quality of life (QOL) scores before and after tamsulosin treatment for 12 weeks were assessed in each cluster. Partial correlation coefficients were also obtained for IPSS and QOL scores based on changes before and after treatment. Five symptom groups were identified by cluster analysis of IPSS. On their symptom profile, each cluster was labeled as minimal type (cluster 1), multiple severe type (cluster 2), weak stream type (cluster 3), storage type (cluster 4), and voiding type (cluster 5). Prevalence and the mean symptom score were significantly improved in almost all symptoms in all clusters by tamsulosin treatment. Nocturia and weak stream had the strongest effect on QOL in clusters 1, 2, and 4 and clusters 3 and 5, respectively. The study clarified that 5 characteristic symptom patterns exist by cluster analysis of IPSS in male patients with LUTS. Tamsulosin improved various symptoms and QOL in each symptom group. The study reports many male patients with LUTS being satisfied with monotherapy using tamsulosin and suggests the usefulness of α1-blockers as a drug of first choice. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Multiscale visual quality assessment for cluster analysis with self-organizing maps

    NASA Astrophysics Data System (ADS)

    Bernard, Jürgen; von Landesberger, Tatiana; Bremm, Sebastian; Schreck, Tobias

    2011-01-01

    Cluster analysis is an important data mining technique for analyzing large amounts of data, reducing many objects to a limited number of clusters. Cluster visualization techniques aim at supporting the user in better understanding the characteristics and relationships among the found clusters. While promising approaches to visual cluster analysis already exist, these usually fall short of incorporating the quality of the obtained clustering results. However, due to the nature of the clustering process, quality plays an important aspect, as for most practical data sets, typically many different clusterings are possible. Being aware of clustering quality is important to judge the expressiveness of a given cluster visualization, or to adjust the clustering process with refined parameters, among others. In this work, we present an encompassing suite of visual tools for quality assessment of an important visual cluster algorithm, namely, the Self-Organizing Map (SOM) technique. We define, measure, and visualize the notion of SOM cluster quality along a hierarchy of cluster abstractions. The quality abstractions range from simple scalar-valued quality scores up to the structural comparison of a given SOM clustering with output of additional supportive clustering methods. The suite of methods allows the user to assess the SOM quality on the appropriate abstraction level, and arrive at improved clustering results. We implement our tools in an integrated system, apply it on experimental data sets, and show its applicability.

  11. Internal dynamics of the radio-halo cluster A2219: A multi-wavelength analysis

    NASA Astrophysics Data System (ADS)

    Boschin, W.; Girardi, M.; Barrena, R.; Biviano, A.; Feretti, L.; Ramella, M.

    2004-03-01

    We present the results of the dynamical analysis of the rich, hot, and X-ray very luminous galaxy cluster A2219, containing a powerful diffuse radio-halo. Our analysis is based on new redshift data for 27 galaxies in the cluster region, measured from spectra obtained at the TNG, with the addition of other 105 galaxies recovered from reduction of CFHT archive data in a cluster region of ˜5 arcmin radius (˜ 0.8 h-1 Mpc ; at the cluster distance) centered on the cD galaxy. The investigation of the dynamical status is also performed using X-ray data stored in the Chandra archive. Further, valuable information comes from other bands - optical photometric, infrared, and radio data - which are analyzed and/or discussed, too. We find that A2219 appears as a peak in the velocity space at z=0.225, and select 113 cluster members. We compute a high value for the line-of-sight velocity dispersion, σv= 1438+109-86 km s-1, consistent with the high average X-ray temperature of 10.3 keV. If dynamical equilibrium is assumed, the virial theorem leads to M˜2.8× 1015 M⊙ ;sun for the global mass within the virial region. However, further investigation based on both optical and X-ray data shows significant signs of a young dynamical status. In fact, we find strong evidence for the elongation of the cluster in the SE-NW direction coupled with a significant velocity gradient, as well as for the presence of substructure both in optical data and X-ray data. Moreover, we point out the presence of several active galaxies. We discuss the results of our multi-wavelength investigation suggesting a complex merging scenario where the main, original structure is subject to an ongoing merger with a few clumps aligned in a filament in the foreground oriented in an oblique direction with respect to the line-of-sight. Our conclusion supports the view of the connection between extended radio emission and merging phenomena in galaxy clusters. Based on observations made on the island of La Palma with the Italian Telescopio Nazionale Galileo (TNG) operated by the Centro Galileo Galilei of the INAF (Istituto Nazionale di Astrofisica) and with the 1.0 m Jacobus Kapteyn Telescope (JKT) operated by the Isaac Newton Group at the Spanish Observatorio de Roque de los Muchachos of the Instituto de Astrofisica de Canarias. Table 1 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/416/839

  12. RNA-seq analysis identifies an intricate regulatory network controlling cluster root development in white lupin

    PubMed Central

    2014-01-01

    Background Highly adapted plant species are able to alter their root architecture to improve nutrient uptake and thrive in environments with limited nutrient supply. Cluster roots (CRs) are specialised structures of dense lateral roots formed by several plant species for the effective mining of nutrient rich soil patches through a combination of increased surface area and exudation of carboxylates. White lupin is becoming a model-species allowing for the discovery of gene networks involved in CR development. A greater understanding of the underlying molecular mechanisms driving these developmental processes is important for the generation of smarter plants for a world with diminishing resources to improve food security. Results RNA-seq analyses for three developmental stages of the CR formed under phosphorus-limited conditions and two of non-cluster roots have been performed for white lupin. In total 133,045,174 high-quality paired-end reads were used for a de novo assembly of the root transcriptome and merged with LAGI01 (Lupinus albus gene index) to generate an improved LAGI02 with 65,097 functionally annotated contigs. This was followed by comparative gene expression analysis. We show marked differences in the transcriptional response across the various cluster root stages to adjust to phosphate limitation by increasing uptake capacity and adjusting metabolic pathways. Several transcription factors such as PLT, SCR, PHB, PHV or AUX/IAA with a known role in the control of meristem activity and developmental processes show an increased expression in the tip of the CR. Genes involved in hormonal responses (PIN, LAX, YUC) and cell cycle control (CYCA/B, CDK) are also differentially expressed. In addition, we identify primary transcripts of miRNAs with established function in the root meristem. Conclusions Our gene expression analysis shows an intricate network of transcription factors and plant hormones controlling CR initiation and formation. In addition, functional differences between the different CR developmental stages in the acclimation to phosphorus starvation have been identified. PMID:24666749

  13. Sequence Similarity of Clostridium difficile Strains by Analysis of Conserved Genes and Genome Content Is Reflected by Their Ribotype Affiliation

    PubMed Central

    Kurka, Hedwig; Ehrenreich, Armin; Ludwig, Wolfgang; Monot, Marc; Rupnik, Maja; Barbut, Frederic; Indra, Alexander; Dupuy, Bruno; Liebl, Wolfgang

    2014-01-01

    PCR-ribotyping is a broadly used method for the classification of isolates of Clostridium difficile, an emerging intestinal pathogen, causing infections with increased disease severity and incidence in several European and North American countries. We have now carried out clustering analysis with selected genes of numerous C. difficile strains as well as gene content comparisons of their genomes in order to broaden our view of the relatedness of strains assigned to different ribotypes. We analyzed the genomic content of 48 C. difficile strains representing 21 different ribotypes. The calculation of distance matrix-based dendrograms using the neighbor joining method for 14 conserved genes (standard phylogenetic marker genes) from the genomes of the C. difficile strains demonstrated that the genes from strains with the same ribotype generally clustered together. Further, certain ribotypes always clustered together and formed ribotype groups, i.e. ribotypes 078, 033 and 126, as well as ribotypes 002 and 017, indicating their relatedness. Comparisons of the gene contents of the genomes of ribotypes that clustered according to the conserved gene analysis revealed that the number of common genes of the ribotypes belonging to each of these three ribotype groups were very similar for the 078/033/126 group (at most 69 specific genes between the different strains with the same ribotype) but less similar for the 002/017 group (86 genes difference). It appears that the ribotype is indicative not only of a specific pattern of the amplified 16S–23S rRNA intergenic spacer but also reflects specific differences in the nucleotide sequences of the conserved genes studied here. It can be anticipated that the sequence deviations of more genes of C. difficile strains are correlated with their PCR-ribotype. In conclusion, the results of this study corroborate and extend the concept of clonal C. difficile lineages, which correlate with ribotypes affiliation. PMID:24482682

  14. Liver Ischemic Preconditioning (IPC) Improves Intestinal Microbiota Following Liver Transplantation in Rats through 16s rDNA-Based Analysis of Microbial Structure Shift

    PubMed Central

    Lu, Haifeng; Chen, Xinhua; Jiang, Jianwen; Liu, Hui; He, Yong; Ding, Songming; Hu, Zhenhua; Wang, Weilin; Zheng, Shusen

    2013-01-01

    Background Ischemia-reperfusion (I/R) injury is associated with intestinal microbial dysbiosis. The “gut-liver axis” closely links gut function and liver function in health and disease. Ischemic preconditioning (IPC) has been proven to reduce I/R injury in the surgery. This study aims to explore the effect of IPC on intestinal microbiota and to analyze characteristics of microbial structure shift following liver transplantation (LT). Methods The LT animal models of liver and gut IPC were established. Hepatic graft function was assessed by histology and serum ALT/AST. Intestinal barrier function was evaluated by mucosal ultrastructure, serum endotoxin, bacterial translocation, fecal sIgA content and serum TNF-α. Intestinal bacterial populations were determined by quantitative PCR. Microbial composition was characterized by DGGE and specific bacterial species were determined by sequence analysis. Principal Findings Liver IPC improved hepatic graft function expressed as ameliorated graft structure and reduced ALT/AST levels. After administration of liver IPC, intestinal mucosal ultrastructure improved, serum endotoxin and bacterial translocation mildly decreased, fecal sIgA content increased, and serum TNF-α decreased. Moreover, liver IPC promoted microbial restorations mainly through restoring Bifidobacterium spp., Clostridium clusters XI and Clostridium cluster XIVab on bacterial genus level. DGGE profiles indicated that liver IPC increased microbial diversity and species richness, and cluster analysis demonstrated that microbial structures were similar and clustered together between the NC group and Liver-IPC group. Furthermore, the phylogenetic tree of band sequences showed key bacteria corresponding to 10 key band classes of microbial structure shift induced by liver IPC, most of which were assigned to Bacteroidetes phylum. Conclusion Liver IPC cannot only improve hepatic graft function and intestinal barrier function, but also promote restorations of intestinal microbiota following LT, which may further benefit hepatic graft by positive feedback of the “gut-liver axis”. PMID:24098410

  15. Multiple-locus variable-number tandem repeat analysis for molecular typing of Aspergillus fumigatus

    PubMed Central

    2010-01-01

    Background Multiple-locus variable-number tandem repeat (VNTR) analysis (MLVA) is a prominent subtyping method to resolve closely related microbial isolates to provide information for establishing genetic patterns among isolates and to investigate disease outbreaks. The usefulness of MLVA was recently demonstrated for the avian major pathogen Chlamydophila psittaci. In the present study, we developed a similar method for another pathogen of birds: the filamentous fungus Aspergillus fumigatus. Results We selected 10 VNTR markers located on 4 different chromosomes (1, 5, 6 and 8) of A. fumigatus. These markers were tested with 57 unrelated isolates from different hosts or their environment (53 isolates from avian species in France, China or Morocco, 3 isolates from humans collected at CHU Henri Mondor hospital in France and the reference strain CBS 144.89). The Simpson index for individual markers ranged from 0.5771 to 0.8530. A combined loci index calculated with all the markers yielded an index of 0.9994. In a second step, the panel of 10 markers was used in different epidemiological situations and tested on 277 isolates, including 62 isolates from birds in Guangxi province in China, 95 isolates collected in two duck farms in France and 120 environmental isolates from a turkey hatchery in France. A database was created with the results of the present study http://minisatellites.u-psud.fr/MLVAnet/. Three major clusters of isolates were defined by using the graphing algorithm termed Minimum Spanning Tree (MST). The first cluster comprised most of the avian isolates collected in the two duck farms in France, the second cluster comprised most of the avian isolates collected in poultry farms in China and the third one comprised most of the isolates collected in the turkey hatchery in France. Conclusions MLVA displayed excellent discriminatory power. The method showed a good reproducibility. MST analysis revealed an interesting clustering with a clear separation between isolates according to their geographic origin rather than their respective hosts. PMID:21143842

  16. Finding gene clusters for a replicated time course study

    PubMed Central

    2014-01-01

    Background Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. Findings In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. Conclusions The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism. PMID:24460656

  17. TCW: Transcriptome Computational Workbench

    PubMed Central

    Soderlund, Carol; Nelson, William; Willer, Mark; Gang, David R.

    2013-01-01

    Background The analysis of transcriptome data involves many steps and various programs, along with organization of large amounts of data and results. Without a methodical approach for storage, analysis and query, the resulting ad hoc analysis can lead to human error, loss of data and results, inefficient use of time, and lack of verifiability, repeatability, and extensibility. Methodology The Transcriptome Computational Workbench (TCW) provides Java graphical interfaces for methodical analysis for both single and comparative transcriptome data without the use of a reference genome (e.g. for non-model organisms). The singleTCW interface steps the user through importing transcript sequences (e.g. Illumina) or assembling long sequences (e.g. Sanger, 454, transcripts), annotating the sequences, and performing differential expression analysis using published statistical programs in R. The data, metadata, and results are stored in a MySQL database. The multiTCW interface builds a comparison database by importing sequence and annotation from one or more single TCW databases, executes the ESTscan program to translate the sequences into proteins, and then incorporates one or more clusterings, where the clustering options are to execute the orthoMCL program, compute transitive closure, or import clusters. Both singleTCW and multiTCW allow extensive query and display of the results, where singleTCW displays the alignment of annotation hits to transcript sequences, and multiTCW displays multiple transcript alignments with MUSCLE or pairwise alignments. The query programs can be executed on the desktop for fastest analysis, or from the web for sharing the results. Conclusion It is now affordable to buy a multi-processor machine, and easy to install Java and MySQL. By simply downloading the TCW, the user can interactively analyze, query and view their data. The TCW allows in-depth data mining of the results, which can lead to a better understanding of the transcriptome. TCW is freely available from www.agcol.arizona.edu/software/tcw. PMID:23874959

  18. A novel strategy of integrated microarray analysis identifies CENPA, CDK1 and CDC20 as a cluster of diagnostic biomarkers in lung adenocarcinoma.

    PubMed

    Liu, Wan-Ting; Wang, Yang; Zhang, Jing; Ye, Fei; Huang, Xiao-Hui; Li, Bin; He, Qing-Yu

    2018-07-01

    Lung adenocarcinoma (LAC) is the most lethal cancer and the leading cause of cancer-related death worldwide. The identification of meaningful clusters of co-expressed genes or representative biomarkers may help improve the accuracy of LAC diagnoses. Public databases, such as the Gene Expression Omnibus (GEO), provide rich resources of valuable information for clinics, however, the integration of multiple microarray datasets from various platforms and institutes remained a challenge. To determine potential indicators of LAC, we performed genome-wide relative significance (GWRS), genome-wide global significance (GWGS) and support vector machine (SVM) analyses progressively to identify robust gene biomarker signatures from 5 different microarray datasets that included 330 samples. The top 200 genes with robust signatures were selected for integrative analysis according to "guilt-by-association" methods, including protein-protein interaction (PPI) analysis and gene co-expression analysis. Of these 200 genes, only 10 genes showed both intensive PPI network and high gene co-expression correlation (r > 0.8). IPA analysis of this regulatory networks suggested that the cell cycle process is a crucial determinant of LAC. CENPA, as well as two linked hub genes CDK1 and CDC20, are determined to be potential indicators of LAC. Immunohistochemical staining showed that CENPA, CDK1 and CDC20 were highly expressed in LAC cancer tissue with co-expression patterns. A Cox regression model indicated that LAC patients with CENPA + /CDK1 + and CENPA + /CDC20 + were high-risk groups in terms of overall survival. In conclusion, our integrated microarray analysis demonstrated that CENPA, CDK1 and CDC20 might serve as novel cluster of prognostic biomarkers for LAC, and the cooperative unit of three genes provides a technically simple approach for identification of LAC patients. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. Carbon-dependent control of electron transfer and central carbon pathway genes for methane biosynthesis in the Archaean, Methanosarcina acetivorans strain C2A

    PubMed Central

    2010-01-01

    Background The archaeon, Methanosarcina acetivorans strain C2A forms methane, a potent greenhouse gas, from a variety of one-carbon substrates and acetate. Whereas the biochemical pathways leading to methane formation are well understood, little is known about the expression of the many of the genes that encode proteins needed for carbon flow, electron transfer and/or energy conservation. Quantitative transcript analysis was performed on twenty gene clusters encompassing over one hundred genes in M. acetivorans that encode enzymes/proteins with known or potential roles in substrate conversion to methane. Results The expression of many seemingly "redundant" genes/gene clusters establish substrate dependent control of approximately seventy genes for methane production by the pathways for methanol and acetate utilization. These include genes for soluble-type and membrane-type heterodisulfide reductases (hdr), hydrogenases including genes for a vht-type F420 non-reducing hydrogenase, molybdenum-type (fmd) as well as tungsten-type (fwd) formylmethanofuran dehydrogenases, genes for rnf and mrp-type electron transfer complexes, for acetate uptake, plus multiple genes for aha- and atp-type ATP synthesis complexes. Analysis of promoters for seven gene clusters reveal UTR leaders of 51-137 nucleotides in length, raising the possibility of both transcriptional and translational levels of control. Conclusions The above findings establish the differential and coordinated expression of two major gene families in M. acetivorans in response to carbon/energy supply. Furthermore, the quantitative mRNA measurements demonstrate the dynamic range for modulating transcript abundance. Since many of these gene clusters in M. acetivorans are also present in other Methanosarcina species including M. mazei, and in M. barkeri, these findings provide a basis for predicting related control in these environmentally significant methanogens. PMID:20178638

  20. The extracellular Leucine-Rich Repeat superfamily; a comparative survey and analysis of evolutionary relationships and expression patterns

    PubMed Central

    Dolan, Jackie; Walshe, Karen; Alsbury, Samantha; Hokamp, Karsten; O'Keeffe, Sean; Okafuji, Tatsuya; Miller, Suzanne FC; Tear, Guy; Mitchell, Kevin J

    2007-01-01

    Background Leucine-rich repeats (LRRs) are highly versatile and evolvable protein-ligand interaction motifs found in a large number of proteins with diverse functions, including innate immunity and nervous system development. Here we catalogue all of the extracellular LRR (eLRR) proteins in worms, flies, mice and humans. We use convergent evidence from several transmembrane-prediction and motif-detection programs, including a customised algorithm, LRRscan, to identify eLRR proteins, and a hierarchical clustering method based on TribeMCL to establish their evolutionary relationships. Results This yields a total of 369 proteins (29 in worm, 66 in fly, 135 in mouse and 139 in human), many of them of unknown function. We group eLRR proteins into several classes: those with only LRRs, those that cluster with Toll-like receptors (Tlrs), those with immunoglobulin or fibronectin-type 3 (FN3) domains and those with some other domain. These groups show differential patterns of expansion and diversification across species. Our analyses reveal several clusters of novel genes, including two Elfn genes, encoding transmembrane proteins with eLRRs and an FN3 domain, and six genes encoding transmembrane proteins with eLRRs only (the Elron cluster). Many of these are expressed in discrete patterns in the developing mouse brain, notably in the thalamus and cortex. We have also identified a number of novel fly eLRR proteins with discrete expression in the embryonic nervous system. Conclusion This study provides the necessary foundation for a systematic analysis of the functions of this class of genes, which are likely to include prominently innate immunity, inflammation and neural development, especially the specification of neuronal connectivity. PMID:17868438

  1. Collective Emotions Online and Their Influence on Community Life

    PubMed Central

    Chmiel, Anna; Sienkiewicz, Julian; Thelwall, Mike; Paltoglou, Georgios; Buckley, Kevan; Kappas, Arvid; Hołyst, Janusz A.

    2011-01-01

    Background E-communities, social groups interacting online, have recently become an object of interdisciplinary research. As with face-to-face meetings, Internet exchanges may not only include factual information but also emotional information – how participants feel about the subject discussed or other group members. Emotions in turn are known to be important in affecting interaction partners in offline communication in many ways. Could emotions in Internet exchanges affect others and systematically influence quantitative and qualitative aspects of the trajectory of e-communities? The development of automatic sentiment analysis has made large scale emotion detection and analysis possible using text messages collected from the web. However, it is not clear if emotions in e-communities primarily derive from individual group members' personalities or if they result from intra-group interactions, and whether they influence group activities. Methodology/Principal Findings Here, for the first time, we show the collective character of affective phenomena on a large scale as observed in four million posts downloaded from Blogs, Digg and BBC forums. To test whether the emotions of a community member may influence the emotions of others, posts were grouped into clusters of messages with similar emotional valences. The frequency of long clusters was much higher than it would be if emotions occurred at random. Distributions for cluster lengths can be explained by preferential processes because conditional probabilities for consecutive messages grow as a power law with cluster length. For BBC forum threads, average discussion lengths were higher for larger values of absolute average emotional valence in the first ten comments and the average amount of emotion in messages fell during discussions. Conclusions/Significance Overall, our results prove that collective emotional states can be created and modulated via Internet communication and that emotional expressiveness is the fuel that sustains some e-communities. PMID:21818302

  2. The influence of temperament and character profiles on specialty choice and well-being in medical residents

    PubMed Central

    Sievert, Martin; Zwir, Igor; Cloninger, Kevin M.; Lester, Nigel; Rozsa, Sandor

    2016-01-01

    Background Multiple factors influence the decision to enter a career in medicine and choose a specialty. Previous studies have looked at personality differences in medicine but often were unable to describe the heterogeneity that exists within each specialty. Our study used a person-centered approach to characterize the complex relations between the personality profiles of resident physicians and their choice of specialty. Methods 169 resident physicians at a large Midwestern US training hospital completed the Temperament and Character Inventory (TCI) and the Satisfaction with Life Scale (SWLS). Clusters of personality profiles were identified without regard to medical specialty, and then the personality clusters were tested for association with their choice of specialty by co-clustering analysis. Life satisfaction was tested for association with personality traits and medical specialty by linear regression and analysis of variance. Results We identified five clusters of people with distinct personality profiles, and found that these were associated with particular medical specialties Physicians with an “investigative” personality profile often chose pathology or internal medicine, those with a “commanding” personality often chose general surgery, “rescuers” often chose emergency medicine, the “dependable” often chose pediatrics, and the “compassionate” often chose psychiatry. Life satisfaction scores were not enhanced by personality-specialty congruence, but were related strongly to self-directedness regardless of specialty. Conclusions The personality profiles of physicians were strongly associated with their medical specialty choices. Nevertheless, the relationships were complex: physicians with each personality profile went into a variety of medical specialties, and physicians in each medical specialty had variable personality profiles. The plasticity and resilience of physicians were more important for their life satisfaction than was matching personality to the prototype of a particular specialty. PMID:27651982

  3. The characteristics of depressive symptoms in medical students during medical education and training: a cross-sectional study

    PubMed Central

    Baldassin, Sergio; Alves, Tânia Correa de Toledo Ferraz; de Andrade, Arthur Guerra; Nogueira Martins, Luiz Antonio

    2008-01-01

    Background Medical education and training can contribute to the development of depressive symptoms that might lead to possible academic and professional consequences. We aimed to investigate the characteristics of depressive symptoms among 481 medical students (79.8% of the total who matriculated). Methods The Beck Depression Inventory (BDI) and cluster analyses were used in order to better describe the characteristics of depressive symptoms. Medical education and training in Brazil is divided into basic (1st and 2nd years), intermediate (3rd and 4th years), and internship (5th and 6th years) periods. The study organized each item from the BDI into the following three clusters: affective, cognitive, and somatic. Statistical analyses were performed using analysis of variance (ANOVA) with post-hoc Tukey corrected for multiple comparisons. Results There were 184 (38.2%) students with depressive symptoms (BDI > 9). The internship period resulted in the highest BDI scores in comparison to both the basic (p < .001) and intermediate (p < .001) periods. Affective, cognitive, and somatic clusters were significantly higher in the internship period. An exploratory analysis of possible risk factors showed that females (p = .020) not having a parent who practiced medicine (p = .016), and the internship period (p = .001) were factors for the development of depressive symptoms. Conclusion There is a high prevalence towards depressive symptoms among medical students, particularly females, in the internship level, mainly involving the somatic and affective clusters, and not having a parent who practiced medicine. The active assessment of these students in evaluating their depressive symptoms is important in order to prevent the development of co-morbidities and suicide risk. PMID:19077227

  4. An efficient and scalable graph modeling approach for capturing information at different levels in next generation sequencing reads

    PubMed Central

    2013-01-01

    Background Next generation sequencing technologies have greatly advanced many research areas of the biomedical sciences through their capability to generate massive amounts of genetic information at unprecedented rates. The advent of next generation sequencing has led to the development of numerous computational tools to analyze and assemble the millions to billions of short sequencing reads produced by these technologies. While these tools filled an important gap, current approaches for storing, processing, and analyzing short read datasets generally have remained simple and lack the complexity needed to efficiently model the produced reads and assemble them correctly. Results Previously, we presented an overlap graph coarsening scheme for modeling read overlap relationships on multiple levels. Most current read assembly and analysis approaches use a single graph or set of clusters to represent the relationships among a read dataset. Instead, we use a series of graphs to represent the reads and their overlap relationships across a spectrum of information granularity. At each information level our algorithm is capable of generating clusters of reads from the reduced graph, forming an integrated graph modeling and clustering approach for read analysis and assembly. Previously we applied our algorithm to simulated and real 454 datasets to assess its ability to efficiently model and cluster next generation sequencing data. In this paper we extend our algorithm to large simulated and real Illumina datasets to demonstrate that our algorithm is practical for both sequencing technologies. Conclusions Our overlap graph theoretic algorithm is able to model next generation sequencing reads at various levels of granularity through the process of graph coarsening. Additionally, our model allows for efficient representation of the read overlap relationships, is scalable for large datasets, and is practical for both Illumina and 454 sequencing technologies. PMID:24564333

  5. Transcriptome database resource and gene expression atlas for the rose

    PubMed Central

    2012-01-01

    Background For centuries roses have been selected based on a number of traits. Little information exists on the genetic and molecular basis that contributes to these traits, mainly because information on expressed genes for this economically important ornamental plant is scarce. Results Here, we used a combination of Illumina and 454 sequencing technologies to generate information on Rosa sp. transcripts using RNA from various tissues and in response to biotic and abiotic stresses. A total of 80714 transcript clusters were identified and 76611 peptides have been predicted among which 20997 have been clustered into 13900 protein families. BLASTp hits in closely related Rosaceae species revealed that about half of the predicted peptides in the strawberry and peach genomes have orthologs in Rosa dataset. Digital expression was obtained using RNA samples from organs at different development stages and under different stress conditions. qPCR validated the digital expression data for a selection of 23 genes with high or low expression levels. Comparative gene expression analyses between the different tissues and organs allowed the identification of clusters that are highly enriched in given tissues or under particular conditions, demonstrating the usefulness of the digital gene expression analysis. A web interface ROSAseq was created that allows data interrogation by BLAST, subsequent analysis of DNA clusters and access to thorough transcript annotation including best BLAST matches on Fragaria vesca, Prunus persica and Arabidopsis. The rose peptides dataset was used to create the ROSAcyc resource pathway database that allows access to the putative genes and enzymatic pathways. Conclusions The study provides useful information on Rosa expressed genes, with thorough annotation and an overview of expression patterns for transcripts with good accuracy. PMID:23164410

  6. [Molecular epidemiologic study on Mycobacterium tuberculosis from drug resistance monitoring sites of Guangdong Province, 2015].

    PubMed

    Huang, X C; Guo, H X; Wu, Z H; Guo, C X; Wei, W J; Li, H C; Sun, Q; Zhang, C C; Li, Z Y; Chen, T; Zhong, Q; Zhou, L

    2017-05-12

    Objective: To understand the characteristics of Mycobacterium tuberculosis (MTB) in epidemiology and distribution from Guangdong Province, and to explore the risk factors associated with drug resistance. Methods: A total of 225 clinical strains of MTB collected from 5 drug resistance monitoring sites of Guangdong Province in 2015 were tested by Regions of Difference 105 (RD105) deletion test and 15 loci mycobacterial interspersed repetitive units (MIRU) were used for genotyping. Gene clustering was analyzed using BioNumerics7.6. Drug susceptibility test was tested by proportion method. The statistical analysis used chi-square test and multivariate logistic regression. Results: There were 158 (70.2%) Beijing family strains from the 225 cases. Hunter-gaston index of MIRU loci varied from each other. The MTBs from Guangdong Province were categorized into 2 gene clusters by clustering analysis in which the rate of cluster of complexⅠwas significantly higher than complexⅡ(χ(2) values were 9.331, P values were 0.020). It was found by multivariate logistic regression that Qub11b was associated with resistance to rifampicin and isoniazid ( P values were 0.013, 0.012 respectively.), ETR F with resistance to isoniazid, streptomycin, ethambutol and ofloxacin ( P values were 0.039, 0.040, 0.023 and 0.003 respectively), Mtub21 with resistance to capreomycin ( P values were 0.040), and QUB26 with resistance to ethionamide ( P values were 0.047). Conclusions: The genes of MTB from Guangdong Province were of polymorphisms and the distribution of strains were stable. QUB11b, ETR F, Mtub21 and QUB26 could be related to biomarkers for predicting drug resistance.

  7. From Points to Patterns - Functional Relations between Groundwater Connectivity and Catchment-scale Streamflow Response

    NASA Astrophysics Data System (ADS)

    Rinderer, M.; McGlynn, B. L.; van Meerveld, I. H. J.

    2016-12-01

    Groundwater measurements can help us to improve our understanding of runoff generation at the catchment-scale but typically only provide point-scale data. These measurements, therefore, need to be interpolated or upscaled in order to obtain information about catchment scale groundwater dynamics. Our approach used data from 51 spatially distributed groundwater monitoring sites in a Swiss pre-alpine catchment and time series clustering to define six groundwater response clusters. Each of the clusters was characterized by distinctly different site characteristics (i.e., Topographic Wetness Index and curvature), which allowed us to assign all unmonitored locations to one of these clusters. Time series modeling and the definition of response thresholds (i.e., the depth of more transmissive soil layers) allowed us to derive maps of the spatial distribution of active (i.e., responding) locations across the catchment at 15 min time intervals. Connectivity between all active locations and the stream network was determined using a graph theory approach. The extent of the active and connected areas differed during events and suggests that not all active locations directly contributed to streamflow. Gate keeper sites prevented connectivity of upslope locations to the channel network. Streamflow dynamics at the catchment outlet were correlated to catchment average connectivity dynamics. In a sensitivity analysis we tested six different groundwater levels for a site to be considered "active", which showed that the definition of the threshold did not significantly influence the conclusions drawn from our analysis. This study is the first one to derive patterns of groundwater dynamics based on empirical data (rather than interpolation) and provides insight into the spatio-temporal evolution of the active and connected runoff source areas at the catchment-scale that is critical to understanding the dynamics of water quantity and quality in streams.

  8. Simultaneous clustering of gene expression data with clinical chemistry and pathological evaluations reveals phenotypic prototypes

    PubMed Central

    Bushel, Pierre R; Wolfinger, Russell D; Gibson, Greg

    2007-01-01

    Background Commonly employed clustering methods for analysis of gene expression data do not directly incorporate phenotypic data about the samples. Furthermore, clustering of samples with known phenotypes is typically performed in an informal fashion. The inability of clustering algorithms to incorporate biological data in the grouping process can limit proper interpretation of the data and its underlying biology. Results We present a more formal approach, the modk-prototypes algorithm, for clustering biological samples based on simultaneously considering microarray gene expression data and classes of known phenotypic variables such as clinical chemistry evaluations and histopathologic observations. The strategy involves constructing an objective function with the sum of the squared Euclidean distances for numeric microarray and clinical chemistry data and simple matching for histopathology categorical values in order to measure dissimilarity of the samples. Separate weighting terms are used for microarray, clinical chemistry and histopathology measurements to control the influence of each data domain on the clustering of the samples. The dynamic validity index for numeric data was modified with a category utility measure for determining the number of clusters in the data sets. A cluster's prototype, formed from the mean of the values for numeric features and the mode of the categorical values of all the samples in the group, is representative of the phenotype of the cluster members. The approach is shown to work well with a simulated mixed data set and two real data examples containing numeric and categorical data types. One from a heart disease study and another from acetaminophen (an analgesic) exposure in rat liver that causes centrilobular necrosis. Conclusion The modk-prototypes algorithm partitioned the simulated data into clusters with samples in their respective class group and the heart disease samples into two groups (sick and buff denoting samples having pain type representative of angina and non-angina respectively) with an accuracy of 79%. This is on par with, or better than, the assignment accuracy of the heart disease samples by several well-known and successful clustering algorithms. Following modk-prototypes clustering of the acetaminophen-exposed samples, informative genes from the cluster prototypes were identified that are descriptive of, and phenotypically anchored to, levels of necrosis of the centrilobular region of the rat liver. The biological processes cell growth and/or maintenance, amine metabolism, and stress response were shown to discern between no and moderate levels of acetaminophen-induced centrilobular necrosis. The use of well-known and traditional measurements directly in the clustering provides some guarantee that the resulting clusters will be meaningfully interpretable. PMID:17408499

  9. Genetic structure of Cantharellus formosus populations in a second-growth temperate rain forest of the Pacific Northwest

    USGS Publications Warehouse

    Redman, Regina S.; Ranson, Judith; Rodriguez, Rusty J.

    2006-01-01

    Cantharellus formosus growing on the Olympic Peninsula of the Pacific Northwest was sampled from September – November 1995 for genetic analysis. A total of ninety-six basidiomes from five clusters separated from one another by 3 - 25 meters were genetically characterized by PCR analysis of 13 arbitrary loci and rDNA sequences. The number of basidiomes in each cluster varied from 15 to 25 and genetic analysis delineated 15 genets among the clusters. Analysis of variance utilizing thirteen apPCR generated genetic molecular markers and PCR amplification of the ribosomal ITS regions indicated that 81.41% of the genetic variation occurred between clusters and 18.59% within clusters. Proximity of the basidiomes within a cluster was not an indicator of genotypic similarity. The molecular profiles of each cluster were distinct and defined as unique populations containing 2 - 6 genets. The monitoring and analysis of this species through non-lethal sampling and future applications is discussed.

  10. Is the non-isothermal double β-model incompatible with no time evolution of galaxy cluster gas mass fraction?

    NASA Astrophysics Data System (ADS)

    Holanda, R. F. L.

    2018-05-01

    In this paper, we propose a new method to obtain the depletion factor γ(z), the ratio by which the measured baryon fraction in galaxy clusters is depleted with respect to the universal mean. We use exclusively galaxy cluster data, namely, X-ray gas mass fraction (fgas) and angular diameter distance measurements from Sunyaev-Zel'dovich effect plus X-ray observations. The galaxy clusters are the same in both data set and the non-isothermal spherical double β-model was used to describe their electron density and temperature profiles. In order to compare our results with those from recent cosmological hydrodynamical simulations, we suppose a possible time evolution for γ(z), such as, γ(z) =γ0(1 +γ1 z) . As main conclusions we found that: the γ0 value is in full agreement with the simulations. On the other hand, although the γ1 value found in our analysis is compatible with γ1 = 0 within 2σ c.l., our results show a non-negligible time evolution for the depletion factor, unlike the results of the simulations. However, we also put constraints on γ(z) by using the fgas measurements and angular diameter distances obtained from the flat ΛCDM model (Planck results) and from a sample of galaxy clusters described by an elliptical profile. For these cases no significant time evolution for γ(z) was found. Then, if a constant depletion factor is an inherent characteristic of these structures, our results show that the spherical double β-model used to describe the galaxy clusters considered does not affect the quality of their fgas measurements.

  11. [Achene morphology cluster analysis of Taraxacum F. H. Wigg. from northeast China and molecule systematics evidence determined by SRAP].

    PubMed

    Li, Hai-juan; Zhao, Xin; Jia, Qing-fei; Li, Tian-lai; Ning, Wei

    2012-08-01

    The achenes morphological and micro-morphological characteristics of six species of genus Taraxacum from northeastern China as well as SRAP cluster analysis were observed for their classification evidences. The achenes were observed by microscope and EPMA. Cluster analysis was given on the basis of the size, shape, cone proportion, color and surface sculpture of achenes. The Taraxacum inter-species achene shape characteristic difference is obvious, particularly spinulose distribution and size, achene color and achene size; with the Taraxacum plant achene shape the cluster method T. antungense Kitag. and the T. urbanum Kitag. should combine for the identical kind; the achene morphology cluster analysis and the SRAP tagged molecule systematics's cluster result retrieves in the table with "the Chinese flora". The class group to divide the result is consistent. Taraxacum plant achene shape characteristic stable conservative, may carry on the inter-species division and the sibship analysis according to the achene shape characteristic combination difference; the achene morphology cluster analysis as well as the SRAP tagged molecule systematics confirmation support dandelion classification result of "the Chinese flora".

  12. Exploratory Item Classification Via Spectral Graph Clustering

    PubMed Central

    Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen; Xu, Gongjun; Ying, Zhiliang

    2017-01-01

    Large-scale assessments are supported by a large item pool. An important task in test development is to assign items into scales that measure different characteristics of individuals, and a popular approach is cluster analysis of items. Classical methods in cluster analysis, such as the hierarchical clustering, K-means method, and latent-class analysis, often induce a high computational overhead and have difficulty handling missing data, especially in the presence of high-dimensional responses. In this article, the authors propose a spectral clustering algorithm for exploratory item cluster analysis. The method is computationally efficient, effective for data with missing or incomplete responses, easy to implement, and often outperforms traditional clustering algorithms in the context of high dimensionality. The spectral clustering algorithm is based on graph theory, a branch of mathematics that studies the properties of graphs. The algorithm first constructs a graph of items, characterizing the similarity structure among items. It then extracts item clusters based on the graphical structure, grouping similar items together. The proposed method is evaluated through simulations and an application to the revised Eysenck Personality Questionnaire. PMID:29033476

  13. Sulfur in Cometary Dust

    NASA Technical Reports Server (NTRS)

    Fomenkova, M. N.

    1997-01-01

    The computer-intensive project consisted of the analysis and synthesis of existing data on composition of comet Halley dust particles. The main objective was to obtain a complete inventory of sulfur containing compounds in the comet Halley dust by building upon the existing classification of organic and inorganic compounds and applying a variety of statistical techniques for cluster and cross-correlational analyses. A student hired for this project wrote and tested the software to perform cluster analysis. The following tasks were carried out: (1) selecting the data from existing database for the proposed project; (2) finding access to a standard library of statistical routines for cluster analysis; (3) reformatting the data as necessary for input into the library routines; (4) performing cluster analysis and constructing hierarchical cluster trees using three methods to define the proximity of clusters; (5) presenting the output results in different formats to facilitate the interpretation of the obtained cluster trees; (6) selecting groups of data points common for all three trees as stable clusters. We have also considered the chemistry of sulfur in inorganic compounds.

  14. Performance analysis of clustering techniques over microarray data: A case study

    NASA Astrophysics Data System (ADS)

    Dash, Rasmita; Misra, Bijan Bihari

    2018-03-01

    Handling big data is one of the major issues in the field of statistical data analysis. In such investigation cluster analysis plays a vital role to deal with the large scale data. There are many clustering techniques with different cluster analysis approach. But which approach suits a particular dataset is difficult to predict. To deal with this problem a grading approach is introduced over many clustering techniques to identify a stable technique. But the grading approach depends on the characteristic of dataset as well as on the validity indices. So a two stage grading approach is implemented. In this study the grading approach is implemented over five clustering techniques like hybrid swarm based clustering (HSC), k-means, partitioning around medoids (PAM), vector quantization (VQ) and agglomerative nesting (AGNES). The experimentation is conducted over five microarray datasets with seven validity indices. The finding of grading approach that a cluster technique is significant is also established by Nemenyi post-hoc hypothetical test.

  15. A tale of two cities: The role of neighborhood socioeconomic status in spatial clustering of bystander CPR in Austin and Houston☆

    PubMed Central

    Root, Elisabeth Dowling; Gonzales, Louis; Persse, David E.; Hinchey, Paul R.; McNally, Bryan; Sasson, Comilla

    2013-01-01

    Background Despite evidence to suggest significant spatial variation in out-of-hospital cardiac arrest (OHCA) and bystander cardiopulmonary resuscitation (BCPR) rates, geographic information systems (GIS) and spatial analysis have not been widely used to understand the reasons behind this variation. This study employs spatial statistics to identify the location and extent of clusters of bystander CPR in Houston and Travis County, TX. Methods Data were extracted from the Cardiac Arrest Registry to Enhance Survival for two U.S. sites –Austin-Travis County EMS and the Houston Fire Department – between October 1, 2006 and December 31, 2009. Hierarchical logistic regression models were used to assess the relationship between income and racial/ethnic composition of a neighborhood and BCPR for OHCA and to adjust expected counts of BCPR for spatial cluster analysis. The spatial scan statistic was used to find the geographic extent of clusters of high and low BCPR. Results Results indicate spatial clusters of lower than expected BCPR rates in Houston. Compared to BCPR rates in the rest of the community, there was a circular area of 4.2 km radius where BCPR rates were lower than expected (RR = 0.62; p < 0.0001 and RR = 0.55; p = 0.037) which persist when adjusted for individual-level patient characteristics (RR = 0.34; p = 0.027) and neighborhood-level race (RR = 0.34; p = 0.034) and household income (RR = 0.34; p = 0.046). We also find a spatial cluster of higher than expected BCPR in Austin. Compared to the rest of the community, there was a 23.8 km radius area where BCPR rates were higher than expected (RR = 1.75; p = 0.07) which disappears after controlling for individual-level characteristics. Conclusions A geographically targeted CPR training strategy which is tailored to individual and neighborhood population characteristics may be effective in reducing existing disparities in the provision of bystander CPR for out-of-hospital cardiac arrest. PMID:23318916

  16. Clusters of Insomnia Disorder: An Exploratory Cluster Analysis of Objective Sleep Parameters Reveals Differences in Neurocognitive Functioning, Quantitative EEG, and Heart Rate Variability.

    PubMed

    Miller, Christopher B; Bartlett, Delwyn J; Mullins, Anna E; Dodds, Kirsty L; Gordon, Christopher J; Kyle, Simon D; Kim, Jong Won; D'Rozario, Angela L; Lee, Rico S C; Comas, Maria; Marshall, Nathaniel S; Yee, Brendon J; Espie, Colin A; Grunstein, Ronald R

    2016-11-01

    To empirically derive and evaluate potential clusters of Insomnia Disorder through cluster analysis from polysomnography (PSG). We hypothesized that clusters would differ on neurocognitive performance, sleep-onset measures of quantitative ( q )-EEG and heart rate variability (HRV). Research volunteers with Insomnia Disorder (DSM-5) completed a neurocognitive assessment and overnight PSG measures of total sleep time (TST), wake time after sleep onset (WASO), and sleep onset latency (SOL) were used to determine clusters. From 96 volunteers with Insomnia Disorder, cluster analysis derived at least two clusters from objective sleep parameters: Insomnia with normal objective sleep duration (I-NSD: n = 53) and Insomnia with short sleep duration (I-SSD: n = 43). At sleep onset, differences in HRV between I-NSD and I-SSD clusters suggest attenuated parasympathetic activity in I-SSD (P < 0.05). Preliminary work suggested three clusters by retaining the I-NSD and splitting the I-SSD cluster into two: I-SSD A (n = 29): defined by high WASO and I-SSD B (n = 14): a second I-SSD cluster with high SOL and medium WASO. The I-SSD B cluster performed worse than I-SSD A and I-NSD for sustained attention (P ≤ 0.05). In an exploratory analysis, q -EEG revealed reduced spectral power also in I-SSD B before (Delta, Alpha, Beta-1) and after sleep-onset (Beta-2) compared to I-SSD A and I-NSD (P ≤ 0.05). Two insomnia clusters derived from cluster analysis differ in sleep onset HRV. Preliminary data suggest evidence for three clusters in insomnia with differences for sustained attention and sleep-onset q -EEG. Insomnia 100 sleep study: Australia New Zealand Clinical Trials Registry (ANZCTR) identification number 12612000049875. URL: https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=347742. © 2016 Associated Professional Sleep Societies, LLC.

  17. CLUSFAVOR 5.0: hierarchical cluster and principal-component analysis of microarray-based transcriptional profiles

    PubMed Central

    Peterson, Leif E

    2002-01-01

    CLUSFAVOR (CLUSter and Factor Analysis with Varimax Orthogonal Rotation) 5.0 is a Windows-based computer program for hierarchical cluster and principal-component analysis of microarray-based transcriptional profiles. CLUSFAVOR 5.0 standardizes input data; sorts data according to gene-specific coefficient of variation, standard deviation, average and total expression, and Shannon entropy; performs hierarchical cluster analysis using nearest-neighbor, unweighted pair-group method using arithmetic averages (UPGMA), or furthest-neighbor joining methods, and Euclidean, correlation, or jack-knife distances; and performs principal-component analysis. PMID:12184816

  18. DICON: interactive visual analysis of multidimensional clusters.

    PubMed

    Cao, Nan; Gotz, David; Sun, Jimeng; Qu, Huamin

    2011-12-01

    Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis. © 2011 IEEE

  19. Cluster Correspondence Analysis.

    PubMed

    van de Velden, M; D'Enza, A Iodice; Palumbo, F

    2017-03-01

    A method is proposed that combines dimension reduction and cluster analysis for categorical data by simultaneously assigning individuals to clusters and optimal scaling values to categories in such a way that a single between variance maximization objective is achieved. In a unified framework, a brief review of alternative methods is provided and we show that the proposed method is equivalent to GROUPALS applied to categorical data. Performance of the methods is appraised by means of a simulation study. The results of the joint dimension reduction and clustering methods are compared with the so-called tandem approach, a sequential analysis of dimension reduction followed by cluster analysis. The tandem approach is conjectured to perform worse when variables are added that are unrelated to the cluster structure. Our simulation study confirms this conjecture. Moreover, the results of the simulation study indicate that the proposed method also consistently outperforms alternative joint dimension reduction and clustering methods.

  20. Towards Effective Clustering Techniques for the Analysis of Electric Power Grids

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hogan, Emilie A.; Cotilla Sanchez, Jose E.; Halappanavar, Mahantesh

    2013-11-30

    Clustering is an important data analysis technique with numerous applications in the analysis of electric power grids. Standard clustering techniques are oblivious to the rich structural and dynamic information available for power grids. Therefore, by exploiting the inherent topological and electrical structure in the power grid data, we propose new methods for clustering with applications to model reduction, locational marginal pricing, phasor measurement unit (PMU or synchrophasor) placement, and power system protection. We focus our attention on model reduction for analysis based on time-series information from synchrophasor measurement devices, and spectral techniques for clustering. By comparing different clustering techniques onmore » two instances of realistic power grids we show that the solutions are related and therefore one could leverage that relationship for a computational advantage. Thus, by contrasting different clustering techniques we make a case for exploiting structure inherent in the data with implications for several domains including power systems.« less

  1. Are clusters of dietary patterns and cluster membership stable over time? Results of a longitudinal cluster analysis study.

    PubMed

    Walthouwer, Michel Jean Louis; Oenema, Anke; Soetens, Katja; Lechner, Lilian; de Vries, Hein

    2014-11-01

    Developing nutrition education interventions based on clusters of dietary patterns can only be done adequately when it is clear if distinctive clusters of dietary patterns can be derived and reproduced over time, if cluster membership is stable, and if it is predictable which type of people belong to a certain cluster. Hence, this study aimed to: (1) identify clusters of dietary patterns among Dutch adults, (2) test the reproducibility of these clusters and stability of cluster membership over time, and (3) identify sociodemographic predictors of cluster membership and cluster transition. This study had a longitudinal design with online measurements at baseline (N=483) and 6 months follow-up (N=379). Dietary intake was assessed with a validated food frequency questionnaire. A hierarchical cluster analysis was performed, followed by a K-means cluster analysis. Multinomial logistic regression analyses were conducted to identify the sociodemographic predictors of cluster membership and cluster transition. At baseline and follow-up, a comparable three-cluster solution was derived, distinguishing a healthy, moderately healthy, and unhealthy dietary pattern. Male and lower educated participants were significantly more likely to have a less healthy dietary pattern. Further, 251 (66.2%) participants remained in the same cluster, 45 (11.9%) participants changed to an unhealthier cluster, and 83 (21.9%) participants shifted to a healthier cluster. Men and people living alone were significantly more likely to shift toward a less healthy dietary pattern. Distinctive clusters of dietary patterns can be derived. Yet, cluster membership is unstable and only few sociodemographic factors were associated with cluster membership and cluster transition. These findings imply that clusters based on dietary intake may not be suitable as a basis for nutrition education interventions. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. X-ray and optical substructures of the DAFT/FADA survey clusters

    NASA Astrophysics Data System (ADS)

    Guennou, L.; Durret, F.; Adami, C.; Lima Neto, G. B.

    2013-04-01

    We have undertaken the DAFT/FADA survey with the double aim of setting constraints on dark energy based on weak lensing tomography and of obtaining homogeneous and high quality data for a sample of 91 massive clusters in the redshift range 0.4-0.9 for which there were HST archive data. We have analysed the XMM-Newton data available for 42 of these clusters to derive their X-ray temperatures and luminosities and search for substructures. Out of these, a spatial analysis was possible for 30 clusters, but only 23 had deep enough X-ray data for a really robust analysis. This study was coupled with a dynamical analysis for the 26 clusters having at least 30 spectroscopic galaxy redshifts in the cluster range. Altogether, the X-ray sample of 23 clusters and the optical sample of 26 clusters have 14 clusters in common. We present preliminary results on the coupled X-ray and dynamical analyses of these 14 clusters.

  3. Identifying novel phenotypes of acute heart failure using cluster analysis of clinical variables.

    PubMed

    Horiuchi, Yu; Tanimoto, Shuzou; Latif, A H M Mahbub; Urayama, Kevin Y; Aoki, Jiro; Yahagi, Kazuyuki; Okuno, Taishi; Sato, Yu; Tanaka, Tetsu; Koseki, Keita; Komiyama, Kota; Nakajima, Hiroyoshi; Hara, Kazuhiro; Tanabe, Kengo

    2018-07-01

    Acute heart failure (AHF) is a heterogeneous disease caused by various cardiovascular (CV) pathophysiology and multiple non-CV comorbidities. We aimed to identify clinically important subgroups to improve our understanding of the pathophysiology of AHF and inform clinical decision-making. We evaluated detailed clinical data of 345 consecutive AHF patients using non-hierarchical cluster analysis of 77 variables, including age, sex, HF etiology, comorbidities, physical findings, laboratory data, electrocardiogram, echocardiogram and treatment during hospitalization. Cox proportional hazards regression analysis was performed to estimate the association between the clusters and clinical outcomes. Three clusters were identified. Cluster 1 (n=108) represented "vascular failure". This cluster had the highest average systolic blood pressure at admission and lung congestion with type 2 respiratory failure. Cluster 2 (n=89) represented "cardiac and renal failure". They had the lowest ejection fraction (EF) and worst renal function. Cluster 3 (n=148) comprised mostly older patients and had the highest prevalence of atrial fibrillation and preserved EF. Death or HF hospitalization within 12-month occurred in 23% of Cluster 1, 36% of Cluster 2 and 36% of Cluster 3 (p=0.034). Compared with Cluster 1, risk of death or HF hospitalization was 1.74 (95% CI, 1.03-2.95, p=0.037) for Cluster 2 and 1.82 (95% CI, 1.13-2.93, p=0.014) for Cluster 3. Cluster analysis may be effective in producing clinically relevant categories of AHF, and may suggest underlying pathophysiology and potential utility in predicting clinical outcomes. Copyright © 2018 Elsevier B.V. All rights reserved.

  4. Mixture modelling for cluster analysis.

    PubMed

    McLachlan, G J; Chang, S U

    2004-10-01

    Cluster analysis via a finite mixture model approach is considered. With this approach to clustering, the data can be partitioned into a specified number of clusters g by first fitting a mixture model with g components. An outright clustering of the data is then obtained by assigning an observation to the component to which it has the highest estimated posterior probability of belonging; that is, the ith cluster consists of those observations assigned to the ith component (i = 1,..., g). The focus is on the use of mixtures of normal components for the cluster analysis of data that can be regarded as being continuous. But attention is also given to the case of mixed data, where the observations consist of both continuous and discrete variables.

  5. Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis

    PubMed Central

    Lin, Nan; Jiang, Junhai; Guo, Shicheng; Xiong, Momiao

    2015-01-01

    Due to the advancement in sensor technology, the growing large medical image data have the ability to visualize the anatomical changes in biological tissues. As a consequence, the medical images have the potential to enhance the diagnosis of disease, the prediction of clinical outcomes and the characterization of disease progression. But in the meantime, the growing data dimensions pose great methodological and computational challenges for the representation and selection of features in image cluster analysis. To address these challenges, we first extend the functional principal component analysis (FPCA) from one dimension to two dimensions to fully capture the space variation of image the signals. The image signals contain a large number of redundant features which provide no additional information for clustering analysis. The widely used methods for removing the irrelevant features are sparse clustering algorithms using a lasso-type penalty to select the features. However, the accuracy of clustering using a lasso-type penalty depends on the selection of the penalty parameters and the threshold value. In practice, they are difficult to determine. Recently, randomized algorithms have received a great deal of attentions in big data analysis. This paper presents a randomized algorithm for accurate feature selection in image clustering analysis. The proposed method is applied to both the liver and kidney cancer histology image data from the TCGA database. The results demonstrate that the randomized feature selection method coupled with functional principal component analysis substantially outperforms the current sparse clustering algorithms in image cluster analysis. PMID:26196383

  6. Clustering for Binary Data Sets by Using Genetic Algorithm-Incremental K-means

    NASA Astrophysics Data System (ADS)

    Saharan, S.; Baragona, R.; Nor, M. E.; Salleh, R. M.; Asrah, N. M.

    2018-04-01

    This research was initially driven by the lack of clustering algorithms that specifically focus in binary data. To overcome this gap in knowledge, a promising technique for analysing this type of data became the main subject in this research, namely Genetic Algorithms (GA). For the purpose of this research, GA was combined with the Incremental K-means (IKM) algorithm to cluster the binary data streams. In GAIKM, the objective function was based on a few sufficient statistics that may be easily and quickly calculated on binary numbers. The implementation of IKM will give an advantage in terms of fast convergence. The results show that GAIKM is an efficient and effective new clustering algorithm compared to the clustering algorithms and to the IKM itself. In conclusion, the GAIKM outperformed other clustering algorithms such as GCUK, IKM, Scalable K-means (SKM) and K-means clustering and paves the way for future research involving missing data and outliers.

  7. Symptom Clusters Change over Time in Women Receiving Adjuvant Chemotherapy for Breast Cancer

    PubMed Central

    Albusoul, Randa M.; Berger, Ann M.; Gay, Caryl L.; Janson, Susan L.; Lee, Kathryn A.

    2017-01-01

    Context Patients with breast cancer receiving chemotherapy (CTX) experience multiple concurrent symptoms, but little is known about how symptoms change during and after treatment. Knowledge of the identity and trajectory of symptom clusters (SCs) would enhance measurement and management. Objectives We aimed to identify SCs and their change over time from baseline to completion of breast cancer CTX. Methods SCs were identified and assessed for change in 219 women from Nebraska at four times: baseline, during cycles #3 and #4 of CTX, and one-month after finishing CTX. Ten symptoms were measured: two using the Hospital Anxiety and Depression Scale and eight using the Symptom Experience Scale. Exploratory factor analysis was conducted at each time point, then changes in SCs were evaluated at different times. Results Two SCs were identified before and after initiating CTX: Gastrointestinal (GI) and Treatment-related (Tr). The number and type of symptoms in each cluster differed over time. Clusters were dynamic during CTX with changes in the number and type of symptoms. Only one Tr SC, which consisted of fatigue, pain, and sleep disturbance, was identified after CTX completion. Conclusion SCs during CTX appear to be dynamic, changing over time from before until after CTX completion. Repeated assessments of SCs reveal symptoms that are present and when patients are most burdened and in need of additional support. PMID:28062343

  8. [Surveillance data on typhoid fever and paratyphoid fever in 2015, China].

    PubMed

    Liu, F F; Zhao, S L; Chen, Q; Chang, Z R; Zhang, J; Zheng, Y M; Luo, L; Ran, L; Liao, Q H

    2017-06-10

    Objective: Through analyzing the surveillance data on typhoid fever and paratyphoid fever in 2015 to understand the related epidemiological features and most possible clustering areas of high incidence. Methods: Individual data was collected from the passive surveillance program and analyzed by descriptive statistic method. Characteristics on seasonal, regional and distribution of the diseases were described. Spatial-temporal clustering characteristics were estimated, under the retrospective space-time method. Results: A total of 8 850 typhoid fever cases were reported from the surveillance system, with incidence rate as 0.65/100 000. The number of paratyphoid fever cases was 2 794, with incidence rate as 0.21/100 000. Both cases of typhoid fever and paratyphoid fever occurred all year round, with high epidemic season from May to October. Most cases involved farmers (39.68 % ), children (15.89 % ) and students (12.01 % ). Children under 5 years showed the highest incidence rate. Retrospective space-time analysis for provinces with high incidence rates would include Yunnan, Guangxi, Guizhou, Hunan and Guangdong, indicating the first and second class clusters were mainly distributed near the bordering adjacent districts and counties among the provinces. Conclusion: In 2015, the prevalence rates of typhoid fever and paratyphoid fever were low, however with regional high prevalence areas. Cross regional transmission existed among provinces with high incidence rates which might be responsible for the clusters to appear in these areas.

  9. A hierarchical cluster analysis of normal-tension glaucoma using spectral-domain optical coherence tomography parameters.

    PubMed

    Bae, Hyoung Won; Ji, Yongwoo; Lee, Hye Sun; Lee, Naeun; Hong, Samin; Seong, Gong Je; Sung, Kyung Rim; Kim, Chan Yun

    2015-01-01

    Normal-tension glaucoma (NTG) is a heterogenous disease, and there is still controversy about subclassifications of this disorder. On the basis of spectral-domain optical coherence tomography (SD-OCT), we subdivided NTG with hierarchical cluster analysis using optic nerve head (ONH) parameters and retinal nerve fiber layer (RNFL) thicknesses. A total of 200 eyes of 200 NTG patients between March 2011 and June 2012 underwent SD-OCT scans to measure ONH parameters and RNFL thicknesses. We classified NTG into homogenous subgroups based on these variables using a hierarchical cluster analysis, and compared clusters to evaluate diverse NTG characteristics. Three clusters were found after hierarchical cluster analysis. Cluster 1 (62 eyes) had the thickest RNFL and widest rim area, and showed early glaucoma features. Cluster 2 (60 eyes) was characterized by the largest cup/disc ratio and cup volume, and showed advanced glaucomatous damage. Cluster 3 (78 eyes) had small disc areas in SD-OCT and were comprised of patients with significantly younger age, longer axial length, and greater myopia than the other 2 groups. A hierarchical cluster analysis of SD-OCT scans divided NTG patients into 3 groups based upon ONH parameters and RNFL thicknesses. It is anticipated that the small disc area group comprised of younger and more myopic patients may show unique features unlike the other 2 groups.

  10. Cluster analysis of spontaneous preterm birth phenotypes identifies potential associations among preterm birth mechanisms.

    PubMed

    Esplin, M Sean; Manuck, Tracy A; Varner, Michael W; Christensen, Bryce; Biggio, Joseph; Bukowski, Radek; Parry, Samuel; Zhang, Heping; Huang, Hao; Andrews, William; Saade, George; Sadovsky, Yoel; Reddy, Uma M; Ilekis, John

    2015-09-01

    We sought to use an innovative tool that is based on common biologic pathways to identify specific phenotypes among women with spontaneous preterm birth (SPTB) to enhance investigators' ability to identify and to highlight common mechanisms and underlying genetic factors that are responsible for SPTB. We performed a secondary analysis of a prospective case-control multicenter study of SPTB. All cases delivered a preterm singleton at SPTB ≤34.0 weeks' gestation. Each woman was assessed for the presence of underlying SPTB causes. A hierarchic cluster analysis was used to identify groups of women with homogeneous phenotypic profiles. One of the phenotypic clusters was selected for candidate gene association analysis with the use of VEGAS software. One thousand twenty-eight women with SPTB were assigned phenotypes. Hierarchic clustering of the phenotypes revealed 5 major clusters. Cluster 1 (n = 445) was characterized by maternal stress; cluster 2 (n = 294) was characterized by premature membrane rupture; cluster 3 (n = 120) was characterized by familial factors, and cluster 4 (n = 63) was characterized by maternal comorbidities. Cluster 5 (n = 106) was multifactorial and characterized by infection (INF), decidual hemorrhage (DH), and placental dysfunction (PD). These 3 phenotypes were correlated highly by χ(2) analysis (PD and DH, P < 2.2e-6; PD and INF, P = 6.2e-10; INF and DH, (P = .0036). Gene-based testing identified the INS (insulin) gene as significantly associated with cluster 3 of SPTB. We identified 5 major clusters of SPTB based on a phenotype tool and hierarch clustering. There was significant correlation between several of the phenotypes. The INS gene was associated with familial factors that were underlying SPTB. Copyright © 2015 Elsevier Inc. All rights reserved.

  11. Cluster analysis of the hot subdwarfs in the PG survey

    NASA Technical Reports Server (NTRS)

    Thejll, Peter; Charache, Darryl; Shipman, Harry L.

    1989-01-01

    Application of cluster analysis to the hot subdwarfs in the Palomar Green (PG) survey of faint blue high-Galactic-latitude objects is assessed, with emphasis on data noise and the number of clusters to subdivide the data into. The data used in the study are presented, and cluster analysis, using the CLUSTAN program, is applied to it. Distances are calculated using the Euclidean formula, and clustering is done by Ward's method. The results are discussed, and five groups representing natural divisions of the subdwarfs in the PG survey are presented.

  12. Using Machine Learning Techniques in the Analysis of Oceanographic Data

    NASA Astrophysics Data System (ADS)

    Falcinelli, K. E.; Abuomar, S.

    2017-12-01

    Acoustic Doppler Current Profilers (ADCPs) are oceanographic tools capable of collecting large amounts of current profile data. Using unsupervised machine learning techniques such as principal component analysis, fuzzy c-means clustering, and self-organizing maps, patterns and trends in an ADCP dataset are found. Cluster validity algorithms such as visual assessment of cluster tendency and clustering index are used to determine the optimal number of clusters in the ADCP dataset. These techniques prove to be useful in analysis of ADCP data and demonstrate potential for future use in other oceanographic applications.

  13. Impact of Sampling Density on the Extent of HIV Clustering

    PubMed Central

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor

    2014-01-01

    Abstract Identifying and monitoring HIV clusters could be useful in tracking the leading edge of HIV transmission in epidemics. Currently, greater specificity in the definition of HIV clusters is needed to reduce confusion in the interpretation of HIV clustering results. We address sampling density as one of the key aspects of HIV cluster analysis. The proportion of viral sequences in clusters was estimated at sampling densities from 1.0% to 70%. A set of 1,248 HIV-1C env gp120 V1C5 sequences from a single community in Botswana was utilized in simulation studies. Matching numbers of HIV-1C V1C5 sequences from the LANL HIV Database were used as comparators. HIV clusters were identified by phylogenetic inference under bootstrapped maximum likelihood and pairwise distance cut-offs. Sampling density below 10% was associated with stochastic HIV clustering with broad confidence intervals. HIV clustering increased linearly at sampling density >10%, and was accompanied by narrowing confidence intervals. Patterns of HIV clustering were similar at bootstrap thresholds 0.7 to 1.0, but the extent of HIV clustering decreased with higher bootstrap thresholds. The origin of sampling (local concentrated vs. scattered global) had a substantial impact on HIV clustering at sampling densities ≥10%. Pairwise distances at 10% were estimated as a threshold for cluster analysis of HIV-1 V1C5 sequences. The node bootstrap support distribution provided additional evidence for 10% sampling density as the threshold for HIV cluster analysis. The detectability of HIV clusters is substantially affected by sampling density. A minimal genotyping density of 10% and sampling density of 50–70% are suggested for HIV-1 V1C5 cluster analysis. PMID:25275430

  14. Metabolic network visualization eliminating node redundance and preserving metabolic pathways

    PubMed Central

    Bourqui, Romain; Cottret, Ludovic; Lacroix, Vincent; Auber, David; Mary, Patrick; Sagot, Marie-France; Jourdan, Fabien

    2007-01-01

    Background The tools that are available to draw and to manipulate the representations of metabolism are usually restricted to metabolic pathways. This limitation becomes problematic when studying processes that span several pathways. The various attempts that have been made to draw genome-scale metabolic networks are confronted with two shortcomings: 1- they do not use contextual information which leads to dense, hard to interpret drawings, 2- they impose to fit to very constrained standards, which implies, in particular, duplicating nodes making topological analysis considerably more difficult. Results We propose a method, called MetaViz, which enables to draw a genome-scale metabolic network and that also takes into account its structuration into pathways. This method consists in two steps: a clustering step which addresses the pathway overlapping problem and a drawing step which consists in drawing the clustered graph and each cluster. Conclusion The method we propose is original and addresses new drawing issues arising from the no-duplication constraint. We do not propose a single drawing but rather several alternative ways of presenting metabolism depending on the pathway on which one wishes to focus. We believe that this provides a valuable tool to explore the pathway structure of metabolism. PMID:17608928

  15. Assisting community management of groundwater: Irrigator attitudes in two watersheds in Rajasthan and Gujarat, India

    NASA Astrophysics Data System (ADS)

    Varua, M. E.; Ward, J.; Maheshwari, B.; Oza, S.; Purohit, R.; Hakimuddin; Chinnasamy, P.

    2016-06-01

    The absence of either state regulations or markets to coordinate the operation of individual wells has focussed attention on community level institutions as the primary loci for sustainable groundwater management in Rajasthan and Gujarat, India. The reported research relied on theoretical propositions that livelihood strategies, groundwater management and the propensity to cooperate are associated with the attitudinal orientations of well owners in the Meghraj and Dharta watersheds, located in Gujarat and Rajasthan respectively. The research tested the hypothesis that attitudes to groundwater management and farming practices, household income and trust levels of assisting agencies were not consistent across the watersheds, implying that a targeted approach, in contrast to default uniform programs, would assist communities craft rules to manage groundwater across multiple hydro-geological settings. Hierarchical cluster analysis of attitudes held by survey respondents revealed four statistically significant discrete clusters, supporting acceptance of the hypothesis. Further analyses revealed significant differences in farming practices, household wealth and willingness to adapt across the four groundwater management clusters. In conclusion, the need to account for attitudinal diversity is highlighted and a framework to guide the specific design of processes to assist communities craft coordinating instruments to sustainably manage local aquifers described.

  16. Quality Evaluation of Potentilla fruticosa L. by High Performance Liquid Chromatography Fingerprinting Associated with Chemometric Methods.

    PubMed

    Liu, Wei; Wang, Dongmei; Liu, Jianjun; Li, Dengwu; Yin, Dongxue

    2016-01-01

    The present study was performed to assess the quality of Potentilla fruticosa L. sampled from distinct regions of China using high performance liquid chromatography (HPLC) fingerprinting coupled with a suite of chemometric methods. For this quantitative analysis, the main active phytochemical compositions and the antioxidant activity in P. fruticosa were also investigated. Considering the high percentages and antioxidant activities of phytochemicals, P. fruticosa samples from Kangding, Sichuan were selected as the most valuable raw materials. Similarity analysis (SA) of HPLC fingerprints, hierarchical cluster analysis (HCA), principle component analysis (PCA), and discriminant analysis (DA) were further employed to provide accurate classification and quality estimates of P. fruticosa. Two principal components (PCs) were collected by PCA. PC1 separated samples from Kangding, Sichuan, capturing 57.64% of the variance, whereas PC2 contributed to further separation, capturing 18.97% of the variance. Two kinds of discriminant functions with a 100% discrimination ratio were constructed. The results strongly supported the conclusion that the eight samples from different regions were clustered into three major groups, corresponding with their morphological classification, for which HPLC analysis confirmed the considerable variation in phytochemical compositions and that P. fruticosa samples from Kangding, Sichuan were of high quality. The results of SA, HCA, PCA, and DA were in agreement and performed well for the quality assessment of P. fruticosa. Consequently, HPLC fingerprinting coupled with chemometric techniques provides a highly flexible and reliable method for the quality evaluation of traditional Chinese medicines.

  17. Quality Evaluation of Potentilla fruticosa L. by High Performance Liquid Chromatography Fingerprinting Associated with Chemometric Methods

    PubMed Central

    Liu, Wei; Wang, Dongmei; Liu, Jianjun; Li, Dengwu; Yin, Dongxue

    2016-01-01

    The present study was performed to assess the quality of Potentilla fruticosa L. sampled from distinct regions of China using high performance liquid chromatography (HPLC) fingerprinting coupled with a suite of chemometric methods. For this quantitative analysis, the main active phytochemical compositions and the antioxidant activity in P. fruticosa were also investigated. Considering the high percentages and antioxidant activities of phytochemicals, P. fruticosa samples from Kangding, Sichuan were selected as the most valuable raw materials. Similarity analysis (SA) of HPLC fingerprints, hierarchical cluster analysis (HCA), principle component analysis (PCA), and discriminant analysis (DA) were further employed to provide accurate classification and quality estimates of P. fruticosa. Two principal components (PCs) were collected by PCA. PC1 separated samples from Kangding, Sichuan, capturing 57.64% of the variance, whereas PC2 contributed to further separation, capturing 18.97% of the variance. Two kinds of discriminant functions with a 100% discrimination ratio were constructed. The results strongly supported the conclusion that the eight samples from different regions were clustered into three major groups, corresponding with their morphological classification, for which HPLC analysis confirmed the considerable variation in phytochemical compositions and that P. fruticosa samples from Kangding, Sichuan were of high quality. The results of SA, HCA, PCA, and DA were in agreement and performed well for the quality assessment of P. fruticosa. Consequently, HPLC fingerprinting coupled with chemometric techniques provides a highly flexible and reliable method for the quality evaluation of traditional Chinese medicines. PMID:26890416

  18. Hierarchical cluster analysis of progression patterns in open-angle glaucoma patients with medical treatment.

    PubMed

    Bae, Hyoung Won; Rho, Seungsoo; Lee, Hye Sun; Lee, Naeun; Hong, Samin; Seong, Gong Je; Sung, Kyung Rim; Kim, Chan Yun

    2014-04-29

    To classify medically treated open-angle glaucoma (OAG) by the pattern of progression using hierarchical cluster analysis, and to determine OAG progression characteristics by comparing clusters. Ninety-five eyes of 95 OAG patients who received medical treatment, and who had undergone visual field (VF) testing at least once per year for 5 or more years. OAG was classified into subgroups using hierarchical cluster analysis based on the following five variables: baseline mean deviation (MD), baseline visual field index (VFI), MD slope, VFI slope, and Glaucoma Progression Analysis (GPA) printout. After that, other parameters were compared between clusters. Two clusters were made after a hierarchical cluster analysis. Cluster 1 showed -4.06 ± 2.43 dB baseline MD, 92.58% ± 6.27% baseline VFI, -0.28 ± 0.38 dB per year MD slope, -0.52% ± 0.81% per year VFI slope, and all "no progression" cases in GPA printout, whereas cluster 2 showed -8.68 ± 3.81 baseline MD, 77.54 ± 12.98 baseline VFI, -0.72 ± 0.55 MD slope, -2.22 ± 1.89 VFI slope, and seven "possible" and four "likely" progression cases in GPA printout. There were no significant differences in age, sex, mean IOP, central corneal thickness, and axial length between clusters. However, cluster 2 included more high-tension glaucoma patients and used a greater number of antiglaucoma eye drops significantly compared with cluster 1. Hierarchical cluster analysis of progression patterns divided OAG into slow and fast progression groups, evidenced by assessing the parameters of glaucomatous progression in VF testing. In the fast progression group, the prevalence of high-tension glaucoma was greater and the number of antiglaucoma medications administered was increased versus the slow progression group. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.

  19. Multilocus sequence analysis and rpoB sequencing of Mycobacterium abscessus (sensu lato) strains.

    PubMed

    Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

    2011-02-01

    Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies.

  20. Multilocus Sequence Analysis and rpoB Sequencing of Mycobacterium abscessus (Sensu Lato) Strains▿

    PubMed Central

    Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

    2011-01-01

    Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536T, M. massiliense CIP 108297T, and M. bolletii CIP 108541T) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering of strains. We found 10/120 (8.3%) isolates for which the concatenated MLSA gene sequence and rpoB sequence were discordant (e.g., M. massiliense MLSA sequence and M. abscessus rpoB sequence), suggesting the intergroup lateral transfers of rpoB. In conclusion, our study strongly supports the recent proposal that M. abscessus, M. massiliense, and M. bolletii should constitute a single species. Our findings also indicate that there has been a horizontal transfer of rpoB sequences between these subgroups, precluding the use of rpoB sequencing alone for the accurate identification of the two proposed M. abscessus subspecies. PMID:21106786

Top