Dong, Skye T; Costa, Daniel S J; Butow, Phyllis N; Lovell, Melanie R; Agar, Meera; Velikova, Galina; Teckle, Paulos; Tong, Allison; Tebbutt, Niall C; Clarke, Stephen J; van der Hoek, Kim; King, Madeleine T; Fayers, Peter M
2016-01-01
Symptom clusters in advanced cancer can influence patient outcomes. There is large heterogeneity in the methods used to identify symptom clusters. To investigate the consistency of symptom cluster composition in advanced cancer patients using different statistical methodologies for all patients across five primary cancer sites, and to examine which clusters predict functional status, a global assessment of health and global quality of life. Principal component analysis and exploratory factor analysis (with different rotation and factor selection methods) and hierarchical cluster analysis (with different linkage and similarity measures) were used on a data set of 1562 advanced cancer patients who completed the European Organization for the Research and Treatment of Cancer Quality of Life Questionnaire-Core 30. Four clusters consistently formed for many of the methods and cancer sites: tense-worry-irritable-depressed (emotional cluster), fatigue-pain, nausea-vomiting, and concentration-memory (cognitive cluster). The emotional cluster was a stronger predictor of overall quality of life than the other clusters. Fatigue-pain was a stronger predictor of overall health than the other clusters. The cognitive cluster and fatigue-pain predicted physical functioning, role functioning, and social functioning. The four identified symptom clusters were consistent across statistical methods and cancer types, although there were some noteworthy differences. Statistical derivation of symptom clusters is in need of greater methodological guidance. A psychosocial pathway in the management of symptom clusters may improve quality of life. Biological mechanisms underpinning symptom clusters need to be delineated by future research. A framework for evidence-based screening, assessment, treatment, and follow-up of symptom clusters in advanced cancer is essential. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Symptom clusters and quality of life among patients with advanced heart failure
Yu, Doris SF; Chan, Helen YL; Leung, Doris YP; Hui, Elsie; Sit, Janet WH
2016-01-01
Objectives To identify symptom clusters among patients with advanced heart failure (HF) and the independent relationships with their quality of life (QoL). Methods This is the secondary data analysis of a cross-sectional study which interviewed 119 patients with advanced HF in the geriatric unit of a regional hospital in Hong Kong. The symptom profile and QoL were assessed by using the Edmonton Symptom Assessment Scale (ESAS) and the McGill QoL Questionnaire. Exploratory factor analysis was used to identify the symptom clusters. Hierarchical regression analysis was used to examine the independent relationships with their QoL, after adjusting the effects of age, gender, and comorbidities. Results The patients were at an advanced age (82.9 ± 6.5 years). Three distinct symptom clusters were identified: they were the distress cluster (including shortness of breath, anxiety, and depression), the decondition cluster (fatigue, drowsiness, nausea, and reduced appetite), and the discomfort cluster (pain, and sense of generalized discomfort). These three symptom clusters accounted for 63.25% of variance of the patients' symptom experience. The small to moderate correlations between these symptom clusters indicated that they were rather independent of one another. After adjusting the age, gender and comorbidities, the distress (β = −0.635, P < 0.001), the decondition (β = −0.148, P = 0.01), and the discomfort (β = −0.258, P < 0.001) symptom clusters independently predicted their QoL. Conclusions This study identified the distinctive symptom clusters among patients with advanced HF. The results shed light on the need to develop palliative care interventions for optimizing the symptom control for this life-limiting disease. PMID:27403150
Advanced Solar Observatory (ASO) accommodations requirements study
NASA Technical Reports Server (NTRS)
1989-01-01
Results of an accommodations analysis for the Advanced Solar Observatory on Space Station Freedom are reported. Concepts for the High Resolution Telescope Cluster, Pinhole/Occulter Facility, and High Energy Cluster were developed which can be accommodated on Space Station Freedom. It is shown that workable accommodations concepts are possible. Areas of emphasis for the next stage of engineering development are identified.
Ahmad, Tariq; Desai, Nihar; Wilson, Francis; Schulte, Phillip; Dunning, Allison; Jacoby, Daniel; Allen, Larry; Fiuzat, Mona; Rogers, Joseph; Felker, G Michael; O'Connor, Christopher; Patel, Chetan B
2016-01-01
Classification of acute decompensated heart failure (ADHF) is based on subjective criteria that crudely capture disease heterogeneity. Improved phenotyping of the syndrome may help improve therapeutic strategies. To derive cluster analysis-based groupings for patients hospitalized with ADHF, and compare their prognostic performance to hemodynamic classifications derived at the bedside. We performed a cluster analysis on baseline clinical variables and PAC measurements of 172 ADHF patients from the ESCAPE trial. Employing regression techniques, we examined associations between clusters and clinically determined hemodynamic profiles (warm/cold/wet/dry). We assessed association with clinical outcomes using Cox proportional hazards models. Likelihood ratio tests were used to compare the prognostic value of cluster data to that of hemodynamic data. We identified four advanced HF clusters: 1) male Caucasians with ischemic cardiomyopathy, multiple comorbidities, lowest B-type natriuretic peptide (BNP) levels; 2) females with non-ischemic cardiomyopathy, few comorbidities, most favorable hemodynamics; 3) young African American males with non-ischemic cardiomyopathy, most adverse hemodynamics, advanced disease; and 4) older Caucasians with ischemic cardiomyopathy, concomitant renal insufficiency, highest BNP levels. There was no association between clusters and bedside-derived hemodynamic profiles (p = 0.70). For all adverse clinical outcomes, Cluster 4 had the highest risk, and Cluster 2, the lowest. Compared to Cluster 4, Clusters 1-3 had 45-70% lower risk of all-cause mortality. Clusters were significantly associated with clinical outcomes, whereas hemodynamic profiles were not. By clustering patients with similar objective variables, we identified four clinically relevant phenotypes of ADHF patients, with no discernable relationship to hemodynamic profiles, but distinct associations with adverse outcomes. Our analysis suggests that ADHF classification using simultaneous considerations of etiology, comorbid conditions, and biomarker levels, may be superior to bedside classifications.
Choi, S; Ryu, E
2018-01-01
People with advanced lung cancer experience later symptoms after treatment that is related to poorer psychosocial and quality of life (QOL) outcomes. The purpose of this study was to identify the effect of symptom clusters and depression on the QOL of patients with advanced lung cancer. A sample of 178 patients with advanced lung cancer at the National Cancer Center in Korea completed a demographic questionnaire, the M.D. Anderson Symptom Inventory-Lung Cancer, the Center for Epidemiological Studies Depression Scale, and the Functional Assessment of Cancer Therapy-General scale. The most frequently experienced symptom was fatigue, anguish was the most severe symptom-associated distress, and 28.9% of participants were clinically depressed. Factor analysis was used to identify symptom clusters based on the severity of patients' symptom experiences. Three symptom clusters were identified: treatment-associated, lung cancer and psychological symptom clusters. The regression model found a significant negative impact on QOL for depression and lung cancer symptom cluster. Age as the control variable was found to be significant impact on QOL. Therefore, psychological screening and appropriate intervention is an essential part of advanced cancer care. Both pharmacological and non-pharmacological approaches for alleviating depression may help to improve the QOL of lung cancer patients. © 2016 John Wiley & Sons Ltd.
Yennurajalingam, Sriram; Williams, Janet L; Chisholm, Gary; Bruera, Eduardo
2016-03-01
Advanced cancer patients frequently experience debilitating symptoms that occur in clusters, but few pharmacological studies have targeted symptom clusters. Our objective was to examine the effects of dexamethasone on symptom clusters in patients with advanced cancer. We reviewed the data from a previous randomized clinical trial to determine the effects of dexamethasone on cancer symptoms. Symptom clusters were identified according to baseline symptoms by using principal component analysis. Correlations and change in the severity of symptom clusters were analyzed after study treatment. A total of 114 participants were included in this study. Three clusters were identified: fatigue/anorexia-cachexia/depression (FAD), sleep/anxiety/drowsiness (SAD), and pain/dyspnea (PD). Changes in severity of FAD and PD significantly correlated over time (at baseline, day 8, and day 15). The FAD cluster was associated with significant improvement in severity at day 8 and day 15, whereas no significant change was observed with the SAD cluster or PD cluster after dexamethasone treatment. The results of this preliminary study suggest significant correlation over time and improvement in the FAD cluster at day 8 and day 15 after treatment with dexamethasone. These findings suggest that fatigue, anorexia-cachexia, and depression may share a common pathophysiologic basis. Further studies are needed to investigate this cluster and target anti-inflammatory therapies. ©AlphaMed Press.
ERIC Educational Resources Information Center
Huang, Francis L.; Cornell, Dewey G.
2016-01-01
Advances in multilevel modeling techniques now make it possible to investigate the psychometric properties of instruments using clustered data. Factor models that overlook the clustering effect can lead to underestimated standard errors, incorrect parameter estimates, and model fit indices. In addition, factor structures may differ depending on…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Abramovici, E.; Northwood, D.O.; Shehata, M.T.
1999-01-01
The contents include Analysis of In-Service Failures (tutorials, transportation industry, corrosion and materials degradation, electronic and advanced materials); 1998 Sorby Award Lecture by Kay Geels, Struers A/S (Metallographic Preparation from Sorby to the Present); Advances in Microstructural Characterization (characterization techniques using high resolution and focused ion beam, characterization of microstructural clustering and correlation with performance); Advanced Applications (advanced alloys and intermetallic compounds, plasma spray coatings and other surface coatings, corrosion, and materials degradation).
Symptom Clusters and Quality of Life in Hospice Patients with Cancer
Omran, Suha; Khader, Yousef; McMillan, Susan
2017-01-01
Background: Symptom control is an important part of palliative care and important to achieve optimal quality of life (QOL). Studies have shown that patients with advanced cancer suffer from diverse and often severe physical and psychological symptoms. The aim is to explore the influence of symptom clusters on QOL among patients with advanced cancer. Materials and Methods: 709 patients with advanced cancer were recruited to participate in a clinical trial focusing on symptom management and QOL. Patients were adults newly admitted to hospice home care in one of two hospices in southwest Florida, who could pass mental status screening. The instruments used for data collection were the Demographic Data Form, Memorial Symptom Assessment Scale (MSAS), and the Hospice Quality of Life Index-14. Results: Exploratory factor analysis and multiple regression were used to identify symptom clusters and their influence on QOL. The results revealed that the participants experienced multiple concurrent symptoms. There were four symptom clusters found among these cancer patients. Individual symptom distress scores that were the strongest predictors of QOL were: feeling pain; dry mouth; feeling drowsy; nausea; difficulty swallowing; worrying and feeling nervous. Conclusions: Patients with advanced cancer reported various concurrent symptoms, and these form symptom clusters of four main categories. The four symptoms clusters have a negative influence on patients’ QOL and required specific care from different members of the hospice healthcare team. The results of this study should be used to guide health care providers’ symptom management. Proper attention to symptom clusters should be the basis for accurate planning of effective interventions to manage the symptom clusters experienced by advanced cancer patients. The health care provider needs to plan ahead for these symptoms and manage any concurrent symptoms for successful promotion of their patient’s QOL. PMID:28950683
Symptom Clusters and Quality of Life in Hospice Patients with Cancer
Omran, Suha; Khader, Yousef; McMillan, Susan
2017-09-27
Background: Symptom control is an important part of palliative care and important to achieve optimal quality of life (QOL). Studies have shown that patients with advanced cancer suffer from diverse and often severe physical and psychological symptoms. The aim is to explore the influence of symptom clusters on QOL among patients with advanced cancer. Materials and Methods: 709 patients with advanced cancer were recruited to participate in a clinical trial focusing on symptom management and QOL. Patients were adults newly admitted to hospice home care in one of two hospices in southwest Florida, who could pass mental status screening. The instruments used for data collection were the Demographic Data Form, Memorial Symptom Assessment Scale (MSAS), and the Hospice Quality of Life Index-14. Results: Exploratory factor analysis and multiple regression were used to identify symptom clusters and their influence on QOL. The results revealed that the participants experienced multiple concurrent symptoms. There were four symptom clusters found among these cancer patients. Individual symptom distress scores that were the strongest predictors of QOL were: feeling pain; dry mouth; feeling drowsy; nausea; difficulty swallowing; worrying and feeling nervous. Conclusions: Patients with advanced cancer reported various concurrent symptoms, and these form symptom clusters of four main categories. The four symptoms clusters have a negative influence on patients’ QOL and required specific care from different members of the hospice healthcare team. The results of this study should be used to guide health care providers’ symptom management. Proper attention to symptom clusters should be the basis for accurate planning of effective interventions to manage the symptom clusters experienced by advanced cancer patients. The health care provider needs to plan ahead for these symptoms and manage any concurrent symptoms for successful promotion of their patient’s QOL. Creative Commons Attribution License
Bai, Mei; Dixon, Jane; Williams, Anna-Leila; Jeon, Sangchoon; Lazenby, Mark; McCorkle, Ruth
2016-11-01
Research shows that spiritual well-being correlates positively with quality of life (QOL) for people with cancer, whereas contradictory findings are frequently reported with respect to the differentiated associations between dimensions of spiritual well-being, namely peace, meaning and faith, and QOL. This study aimed to examine individual patterns of spiritual well-being among patients newly diagnosed with advanced cancer. Cluster analysis was based on the twelve items of the 12-item Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being Scale at Time 1. A combination of hierarchical and k-means (non-hierarchical) clustering methods was employed to jointly determine the number of clusters. Self-rated health, depressive symptoms, peace, meaning and faith, and overall QOL were compared at Time 1 and Time 2. Hierarchical and k-means clustering methods both suggested four clusters. Comparison of the four clusters supported statistically significant and clinically meaningful differences in QOL outcomes among clusters while revealing contrasting relations of faith with QOL. Cluster 1, Cluster 3, and Cluster 4 represented high, medium, and low levels of overall QOL, respectively, with correspondingly high, medium, and low levels of peace, meaning, and faith. Cluster 2 was distinguished from other clusters by its medium levels of overall QOL, peace, and meaning and low level of faith. This study provides empirical support for individual difference in response to a newly diagnosed cancer and brings into focus conceptual and methodological challenges associated with the measure of spiritual well-being, which may partly contribute to the attenuated relation between faith and QOL.
ERIC Educational Resources Information Center
Lee, Alwyn Vwen Yen; Tan, Seng Chee
2017-01-01
Understanding ideas in a discourse is challenging, especially in textual discourse analysis. We propose using temporal analytics with unsupervised machine learning techniques to investigate promising ideas for the collective advancement of communal knowledge in an online knowledge building discourse. A discourse unit network was constructed and…
Fast gene ontology based clustering for microarray experiments.
Ovaska, Kristian; Laakso, Marko; Hautaniemi, Sampsa
2008-11-21
Analysis of a microarray experiment often results in a list of hundreds of disease-associated genes. In order to suggest common biological processes and functions for these genes, Gene Ontology annotations with statistical testing are widely used. However, these analyses can produce a very large number of significantly altered biological processes. Thus, it is often challenging to interpret GO results and identify novel testable biological hypotheses. We present fast software for advanced gene annotation using semantic similarity for Gene Ontology terms combined with clustering and heat map visualisation. The methodology allows rapid identification of genes sharing the same Gene Ontology cluster. Our R based semantic similarity open-source package has a speed advantage of over 2000-fold compared to existing implementations. From the resulting hierarchical clustering dendrogram genes sharing a GO term can be identified, and their differences in the gene expression patterns can be seen from the heat map. These methods facilitate advanced annotation of genes resulting from data analysis.
Pichia stipitis genomics, transcriptomics, and gene clusters
Thomas W. Jeffries; Jennifer R. Headman Van Vleet
2009-01-01
Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...
Lee, Alexandra J; Chang, Ivan; Burel, Julie G; Lindestam Arlehamn, Cecilia S; Mandava, Aishwarya; Weiskopf, Daniela; Peters, Bjoern; Sette, Alessandro; Scheuermann, Richard H; Qian, Yu
2018-04-17
Computational methods for identification of cell populations from polychromatic flow cytometry data are changing the paradigm of cytometry bioinformatics. Data clustering is the most common computational approach to unsupervised identification of cell populations from multidimensional cytometry data. However, interpretation of the identified data clusters is labor-intensive. Certain types of user-defined cell populations are also difficult to identify by fully automated data clustering analysis. Both are roadblocks before a cytometry lab can adopt the data clustering approach for cell population identification in routine use. We found that combining recursive data filtering and clustering with constraints converted from the user manual gating strategy can effectively address these two issues. We named this new approach DAFi: Directed Automated Filtering and Identification of cell populations. Design of DAFi preserves the data-driven characteristics of unsupervised clustering for identifying novel cell subsets, but also makes the results interpretable to experimental scientists through mapping and merging the multidimensional data clusters into the user-defined two-dimensional gating hierarchy. The recursive data filtering process in DAFi helped identify small data clusters which are otherwise difficult to resolve by a single run of the data clustering method due to the statistical interference of the irrelevant major clusters. Our experiment results showed that the proportions of the cell populations identified by DAFi, while being consistent with those by expert centralized manual gating, have smaller technical variances across samples than those from individual manual gating analysis and the nonrecursive data clustering analysis. Compared with manual gating segregation, DAFi-identified cell populations avoided the abrupt cut-offs on the boundaries. DAFi has been implemented to be used with multiple data clustering methods including K-means, FLOCK, FlowSOM, and the ClusterR package. For cell population identification, DAFi supports multiple options including clustering, bisecting, slope-based gating, and reversed filtering to meet various autogating needs from different scientific use cases. © 2018 International Society for Advancement of Cytometry. © 2018 International Society for Advancement of Cytometry.
[Typologies of Madrid's citizens (Spain) at the end-of-life: cluster analysis].
Ortiz-Gonçalves, Belén; Perea-Pérez, Bernardo; Labajo González, Elena; Albarrán Juan, Elena; Santiago-Sáez, Andrés
2018-03-06
To establish typologies within Madrid's citizens (Spain) with regard to end-of-life by cluster analysis. The SPAD 8 programme was implemented in a sample from a health care centre in the autonomous region of Madrid (Spain). A multiple correspondence analysis technique was used, followed by a cluster analysis to create a dendrogram. A cross-sectional study was made beforehand with the results of the questionnaire. Five clusters stand out. Cluster 1: a group who preferred not to answer numerous questions (5%). Cluster 2: in favour of receiving palliative care and euthanasia (40%). Cluster 3: would oppose assisted suicide and would not ask for spiritual assistance (15%). Cluster 4: would like to receive palliative care and assisted suicide (16%). Cluster 5: would oppose assisted suicide and would ask for spiritual assistance (24%). The following four clusters stood out. Clusters 2 and 4 would like to receive palliative care, euthanasia (2) and assisted suicide (4). Clusters 4 and 5 regularly practiced their faith and their family members did not receive palliative care. Clusters 3 and 5 would be opposed to euthanasia and assisted suicide in particular. Clusters 2, 4 and 5 had not completed an advance directive document (2, 4 and 5). Clusters 2 and 3 seldom practiced their faith. This study could be taken into consideration to improve the quality of end-of-life care choices. Copyright © 2017 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Bae, Hyoung Won; Ji, Yongwoo; Lee, Hye Sun; Lee, Naeun; Hong, Samin; Seong, Gong Je; Sung, Kyung Rim; Kim, Chan Yun
2015-01-01
Normal-tension glaucoma (NTG) is a heterogenous disease, and there is still controversy about subclassifications of this disorder. On the basis of spectral-domain optical coherence tomography (SD-OCT), we subdivided NTG with hierarchical cluster analysis using optic nerve head (ONH) parameters and retinal nerve fiber layer (RNFL) thicknesses. A total of 200 eyes of 200 NTG patients between March 2011 and June 2012 underwent SD-OCT scans to measure ONH parameters and RNFL thicknesses. We classified NTG into homogenous subgroups based on these variables using a hierarchical cluster analysis, and compared clusters to evaluate diverse NTG characteristics. Three clusters were found after hierarchical cluster analysis. Cluster 1 (62 eyes) had the thickest RNFL and widest rim area, and showed early glaucoma features. Cluster 2 (60 eyes) was characterized by the largest cup/disc ratio and cup volume, and showed advanced glaucomatous damage. Cluster 3 (78 eyes) had small disc areas in SD-OCT and were comprised of patients with significantly younger age, longer axial length, and greater myopia than the other 2 groups. A hierarchical cluster analysis of SD-OCT scans divided NTG patients into 3 groups based upon ONH parameters and RNFL thicknesses. It is anticipated that the small disc area group comprised of younger and more myopic patients may show unique features unlike the other 2 groups.
Lin, Nan; Jiang, Junhai; Guo, Shicheng; Xiong, Momiao
2015-01-01
Due to the advancement in sensor technology, the growing large medical image data have the ability to visualize the anatomical changes in biological tissues. As a consequence, the medical images have the potential to enhance the diagnosis of disease, the prediction of clinical outcomes and the characterization of disease progression. But in the meantime, the growing data dimensions pose great methodological and computational challenges for the representation and selection of features in image cluster analysis. To address these challenges, we first extend the functional principal component analysis (FPCA) from one dimension to two dimensions to fully capture the space variation of image the signals. The image signals contain a large number of redundant features which provide no additional information for clustering analysis. The widely used methods for removing the irrelevant features are sparse clustering algorithms using a lasso-type penalty to select the features. However, the accuracy of clustering using a lasso-type penalty depends on the selection of the penalty parameters and the threshold value. In practice, they are difficult to determine. Recently, randomized algorithms have received a great deal of attentions in big data analysis. This paper presents a randomized algorithm for accurate feature selection in image clustering analysis. The proposed method is applied to both the liver and kidney cancer histology image data from the TCGA database. The results demonstrate that the randomized feature selection method coupled with functional principal component analysis substantially outperforms the current sparse clustering algorithms in image cluster analysis. PMID:26196383
NASA Technical Reports Server (NTRS)
Zhu, Dongming; Chen, Yuan L.; Miller, Robert A.
2003-01-01
Advanced oxide thermal barrier coatings have been developed by incorporating multi-component rare earth oxide dopants into zirconia-yttria to effectively promote the creation of the thermodynamically stable, immobile oxide defect clusters and/or nano-scale phases within the coating systems. The presence of these nano-sized defect clusters has found to significantly reduce the coating intrinsic thermal conductivity, improve sintering resistance, and maintain long-term high temperature stability. In this paper, the defect clusters and nano-structured phases, which were created by the addition of multi-component rare earth dopants to the plasma-sprayed and electron-beam physical vapor deposited thermal barrier coatings, were characterized by high-resolution transmission electron microscopy (TEM). The defect cluster size, distribution, crystallographic and compositional information were investigated using high-resolution TEM lattice imaging, selected area diffraction (SAD), electron energy-loss spectroscopy (EELS) and energy dispersive spectroscopy (EDS) analysis techniques. The results showed that substantial defect clusters were formed in the advanced multi-component rare earth oxide doped zirconia- yttria systems. The size of the oxide defect clusters and the cluster dopant segregation was typically ranging from 5 to 50 nm. These multi-component dopant induced defect clusters are an important factor for the coating long-term high temperature stability and excellent performance.
NASA Technical Reports Server (NTRS)
Zhu, Dongming; Chen, Yuan L.; Miller, Robert A.
1990-01-01
Advanced oxide thermal barrier coatings have been developed by incorporating multi- component rare earth oxide dopants into zirconia-yttria to effectively promote the creation of the thermodynamically stable, immobile oxide defect clusters and/or nano-scale phases within the coating systems. The presence of these nano-sized defect clusters has found to significantly reduce the coating intrinsic thermal conductivity, improve sintering resistance, and maintain long-term high temperature stability. In this paper, the defect clusters and nano-structured phases, which were created by the addition of multi-component rare earth dopants to the plasma- sprayed and electron-beam physical vapor deposited thermal barrier coatings, were characterized by high-resolution transmission electron microscopy (TEM). The defect cluster size, distribution, crystallographic and compositional information were investigated using high-resolution TEM lattice imaging, selected area diffraction (SAD), and energy dispersive spectroscopy (EDS) analysis techniques. The results showed that substantial defect clusters were formed in the advanced multi-component rare earth oxide doped zirconia-yttria systems. The size of the oxide defect clusters and the cluster dopant segregation was typically ranging fiom 5 to 50 nm. These multi-component dopant induced defect clusters are an important factor for the coating long-term high temperature stability and excellent performance.
NASA Astrophysics Data System (ADS)
Zhang, Ying; Moges, Semu; Block, Paul
2018-01-01
Prediction of seasonal precipitation can provide actionable information to guide management of various sectoral activities. For instance, it is often translated into hydrological forecasts for better water resources management. However, many studies assume homogeneity in precipitation across an entire study region, which may prove ineffective for operational and local-level decisions, particularly for locations with high spatial variability. This study proposes advancing local-level seasonal precipitation predictions by first conditioning on regional-level predictions, as defined through objective cluster analysis, for western Ethiopia. To our knowledge, this is the first study predicting seasonal precipitation at high resolution in this region, where lives and livelihoods are vulnerable to precipitation variability given the high reliance on rain-fed agriculture and limited water resources infrastructure. The combination of objective cluster analysis, spatially high-resolution prediction of seasonal precipitation, and a modeling structure spanning statistical and dynamical approaches makes clear advances in prediction skill and resolution, as compared with previous studies. The statistical model improves versus the non-clustered case or dynamical models for a number of specific clusters in northwestern Ethiopia, with clusters having regional average correlation and ranked probability skill score (RPSS) values of up to 0.5 and 33 %, respectively. The general skill (after bias correction) of the two best-performing dynamical models over the entire study region is superior to that of the statistical models, although the dynamical models issue predictions at a lower resolution and the raw predictions require bias correction to guarantee comparable skills.
Going beyond Clustering in MD Trajectory Analysis: An Application to Villin Headpiece Folding
Rajan, Aruna; Freddolino, Peter L.; Schulten, Klaus
2010-01-01
Recent advances in computing technology have enabled microsecond long all-atom molecular dynamics (MD) simulations of biological systems. Methods that can distill the salient features of such large trajectories are now urgently needed. Conventional clustering methods used to analyze MD trajectories suffer from various setbacks, namely (i) they are not data driven, (ii) they are unstable to noise and changes in cut-off parameters such as cluster radius and cluster number, and (iii) they do not reduce the dimensionality of the trajectories, and hence are unsuitable for finding collective coordinates. We advocate the application of principal component analysis (PCA) and a non-metric multidimensional scaling (nMDS) method to reduce MD trajectories and overcome the drawbacks of clustering. To illustrate the superiority of nMDS over other methods in reducing data and reproducing salient features, we analyze three complete villin headpiece folding trajectories. Our analysis suggests that the folding process of the villin headpiece is structurally heterogeneous. PMID:20419160
Going beyond clustering in MD trajectory analysis: an application to villin headpiece folding.
Rajan, Aruna; Freddolino, Peter L; Schulten, Klaus
2010-04-15
Recent advances in computing technology have enabled microsecond long all-atom molecular dynamics (MD) simulations of biological systems. Methods that can distill the salient features of such large trajectories are now urgently needed. Conventional clustering methods used to analyze MD trajectories suffer from various setbacks, namely (i) they are not data driven, (ii) they are unstable to noise and changes in cut-off parameters such as cluster radius and cluster number, and (iii) they do not reduce the dimensionality of the trajectories, and hence are unsuitable for finding collective coordinates. We advocate the application of principal component analysis (PCA) and a non-metric multidimensional scaling (nMDS) method to reduce MD trajectories and overcome the drawbacks of clustering. To illustrate the superiority of nMDS over other methods in reducing data and reproducing salient features, we analyze three complete villin headpiece folding trajectories. Our analysis suggests that the folding process of the villin headpiece is structurally heterogeneous.
Carious Dentine Provides a Habitat for a Complex Array of Novel Prevotella-Like Bacteria
Nadkarni, Mangala A.; Caldon, C. Elizabeth; Chhour, Kim-Ly; Fisher, Ilana P.; Martin, F. Elizabeth; Jacques, Nicholas A.; Hunter, Neil
2004-01-01
Previous analysis of the microbiology of advanced caries by culture and real-time PCR emphasized the high incidence and abundance of gram-negative anaerobic species, particularly Prevotella-like bacteria. The diversity of Prevotella-like bacteria was further explored by analyzing pooled bacterial DNA from lesions of carious dentine. This was achieved by amplification of a region of the 16S ribosomal DNA with a Prevotella genus-specific forward primer and a universal bacterial reverse primer, followed by cloning and sequencing. Cultured Prevotella species commonly associated with oral tissues constituted only 12% of the Prevotella clones isolated from advanced carious lesions. The remaining 88% consisted of a diverse range of phylotypes. These included five clusters of previously recognized but uncultured oral Prevotella spp. and a major cluster containing Prevotella-like bacteria most closely related to uncharacterized rumen bacteria. Cluster-specific primers were designed, and the numbers of bacteria within clusters were quantified by real-time PCR, confirming the abundance of these organisms. The data indicated that advanced dental caries provides a unique environment for a complex array of novel and uncultured Prevotella and Prevotella-like bacteria which, in some cases, may dominate the diverse polymicrobial community associated with the disease. PMID:15528720
Rudolf, Jeffrey D.; Yan, Xiaohui; Shen, Ben
2015-01-01
The enediynes are one of the most fascinating families of bacterial natural products given their unprecedented molecular architecture and extraordinary cytotoxicity. Enediynes are rare with only 11 structurally characterized members and four additional members isolated in their cycloaromatized form. Recent advances in DNA sequencing have resulted in an explosion of microbial genomes. A virtual survey of the GenBank and JGI genome databases revealed 87 enediyne biosynthetic gene clusters from 78 bacteria strains, implying enediynes are more common than previously thought. Here we report the construction and analysis of an enediyne genome neighborhood network (GNN) as a high-throughput approach to analyze secondary metabolite gene clusters. Analysis of the enediyne GNN facilitated rapid gene cluster annotation, revealed genetic trends in enediyne biosynthetic gene clusters resulting in a simple prediction scheme to determine 9- vs 10-membered enediyne gene clusters, and supported a genomic-based strain prioritization method for enediyne discovery. PMID:26318027
Characteristic and factors of competitive maritime industry clusters in Indonesia
NASA Astrophysics Data System (ADS)
Marlyana, N.; Tontowi, A. E.; Yuniarto, H. A.
2017-12-01
Indonesia is situated in the strategic position between two oceans therefore is identified as a maritime state. The fact opens big opportunity to build a competitive maritime industry. However, potential factors to boost the competitive maritime industry still need to be explored. The objective of this paper is then to determine the main characteristics and potential factors of competitive maritime industry cluster. Qualitative analysis based on literature review has been carried out in two aspects. First, benchmarking analysis conducted to distinguish the most relevant factors of maritime clusters in several countries in Europe (Norway, Spain, South West of England) and Asia (China, South Korea, Malaysia). Seven key dimensions are used for this benchmarking. Secondly, the competitiveness of maritime clusters in Indonesia was diagnosed through a reconceptualization of Porter’s Diamond model. There were four interlinked of advanced factors in and between companies within clusters, which can be influenced in a proactive way by government.
Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken; ...
2016-11-29
Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic genemore » clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hadjithomas, Michalis; Chen, I-Min A.; Chu, Ken
Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic genemore » clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery.« less
Romay-Tallon, Raquel; Rivera-Baltanas, Tania; Allen, Josh; Olivares, Jose M; Kalynchuk, Lisa E; Caruncho, Hector J
2017-01-01
The pattern of serotonin transporter clustering on the plasma membrane of lymphocytes extracted from human whole blood samples has been identified as a putative biomarker of therapeutic efficacy in major depression. Here we evaluated the possibility of performing a similar analysis using blood smears obtained from rats, and from control human subjects and depression patients. We hypothesized that we could optimize a protocol to make the analysis of serotonin protein clustering in blood smears comparable to the analysis of serotonin protein clustering using isolated lymphocytes. Our data indicate that blood smears require a longer fixation time and longer times of incubation with primary and secondary antibodies. In addition, one needs to optimize the image analysis settings for the analysis of smears. When these steps are followed, the quantitative analysis of both the number and size of serotonin transporter clusters on the plasma membrane of lymphocytes is similar using both blood smears and isolated lymphocytes. The development of this novel protocol will greatly facilitate the collection of appropriate samples by eliminating the necessity and cost of specialized personnel for drawing blood samples, and by being a less invasive procedure. Therefore, this protocol will help us advance the validation of membrane protein clustering in lymphocytes as a biomarker of therapeutic efficacy in major depression, and bring it closer to its clinical application.
Genetic Network Inference: From Co-Expression Clustering to Reverse Engineering
NASA Technical Reports Server (NTRS)
Dhaeseleer, Patrik; Liang, Shoudan; Somogyi, Roland
2000-01-01
Advances in molecular biological, analytical, and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using high-throughput gene expression assays, we are able to measure the output of the gene regulatory network. We aim here to review datamining and modeling approaches for conceptualizing and unraveling the functional relationships implicit in these datasets. Clustering of co-expression profiles allows us to infer shared regulatory inputs and functional pathways. We discuss various aspects of clustering, ranging from distance measures to clustering algorithms and multiple-duster memberships. More advanced analysis aims to infer causal connections between genes directly, i.e., who is regulating whom and how. We discuss several approaches to the problem of reverse engineering of genetic networks, from discrete Boolean networks, to continuous linear and non-linear models. We conclude that the combination of predictive modeling with systematic experimental verification will be required to gain a deeper insight into living organisms, therapeutic targeting, and bioengineering.
Waiting-time distributions of magnetic discontinuities: clustering or Poisson process?
Greco, A; Matthaeus, W H; Servidio, S; Dmitruk, P
2009-10-01
Using solar wind data from the Advanced Composition Explorer spacecraft, with the support of Hall magnetohydrodynamic simulations, the waiting-time distributions of magnetic discontinuities have been analyzed. A possible phenomenon of clusterization of these discontinuities is studied in detail. We perform a local Poisson's analysis in order to establish if these intermittent events are randomly distributed or not. Possible implications about the nature of solar wind discontinuities are discussed.
Waiting-time distributions of magnetic discontinuities: Clustering or Poisson process?
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greco, A.; Matthaeus, W. H.; Servidio, S.
2009-10-15
Using solar wind data from the Advanced Composition Explorer spacecraft, with the support of Hall magnetohydrodynamic simulations, the waiting-time distributions of magnetic discontinuities have been analyzed. A possible phenomenon of clusterization of these discontinuities is studied in detail. We perform a local Poisson's analysis in order to establish if these intermittent events are randomly distributed or not. Possible implications about the nature of solar wind discontinuities are discussed.
Cohen, Mitchell J; Grossman, Adam D; Morabito, Diane; Knudson, M Margaret; Butte, Atul J; Manley, Geoffrey T
2010-01-01
Advances in technology have made extensive monitoring of patient physiology the standard of care in intensive care units (ICUs). While many systems exist to compile these data, there has been no systematic multivariate analysis and categorization across patient physiological data. The sheer volume and complexity of these data make pattern recognition or identification of patient state difficult. Hierarchical cluster analysis allows visualization of high dimensional data and enables pattern recognition and identification of physiologic patient states. We hypothesized that processing of multivariate data using hierarchical clustering techniques would allow identification of otherwise hidden patient physiologic patterns that would be predictive of outcome. Multivariate physiologic and ventilator data were collected continuously using a multimodal bioinformatics system in the surgical ICU at San Francisco General Hospital. These data were incorporated with non-continuous data and stored on a server in the ICU. A hierarchical clustering algorithm grouped each minute of data into 1 of 10 clusters. Clusters were correlated with outcome measures including incidence of infection, multiple organ failure (MOF), and mortality. We identified 10 clusters, which we defined as distinct patient states. While patients transitioned between states, they spent significant amounts of time in each. Clusters were enriched for our outcome measures: 2 of the 10 states were enriched for infection, 6 of 10 were enriched for MOF, and 3 of 10 were enriched for death. Further analysis of correlations between pairs of variables within each cluster reveals significant differences in physiology between clusters. Here we show for the first time the feasibility of clustering physiological measurements to identify clinically relevant patient states after trauma. These results demonstrate that hierarchical clustering techniques can be useful for visualizing complex multivariate data and may provide new insights for the care of critically injured patients.
Kim, Jiyeon; Dick, Jeffrey E; Bard, Allen J
2016-11-15
Metal clusters are very important as building blocks for nanoparticles (NPs) for electrocatalysis and electroanalysis in both fundamental and applied electrochemistry. Attention has been given to understanding of traditional nucleation and growth of metal clusters and to their catalytic activities for various electrochemical applications in energy harvesting as well as analytical sensing. Importantly, understanding the properties of these clusters, primarily the relationship between catalysis and morphology, is required to optimize catalytic function. This has been difficult due to the heterogeneities in the size, shape, and surface properties. Thus, methods that address these issues are necessary to begin understanding the reactivity of individual catalytic centers as opposed to ensemble measurements, where the effect of size and morphology on the catalysis is averaged out in the measurement. This Account introduces our advanced electrochemical approaches to focus on each isolated metal cluster, where we electrochemically fabricated clusters or NPs atom by atom to nanometer by nanometer and explored their electrochemistry for their kinetic and catalytic behavior. Such approaches expand the dimensions of analysis, to include the electrochemistry of (1) a discrete atomic cluster, (2) solely a single NP, or (3) individual NPs in the ensemble sample. Specifically, we studied the electrocatalysis of atomic metal clusters as a nascent electrocatalyst via direct electrodeposition on carbon ultramicroelectrode (C UME) in a femtomolar metal ion precursor. In addition, we developed tunneling ultramicroelectrodes (TUMEs) to study electron transfer (ET) kinetics of a redox probe at a single metal NP electrodeposited on this TUME. Owing to the small dimension of a NP as an active area of a TUME, extremely high mass transfer conditions yielded a remarkably high standard ET rate constant, k 0 , of 36 cm/s for outer-sphere ET reaction. Most recently, we advanced nanoscale scanning electrochemical microscopy (SECM) imaging to resolve the electrocatalytic activity of individual electrodeposited NPs within an ensemble sample yielding consistent high k 0 values of ≥2 cm/s for the hydrogen oxidation reaction (HOR) at different NPs. We envision that our advanced electrochemical approaches will enable us to systematically address structure effects on the catalytic activity, thus providing a quantitative guideline for electrocatalysts in energy-related applications.
Advanced Cyber Attack Modeling Analysis and Visualization
2010-03-01
Graph Analysis Network Web Logs Netflow Data TCP Dump Data System Logs Detect Protect Security Management What-If Figure 8. TVA attack graphs for...Clustered Graphs,” in Proceedings of the Symposium on Graph Drawing, September 1996. [25] K. Lakkaraju, W. Yurcik, A. Lee, “NVisionIP: NetFlow
Topic modeling for cluster analysis of large biological and medical datasets
2014-01-01
Background The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. Results In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Conclusion Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting that topic model-based methods could provide an analytic advancement in the analysis of large biological or medical datasets. PMID:25350106
Topic modeling for cluster analysis of large biological and medical datasets.
Zhao, Weizhong; Zou, Wen; Chen, James J
2014-01-01
The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting that topic model-based methods could provide an analytic advancement in the analysis of large biological or medical datasets.
Sidebottom, D L; Tran, Tri D
2010-11-01
Dynamic light scattering performed on aqueous solutions of three sugars (glucose, maltose and sucrose) reveal a common pattern of sugar cluster formation with a narrow cluster size distribution. In each case, equilibrium clusters form whose size increases with increasing sugar content in an identical power law manner in advance of a common, critical-like, percolation threshold near 83 wt % sugar. The critical exponent of the power law divergence of the cluster size varies with temperature, increasing with decreasing temperature, due to changes in the strength of the intermolecular hydrogen bond and appears to vanish for temperatures in excess of 90 °C. Detailed analysis of the cluster growth process suggests a two-stage process: an initial cluster phase formed at low volume fractions, ϕ, consisting of noninteracting, monodisperse sugar clusters whose size increases ϕ(1/3) followed by an aggregation stage, active at concentrations above about ϕ=40%, where cluster-cluster contact first occurs.
NASA Astrophysics Data System (ADS)
Atherton, Daniel
Early detection of disease and insect infestation within crops and precise application of pesticides can help reduce potential production losses, reduce environmental risk, and reduce the cost of farming. The goal of this study was the advanced detection of early blight (Alternaria solani) in potato (Solanum tuberosum) plants using hyperspectral remote sensing data captured with a handheld spectroradiometer. Hyperspectral reflectance spectra were captured 10 times over five weeks from plants grown to the vegetative and tuber bulking growth stages. The spectra were analyzed using principal component analysis (PCA), spectral change (ratio) analysis, partial least squares (PLS), cluster analysis, and vegetative indices. PCA successfully distinguished more heavily diseased plants from healthy and minimally diseased plants using two principal components. Spectral change (ratio) analysis provided wavelengths (490-510, 640, 665-670, 690, 740-750, and 935 nm) most sensitive to early blight infection followed by ANOVA results indicating a highly significant difference (p < 0.0001) between disease rating group means. In the majority of the experiments, comparisons of diseased plants with healthy plants using Fisher's LSD revealed more heavily diseased plants were significantly different from healthy plants. PLS analysis demonstrated the feasibility of detecting early blight infected plants, finding four optimal factors for raw spectra with the predictor variation explained ranging from 93.4% to 94.6% and the response variation explained ranging from 42.7% to 64.7%. Cluster analysis successfully distinguished healthy plants from all diseased plants except for the most mildly diseased plants, showing clustering analysis was an effective method for detection of early blight. Analysis of the reflectance spectra using the simple ratio (SR) and the normalized difference vegetative index (NDVI) was effective at differentiating all diseased plants from healthy plants, except for the most mildly diseased plants. Of the analysis methods attempted, cluster analysis and vegetative indices were the most promising. The results show the potential of hyperspectral remote sensing for the detection of early blight in potato plants.
Hadjithomas, Michalis; Chen, I-Min A; Chu, Ken; Huang, Jinghua; Ratner, Anna; Palaniappan, Krishna; Andersen, Evan; Markowitz, Victor; Kyrpides, Nikos C; Ivanova, Natalia N
2017-01-04
Secondary metabolites produced by microbes have diverse biological functions, which makes them a great potential source of biotechnologically relevant compounds with antimicrobial, anti-cancer and other activities. The proteins needed to synthesize these natural products are often encoded by clusters of co-located genes called biosynthetic gene clusters (BCs). In order to advance the exploration of microbial secondary metabolism, we developed the largest publically available database of experimentally verified and predicted BCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc/). Here, we describe an update of IMG-ABC, which includes ClusterScout, a tool for targeted identification of custom biosynthetic gene clusters across 40 000 isolate microbial genomes, and a new search capability to query more than 700 000 BCs from isolate genomes for clusters with similar Pfam composition. Additional features enable fast exploration and analysis of BCs through two new interactive visualization features, a BC function heatmap and a BC similarity network graph. These new tools and features add to the value of IMG-ABC's vast body of BC data, facilitating their in-depth analysis and accelerating secondary metabolite discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
ERIC Educational Resources Information Center
Ullrich-French, Sarah; Cox, Anne E.; Cooper, Brittany Rhoades
2016-01-01
Previous research has used cluster analysis to examine how social physique anxiety (SPA) combines with motivation in physical education. This study utilized a more advanced analytic approach, latent profile analysis (LPA), to identify profiles of SPA and motivation regulations. Students in grades 9-12 (N = 298) completed questionnaires at two time…
Cluster stability in the analysis of mass cytometry data.
Melchiotti, Rossella; Gracio, Filipe; Kordasti, Shahram; Todd, Alan K; de Rinaldis, Emanuele
2017-01-01
Manual gating has been traditionally applied to cytometry data sets to identify cells based on protein expression. The advent of mass cytometry allows for a higher number of proteins to be simultaneously measured on cells, therefore providing a means to define cell clusters in a high dimensional expression space. This enhancement, whilst opening unprecedented opportunities for single cell-level analyses, makes the incremental replacement of manual gating with automated clustering a compelling need. To this aim many methods have been implemented and their successful applications demonstrated in different settings. However, the reproducibility of automatically generated clusters is proving challenging and an analytical framework to distinguish spurious clusters from more stable entities, and presumably more biologically relevant ones, is still missing. One way to estimate cell clusters' stability is the evaluation of their consistent re-occurrence within- and between-algorithms, a metric that is commonly used to evaluate results from gene expression. Herein we report the usage and importance of cluster stability evaluations, when applied to results generated from three popular clustering algorithms - SPADE, FLOCK and PhenoGraph - run on four different data sets. These algorithms were shown to generate clusters with various degrees of statistical stability, many of them being unstable. By comparing the results of automated clustering with manually gated populations, we illustrate how information on cluster stability can assist towards a more rigorous and informed interpretation of clustering results. We also explore the relationships between statistical stability and other properties such as clusters' compactness and isolation, demonstrating that whilst cluster stability is linked to other properties it cannot be reliably predicted by any of them. Our study proposes the introduction of cluster stability as a necessary checkpoint for cluster interpretation and contributes to the construction of a more systematic and standardized analytical framework for the assessment of cytometry clustering results. © 2016 International Society for Advancement of Cytometry. © 2016 International Society for Advancement of Cytometry.
Data processing 1: Advancements in machine analysis of multispectral data
NASA Technical Reports Server (NTRS)
Swain, P. H.
1972-01-01
Multispectral data processing procedures are outlined beginning with the data display process used to accomplish data editing and proceeding through clustering, feature selection criterion for error probability estimation, and sample clustering and sample classification. The effective utilization of large quantities of remote sensing data by formulating a three stage sampling model for evaluation of crop acreage estimates represents an improvement in determining the cost benefit relationship associated with remote sensing technology.
Chia -Hsun Chuang; Pellejero-Ibanez, Marco; Rodriguez-Torres, Sergio; ...
2016-06-26
We analyze the broad-range shape of the monopole and quadrupole correlation functions of the BOSS Data Release 12 (DR12) CMASS and LOWZ galaxy sample to obtain constraints on the Hubble expansion rate H(z), the angular-diameter distance DA(z), the normalised growth rate f(z)σ 8(z), and the physical matter density Ω mh 2. In addition, we adopt wide and flat priors on all model parameters in order to ensure the results are those of a `single-probe' galaxy clustering analysis. We also marginalize over three nuisance terms that account for potential observational systematics affecting the measured monopole. However, such Monte Carlo Markov Chainmore » analysis is computationally expensive for advanced theoretical models, thus we develop a new methodology to speed up our analysis.« less
Symptom clusters in advanced cancer.
Jiménez, Ana; Madero, Rosario; Alonso, Alberto; Martínez-Marín, Virginia; Vilches, Yolanda; Martínez, Beatriz; Feliu, Marta; Díaz, Leyre; Espinosa, Enrique; Feliu, Jaime
2011-07-01
Patients with advanced cancer often experience multiple concurrent symptoms. Few studies have explored symptom clusters (SCs) in this population. The aim of the present study was to explore SCs in advanced cancer, evaluate the characteristics associated with various clusters, and determine their relationship to survival. This study included patients in the palliative care program of the Hospital Universitario La Paz from 2003 to 2005. The Edmonton Symptom Assessment System and a supplement including 13 other symptoms were used to detect symptoms. Principal component analysis was performed to determine symptom relationships and compare SCs with associated parameters. In total, 406 patients were included, 61% men and 39% women. The median age was 66.4 (range 18-95). The most common primaries were gastrointestinal (35%), lung (25%), genitourinary (8%), breast (5%), and head and neck (5%) carcinomas. The following clusters were identified: confusion (cognitive impairment, agitation, urinary incontinence), neuropsychological (anxiety, depression, and insomnia), anorexia-cachexia (anorexia, weight loss, and tiredness), and gastrointestinal (nausea and vomiting). The presence of these SCs was influenced by primary cancer site, gender, age, and performance status. Survival was related to the number of SCs present in a given patient: zero SC, 52 days; one SC, 38 days; two SCs, 23 days; and three to four SCs, 19 days; P < 0.001. Different SCs can be identified in patients with advanced cancer. These SCs are influenced by primary cancer site, gender, age, and Eastern Cooperative Oncology Group performance status, and they can have prognostic value. Copyright © 2011 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Subgroups of advanced cancer patients clustered by their symptom profiles: quality-of-life outcomes.
Husain, Amna; Myers, Jeff; Selby, Debbie; Thomson, Barbara; Chow, Edward
2011-11-01
Symptom cluster analysis is a new frontier of research in symptom management. This study clustered patients by their symptom profiles to identify subgroups that may be at higher risk for poor quality of life (QOL) and that may, therefore, benefit most from targeted interventions. Longitudinal study of metastatic cancer patients using the Edmonton Symptom Assessment Scale (ESAS). We generated two-, three-, and four-cluster subgroups and examined the relationship of cluster membership with patient outcomes. To address the problem of missing longitudinal data, we developed a novel outcome variable (QualTime) that measures both QOL and time in study. Two hundred and twenty-one patients with a mean Palliative Performance Scale (PPS) of 59.1 were enrolled. The three-cluster model was chosen for further analysis. The low-burden subgroup had all low severity symptom scores. The intermediate subgroup separates from the low-burden group on the "debility" profile of fatigue, drowsiness, appetite, and well-being. The high-burden group separates from the intermediate-burden group on pain, depression, and anxiety. At baseline, PPS (p=0.0003) and cluster membership (p<0.0001) contributed significantly to global QOL. In univariate analysis, cluster membership was related to the longitudinal outcome, QualTime. In a multivariate model, the relationship of PPS to QualTime was still significant (p=0.0002), but subgroup membership was no longer significant (p=0.1009). PPS is a stronger predictor of the longitudinal variable than cluster subgroups; however, cluster subgroups provide a target for clinical interventions that may improve QOL.
2012-03-02
Programs and services to help you start, grow and succeed www.sba.gov U.S. Small Business Administration Your Small Business Resource 1Approved for...Public Release SBA Innovation Clusters RADM Steven G. Smith USN (Ret) Advanced Defense Technology Cluster Coordinator U.S. Small Business ...UNIT NUMBER 7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) U.S. Small Business Administration ,Advanced Defense Technology Cluster,409 3rd St, SW
MASPECTRAS: a platform for management and analysis of proteomics LC-MS/MS data
Hartler, Jürgen; Thallinger, Gerhard G; Stocker, Gernot; Sturn, Alexander; Burkard, Thomas R; Körner, Erik; Rader, Robert; Schmidt, Andreas; Mechtler, Karl; Trajanoski, Zlatko
2007-01-01
Background The advancements of proteomics technologies have led to a rapid increase in the number, size and rate at which datasets are generated. Managing and extracting valuable information from such datasets requires the use of data management platforms and computational approaches. Results We have developed the MAss SPECTRometry Analysis System (MASPECTRAS), a platform for management and analysis of proteomics LC-MS/MS data. MASPECTRAS is based on the Proteome Experimental Data Repository (PEDRo) relational database schema and follows the guidelines of the Proteomics Standards Initiative (PSI). Analysis modules include: 1) import and parsing of the results from the search engines SEQUEST, Mascot, Spectrum Mill, X! Tandem, and OMSSA; 2) peptide validation, 3) clustering of proteins based on Markov Clustering and multiple alignments; and 4) quantification using the Automated Statistical Analysis of Protein Abundance Ratios algorithm (ASAPRatio). The system provides customizable data retrieval and visualization tools, as well as export to PRoteomics IDEntifications public repository (PRIDE). MASPECTRAS is freely available at Conclusion Given the unique features and the flexibility due to the use of standard software technology, our platform represents significant advance and could be of great interest to the proteomics community. PMID:17567892
Incipient fault detection study for advanced spacecraft systems
NASA Technical Reports Server (NTRS)
Milner, G. Martin; Black, Michael C.; Hovenga, J. Mike; Mcclure, Paul F.
1986-01-01
A feasibility study to investigate the application of vibration monitoring to the rotating machinery of planned NASA advanced spacecraft components is described. Factors investigated include: (1) special problems associated with small, high RPM machines; (2) application across multiple component types; (3) microgravity; (4) multiple fault types; (5) eight different analysis techniques including signature analysis, high frequency demodulation, cepstrum, clustering, amplitude analysis, and pattern recognition are compared; and (6) small sample statistical analysis is used to compare performance by computation of probability of detection and false alarm for an ensemble of repeated baseline and faulted tests. Both detection and classification performance are quantified. Vibration monitoring is shown to be an effective means of detecting the most important problem types for small, high RPM fans and pumps typical of those planned for the advanced spacecraft. A preliminary monitoring system design and implementation plan is presented.
Lindsey, Cary R.; Neupane, Ghanashym; Spycher, Nicolas; ...
2018-01-03
Although many Known Geothermal Resource Areas in Oregon and Idaho were identified during the 1970s and 1980s, few were subsequently developed commercially. Because of advances in power plant design and energy conversion efficiency since the 1980s, some previously identified KGRAs may now be economically viable prospects. Unfortunately, available characterization data vary widely in accuracy, precision, and granularity, making assessments problematic. In this paper, we suggest a procedure for comparing test areas against proven resources using Principal Component Analysis and cluster identification. The result is a low-cost tool for evaluating potential exploration targets using uncertain or incomplete data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lindsey, Cary R.; Neupane, Ghanashym; Spycher, Nicolas
Although many Known Geothermal Resource Areas in Oregon and Idaho were identified during the 1970s and 1980s, few were subsequently developed commercially. Because of advances in power plant design and energy conversion efficiency since the 1980s, some previously identified KGRAs may now be economically viable prospects. Unfortunately, available characterization data vary widely in accuracy, precision, and granularity, making assessments problematic. In this paper, we suggest a procedure for comparing test areas against proven resources using Principal Component Analysis and cluster identification. The result is a low-cost tool for evaluating potential exploration targets using uncertain or incomplete data.
Invasive advance of an advantageous mutation: nucleation theory.
O'Malley, Lauren; Basham, James; Yasi, Joseph A; Korniss, G; Allstadt, Andrew; Caraco, Thomas
2006-12-01
For sedentary organisms with localized reproduction, spatially clustered growth drives the invasive advance of a favorable mutation. We model competition between two alleles where recurrent mutation introduces a genotype with a rate of local propagation exceeding the resident's rate. We capture ecologically important properties of the rare invader's stochastic dynamics by assuming discrete individuals and local neighborhood interactions. To understand how individual-level processes may govern population patterns, we invoke the physical theory for nucleation of spatial systems. Nucleation theory discriminates between single-cluster and multi-cluster dynamics. A sufficiently low mutation rate, or a sufficiently small environment, generates single-cluster dynamics, an inherently stochastic process; a favorable mutation advances only if the invader cluster reaches a critical radius. For this mode of invasion, we identify the probability distribution of waiting times until the favored allele advances to competitive dominance, and we ask how the critical cluster size varies as propagation or mortality rates vary. Increasing the mutation rate or system size generates multi-cluster invasion, where spatial averaging produces nearly deterministic global dynamics. For this process, an analytical approximation from nucleation theory, called Avrami's Law, describes the time-dependent behavior of the genotype densities with remarkable accuracy.
Wavelet-based clustering of resting state MRI data in the rat.
Medda, Alessio; Hoffmann, Lukas; Magnuson, Matthew; Thompson, Garth; Pan, Wen-Ju; Keilholz, Shella
2016-01-01
While functional connectivity has typically been calculated over the entire length of the scan (5-10min), interest has been growing in dynamic analysis methods that can detect changes in connectivity on the order of cognitive processes (seconds). Previous work with sliding window correlation has shown that changes in functional connectivity can be observed on these time scales in the awake human and in anesthetized animals. This exciting advance creates a need for improved approaches to characterize dynamic functional networks in the brain. Previous studies were performed using sliding window analysis on regions of interest defined based on anatomy or obtained from traditional steady-state analysis methods. The parcellation of the brain may therefore be suboptimal, and the characteristics of the time-varying connectivity between regions are dependent upon the length of the sliding window chosen. This manuscript describes an algorithm based on wavelet decomposition that allows data-driven clustering of voxels into functional regions based on temporal and spectral properties. Previous work has shown that different networks have characteristic frequency fingerprints, and the use of wavelets ensures that both the frequency and the timing of the BOLD fluctuations are considered during the clustering process. The method was applied to resting state data acquired from anesthetized rats, and the resulting clusters agreed well with known anatomical areas. Clusters were highly reproducible across subjects. Wavelet cross-correlation values between clusters from a single scan were significantly higher than the values from randomly matched clusters that shared no temporal information, indicating that wavelet-based analysis is sensitive to the relationship between areas. Copyright © 2015 Elsevier Inc. All rights reserved.
GLOBULAR CLUSTERS AS CRADLES OF LIFE AND ADVANCED CIVILIZATIONS
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stefano, R. Di; Ray, A., E-mail: rdistefano@cfa.harvard.edu, E-mail: akr@tifr.res.in
2016-08-10
Globular clusters are ancient stellar populations in compact dense ellipsoids. There is no star formation and there are no core-collapse supernovae, but several lines of evidence suggest that globular clusters are rich in planets. If so, and if advanced civilizations can develop there, then the distances between these civilizations and other stars would be far smaller than typical distances between stars in the Galactic disk, facilitating interstellar communication and travel. The potent combination of long-term stability and high stellar densities provides a globular cluster opportunity. Yet the very proximity that promotes interstellar travel also brings danger, as stellar interactions canmore » destroy planetary systems. We find, however, that large portions of many globular clusters are “sweet spots,” where habitable-zone planetary orbits are stable for long times. Globular clusters in our own and other galaxies are, therefore, among the best targets for searches for extraterrestrial intelligence (SETI). We use the Drake equation to compare the likelihood of advanced civilizations in globular clusters to that in the Galactic disk. We also consider free-floating planets, since wide-orbit planets can be ejected to travel through the cluster. Civilizations spawned in globular clusters may be able to establish self-sustaining outposts, reducing the probability that a single catastrophic event will destroy the civilization. Although individual civilizations may follow different evolutionary paths, or even be destroyed, the cluster may continue to host advanced civilizations once a small number have jumped across interstellar space. Civilizations residing in globular clusters could therefore, in a sense, be immortal.« less
Globular Clusters as Cradles of Life and Advanced Civilizations
NASA Astrophysics Data System (ADS)
Di Stefano, R.; Ray, A.
2016-08-01
Globular clusters are ancient stellar populations in compact dense ellipsoids. There is no star formation and there are no core-collapse supernovae, but several lines of evidence suggest that globular clusters are rich in planets. If so, and if advanced civilizations can develop there, then the distances between these civilizations and other stars would be far smaller than typical distances between stars in the Galactic disk, facilitating interstellar communication and travel. The potent combination of long-term stability and high stellar densities provides a globular cluster opportunity. Yet the very proximity that promotes interstellar travel also brings danger, as stellar interactions can destroy planetary systems. We find, however, that large portions of many globular clusters are “sweet spots,” where habitable-zone planetary orbits are stable for long times. Globular clusters in our own and other galaxies are, therefore, among the best targets for searches for extraterrestrial intelligence (SETI). We use the Drake equation to compare the likelihood of advanced civilizations in globular clusters to that in the Galactic disk. We also consider free-floating planets, since wide-orbit planets can be ejected to travel through the cluster. Civilizations spawned in globular clusters may be able to establish self-sustaining outposts, reducing the probability that a single catastrophic event will destroy the civilization. Although individual civilizations may follow different evolutionary paths, or even be destroyed, the cluster may continue to host advanced civilizations once a small number have jumped across interstellar space. Civilizations residing in globular clusters could therefore, in a sense, be immortal.
Classification of different types of beer according to their colour characteristics
NASA Astrophysics Data System (ADS)
Nikolova, Kr T.; Gabrova, R.; Boyadzhiev, D.; Pisanova, E. S.; Ruseva, J.; Yanakiev, D.
2017-01-01
Twenty-two samples from different beers have been investigated in two colour systems - XYZ and SIELab - and have been characterised according to their colour parameters. The goals of the current study were to conduct correlation and discriminant analysis and to find the inner relation between the studied indices. K-means cluster has been used to compare and group the tested types of beer based on their similarity. To apply the K-Cluster analysis it is required that the number of clusters be determined in advance. The variant K = 4 was worked out. The first cluster unified all bright beers, the second one contained samples with fruits, the third one contained samples with addition of lemon, the fourth unified the samples of dark beers. By applying the discriminant analysis it is possible to help selections in the establishment of the type of beer. The proposed model correctly describes the types of beer on the Bulgarian market and it can be used for determining the affiliation of the beer which is not used in obtained model. One sample has been chosen from each cluster and the digital image has been obtained. It confirms the color parameters in the color system XYZ and SIELab. These facts can be used for elaboration for express estimation of beer by color.
Dong, Skye T; Butow, Phyllis N; Agar, Meera; Lovell, Melanie R; Boyle, Frances; Stockler, Martin; Forster, Benjamin C; Tong, Allison
2016-04-01
Managing symptom clusters or multiple concurrent symptoms in patients with advanced cancer remains a clinical challenge. The optimal processes constituting effective management of symptom clusters remain uncertain. To describe the attitudes and strategies of clinicians in managing multiple co-occurring symptoms in patients with advanced cancer. Semistructured interviews were conducted with 48 clinicians (palliative care physicians [n = 10], oncologists [n = 6], general practitioners [n = 6], nurses [n = 12], and allied health providers [n = 14]), purposively recruited from two acute hospitals, two palliative care centers, and four community general practices in Sydney, Australia. Transcripts were analyzed using thematic analysis and adapted grounded theory. Six themes were identified: uncertainty in decision making (inadequacy of scientific evidence, relying on experiential knowledge, and pressure to optimize care); attunement to patient and family (sensitivity to multiple cues, prioritizing individual preferences, addressing psychosocial and physical interactions, and opening Pandora's box); deciphering cause to guide intervention (disaggregating symptoms and interactions, flexibility in assessment, and curtailing investigative intrusiveness); balancing complexities in medical management (trading off side effects, minimizing mismatched goals, and urgency in resolving severe symptoms); fostering hope and empowerment (allaying fear of the unknown, encouraging meaning making, championing patient empowerment, and truth telling); and depending on multidisciplinary expertise (maximizing knowledge exchange, sharing management responsibility, contending with hierarchical tensions, and isolation and discontinuity of care). Management of symptom clusters, as both an art and a science, is currently fraught with uncertainty in decision making. Strengthening multidisciplinary collaboration, continuity of care, more pragmatic planning of clinical trials to address more than one symptom, and training in symptom cluster management are required. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Use of EST-SSR loci flanking regions for phylogenetic analysis of genus Arachis
USDA-ARS?s Scientific Manuscript database
All wild peanut collections in the genus Arachis were assigned to nine taxonomy sections on the bases of cross-compatibility and morphologic character clustering. These nine sections consist of 80 species from the most ancient to the most advanced, providing a diverse genetic resource for phylogenet...
[Visual field progression in glaucoma: cluster analysis].
Bresson-Dumont, H; Hatton, J; Foucher, J; Fonteneau, M
2012-11-01
Visual field progression analysis is one of the key points in glaucoma monitoring, but distinction between true progression and random fluctuation is sometimes difficult. There are several different algorithms but no real consensus for detecting visual field progression. The trend analysis of global indices (MD, sLV) may miss localized deficits or be affected by media opacities. Conversely, point-by-point analysis makes progression difficult to differentiate from physiological variability, particularly when the sensitivity of a point is already low. The goal of our study was to analyse visual field progression with the EyeSuite™ Octopus Perimetry Clusters algorithm in patients with no significant changes in global indices or worsening of the analysis of pointwise linear regression. We analyzed the visual fields of 162 eyes (100 patients - 58 women, 42 men, average age 66.8 ± 10.91) with ocular hypertension or glaucoma. For inclusion, at least six reliable visual fields per eye were required, and the trend analysis (EyeSuite™ Perimetry) of visual field global indices (MD and SLV), could show no significant progression. The analysis of changes in cluster mode was then performed. In a second step, eyes with statistically significant worsening of at least one of their clusters were analyzed point-by-point with the Octopus Field Analysis (OFA). Fifty four eyes (33.33%) had a significant worsening in some clusters, while their global indices remained stable over time. In this group of patients, more advanced glaucoma was present than in stable group (MD 6.41 dB vs. 2.87); 64.82% (35/54) of those eyes in which the clusters progressed, however, had no statistically significant change in the trend analysis by pointwise linear regression. Most software algorithms for analyzing visual field progression are essentially trend analyses of global indices, or point-by-point linear regression. This study shows the potential role of analysis by clusters trend. However, for best results, it is preferable to compare the analyses of several tests in combination with morphologic exam. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Characterizing Suicide in Toronto: An Observational Study and Cluster Analysis
Sinyor, Mark; Schaffer, Ayal; Streiner, David L
2014-01-01
Objective: To determine whether people who have died from suicide in a large epidemiologic sample form clusters based on demographic, clinical, and psychosocial factors. Method: We conducted a coroner’s chart review for 2886 people who died in Toronto, Ontario, from 1998 to 2010, and whose death was ruled as suicide by the Office of the Chief Coroner of Ontario. A cluster analysis using known suicide risk factors was performed to determine whether suicide deaths separate into distinct groups. Clusters were compared according to person- and suicide-specific factors. Results: Five clusters emerged. Cluster 1 had the highest proportion of females and nonviolent methods, and all had depression and a past suicide attempt. Cluster 2 had the highest proportion of people with a recent stressor and violent suicide methods, and all were married. Cluster 3 had mostly males between the ages of 20 and 64, and all had either experienced recent stressors, suffered from mental illness, or had a history of substance abuse. Cluster 4 had the youngest people and the highest proportion of deaths by jumping from height, few were married, and nearly one-half had bipolar disorder or schizophrenia. Cluster 5 had all unmarried people with no prior suicide attempts, and were the least likely to have an identified mental illness and most likely to leave a suicide note. Conclusions: People who die from suicide assort into different patterns of demographic, clinical, and death-specific characteristics. Identifying and studying subgroups of suicides may advance our understanding of the heterogeneous nature of suicide and help to inform development of more targeted suicide prevention strategies. PMID:24444321
NASA Astrophysics Data System (ADS)
Sams, Michael; Silye, Rene; Göhring, Janett; Muresan, Leila; Schilcher, Kurt; Jacak, Jaroslaw
2014-01-01
We present a cluster spatial analysis method using nanoscopic dSTORM images to determine changes in protein cluster distributions within brain tissue. Such methods are suitable to investigate human brain tissue and will help to achieve a deeper understanding of brain disease along with aiding drug development. Human brain tissue samples are usually treated postmortem via standard fixation protocols, which are established in clinical laboratories. Therefore, our localization microscopy-based method was adapted to characterize protein density and protein cluster localization in samples fixed using different protocols followed by common fluorescent immunohistochemistry techniques. The localization microscopy allows nanoscopic mapping of serotonin 5-HT1A receptor groups within a two-dimensional image of a brain tissue slice. These nanoscopically mapped proteins can be confined to clusters by applying the proposed statistical spatial analysis. Selected features of such clusters were subsequently used to characterize and classify the tissue. Samples were obtained from different types of patients, fixed with different preparation methods, and finally stored in a human tissue bank. To verify the proposed method, samples of a cryopreserved healthy brain have been compared with epitope-retrieved and paraffin-fixed tissues. Furthermore, samples of healthy brain tissues were compared with data obtained from patients suffering from mental illnesses (e.g., major depressive disorder). Our work demonstrates the applicability of localization microscopy and image analysis methods for comparison and classification of human brain tissues at a nanoscopic level. Furthermore, the presented workflow marks a unique technological advance in the characterization of protein distributions in brain tissue sections.
Kwekkeboom, Kristine L; Tostrud, Lauren; Costanzo, Erin; Coe, Christopher L; Serlin, Ronald C; Ward, Sandra E; Zhang, Yingzi
2018-05-01
Symptom researchers have proposed a model of inflammatory cytokine activity and dysregulation in cancer to explain co-occurring symptoms including pain, fatigue, and sleep disturbance. We tested the hypothesis that psychological stress accentuates inflammation and that stress and inflammation contribute to one's experience of the pain, fatigue, and sleep disturbance symptom cluster (symptom cluster severity, symptom cluster distress) and its impact (symptom cluster interference with daily life, quality of life). We used baseline data from a symptom cluster management trial. Adult participants (N = 158) receiving chemotherapy for advanced cancer reported pain, fatigue, and sleep disturbance on enrollment. Before intervention, participants completed measures of demographics, perceived stress, symptom cluster severity, symptom cluster distress, symptom cluster interference with daily life, and quality of life and provided a blood sample for four inflammatory biomarkers (interleukin-1β, interleukin-6, tumor necrosis factor-α, and C-reactive protein). Stress was not directly related to any inflammatory biomarker. Stress and tumor necrosis factor-α were positively related to symptom cluster distress, although not symptom cluster severity. Tumor necrosis factor-α was indirectly related to symptom cluster interference with daily life, through its effect on symptom cluster distress. Stress was positively associated with symptom cluster interference with daily life and inversely with quality of life. Stress also had indirect effects on symptom cluster interference with daily life, through its effect on symptom cluster distress. The proposed inflammatory model of symptoms was partially supported. Investigators should test interventions that target stress as a contributing factor in co-occurring pain, fatigue, and sleep disturbance and explore other factors that may influence inflammatory biomarker levels within the context of an advanced cancer diagnosis and treatment. Copyright © 2018 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Schrabback, T.; Applegate, D.; Dietrich, J. P.; Hoekstra, H.; Bocquet, S.; Gonzalez, A. H.; von der Linden, A.; McDonald, M.; Morrison, C. B.; Raihan, S. F.; Allen, S. W.; Bayliss, M.; Benson, B. A.; Bleem, L. E.; Chiu, I.; Desai, S.; Foley, R. J.; de Haan, T.; High, F. W.; Hilbert, S.; Mantz, A. B.; Massey, R.; Mohr, J.; Reichardt, C. L.; Saro, A.; Simon, P.; Stern, C.; Stubbs, C. W.; Zenteno, A.
2018-02-01
We present an HST/Advanced Camera for Surveys (ACS) weak gravitational lensing analysis of 13 massive high-redshift (zmedian = 0.88) galaxy clusters discovered in the South Pole Telescope (SPT) Sunyaev-Zel'dovich Survey. This study is part of a larger campaign that aims to robustly calibrate mass-observable scaling relations over a wide range in redshift to enable improved cosmological constraints from the SPT cluster sample. We introduce new strategies to ensure that systematics in the lensing analysis do not degrade constraints on cluster scaling relations significantly. First, we efficiently remove cluster members from the source sample by selecting very blue galaxies in V - I colour. Our estimate of the source redshift distribution is based on Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS) data, where we carefully mimic the source selection criteria of the cluster fields. We apply a statistical correction for systematic photometric redshift errors as derived from Hubble Ultra Deep Field data and verified through spatial cross-correlations. We account for the impact of lensing magnification on the source redshift distribution, finding that this is particularly relevant for shallower surveys. Finally, we account for biases in the mass modelling caused by miscentring and uncertainties in the concentration-mass relation using simulations. In combination with temperature estimates from Chandra we constrain the normalization of the mass-temperature scaling relation ln (E(z)M500c/1014 M⊙) = A + 1.5ln (kT/7.2 keV) to A=1.81^{+0.24}_{-0.14}(stat.) {± } 0.09(sys.), consistent with self-similar redshift evolution when compared to lower redshift samples. Additionally, the lensing data constrain the average concentration of the clusters to c_200c=5.6^{+3.7}_{-1.8}.
Scholarly Research Program in Fuel Analysis and Combustion Research
1993-02-01
Public reporting burden for this collection of information is es•tmated to average I hour per response, ilnduding the time fo," reviwing ...Thermal Oxidative Flask Test 45 9. Advanced Fuel System Configuration Descent Condition 57 10. TGPGC for n-Alkane Mixture 63 11. Hierarchical Cluster ...will include all analytical data, data analysis conclusions, recommendations and rationale. 16 a& k : 05 Titl: Development of Test Cell Assemblies for
Ankle plantarflexion strength in rearfoot and forefoot runners: a novel clusteranalytic approach.
Liebl, Dominik; Willwacher, Steffen; Hamill, Joseph; Brüggemann, Gert-Peter
2014-06-01
The purpose of the present study was to test for differences in ankle plantarflexion strengths of habitually rearfoot and forefoot runners. In order to approach this issue, we revisit the problem of classifying different footfall patterns in human runners. A dataset of 119 subjects running shod and barefoot (speed 3.5m/s) was analyzed. The footfall patterns were clustered by a novel statistical approach, which is motivated by advances in the statistical literature on functional data analysis. We explain the novel statistical approach in detail and compare it to the classically used strike index of Cavanagh and Lafortune (1980). The two groups found by the new cluster approach are well interpretable as a forefoot and a rearfoot footfall groups. The subsequent comparison study of the clustered subjects reveals that runners with a forefoot footfall pattern are capable of producing significantly higher joint moments in a maximum voluntary contraction (MVC) of their ankle plantarflexor muscles tendon units; difference in means: 0.28Nm/kg. This effect remains significant after controlling for an additional gender effect and for differences in training levels. Our analysis confirms the hypothesis that forefoot runners have a higher mean MVC plantarflexion strength than rearfoot runners. Furthermore, we demonstrate that our proposed stochastic cluster analysis provides a robust and useful framework for clustering foot strikes. Copyright © 2014 Elsevier B.V. All rights reserved.
Large clusters of co-expressed genes in the Drosophila genome.
Boutanaev, Alexander M; Kalmykova, Alla I; Shevelyov, Yuri Y; Nurminsky, Dmitry I
2002-12-12
Clustering of co-expressed, non-homologous genes on chromosomes implies their co-regulation. In lower eukaryotes, co-expressed genes are often found in pairs. Clustering of genes that share aspects of transcriptional regulation has also been reported in higher eukaryotes. To advance our understanding of the mode of coordinated gene regulation in multicellular organisms, we performed a genome-wide analysis of the chromosomal distribution of co-expressed genes in Drosophila. We identified a total of 1,661 testes-specific genes, one-third of which are clustered on chromosomes. The number of clusters of three or more genes is much higher than expected by chance. We observed a similar trend for genes upregulated in the embryo and in the adult head, although the expression pattern of individual genes cannot be predicted on the basis of chromosomal position alone. Our data suggest that the prevalent mechanism of transcriptional co-regulation in higher eukaryotes operates with extensive chromatin domains that comprise multiple genes.
Advances in Statistical Methods for Substance Abuse Prevention Research
MacKinnon, David P.; Lockwood, Chondra M.
2010-01-01
The paper describes advances in statistical methods for prevention research with a particular focus on substance abuse prevention. Standard analysis methods are extended to the typical research designs and characteristics of the data collected in prevention research. Prevention research often includes longitudinal measurement, clustering of data in units such as schools or clinics, missing data, and categorical as well as continuous outcome variables. Statistical methods to handle these features of prevention data are outlined. Developments in mediation, moderation, and implementation analysis allow for the extraction of more detailed information from a prevention study. Advancements in the interpretation of prevention research results include more widespread calculation of effect size and statistical power, the use of confidence intervals as well as hypothesis testing, detailed causal analysis of research findings, and meta-analysis. The increased availability of statistical software has contributed greatly to the use of new methods in prevention research. It is likely that the Internet will continue to stimulate the development and application of new methods. PMID:12940467
Classic fungal natural products in the genomic age: the molecular legacy of Harold Raistrick.
Schor, Raissa; Cox, Russell
2018-03-01
Covering: 1893 to 2017Harold Raistrick was involved in the discovery of many of the most important classes of fungal metabolites during the 20th century. This review focusses on how these discoveries led to developments in isotopic labelling, biomimetic chemistry and the discovery, analysis and exploitation of biosynthetic gene clusters for major classes of fungal metabolites including: alternariol; geodin and metabolites of the emodin pathway; maleidrides; citrinin and the azaphilones; dehydrocurvularin; mycophenolic acid; and the tropolones. Key recent advances in the molecular understanding of these important pathways, including the discovery of biosynthetic gene clusters, the investigation of the molecular and chemical aspects of key biosynthetic steps, and the reengineering of key components of the pathways are reviewed and compared. Finally, discussion of key relationships between metabolites and pathways and the most important recent advances and opportunities for future research directions are given.
Advanced Approach of Multiagent Based Buoy Communication
Gricius, Gediminas; Drungilas, Darius; Dzemydiene, Dale
2015-01-01
Usually, a hydrometeorological information system is faced with great data flows, but the data levels are often excessive, depending on the observed region of the water. The paper presents advanced buoy communication technologies based on multiagent interaction and data exchange between several monitoring system nodes. The proposed management of buoy communication is based on a clustering algorithm, which enables the performance of the hydrometeorological information system to be enhanced. The experiment is based on the design and analysis of the inexpensive but reliable Baltic Sea autonomous monitoring network (buoys), which would be able to continuously monitor and collect temperature, waviness, and other required data. The proposed approach of multiagent based buoy communication enables all the data from the costal-based station to be monitored with limited transition speed by setting different tasks for the agent-based buoy system according to the clustering information. PMID:26345197
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilson, Andrew; Haass, Michael; Rintoul, Mark Daniel
GazeAppraise advances the state of the art of gaze pattern analysis using methods that simultaneously analyze spatial and temporal characteristics of gaze patterns. GazeAppraise enables novel research in visual perception and cognition; for example, using shape features as distinguishing elements to assess individual differences in visual search strategy. Given a set of point-to-point gaze sequences, hereafter referred to as scanpaths, the method constructs multiple descriptive features for each scanpath. Once the scanpath features have been calculated, they are used to form a multidimensional vector representing each scanpath and cluster analysis is performed on the set of vectors from all scanpaths.more » An additional benefit of this method is the identification of causal or correlated characteristics of the stimuli, subjects, and visual task through statistical analysis of descriptive metadata distributions within and across clusters.« less
Computational methods for evaluation of cell-based data assessment--Bioconductor.
Le Meur, Nolwenn
2013-02-01
Recent advances in miniaturization and automation of technologies have enabled cell-based assay high-throughput screening, bringing along new challenges in data analysis. Automation, standardization, reproducibility have become requirements for qualitative research. The Bioconductor community has worked in that direction proposing several R packages to handle high-throughput data including flow cytometry (FCM) experiment. Altogether, these packages cover the main steps of a FCM analysis workflow, that is, data management, quality assessment, normalization, outlier detection, automated gating, cluster labeling, and feature extraction. Additionally, the open-source philosophy of R and Bioconductor, which offers room for new development, continuously drives research and improvement of theses analysis methods, especially in the field of clustering and data mining. This review presents the principal FCM packages currently available in R and Bioconductor, their advantages and their limits. Copyright © 2012 Elsevier Ltd. All rights reserved.
Towards a comprehensive knowledge of the open cluster Haffner 9
NASA Astrophysics Data System (ADS)
Piatti, Andrés E.
2017-03-01
We turn our attention to Haffner 9, a Milky Way open cluster whose previous fundamental parameter estimates are far from being in agreement. In order to provide with accurate estimates, we present high-quality Washington CT1 and Johnson BVI photometry of the cluster field. We put particular care in statistically cleaning the colour-magnitude diagrams (CMDs) from field star contamination, which was found a common source in previous works for the discordant fundamental parameter estimates. The resulting cluster CMD fiducial features were confirmed from a proper motion membership analysis. Haffner 9 is a moderately young object (age ∼350 Myr), placed in the Perseus arm - at a heliocentric distance of ∼3.2 kpc - , with a lower limit for its present mass of ∼160 M⊙ and of nearly metal solar content. The combination of the cluster structural and fundamental parameters suggest that it is in an advanced stage of internal dynamical evolution, possibly in the phase typical of those with mass segregation in their core regions. However, the cluster still keeps its mass function close to that of the Salpeter's law.
Py, Béatrice; Barras, Frédéric
2015-06-01
Since their discovery in the 50's, Fe-S cluster proteins have attracted much attention from chemists, biophysicists and biochemists. However, in the 80's they were joined by geneticists who helped to realize that in vivo maturation of Fe-S cluster bound proteins required assistance of a large number of factors defining complex multi-step pathways. The question of how clusters are formed and distributed in vivo has since been the focus of much effort. Here we review how genetics in discovering genes and investigating processes as they unfold in vivo has provoked seminal advances toward our understanding of Fe-S cluster biogenesis. The power and limitations of genetic approaches are discussed. As a final comment, we argue how the marriage of classic strategies and new high-throughput technologies should allow genetics of Fe-S cluster biology to be even more insightful in the future. This article is part of a Special Issue entitled: Fe/S proteins: Analysis, structure, function, biogenesis and diseases. Copyright © 2015 Elsevier B.V. All rights reserved.
Concept design and cluster control of advanced space connectable intelligent microsatellite
NASA Astrophysics Data System (ADS)
Wang, Xiaohui; Li, Shuang; She, Yuchen
2017-12-01
In this note, a new type of advanced space connectable intelligent microsatellite is presented to extend the range of potential application of microsatellite and improve the efficiency of cooperation. First, the overall concept of the micro satellite cluster is described, which is characterized by autonomously connecting with each other and being able to realize relative rotation through the external interfaces. Second, the multi-satellite autonomous assembly algorithm and control algorithm of the cluster motion are developed to make the cluster system combine into a variety of configurations in order to achieve different types of functionality. Finally, the design of the satellite cluster system is proposed, and the possible applications are discussed.
Word-initial rhotic clusters in typically developing children: European Portuguese.
Ramalho, Ana Margarida; Freitas, M João
2018-01-01
Rhotic clusters are complex structures segmentally and prosodically and are frequently one of the last structures acquired by Portuguese-speaking children. This paper describes cross-sectional data for word-initial (WI) rhotic tap clusters in typically developing 3-4- and 5-year-olds in Portugal. Additional information is provided on WI /l/ as a singleton and in clusters. A native speaker audio-recorded and transcribed single words in a story-telling task. Results for WI rhotic clusters show an age effect consistent with previous research on European Portuguese. Singleton /l/ was in advance of /l/-clusters as expected, but the tap clusters were in advance of the /l/-clusters, possibly reflecting the velarized characteristics of the lateral. The prosodic variables word stress and word length were relevant for the WI rhotic clusters: shorter words and stressed syllables showed higher accuracy. Finally, mismatches ('errors') mainly reflected negative structural constraints (deletion of C2 and epenthesis) rather than segmental constraints (substitutions).
SECIMTools: a suite of metabolomics data analysis tools.
Kirpich, Alexander S; Ibarra, Miguel; Moskalenko, Oleksandr; Fear, Justin M; Gerken, Joseph; Mi, Xinlei; Ashrafi, Ali; Morse, Alison M; McIntyre, Lauren M
2018-04-20
Metabolomics has the promise to transform the area of personalized medicine with the rapid development of high throughput technology for untargeted analysis of metabolites. Open access, easy to use, analytic tools that are broadly accessible to the biological community need to be developed. While technology used in metabolomics varies, most metabolomics studies have a set of features identified. Galaxy is an open access platform that enables scientists at all levels to interact with big data. Galaxy promotes reproducibility by saving histories and enabling the sharing workflows among scientists. SECIMTools (SouthEast Center for Integrated Metabolomics) is a set of Python applications that are available both as standalone tools and wrapped for use in Galaxy. The suite includes a comprehensive set of quality control metrics (retention time window evaluation and various peak evaluation tools), visualization techniques (hierarchical cluster heatmap, principal component analysis, modular modularity clustering), basic statistical analysis methods (partial least squares - discriminant analysis, analysis of variance, t-test, Kruskal-Wallis non-parametric test), advanced classification methods (random forest, support vector machines), and advanced variable selection tools (least absolute shrinkage and selection operator LASSO and Elastic Net). SECIMTools leverages the Galaxy platform and enables integrated workflows for metabolomics data analysis made from building blocks designed for easy use and interpretability. Standard data formats and a set of utilities allow arbitrary linkages between tools to encourage novel workflow designs. The Galaxy framework enables future data integration for metabolomics studies with other omics data.
Pre-crash scenarios at road junctions: A clustering method for car crash data.
Nitsche, Philippe; Thomas, Pete; Stuetz, Rainer; Welsh, Ruth
2017-10-01
Given the recent advancements in autonomous driving functions, one of the main challenges is safe and efficient operation in complex traffic situations such as road junctions. There is a need for comprehensive testing, either in virtual simulation environments or on real-world test tracks. This paper presents a novel data analysis method including the preparation, analysis and visualization of car crash data, to identify the critical pre-crash scenarios at T- and four-legged junctions as a basis for testing the safety of automated driving systems. The presented method employs k-medoids to cluster historical junction crash data into distinct partitions and then applies the association rules algorithm to each cluster to specify the driving scenarios in more detail. The dataset used consists of 1056 junction crashes in the UK, which were exported from the in-depth "On-the-Spot" database. The study resulted in thirteen crash clusters for T-junctions, and six crash clusters for crossroads. Association rules revealed common crash characteristics, which were the basis for the scenario descriptions. The results support existing findings on road junction accidents and provide benchmark situations for safety performance tests in order to reduce the possible number parameter combinations. Copyright © 2017 Elsevier Ltd. All rights reserved.
Supra-galactic colour patterns in globular cluster systems
NASA Astrophysics Data System (ADS)
Forte, Juan C.
2017-07-01
An analysis of globular cluster systems associated with galaxies included in the Virgo and Fornax Hubble Space Telescope-Advanced Camera Surveys reveals distinct (g - z) colour modulation patterns. These features appear on composite samples of globular clusters and, most evidently, in galaxies with absolute magnitudes Mg in the range from -20.2 to -19.2. These colour modulations are also detectable on some samples of globular clusters in the central galaxies NGC 1399 and NGC 4486 (and confirmed on data sets obtained with different instruments and photometric systems), as well as in other bright galaxies in these clusters. After discarding field contamination, photometric errors and statistical effects, we conclude that these supra-galactic colour patterns are real and reflect some previously unknown characteristic. These features suggest that the globular cluster formation process was not entirely stochastic but included a fraction of clusters that formed in a rather synchronized fashion over large spatial scales, and in a tentative time lapse of about 1.5 Gy at redshifts z between 2 and 4. We speculate that the putative mechanism leading to that synchronism may be associated with large scale feedback effects connected with violent star-forming events and/or with supermassive black holes.
NASA Astrophysics Data System (ADS)
Lilly, T.; Fritze-v. Alvensleben, U.; de Grijs, R.
2005-05-01
We present mathematically advanced tools for the determination of age, metallicity, and mass of old Globular Clusters (CGs) using both broad-band colors and spectral indices, and we present their application to the Globular Cluster Systems (GCSs) of elliptical galaxies. Since one of the most intriguing questions of today's astronomy aims at the evolutionary connection between (young) violently interacting galaxies at high-redshift and the (old) elliptical galaxies we observe nearby, it is necessary to reveal the possibly violent star-formation history of these old galaxies. By means of evolutionary synthesis models, we can show that, using the integrated light of a galaxy's (composite) stellar content alone, it is impossible to date (and, actually, to identify) even very strong starbursts if these events took place more than two or three Gyr ago. However, since large and violent starbursts are associated with the formation of GCs, GCSs are very good tracers of the most violent starburst events in the history of their host galaxies. Using our well-established Göttingen SED (Spectral Energy Distribution) analysis tool, we can reveal the age, metallicity, mass (and possibly extinction) of GCs by comparing the observations with an extensive grid of SSP model colors. This is done in a statistically advanced and reasonable way, including their 1 σ uncertainties. However, since for all colors the evolution slows down considerably at ages older than about 8 Gyr, even with several passbands and a long wavelength base line, the results are severely uncertain for old clusters. Therefore, we incorporated empirical calibrations for Lick indices in our models and developed a Lick indices analysis tool that works in the same way as the SED analysis tool described above. We compare the theoretical possibilities and limitations of both methods as well as their results for the example of the cD galaxy NGC 1399, for which both multi-color observations and, for a subsample of clusters, spectral indices are available, and address implications for the nature and origin of the observed bimodal color distribution.
ERIC Educational Resources Information Center
Smeltz, LeRoy C.
To identify, rate, and cluster groups of competencies and occupational titles at entry and advance levels for occupations in five food products commodity areas, data were collected by interviews with personnel managers in 25 Pennsylvania food processing plants. Some findings were: (1) There were meaningful competency factor and occupational title…
Chang, Ni-Bin; Wimberly, Brent; Xuan, Zhemin
2012-03-01
This study presents an integrated k-means clustering and gravity model (IKCGM) for investigating the spatiotemporal patterns of nutrient and associated dissolved oxygen levels in Tampa Bay, Florida. By using a k-means clustering analysis to first partition the nutrient data into a user-specified number of subsets, it is possible to discover the spatiotemporal patterns of nutrient distribution in the bay and capture the inherent linkages of hydrodynamic and biogeochemical features. Such patterns may then be combined with a gravity model to link the nutrient source contribution from each coastal watershed to the generated clusters in the bay to aid in the source proportion analysis for environmental management. The clustering analysis was carried out based on 1 year (2008) water quality data composed of 55 sample stations throughout Tampa Bay collected by the Environmental Protection Commission of Hillsborough County. In addition, hydrological and river water quality data of the same year were acquired from the United States Geological Survey's National Water Information System to support the gravity modeling analysis. The results show that the k-means model with 8 clusters is the optimal choice, in which cluster 2 at Lower Tampa Bay had the minimum values of total nitrogen (TN) concentrations, chlorophyll a (Chl-a) concentrations, and ocean color values in every season as well as the minimum concentration of total phosphorus (TP) in three consecutive seasons in 2008. The datasets indicate that Lower Tampa Bay is an area with limited nutrient input throughout the year. Cluster 5, located in Middle Tampa Bay, displayed elevated TN concentrations, ocean color values, and Chl-a concentrations, suggesting that high values of colored dissolved organic matter are linked with some nutrient sources. The data presented by the gravity modeling analysis indicate that the Alafia River Basin is the major contributor of nutrients in terms of both TP and TN values in all seasons. With this new integration, improvements for environmental monitoring and assessment were achieved to advance our understanding of sea-land interactions and nutrient cycling in a critical coastal bay, the Gulf of Mexico. This journal is © The Royal Society of Chemistry 2012
Cluster ensemble based on Random Forests for genetic data.
Alhusain, Luluah; Hafez, Alaaeldin M
2017-01-01
Clustering plays a crucial role in several application domains, such as bioinformatics. In bioinformatics, clustering has been extensively used as an approach for detecting interesting patterns in genetic data. One application is population structure analysis, which aims to group individuals into subpopulations based on shared genetic variations, such as single nucleotide polymorphisms. Advances in DNA sequencing technology have facilitated the obtainment of genetic datasets with exceptional sizes. Genetic data usually contain hundreds of thousands of genetic markers genotyped for thousands of individuals, making an efficient means for handling such data desirable. Random Forests (RFs) has emerged as an efficient algorithm capable of handling high-dimensional data. RFs provides a proximity measure that can capture different levels of co-occurring relationships between variables. RFs has been widely considered a supervised learning method, although it can be converted into an unsupervised learning method. Therefore, RF-derived proximity measure combined with a clustering technique may be well suited for determining the underlying structure of unlabeled data. This paper proposes, RFcluE, a cluster ensemble approach for determining the underlying structure of genetic data based on RFs. The approach comprises a cluster ensemble framework to combine multiple runs of RF clustering. Experiments were conducted on high-dimensional, real genetic dataset to evaluate the proposed approach. The experiments included an examination of the impact of parameter changes, comparing RFcluE performance against other clustering methods, and an assessment of the relationship between the diversity and quality of the ensemble and its effect on RFcluE performance. This paper proposes, RFcluE, a cluster ensemble approach based on RF clustering to address the problem of population structure analysis and demonstrate the effectiveness of the approach. The paper also illustrates that applying a cluster ensemble approach, combining multiple RF clusterings, produces more robust and higher-quality results as a consequence of feeding the ensemble with diverse views of high-dimensional genetic data obtained through bagging and random subspace, the two key features of the RF algorithm.
Advanced analysis of forest fire clustering
NASA Astrophysics Data System (ADS)
Kanevski, Mikhail; Pereira, Mario; Golay, Jean
2017-04-01
Analysis of point pattern clustering is an important topic in spatial statistics and for many applications: biodiversity, epidemiology, natural hazards, geomarketing, etc. There are several fundamental approaches used to quantify spatial data clustering using topological, statistical and fractal measures. In the present research, the recently introduced multi-point Morisita index (mMI) is applied to study the spatial clustering of forest fires in Portugal. The data set consists of more than 30000 fire events covering the time period from 1975 to 2013. The distribution of forest fires is very complex and highly variable in space. mMI is a multi-point extension of the classical two-point Morisita index. In essence, mMI is estimated by covering the region under study by a grid and by computing how many times more likely it is that m points selected at random will be from the same grid cell than it would be in the case of a complete random Poisson process. By changing the number of grid cells (size of the grid cells), mMI characterizes the scaling properties of spatial clustering. From mMI, the data intrinsic dimension (fractal dimension) of the point distribution can be estimated as well. In this study, the mMI of forest fires is compared with the mMI of random patterns (RPs) generated within the validity domain defined as the forest area of Portugal. It turns out that the forest fires are highly clustered inside the validity domain in comparison with the RPs. Moreover, they demonstrate different scaling properties at different spatial scales. The results obtained from the mMI analysis are also compared with those of fractal measures of clustering - box counting and sand box counting approaches. REFERENCES Golay J., Kanevski M., Vega Orozco C., Leuenberger M., 2014: The multipoint Morisita index for the analysis of spatial patterns. Physica A, 406, 191-202. Golay J., Kanevski M. 2015: A new estimator of intrinsic dimension based on the multipoint Morisita index. Pattern Recognition, 48, 4070-4081.
Ramón, M; Martínez-Pastor, F
2018-04-23
Computer-aided sperm analysis (CASA) produces a wealth of data that is frequently ignored. The use of multiparametric statistical methods can help explore these datasets, unveiling the subpopulation structure of sperm samples. In this review we analyse the significance of the internal heterogeneity of sperm samples and its relevance. We also provide a brief description of the statistical tools used for extracting sperm subpopulations from the datasets, namely unsupervised clustering (with non-hierarchical, hierarchical and two-step methods) and the most advanced supervised methods, based on machine learning. The former method has allowed exploration of subpopulation patterns in many species, whereas the latter offering further possibilities, especially considering functional studies and the practical use of subpopulation analysis. We also consider novel approaches, such as the use of geometric morphometrics or imaging flow cytometry. Finally, although the data provided by CASA systems provides valuable information on sperm samples by applying clustering analyses, there are several caveats. Protocols for capturing and analysing motility or morphometry should be standardised and adapted to each experiment, and the algorithms should be open in order to allow comparison of results between laboratories. Moreover, we must be aware of new technology that could change the paradigm for studying sperm motility and morphology.
The bacterial species definition in the genomic era
Konstantinidis, Konstantinos T; Ramette, Alban; Tiedje, James M
2006-01-01
The bacterial species definition, despite its eminent practical significance for identification, diagnosis, quarantine and diversity surveys, remains a very difficult issue to advance. Genomics now offers novel insights into intra-species diversity and the potential for emergence of a more soundly based system. Although we share the excitement, we argue that it is premature for a universal change to the definition because current knowledge is based on too few phylogenetic groups and too few samples of natural populations. Our analysis of five important bacterial groups suggests, however, that more stringent standards for species may be justifiable when a solid understanding of gene content and ecological distinctiveness becomes available. Our analysis also reveals what is actually encompassed in a species according to the current standards, in terms of whole-genome sequence and gene-content diversity, and shows that this does not correspond to coherent clusters for the environmental Burkholderia and Shewanella genera examined. In contrast, the obligatory pathogens, which have a very restricted ecological niche, do exhibit clusters. Therefore, the idea of biologically meaningful clusters of diversity that applies to most eukaryotes may not be universally applicable in the microbial world, or if such clusters exist, they may be found at different levels of distinction. PMID:17062412
Jones, Edgar; Hodgins-Vermaas, Robert; McCartney, Helen; Everitt, Brian; Beech, Charlotte; Poynter, Denise; Palmer, Ian; Hyams, Kenneth; Wessely, Simon
2002-02-09
To discover whether post-combat syndromes have existed after modern wars and what relation they bear to each other. Review of medical and military records of servicemen and cluster analysis of symptoms. Records for 1856 veterans randomly selected from war pension files awarded from 1872 and from the Medical Assessment Programme for Gulf war veterans. Characteristic patterns of symptom clusters and their relation to dependent variables including war, diagnosis, predisposing physical illness, and exposure to combat; and servicemen's changing attributions for post-combat disorders. Three varieties of post-combat disorder were identified-a debility syndrome (associated with the 19th and early 20th centuries), somatic syndrome (related primarily to the first world war), and a neuropsychiatric syndrome (associated with the second world war and the Gulf conflict). The era in which the war occurred was overwhelmingly the best predictor of cluster membership. All modern wars have been associated with a syndrome characterised by unexplained medical symptoms. The form that these assume, the terms used to describe them, and the explanations offered by servicemen and doctors seem to be influenced by advances in medical science, changes in the nature of warfare, and underlying cultural forces.
Miaskowski, Christine; Barsevick, Andrea; Berger, Ann; Casagrande, Rocco; Grady, Patricia A.; Jacobsen, Paul; Kutner, Jean; Patrick, Donald; Zimmerman, Lani; Xiao, Canhua; Matocha, Martha; Marden, Sue
2017-01-01
An overview of proceedings, findings, and recommendations from the workshop on “Advancing Symptom Science Through Symptom Cluster Research” sponsored by the National Institute of Nursing Research (NINR) and the Office of Rare Diseases Research, National Center for Advancing Translational Sciences, is presented. This workshop engaged an expert panel in an evidenced-based discussion regarding the state of the science of symptom clusters in chronic conditions including cancer and other rare diseases. An interdisciplinary working group from the extramural research community representing nursing, medicine, oncology, psychology, and bioinformatics was convened at the National Institutes of Health. Based on expertise, members were divided into teams to address key areas: defining characteristics of symptom clusters, priority symptom clusters and underlying mechanisms, measurement issues, targeted interventions, and new analytic strategies. For each area, the evidence was synthesized, limitations and gaps identified, and recommendations for future research delineated. The majority of findings in each area were from studies of oncology patients. However, increasing evidence suggests that symptom clusters occur in patients with other chronic conditions (eg, pulmonary, cardiac, and end-stage renal disease). Nonetheless, symptom cluster research is extremely limited and scientists are just beginning to understand how to investigate symptom clusters by developing frameworks and new methods and approaches. With a focus on personalized care, an understanding of individual susceptibility to symptoms and whether a “driving” symptom exists that triggers other symptoms in the cluster is needed. Also, research aimed at identifying the mechanisms that underlie symptom clusters is essential to developing targeted interventions. PMID:28119347
Jones, Edgar; Hodgins-Vermaas, Robert; McCartney, Helen; Everitt, Brian; Beech, Charlotte; Poynter, Denise; Palmer, Ian; Hyams, Kenneth; Wessely, Simon
2002-01-01
Objectives To discover whether post-combat syndromes have existed after modern wars and what relation they bear to each other. Design Review of medical and military records of servicemen and cluster analysis of symptoms. Data sources Records for 1856 veterans randomly selected from war pension files awarded from 1872 and from the Medical Assessment Programme for Gulf war veterans. Main outcome measures Characteristic patterns of symptom clusters and their relation to dependent variables including war, diagnosis, predisposing physical illness, and exposure to combat; and servicemen's changing attributions for post-combat disorders. Results Three varieties of post-combat disorder were identified—a debility syndrome (associated with the 19th and early 20th centuries), somatic syndrome (related primarily to the first world war), and a neuropsychiatric syndrome (associated with the second world war and the Gulf conflict). The era in which the war occurred was overwhelmingly the best predictor of cluster membership. Conclusions All modern wars have been associated with a syndrome characterised by unexplained medical symptoms. The form that these assume, the terms used to describe them, and the explanations offered by servicemen and doctors seem to be influenced by advances in medical science, changes in the nature of warfare, and underlying cultural forces. What is already known on this topicService in the Gulf war is associated with an increased rate of reported symptoms and worsening subjective healthPost-combat syndromes have been described after most modern conflicts from the US civil war onwardsWhat this study addsThere seems to be no single post-combat syndrome but a number of variations on a themeThe ever changing form of post-combat syndromes seems to be related to advances in medical understanding, the developing nature of warfare, and cultural undercurrentsBecause reported symptoms are subject to bias and changing emphasis related to advances in medical science or the discovery of new diseases, the characterisation of individual syndromes has to be treated with cautionAttributions by servicemen are generally consistent with symptom characteristics, though there seems to be a growing reluctance to consider the stress of military service as a cause PMID:11834557
Seizure clusters: characteristics and treatment.
Haut, Sheryl R
2015-04-01
Many patients with epilepsy experience 'clusters' or flurries of seizures, also termed acute repetitive seizures (ARS). Seizure clustering has a significant impact on health and quality of life. This review summarizes recent advances in the definition and neurophysiologic understanding of clustering, the epidemiology and risk factors for clustering and both inpatient and outpatient clinical implications. New treatments for seizure clustering/ARS are perhaps the area of greatest recent progress. Efforts have focused on creating a uniform definition of a seizure cluster. In neurophysiologic studies of refractory epilepsy, seizures within a cluster appear to be self-triggering. Clinical progress has been achieved towards a more precise prevalence of clustering, and consensus guidelines for epilepsy monitoring unit safety. The greatest recent advances are in the study of nonintravenous route of benzodiazepines as rescue medications for seizure clusters/ARS. Rectal benzodiazepines have been very effective but barriers to use exist. New data on buccal, intramuscular and intranasal preparations are anticipated to lead to a greater number of approved treatments. Progesterone may be effective for women who experience catamenial clusters. Seizure clustering is common, particularly in the setting of medically refractory epilepsy. Clustering worsens health and quality of life, and the field requires greater focus on clarifying of definition and clinical implications. Progress towards the development of nonintravenous routes of benzodiazepines has the potential to improve care in this area.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schrabback, T.; Applegate, D.; Dietrich, J. P.
Here we present an HST/Advanced Camera for Surveys (ACS) weak gravitational lensing analysis of 13 massive high-redshift (z median = 0.88) galaxy clusters discovered in the South Pole Telescope (SPT) Sunyaev–Zel'dovich Survey. This study is part of a larger campaign that aims to robustly calibrate mass–observable scaling relations over a wide range in redshift to enable improved cosmological constraints from the SPT cluster sample. We introduce new strategies to ensure that systematics in the lensing analysis do not degrade constraints on cluster scaling relations significantly. First, we efficiently remove cluster members from the source sample by selecting very blue galaxies in V-I colour. Our estimate of the source redshift distribution is based on Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS) data, where we carefully mimic the source selection criteria of the cluster fields. We apply a statistical correction for systematic photometric redshift errors as derived from Hubble Ultra Deep Field data and verified through spatial cross-correlations. We account for the impact of lensing magnification on the source redshift distribution, finding that this is particularly relevant for shallower surveys. Finally, we account for biases in the mass modelling caused by miscentring and uncertainties in the concentration–mass relation using simulations. In combination with temperature estimates from Chandra we constrain the normalization of the mass–temperature scaling relation ln (E(z)M 500c/10 14 M ⊙) = A + 1.5ln (kT/7.2 keV) to A=1.81more » $$+0.24\\atop{-0.14}$$(stat.)±0.09(sys.), consistent with self-similar redshift evolution when compared to lower redshift samples. Additionally, the lensing data constrain the average concentration of the clusters to c 200c=5.6$$+3.7\\atop{-1.8}$$.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schrabback, T.; Applegate, D.; Dietrich, J. P.
We present an HST/Advanced Camera for Surveys (ACS) weak gravitational lensing analysis of 13 massive high-redshift (z(median) = 0.88) galaxy clusters discovered in the South Pole Telescope (SPT) Sunyaev-Zel'dovich Survey. This study is part of a larger campaign that aims to robustly calibrate mass-observable scaling relations over a wide range in redshift to enable improved cosmological constraints from the SPT cluster sample. We introduce new strategies to ensure that systematics in the lensing analysis do not degrade constraints on cluster scaling relations significantly. First, we efficiently remove cluster members from the source sample by selecting very blue galaxies in Vmore » - I colour. Our estimate of the source redshift distribution is based on Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS) data, where we carefully mimic the source selection criteria of the cluster fields. We apply a statistical correction for systematic photometric redshift errors as derived from Hubble Ultra Deep Field data and verified through spatial cross-correlations. We account for the impact of lensing magnification on the source redshift distribution, finding that this is particularly relevant for shallower surveys. Finally, we account for biases in the mass modelling caused by miscentring and uncertainties in the concentration-mass relation using simulations. In combination with temperature estimates from Chandra we constrain the normalization of the mass-temperature scaling relation ln (E(z) M-500c/10(14)M(circle dot)) = A + 1.5ln (kT/7.2 keV) to A = 1.81(-0.14)(+0.24)(stat.)+/- 0.09(sys.), consistent with self-similar redshift evolution when compared to lower redshift samples. Additionally, the lensing data constrain the average concentration of the clusters to c(200c) = 5.6(-1.8)(+3.7).« less
Schrabback, T.; Applegate, D.; Dietrich, J. P.; ...
2017-10-14
Here we present an HST/Advanced Camera for Surveys (ACS) weak gravitational lensing analysis of 13 massive high-redshift (z median = 0.88) galaxy clusters discovered in the South Pole Telescope (SPT) Sunyaev–Zel'dovich Survey. This study is part of a larger campaign that aims to robustly calibrate mass–observable scaling relations over a wide range in redshift to enable improved cosmological constraints from the SPT cluster sample. We introduce new strategies to ensure that systematics in the lensing analysis do not degrade constraints on cluster scaling relations significantly. First, we efficiently remove cluster members from the source sample by selecting very blue galaxies in V-I colour. Our estimate of the source redshift distribution is based on Cosmic Assembly Near-infrared Deep Extragalactic Legacy Survey (CANDELS) data, where we carefully mimic the source selection criteria of the cluster fields. We apply a statistical correction for systematic photometric redshift errors as derived from Hubble Ultra Deep Field data and verified through spatial cross-correlations. We account for the impact of lensing magnification on the source redshift distribution, finding that this is particularly relevant for shallower surveys. Finally, we account for biases in the mass modelling caused by miscentring and uncertainties in the concentration–mass relation using simulations. In combination with temperature estimates from Chandra we constrain the normalization of the mass–temperature scaling relation ln (E(z)M 500c/10 14 M ⊙) = A + 1.5ln (kT/7.2 keV) to A=1.81more » $$+0.24\\atop{-0.14}$$(stat.)±0.09(sys.), consistent with self-similar redshift evolution when compared to lower redshift samples. Additionally, the lensing data constrain the average concentration of the clusters to c 200c=5.6$$+3.7\\atop{-1.8}$$.« less
PCA based clustering for brain tumor segmentation of T1w MRI images.
Kaya, Irem Ersöz; Pehlivanlı, Ayça Çakmak; Sekizkardeş, Emine Gezmez; Ibrikci, Turgay
2017-03-01
Medical images are huge collections of information that are difficult to store and process consuming extensive computing time. Therefore, the reduction techniques are commonly used as a data pre-processing step to make the image data less complex so that a high-dimensional data can be identified by an appropriate low-dimensional representation. PCA is one of the most popular multivariate methods for data reduction. This paper is focused on T1-weighted MRI images clustering for brain tumor segmentation with dimension reduction by different common Principle Component Analysis (PCA) algorithms. Our primary aim is to present a comparison between different variations of PCA algorithms on MRIs for two cluster methods. Five most common PCA algorithms; namely the conventional PCA, Probabilistic Principal Component Analysis (PPCA), Expectation Maximization Based Principal Component Analysis (EM-PCA), Generalize Hebbian Algorithm (GHA), and Adaptive Principal Component Extraction (APEX) were applied to reduce dimensionality in advance of two clustering algorithms, K-Means and Fuzzy C-Means. In the study, the T1-weighted MRI images of the human brain with brain tumor were used for clustering. In addition to the original size of 512 lines and 512 pixels per line, three more different sizes, 256 × 256, 128 × 128 and 64 × 64, were included in the study to examine their effect on the methods. The obtained results were compared in terms of both the reconstruction errors and the Euclidean distance errors among the clustered images containing the same number of principle components. According to the findings, the PPCA obtained the best results among all others. Furthermore, the EM-PCA and the PPCA assisted K-Means algorithm to accomplish the best clustering performance in the majority as well as achieving significant results with both clustering algorithms for all size of T1w MRI images. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Mastrangelo, F; Sberna, M T; Tettamanti, L; Cantatore, G; Tagliabue, A; Gherlone, E
2016-01-01
Vascular Endothelia Growth Factor (VEGF) and Nitric Oxide Synthase (NOS) expression, were evaluated in human tooth germs at two different stages of embryogenesis, to clarify the role of angiogenesis during tooth tissue differentiation and growth. Seventy-two third molar germ specimens were selected during oral surgery. Thirty-six were in the early stage and 36 in the later stage of tooth development. The samples were evaluated with Semi-quantitative Reverse Transcription-Polymerase chain Reaction analyses (RT-PcR), Western blot analysis (WB) and immunohistochemical analysis. Western blot and immunohistochemical analysis showed a VEGF and NOS 1-2-3 positive reaction in all samples analysed. VEGF high positive decrease reaction was observed in stellate reticulum cells, ameloblast and odontoblast clusters in early stage compared to later stage of tooth germ development. Comparable VEGF expression was observed in endothelial cells of early and advanced stage growth. NOS1 and NOS3 expressions showed a high increased value in stellate reticulum cells, and ameloblast and odontoblast clusters in advanced stage compared to early stage of development. The absence or only moderate positive reaction of NOS2 was detected in all the different tissues. Positive NOS2 expression showed in advanced stage of tissue development compared to early stage. The action of VEGF and NOS molecules are important mediators of angiogenesis during dental tissue development. VEGF high positive expression in stellate reticulum cells in the early stage of tooth development compared to the later stage and the other cell types, suggests a critical role of the stellate reticulum during dental embryo-morphogenesis.
Microforms in gravel bed rivers: Formation, disintegration, and effects on bedload transport
Strom, K.; Papanicolaou, A.N.; Evangelopoulos, N.; Odeh, M.
2004-01-01
This research aims to advance current knowledge on cluster formation and evolution by tackling some of the aspects associated with cluster microtopography and the effects of clusters on bedload transport. The specific objectives of the study are (1) to identify the bed shear stress range in which clusters form and disintegrate, (2) to quantitatively describe the spacing characteristics and orientation of clusters with respect to flow characteristics, (3) to quantify the effects clusters have on the mean bedload rate, and (4) to assess the effects of clusters on the pulsating nature of bedload. In order to meet the objectives of this study, two main experimental scenarios, namely, Test Series A and B (20 experiments overall) are considered in a laboratory flume under well-controlled conditions. Series A tests are performed to address objectives (1) and (2) while Series B is designed to meet objectives (3) and (4). Results show that cluster microforms develop in uniform sediment at 1.25 to 2 times the Shields parameter of an individual particle and start disintegrating at about 2.25 times the Shields parameter. It is found that during an unsteady flow event, effects of clusters on bedload transport rate can be classified in three different phases: a sink phase where clusters absorb incoming sediment, a neutral phase where clusters do not affect bedload, and a source phase where clusters release particles. Clusters also increase the magnitude of the fluctuations in bedload transport rate, showing that clusters amplify the unsteady nature of bedload transport. A fourth-order autoregressive, autoregressive integrated moving average model is employed to describe the time series of bedload and provide a predictive formula for predicting bedload at different periods. Finally, a change-point analysis enhanced with a binary segmentation procedure is performed to identify the abrupt changes in the bedload statistic characteristics due to the effects of clusters and detect the different phases in bedload time series using probability theory. The analysis verifies the experimental findings that three phases are detected in the bedload rate time series structure, namely, sink, neutral, and source. ?? ASCE / JUNE 2004.
Nilsson, Daniel; Lindman, Magdalena; Victor, Trent; Dozza, Marco
2018-04-01
Single-vehicle run-off-road crashes are a major traffic safety concern, as they are associated with a high proportion of fatal outcomes. In addressing run-off-road crashes, the development and evaluation of advanced driver assistance systems requires test scenarios that are representative of the variability found in real-world crashes. We apply hierarchical agglomerative cluster analysis to define similarities in a set of crash data variables, these clusters can then be used as the basis in test scenario development. Out of 13 clusters, nine test scenarios are derived, corresponding to crashes characterised by: drivers drifting off the road in daytime and night-time, high speed departures, high-angle departures on narrow roads, highways, snowy roads, loss-of-control on wet roadways, sharp curves, and high speeds on roads with severe road surface conditions. In addition, each cluster was analysed with respect to crash variables related to the crash cause and reason for the unintended lane departure. The study shows that cluster analysis of representative data provides a statistically based method to identify relevant properties for run-off-road test scenarios. This was done to support development of vehicle-based run-off-road countermeasures and driver behaviour models used in virtual testing. Future studies should use driver behaviour from naturalistic driving data to further define how test-scenarios and behavioural causation mechanisms should be included. Copyright © 2018 Elsevier Ltd. All rights reserved.
Auxiliary iron-sulfur cofactors in radical SAM enzymes.
Lanz, Nicholas D; Booker, Squire J
2015-06-01
A vast number of enzymes are now known to belong to a superfamily known as radical SAM, which all contain a [4Fe-4S] cluster ligated by three cysteine residues. The remaining, unligated, iron ion of the cluster binds in contact with the α-amino and α-carboxylate groups of S-adenosyl-l-methionine (SAM). This binding mode facilitates inner-sphere electron transfer from the reduced form of the cluster into the sulfur atom of SAM, resulting in a reductive cleavage of SAM to methionine and a 5'-deoxyadenosyl radical. The 5'-deoxyadenosyl radical then abstracts a target substrate hydrogen atom, initiating a wide variety of radical-based transformations. A subset of radical SAM enzymes contains one or more additional iron-sulfur clusters that are required for the reactions they catalyze. However, outside of a subset of sulfur insertion reactions, very little is known about the roles of these additional clusters. This review will highlight the most recent advances in the identification and characterization of radical SAM enzymes that harbor auxiliary iron-sulfur clusters. This article is part of a Special Issue entitled: Fe/S proteins: Analysis, structure, function, biogenesis and diseases. Copyright © 2015 Elsevier B.V. All rights reserved.
Adamczak, Rafal; Meller, Jarek
2016-12-28
Advances in computing have enabled current protein and RNA structure prediction and molecular simulation methods to dramatically increase their sampling of conformational spaces. The quickly growing number of experimentally resolved structures, and databases such as the Protein Data Bank, also implies large scale structural similarity analyses to retrieve and classify macromolecular data. Consequently, the computational cost of structure comparison and clustering for large sets of macromolecular structures has become a bottleneck that necessitates further algorithmic improvements and development of efficient software solutions. uQlust is a versatile and easy-to-use tool for ultrafast ranking and clustering of macromolecular structures. uQlust makes use of structural profiles of proteins and nucleic acids, while combining a linear-time algorithm for implicit comparison of all pairs of models with profile hashing to enable efficient clustering of large data sets with a low memory footprint. In addition to ranking and clustering of large sets of models of the same protein or RNA molecule, uQlust can also be used in conjunction with fragment-based profiles in order to cluster structures of arbitrary length. For example, hierarchical clustering of the entire PDB using profile hashing can be performed on a typical laptop, thus opening an avenue for structural explorations previously limited to dedicated resources. The uQlust package is freely available under the GNU General Public License at https://github.com/uQlust . uQlust represents a drastic reduction in the computational complexity and memory requirements with respect to existing clustering and model quality assessment methods for macromolecular structure analysis, while yielding results on par with traditional approaches for both proteins and RNAs.
Improved Test Planning and Analysis Through the Use of Advanced Statistical Methods
NASA Technical Reports Server (NTRS)
Green, Lawrence L.; Maxwell, Katherine A.; Glass, David E.; Vaughn, Wallace L.; Barger, Weston; Cook, Mylan
2016-01-01
The goal of this work is, through computational simulations, to provide statistically-based evidence to convince the testing community that a distributed testing approach is superior to a clustered testing approach for most situations. For clustered testing, numerous, repeated test points are acquired at a limited number of test conditions. For distributed testing, only one or a few test points are requested at many different conditions. The statistical techniques of Analysis of Variance (ANOVA), Design of Experiments (DOE) and Response Surface Methods (RSM) are applied to enable distributed test planning, data analysis and test augmentation. The D-Optimal class of DOE is used to plan an optimally efficient single- and multi-factor test. The resulting simulated test data are analyzed via ANOVA and a parametric model is constructed using RSM. Finally, ANOVA can be used to plan a second round of testing to augment the existing data set with new data points. The use of these techniques is demonstrated through several illustrative examples. To date, many thousands of comparisons have been performed and the results strongly support the conclusion that the distributed testing approach outperforms the clustered testing approach.
The three-dimensional Multi-Block Advanced Grid Generation System (3DMAGGS)
NASA Technical Reports Server (NTRS)
Alter, Stephen J.; Weilmuenster, Kenneth J.
1993-01-01
As the size and complexity of three dimensional volume grids increases, there is a growing need for fast and efficient 3D volumetric elliptic grid solvers. Present day solvers are limited by computational speed and do not have all the capabilities such as interior volume grid clustering control, viscous grid clustering at the wall of a configuration, truncation error limiters, and convergence optimization residing in one code. A new volume grid generator, 3DMAGGS (Three-Dimensional Multi-Block Advanced Grid Generation System), which is based on the 3DGRAPE code, has evolved to meet these needs. This is a manual for the usage of 3DMAGGS and contains five sections, including the motivations and usage, a GRIDGEN interface, a grid quality analysis tool, a sample case for verifying correct operation of the code, and a comparison to both 3DGRAPE and GRIDGEN3D. Since it was derived from 3DGRAPE, this technical memorandum should be used in conjunction with the 3DGRAPE manual (NASA TM-102224).
High-resolution Self-Organizing Maps for advanced visualization and dimension reduction.
Saraswati, Ayu; Nguyen, Van Tuc; Hagenbuchner, Markus; Tsoi, Ah Chung
2018-05-04
Kohonen's Self Organizing feature Map (SOM) provides an effective way to project high dimensional input features onto a low dimensional display space while preserving the topological relationships among the input features. Recent advances in algorithms that take advantages of modern computing hardware introduced the concept of high resolution SOMs (HRSOMs). This paper investigates the capabilities and applicability of the HRSOM as a visualization tool for cluster analysis and its suitabilities to serve as a pre-processor in ensemble learning models. The evaluation is conducted on a number of established benchmarks and real-world learning problems, namely, the policeman benchmark, two web spam detection problems, a network intrusion detection problem, and a malware detection problem. It is found that the visualization resulted from an HRSOM provides new insights concerning these learning problems. It is furthermore shown empirically that broad benefits from the use of HRSOMs in both clustering and classification problems can be expected. Copyright © 2018 Elsevier Ltd. All rights reserved.
Parallel implementation of D-Phylo algorithm for maximum likelihood clusters.
Malik, Shamita; Sharma, Dolly; Khatri, Sunil Kumar
2017-03-01
This study explains a newly developed parallel algorithm for phylogenetic analysis of DNA sequences. The newly designed D-Phylo is a more advanced algorithm for phylogenetic analysis using maximum likelihood approach. The D-Phylo while misusing the seeking capacity of k -means keeps away from its real constraint of getting stuck at privately conserved motifs. The authors have tested the behaviour of D-Phylo on Amazon Linux Amazon Machine Image(Hardware Virtual Machine)i2.4xlarge, six central processing unit, 122 GiB memory, 8 × 800 Solid-state drive Elastic Block Store volume, high network performance up to 15 processors for several real-life datasets. Distributing the clusters evenly on all the processors provides us the capacity to accomplish a near direct speed if there should arise an occurrence of huge number of processors.
Unsupervised pattern recognition methods in ciders profiling based on GCE voltammetric signals.
Jakubowska, Małgorzata; Sordoń, Wanda; Ciepiela, Filip
2016-07-15
This work presents a complete methodology of distinguishing between different brands of cider and ageing degrees, based on voltammetric signals, utilizing dedicated data preprocessing procedures and unsupervised multivariate analysis. It was demonstrated that voltammograms recorded on glassy carbon electrode in Britton-Robinson buffer at pH 2 are reproducible for each brand. By application of clustering algorithms and principal component analysis visible homogenous clusters were obtained. Advanced signal processing strategy which included automatic baseline correction, interval scaling and continuous wavelet transform with dedicated mother wavelet, was a key step in the correct recognition of the objects. The results show that voltammetry combined with optimized univariate and multivariate data processing is a sufficient tool to distinguish between ciders from various brands and to evaluate their freshness. Copyright © 2016 Elsevier Ltd. All rights reserved.
Winter Precipitation Forecast in the European and Mediterranean Regions Using Cluster Analysis
NASA Astrophysics Data System (ADS)
Totz, Sonja; Tziperman, Eli; Coumou, Dim; Pfeiffer, Karl; Cohen, Judah
2017-12-01
The European climate is changing under global warming, and especially the Mediterranean region has been identified as a hot spot for climate change with climate models projecting a reduction in winter rainfall and a very pronounced increase in summertime heat waves. These trends are already detectable over the historic period. Hence, it is beneficial to forecast seasonal droughts well in advance so that water managers and stakeholders can prepare to mitigate deleterious impacts. We developed a new cluster-based empirical forecast method to predict precipitation anomalies in winter. This algorithm considers not only the strength but also the pattern of the precursors. We compare our algorithm with dynamic forecast models and a canonical correlation analysis-based prediction method demonstrating that our prediction method performs better in terms of time and pattern correlation in the Mediterranean and European regions.
Deep Learning Nuclei Detection in Digitized Histology Images by Superpixels.
Sornapudi, Sudhir; Stanley, Ronald Joe; Stoecker, William V; Almubarak, Haidar; Long, Rodney; Antani, Sameer; Thoma, George; Zuna, Rosemary; Frazier, Shelliane R
2018-01-01
Advances in image analysis and computational techniques have facilitated automatic detection of critical features in histopathology images. Detection of nuclei is critical for squamous epithelium cervical intraepithelial neoplasia (CIN) classification into normal, CIN1, CIN2, and CIN3 grades. In this study, a deep learning (DL)-based nuclei segmentation approach is investigated based on gathering localized information through the generation of superpixels using a simple linear iterative clustering algorithm and training with a convolutional neural network. The proposed approach was evaluated on a dataset of 133 digitized histology images and achieved an overall nuclei detection (object-based) accuracy of 95.97%, with demonstrated improvement over imaging-based and clustering-based benchmark techniques. The proposed DL-based nuclei segmentation Method with superpixel analysis has shown improved segmentation results in comparison to state-of-the-art methods.
Cluster analysis of obesity and asthma phenotypes.
Sutherland, E Rand; Goleva, Elena; King, Tonya S; Lehman, Erik; Stevens, Allen D; Jackson, Leisa P; Stream, Amanda R; Fahy, John V; Leung, Donald Y M
2012-01-01
Asthma is a heterogeneous disease with variability among patients in characteristics such as lung function, symptoms and control, body weight, markers of inflammation, and responsiveness to glucocorticoids (GC). Cluster analysis of well-characterized cohorts can advance understanding of disease subgroups in asthma and point to unsuspected disease mechanisms. We utilized an hypothesis-free cluster analytical approach to define the contribution of obesity and related variables to asthma phenotype. In a cohort of clinical trial participants (n = 250), minimum-variance hierarchical clustering was used to identify clinical and inflammatory biomarkers important in determining disease cluster membership in mild and moderate persistent asthmatics. In a subset of participants, GC sensitivity was assessed via expression of GC receptor alpha (GCRα) and induction of MAP kinase phosphatase-1 (MKP-1) expression by dexamethasone. Four asthma clusters were identified, with body mass index (BMI, kg/m(2)) and severity of asthma symptoms (AEQ score) the most significant determinants of cluster membership (F = 57.1, p<0.0001 and F = 44.8, p<0.0001, respectively). Two clusters were composed of predominantly obese individuals; these two obese asthma clusters differed from one another with regard to age of asthma onset, measures of asthma symptoms (AEQ) and control (ACQ), exhaled nitric oxide concentration (F(E)NO) and airway hyperresponsiveness (methacholine PC(20)) but were similar with regard to measures of lung function (FEV(1) (%) and FEV(1)/FVC), airway eosinophilia, IgE, leptin, adiponectin and C-reactive protein (hsCRP). Members of obese clusters demonstrated evidence of reduced expression of GCRα, a finding which was correlated with a reduced induction of MKP-1 expression by dexamethasone Obesity is an important determinant of asthma phenotype in adults. There is heterogeneity in expression of clinical and inflammatory biomarkers of asthma across obese individuals. Reduced expression of the dominant functional isoform of the GCR may mediate GC insensitivity in obese asthmatics.
Binary Black Hole Mergers from Globular Clusters: Implications for Advanced LIGO.
Rodriguez, Carl L; Morscher, Meagan; Pattabiraman, Bharath; Chatterjee, Sourav; Haster, Carl-Johan; Rasio, Frederic A
2015-07-31
The predicted rate of binary black hole mergers from galactic fields can vary over several orders of magnitude and is extremely sensitive to the assumptions of stellar evolution. But in dense stellar environments such as globular clusters, binary black holes form by well-understood gravitational interactions. In this Letter, we study the formation of black hole binaries in an extensive collection of realistic globular cluster models. By comparing these models to observed Milky Way and extragalactic globular clusters, we find that the mergers of dynamically formed binaries could be detected at a rate of ∼100 per year, potentially dominating the binary black hole merger rate. We also find that a majority of cluster-formed binaries are more massive than their field-formed counterparts, suggesting that Advanced LIGO could identify certain binaries as originating from dense stellar environments.
Automatic pole-like object modeling via 3D part-based analysis of point cloud
NASA Astrophysics Data System (ADS)
He, Liu; Yang, Haoxiang; Huang, Yuchun
2016-10-01
Pole-like objects, including trees, lampposts and traffic signs, are indispensable part of urban infrastructure. With the advance of vehicle-based laser scanning (VLS), massive point cloud of roadside urban areas becomes applied in 3D digital city modeling. Based on the property that different pole-like objects have various canopy parts and similar trunk parts, this paper proposed the 3D part-based shape analysis to robustly extract, identify and model the pole-like objects. The proposed method includes: 3D clustering and recognition of trunks, voxel growing and part-based 3D modeling. After preprocessing, the trunk center is identified as the point that has local density peak and the largest minimum inter-cluster distance. Starting from the trunk centers, the remaining points are iteratively clustered to the same centers of their nearest point with higher density. To eliminate the noisy points, cluster border is refined by trimming boundary outliers. Then, candidate trunks are extracted based on the clustering results in three orthogonal planes by shape analysis. Voxel growing obtains the completed pole-like objects regardless of overlaying. Finally, entire trunk, branch and crown part are analyzed to obtain seven feature parameters. These parameters are utilized to model three parts respectively and get signal part-assembled 3D model. The proposed method is tested using the VLS-based point cloud of Wuhan University, China. The point cloud includes many kinds of trees, lampposts and other pole-like posters under different occlusions and overlaying. Experimental results show that the proposed method can extract the exact attributes and model the roadside pole-like objects efficiently.
Topology and Control of the Cell-Cycle-Regulated Transcriptional Circuitry
Haase, Steven B.; Wittenberg, Curt
2014-01-01
Nearly 20% of the budding yeast genome is transcribed periodically during the cell division cycle. The precise temporal execution of this large transcriptional program is controlled by a large interacting network of transcriptional regulators, kinases, and ubiquitin ligases. Historically, this network has been viewed as a collection of four coregulated gene clusters that are associated with each phase of the cell cycle. Although the broad outlines of these gene clusters were described nearly 20 years ago, new technologies have enabled major advances in our understanding of the genes comprising those clusters, their regulation, and the complex regulatory interplay between clusters. More recently, advances are being made in understanding the roles of chromatin in the control of the transcriptional program. We are also beginning to discover important regulatory interactions between the cell-cycle transcriptional program and other cell-cycle regulatory mechanisms such as checkpoints and metabolic networks. Here we review recent advances and contemporary models of the transcriptional network and consider these models in the context of eukaryotic cell-cycle controls. PMID:24395825
Analysis of ground-motion simulation big data
NASA Astrophysics Data System (ADS)
Maeda, T.; Fujiwara, H.
2016-12-01
We developed a parallel distributed processing system which applies a big data analysis to the large-scale ground motion simulation data. The system uses ground-motion index values and earthquake scenario parameters as input. We used peak ground velocity value and velocity response spectra as the ground-motion index. The ground-motion index values are calculated from our simulation data. We used simulated long-period ground motion waveforms at about 80,000 meshes calculated by a three dimensional finite difference method based on 369 earthquake scenarios of a great earthquake in the Nankai Trough. These scenarios were constructed by considering the uncertainty of source model parameters such as source area, rupture starting point, asperity location, rupture velocity, fmax and slip function. We used these parameters as the earthquake scenario parameter. The system firstly carries out the clustering of the earthquake scenario in each mesh by the k-means method. The number of clusters is determined in advance using a hierarchical clustering by the Ward's method. The scenario clustering results are converted to the 1-D feature vector. The dimension of the feature vector is the number of scenario combination. If two scenarios belong to the same cluster the component of the feature vector is 1, and otherwise the component is 0. The feature vector shows a `response' of mesh to the assumed earthquake scenario group. Next, the system performs the clustering of the mesh by k-means method using the feature vector of each mesh previously obtained. Here the number of clusters is arbitrarily given. The clustering of scenarios and meshes are performed by parallel distributed processing with Hadoop and Spark, respectively. In this study, we divided the meshes into 20 clusters. The meshes in each cluster are geometrically concentrated. Thus this system can extract regions, in which the meshes have similar `response', as clusters. For each cluster, it is possible to determine particular scenario parameters which characterize the cluster. In other word, by utilizing this system, we can obtain critical scenario parameters of the ground-motion simulation for each evaluation point objectively. This research was supported by CREST, JST.
Heavy-Duty Vehicle Port Drayage Drive Cycle Characterization and Development
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prohaska, Robert; Konan, Arnaud; Kelly, Kenneth
In an effort to better understand the operational requirements of port drayage vehicles and their potential for adoption of advanced technologies, National Renewable Energy Laboratory (NREL) researchers collected over 36,000 miles of in-use duty cycle data from 30 Class 8 drayage trucks operating at the Port of Long Beach and Port of Los Angeles in Southern California. These data include 1-Hz global positioning system location and SAE J1939 high-speed controller area network information. Researchers processed the data through NREL's Drive-Cycle Rapid Investigation, Visualization, and Evaluation tool to examine vehicle kinematic and dynamic patterns across the spectrum of operations. Using themore » k-medoids clustering method, a repeatable and quantitative process for multi-mode drive cycle segmentation, the analysis led to the creation of multiple drive cycles representing four distinct modes of operation that can be used independently or in combination. These drive cycles are statistically representative of real-world operation of port drayage vehicles. When combined with modeling and simulation tools, these representative test cycles allow advanced vehicle or systems developers to efficiently and accurately evaluate vehicle technology performance requirements to reduce cost and development time while ultimately leading to the commercialization of advanced technologies that meet the performance requirements of the port drayage vocation. The drive cycles, which are suitable for chassis dynamometer testing, were compared to several existing test cycles. This paper presents the clustering methodology, accompanying results of the port drayage duty cycle analysis and custom drive cycle creation.« less
Heavy-Duty Vehicle Port Drayage Drive Cycle Characterization and Development: Preprint
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prohaska, Robert; Konan, Arnaud; Kelly, Kenneth
In an effort to better understand the operational requirements of port drayage vehicles and their potential for adoption of advanced technologies, National Renewable Energy Laboratory (NREL) researchers collected over 36,000 miles of in-use duty cycle data from 30 Class 8 drayage trucks operating at the Port of Long Beach and Port of Los Angeles in Southern California. These data include 1-Hz global positioning system location and SAE J1939 high-speed controller area network information. Researchers processed the data through NREL's Drive-Cycle Rapid Investigation, Visualization, and Evaluation tool to examine vehicle kinematic and dynamic patterns across the spectrum of operations. Using themore » k-medoids clustering method, a repeatable and quantitative process for multi-mode drive cycle segmentation, the analysis led to the creation of multiple drive cycles representing four distinct modes of operation that can be used independently or in combination. These drive cycles are statistically representative of real-world operation of port drayage vehicles. When combined with modeling and simulation tools, these representative test cycles allow advanced vehicle or systems developers to efficiently and accurately evaluate vehicle technology performance requirements to reduce cost and development time while ultimately leading to the commercialization of advanced technologies that meet the performance requirements of the port drayage vocation. The drive cycles, which are suitable for chassis dynamometer testing, were compared to several existing test cycles. This paper presents the clustering methodology, accompanying results of the port drayage duty cycle analysis and custom drive cycle creation.« less
Heavy-Duty Vehicle Port Drayage Drive Cycle Characterization and Development
DOE Office of Scientific and Technical Information (OSTI.GOV)
Prohaska, Robert; Konan, Arnaud; Kelly, Kenneth
2016-05-02
In an effort to better understand the operational requirements of port drayage vehicles and their potential for adoption of advanced technologies, National Renewable Energy Laboratory (NREL) researchers collected over 36,000 miles of in-use duty cycle data from 30 Class 8 drayage trucks operating at the Port of Long Beach and Port of Los Angeles in Southern California. These data include 1-Hz global positioning system location and SAE J1939 high-speed controller area network information. Researchers processed the data through NREL's Drive-Cycle Rapid Investigation, Visualization, and Evaluation tool to examine vehicle kinematic and dynamic patterns across the spectrum of operations. Using themore » k-medoids clustering method, a repeatable and quantitative process for multi-mode drive cycle segmentation, the analysis led to the creation of multiple drive cycles representing four distinct modes of operation that can be used independently or in combination. These drive cycles are statistically representative of real-world operation of port drayage vehicles. When combined with modeling and simulation tools, these representative test cycles allow advanced vehicle or systems developers to efficiently and accurately evaluate vehicle technology performance requirements to reduce cost and development time while ultimately leading to the commercialization of advanced technologies that meet the performance requirements of the port drayage vocation. The drive cycles, which are suitable for chassis dynamometer testing, were compared to several existing test cycles. This paper presents the clustering methodology, accompanying results of the port drayage duty cycle analysis and custom drive cycle creation.« less
Dietary patterns in patients with advanced cancer: implications for anorexia-cachexia therapy.
Hutton, Joanne L; Martin, Lisa; Field, Catherine J; Wismer, Wendy V; Bruera, Eduardo D; Watanabe, Sharon M; Baracos, Vickie E
2006-11-01
Severe malnutrition and wasting are considered hallmarks of advanced malignant disease, and clinical research into anorexia-cachexia therapy and nutritional support for cancer patients is ongoing. However, information on typical dietary intakes and food choices for this population is notably lacking; proposed therapies for anorexia and wasting are not framed within the context of current intake. The objective of the study was to characterize the food intake patterns of patients with advanced cancer. Patients with advanced cancer (n = 151) recruited from a regional cancer center and palliative-care program completed a 3-d dietary record a mean (+/-SD) 8 +/- 7 mo before death. Food items were categorized according to macronutrient content and dietary use and subsequently entered into cluster analysis. Wide variations in intakes of energy (range: 4-53 kcal . kg body wt(-1) . d(-1); x +/- SD: 25.1 +/- 10.0 kcal . kg body wt(-1) . d(-1)) and protein (range: 0.2-2.7 g . kg body wt(-1) . d(-1); x +/- SD: 1.0 +/- 0.4 g . kg body wt(-1) . d(-1)) were observed. Even the subjects with the highest intakes had a recent history of weight loss, which suggests that the diets of those persons were consistently inadequate for weight maintenance. Cluster analysis found 3 dietary patterns that differed in food choice and caloric intake. Low intakes and a high risk of weight loss were associated with decreased frequency of eating and dietary profiles with little variety and unusually high proportions of liquids. These data provide a glimpse into dietary habits toward the end of life. Unique dietary patterns were found in this nutritionally vulnerable patient population.
Frongillo, Edward A; Nguyen, Phuong H; Saha, Kuntal K; Sanghvi, Tina; Afsana, Kaosar; Haque, Raisul; Baker, Jean; Ruel, Marie T; Rawat, Rahul; Menon, Purnima
2017-02-01
Promoting adequate nutrition through interventions to improve infant and young child feeding (IYCF) has the potential to contribute to child development. We examined whether an intensive intervention package that was aimed at improving IYCF at scale through the Alive & Thrive initiative in Bangladesh also advanced language and gross motor development, and whether advancements in language and gross motor development were explained through improved complementary feeding. A cluster-randomized design compared 2 intervention packages: intensive interpersonal counseling on IYCF, mass media campaign, and community mobilization (intensive) compared with usual nutrition counseling and mass media campaign (nonintensive). Twenty subdistricts were randomly assigned to receive either the intensive or the nonintensive intervention. Household surveys were conducted at baseline (2010) and at endline (2014) in the same communities (n = ∼4000 children aged 0-47.9 mo for each round). Child development was measured by asking mothers if their child had reached each of multiple milestones, with some observed. Linear regression accounting for clustering was used to derive difference-in-differences (DID) impact estimates, and path analysis was used to examine developmental advancement through indicators of improved IYCF and other factors. The DID in language development between intensive and nonintensive groups was 1.05 milestones (P = 0.001) among children aged 6-23.9 mo and 0.76 milestones (P = 0.038) among children aged 24-47.9 mo. For gross motor development, the DID was 0.85 milestones (P = 0.035) among children aged 6-23.9 mo. The differences observed corresponded to age- and sex-adjusted effect sizes of 0.35 for language and 0.23 for gross motor development. Developmental advancement at 6-23.9 mo was partially explained through improved minimum dietary diversity and the consumption of iron-rich food. Intensive IYCF intervention differentially advanced language and gross motor development, which was partially explained through improved complementary feeding. Measuring a diverse set of child outcomes, including functional outcomes such as child development, is important when evaluating integrated nutrition programs. This trial was registered at clinicaltrials.gov as NCT01678716. © 2017 American Society for Nutrition.
Naegle, Kristen M; Welsch, Roy E; Yaffe, Michael B; White, Forest M; Lauffenburger, Douglas A
2011-07-01
Advances in proteomic technologies continue to substantially accelerate capability for generating experimental data on protein levels, states, and activities in biological samples. For example, studies on receptor tyrosine kinase signaling networks can now capture the phosphorylation state of hundreds to thousands of proteins across multiple conditions. However, little is known about the function of many of these protein modifications, or the enzymes responsible for modifying them. To address this challenge, we have developed an approach that enhances the power of clustering techniques to infer functional and regulatory meaning of protein states in cell signaling networks. We have created a new computational framework for applying clustering to biological data in order to overcome the typical dependence on specific a priori assumptions and expert knowledge concerning the technical aspects of clustering. Multiple clustering analysis methodology ('MCAM') employs an array of diverse data transformations, distance metrics, set sizes, and clustering algorithms, in a combinatorial fashion, to create a suite of clustering sets. These sets are then evaluated based on their ability to produce biological insights through statistical enrichment of metadata relating to knowledge concerning protein functions, kinase substrates, and sequence motifs. We applied MCAM to a set of dynamic phosphorylation measurements of the ERRB network to explore the relationships between algorithmic parameters and the biological meaning that could be inferred and report on interesting biological predictions. Further, we applied MCAM to multiple phosphoproteomic datasets for the ERBB network, which allowed us to compare independent and incomplete overlapping measurements of phosphorylation sites in the network. We report specific and global differences of the ERBB network stimulated with different ligands and with changes in HER2 expression. Overall, we offer MCAM as a broadly-applicable approach for analysis of proteomic data which may help increase the current understanding of molecular networks in a variety of biological problems. © 2011 Naegle et al.
Devine, Patricia G; Forscher, Patrick S; Cox, William T L; Kaatz, Anna; Sheridan, Jennifer; Carnes, Molly
2017-11-01
Addressing the underrepresentation of women in science is a top priority for many institutions, but the majority of efforts to increase representation of women are neither evidence-based nor rigorously assessed. One exception is the gender bias habit-breaking intervention (Carnes et al., 2015), which, in a cluster-randomized trial involving all but two departmental clusters ( N = 92) in the 6 STEMM focused schools/colleges at the University of Wisconsin - Madison, led to increases in gender bias awareness and self-efficacy to promote gender equity in academic science departments. Following this initial success, the present study compares, in a preregistered analysis, hiring rates of new female faculty pre- and post-manipulation. Whereas the proportion of women hired by control departments remained stable over time, the proportion of women hired by intervention departments increased by an estimated 18 percentage points ( OR = 2.23, d OR = 0.34). Though the preregistered analysis did not achieve conventional levels of statistical significance ( p < 0.07), our study has a hard upper limit on statistical power, as the cluster-randomized trial has a maximum sample size of 92 departmental clusters. These patterns have undeniable practical significance for the advancement of women in science, and provide promising evidence that psychological interventions can facilitate gender equity and diversity.
Logo image clustering based on advanced statistics
NASA Astrophysics Data System (ADS)
Wei, Yi; Kamel, Mohamed; He, Yiwei
2007-11-01
In recent years, there has been a growing interest in the research of image content description techniques. Among those, image clustering is one of the most frequently discussed topics. Similar to image recognition, image clustering is also a high-level representation technique. However it focuses on the coarse categorization rather than the accurate recognition. Based on wavelet transform (WT) and advanced statistics, the authors propose a novel approach that divides various shaped logo images into groups according to the external boundary of each logo image. Experimental results show that the presented method is accurate, fast and insensitive to defects.
Zhang, Dake; Stecker, Pamela; Huckabee, Sloan; Miller, Rhonda
2016-09-01
Research has suggested that different strategies used when solving fraction problems are highly correlated with students' problem-solving accuracy. This study (a) utilized latent profile modeling to classify students into three different strategic developmental levels in solving fraction comparison problems and (b) accordingly provided differentiated strategic training for students starting from two different strategic developmental levels. In Study 1 we assessed 49 middle school students' performance on fraction comparison problems and categorized students into three clusters of strategic developmental clusters: a cross-multiplication cluster with the highest accuracy, a representation strategy cluster with medium accuracy, and a whole-number strategy cluster with the lowest accuracy. Based on the strategic developmental levels identified in Study 1, in Study 2 we selected three students from the whole-number strategy cluster and another three students from the representation strategy cluster and implemented a differentiated strategic training intervention within a multiple-baseline design. Results showed that both groups of students transitioned from less advanced to more advanced strategies and improved their problem-solving accuracy during the posttest, the maintenance test, and the generalization test. © Hammill Institute on Disabilities 2014.
Constraining AGN triggering mechanisms through the clustering analysis of active black holes
NASA Astrophysics Data System (ADS)
Gatti, M.; Shankar, F.; Bouillot, V.; Menci, N.; Lamastra, A.; Hirschmann, M.; Fiore, F.
2016-02-01
The triggering mechanisms for active galactic nuclei (AGN) are still debated. Some of the most popular ones include galaxy interactions (IT) and disc instabilities (DIs). Using an advanced semi-analytic model (SAM) of galaxy formation, coupled to accurate halo occupation distribution modelling, we investigate the imprint left by each separate triggering process on the clustering strength of AGN at small and large scales. Our main results are as follows: (I) DIs, irrespective of their exact implementation in the SAM, tend to fall short in triggering AGN activity in galaxies at the centre of haloes with Mh > 1013.5 h-1 M⊙. On the contrary, the IT scenario predicts abundance of active central galaxies that generally agrees well with observations at every halo mass. (II) The relative number of satellite AGN in DIs at intermediate-to-low luminosities is always significantly higher than in IT models, especially in groups and clusters. The low AGN satellite fraction predicted for the IT scenario might suggest that different feeding modes could simultaneously contribute to the triggering of satellite AGN. (III) Both scenarios are quite degenerate in matching large-scale clustering measurements, suggesting that the sole average bias might not be an effective observational constraint. (IV) Our analysis suggests the presence of both a mild luminosity and a more consistent redshift dependence in the AGN clustering, with AGN inhabiting progressively less massive dark matter haloes as the redshift increases. We also discuss the impact of different observational selection cuts in measuring AGN clustering, including possible discrepancies between optical and X-ray surveys.
Multi-Optimisation Consensus Clustering
NASA Astrophysics Data System (ADS)
Li, Jian; Swift, Stephen; Liu, Xiaohui
Ensemble Clustering has been developed to provide an alternative way of obtaining more stable and accurate clustering results. It aims to avoid the biases of individual clustering algorithms. However, it is still a challenge to develop an efficient and robust method for Ensemble Clustering. Based on an existing ensemble clustering method, Consensus Clustering (CC), this paper introduces an advanced Consensus Clustering algorithm called Multi-Optimisation Consensus Clustering (MOCC), which utilises an optimised Agreement Separation criterion and a Multi-Optimisation framework to improve the performance of CC. Fifteen different data sets are used for evaluating the performance of MOCC. The results reveal that MOCC can generate more accurate clustering results than the original CC algorithm.
Geotemporal Analysis of Neisseria meningitidis Clones in the United States: 2000–2005
Wiringa, Ann E.; Shutt, Kathleen A.; Marsh, Jane W.; Cohn, Amanda C.; Messonnier, Nancy E.; Zansky, Shelley M.; Petit, Susan; Farley, Monica M.; Gershman, Ken; Lynfield, Ruth; Reingold, Arthur; Schaffner, William; Thompson, Jamie; Brown, Shawn T.; Lee, Bruce Y.; Harrison, Lee H.
2013-01-01
Background The detection of meningococcal outbreaks relies on serogrouping and epidemiologic definitions. Advances in molecular epidemiology have improved the ability to distinguish unique Neisseria meningitidis strains, enabling the classification of isolates into clones. Around 98% of meningococcal cases in the United States are believed to be sporadic. Methods Meningococcal isolates from 9 Active Bacterial Core surveillance sites throughout the United States from 2000 through 2005 were classified according to serogroup, multilocus sequence typing, and outer membrane protein (porA, porB, and fetA) genotyping. Clones were defined as isolates that were indistinguishable according to this characterization. Case data were aggregated to the census tract level and all non-singleton clones were assessed for non-random spatial and temporal clustering using retrospective space-time analyses with a discrete Poisson probability model. Results Among 1,062 geocoded cases with available isolates, 438 unique clones were identified, 78 of which had ≥2 isolates. 702 cases were attributable to non-singleton clones, accounting for 66.0% of all geocoded cases. 32 statistically significant clusters comprised of 107 cases (10.1% of all geocoded cases) were identified. Clusters had the following attributes: included 2 to 11 cases; 1 day to 33 months duration; radius of 0 to 61.7 km; and attack rate of 0.7 to 57.8 cases per 100,000 population. Serogroups represented among the clusters were: B (n = 12 clusters, 45 cases), C (n = 11 clusters, 27 cases), and Y (n = 9 clusters, 35 cases); 20 clusters (62.5%) were caused by serogroups represented in meningococcal vaccines that are commercially available in the United States. Conclusions Around 10% of meningococcal disease cases in the U.S. could be assigned to a geotemporal cluster. Molecular characterization of isolates, combined with geotemporal analysis, is a useful tool for understanding the spread of virulent meningococcal clones and patterns of transmission in populations. PMID:24349182
Functional Reconstitution of a Fungal Natural Product Gene Cluster by Advanced Genome Editing.
Weber, Jakob; Valiante, Vito; Nødvig, Christina S; Mattern, Derek J; Slotkowski, Rebecca A; Mortensen, Uffe H; Brakhage, Axel A
2017-01-20
Filamentous fungi produce varieties of natural products even in a strain dependent manner. However, the genetic basis of chemical speciation between strains is still widely unknown. One example is trypacidin, a natural product of the opportunistic human pathogen Aspergillus fumigatus, which is not produced among different isolates. Combining computational analysis with targeted gene editing, we could link a single nucleotide insertion in the polyketide synthase of the trypacidin biosynthetic pathway and reconstitute its production in a nonproducing strain. Thus, we present a CRISPR/Cas9-based tool for advanced molecular genetic studies in filamentous fungi, exploiting selectable markers separated from the edited locus.
Chiu, Chia-Yi; Köhn, Hans-Friedrich
2016-09-01
The asymptotic classification theory of cognitive diagnosis (ACTCD) provided the theoretical foundation for using clustering methods that do not rely on a parametric statistical model for assigning examinees to proficiency classes. Like general diagnostic classification models, clustering methods can be useful in situations where the true diagnostic classification model (DCM) underlying the data is unknown and possibly misspecified, or the items of a test conform to a mix of multiple DCMs. Clustering methods can also be an option when fitting advanced and complex DCMs encounters computational difficulties. These can range from the use of excessive CPU times to plain computational infeasibility. However, the propositions of the ACTCD have only been proven for the Deterministic Input Noisy Output "AND" gate (DINA) model and the Deterministic Input Noisy Output "OR" gate (DINO) model. For other DCMs, there does not exist a theoretical justification to use clustering for assigning examinees to proficiency classes. But if clustering is to be used legitimately, then the ACTCD must cover a larger number of DCMs than just the DINA model and the DINO model. Thus, the purpose of this article is to prove the theoretical propositions of the ACTCD for two other important DCMs, the Reduced Reparameterized Unified Model and the General Diagnostic Model.
Recent advances in the Suf Fe-S cluster biogenesis pathway: Beyond the Proteobacteria.
Outten, F Wayne
2015-06-01
Fe-S clusters play critical roles in cellular function throughout all three kingdoms of life. Consequently, Fe-S cluster biogenesis systems are present in most organisms. The Suf (sulfur formation) system is the most ancient of the three characterized Fe-S cluster biogenesis pathways, which also include the Isc and Nif systems. Much of the first work on the Suf system took place in Gram-negative Proteobacteria used as model organisms. These early studies led to a wealth of biochemical, genetic, and physiological information on Suf function. From those studies we have learned that SufB functions as an Fe-S scaffold in conjunction with SufC (and in some cases SufD). SufS and SufE together mobilize sulfur for cluster assembly and SufA traffics the complete Fe-S cluster from SufB to target apo-proteins. However, recent progress on the Suf system in other organisms has opened up new avenues of research and new hypotheses about Suf function. This review focuses primarily on the most recent discoveries about the Suf pathway and where those new models may lead the field. This article is part of a Special Issue entitled: Fe/S proteins: Analysis, structure, function, biogenesis and diseases. Copyright © 2014 Elsevier B.V. All rights reserved.
Yang, Jun; Hou, Ziming; Wang, Changjiang; Wang, Hao; Zhang, Hongbing
2018-04-23
Adamantinomatous craniopharyngioma (ACP) is an aggressive brain tumor that occurs predominantly in the pediatric population. Conventional diagnosis method and standard therapy cannot treat ACPs effectively. In this paper, we aimed to identify key genes for ACP early diagnosis and treatment. Datasets GSE94349 and GSE68015 were obtained from Gene Expression Omnibus database. Consensus clustering was applied to discover the gene clusters in the expression data of GSE94349 and functional enrichment analysis was performed on gene set in each cluster. The protein-protein interaction (PPI) network was built by the Search Tool for the Retrieval of Interacting Genes, and hubs were selected. Support vector machine (SVM) model was built based on the signature genes identified from enrichment analysis and PPI network. Dataset GSE94349 was used for training and testing, and GSE68015 was used for validation. Besides, RT-qPCR analysis was performed to analyze the expression of signature genes in ACP samples compared with normal controls. Seven gene clusters were discovered in the differentially expressed genes identified from GSE94349 dataset. Enrichment analysis of each cluster identified 25 pathways that highly associated with ACP. PPI network was built and 46 hubs were determined. Twenty-five pathway-related genes that overlapped with the hubs in PPI network were used as signatures to establish the SVM diagnosis model for ACP. The prediction accuracy of SVM model for training, testing, and validation data were 94, 85, and 74%, respectively. The expression of CDH1, CCL2, ITGA2, COL8A1, COL6A2, and COL6A3 were significantly upregulated in ACP tumor samples, while CAMK2A, RIMS1, NEFL, SYT1, and STX1A were significantly downregulated, which were consistent with the differentially expressed gene analysis. SVM model is a promising classification tool for screening and early diagnosis of ACP. The ACP-related pathways and signature genes will advance our knowledge of ACP pathogenesis and benefit the therapy improvement.
Efficient and Flexible Climate Analysis with Python in a Cloud-Based Distributed Computing Framework
NASA Astrophysics Data System (ADS)
Gannon, C.
2017-12-01
As climate models become progressively more advanced, and spatial resolution further improved through various downscaling projects, climate projections at a local level are increasingly insightful and valuable. However, the raw size of climate datasets presents numerous hurdles for analysts wishing to develop customized climate risk metrics or perform site-specific statistical analysis. Four Twenty Seven, a climate risk consultancy, has implemented a Python-based distributed framework to analyze large climate datasets in the cloud. With the freedom afforded by efficiently processing these datasets, we are able to customize and continually develop new climate risk metrics using the most up-to-date data. Here we outline our process for using Python packages such as XArray and Dask to evaluate netCDF files in a distributed framework, StarCluster to operate in a cluster-computing environment, cloud computing services to access publicly hosted datasets, and how this setup is particularly valuable for generating climate change indicators and performing localized statistical analysis.
Torheim, Turid; Groendahl, Aurora R; Andersen, Erlend K F; Lyng, Heidi; Malinen, Eirik; Kvaal, Knut; Futsaether, Cecilia M
2016-11-01
Solid tumors are known to be spatially heterogeneous. Detection of treatment-resistant tumor regions can improve clinical outcome, by enabling implementation of strategies targeting such regions. In this study, K-means clustering was used to group voxels in dynamic contrast enhanced magnetic resonance images (DCE-MRI) of cervical cancers. The aim was to identify clusters reflecting treatment resistance that could be used for targeted radiotherapy with a dose-painting approach. Eighty-one patients with locally advanced cervical cancer underwent DCE-MRI prior to chemoradiotherapy. The resulting image time series were fitted to two pharmacokinetic models, the Tofts model (yielding parameters K trans and ν e ) and the Brix model (A Brix , k ep and k el ). K-means clustering was used to group similar voxels based on either the pharmacokinetic parameter maps or the relative signal increase (RSI) time series. The associations between voxel clusters and treatment outcome (measured as locoregional control) were evaluated using the volume fraction or the spatial distribution of each cluster. One voxel cluster based on the RSI time series was significantly related to locoregional control (adjusted p-value 0.048). This cluster consisted of low-enhancing voxels. We found that tumors with poor prognosis had this RSI-based cluster gathered into few patches, making this cluster a potential candidate for targeted radiotherapy. None of the voxels clusters based on Tofts or Brix parameter maps were significantly related to treatment outcome. We identified one group of tumor voxels significantly associated with locoregional relapse that could potentially be used for dose painting. This tumor voxel cluster was identified using the raw MRI time series rather than the pharmacokinetic maps.
Deep Learning Nuclei Detection in Digitized Histology Images by Superpixels
Sornapudi, Sudhir; Stanley, Ronald Joe; Stoecker, William V.; Almubarak, Haidar; Long, Rodney; Antani, Sameer; Thoma, George; Zuna, Rosemary; Frazier, Shelliane R.
2018-01-01
Background: Advances in image analysis and computational techniques have facilitated automatic detection of critical features in histopathology images. Detection of nuclei is critical for squamous epithelium cervical intraepithelial neoplasia (CIN) classification into normal, CIN1, CIN2, and CIN3 grades. Methods: In this study, a deep learning (DL)-based nuclei segmentation approach is investigated based on gathering localized information through the generation of superpixels using a simple linear iterative clustering algorithm and training with a convolutional neural network. Results: The proposed approach was evaluated on a dataset of 133 digitized histology images and achieved an overall nuclei detection (object-based) accuracy of 95.97%, with demonstrated improvement over imaging-based and clustering-based benchmark techniques. Conclusions: The proposed DL-based nuclei segmentation Method with superpixel analysis has shown improved segmentation results in comparison to state-of-the-art methods. PMID:29619277
Cortical atrophy patterns in early Parkinson's disease patients using hierarchical cluster analysis.
Uribe, Carme; Segura, Barbara; Baggio, Hugo Cesar; Abos, Alexandra; Garcia-Diaz, Anna Isabel; Campabadal, Anna; Marti, Maria Jose; Valldeoriola, Francesc; Compta, Yaroslau; Tolosa, Eduard; Junque, Carme
2018-05-01
Cortical brain atrophy detectable with MRI in non-demented advanced Parkinson's disease (PD) is well characterized, but its presence in early disease stages is still under debate. We aimed to investigate cortical atrophy patterns in a large sample of early untreated PD patients using a hypothesis-free data-driven approach. Seventy-seven de novo PD patients and 50 controls from the Parkinson's Progression Marker Initiative database with T1-weighted images in a 3-tesla Siemens scanner were included in this study. Mean cortical thickness was extracted from 360 cortical areas defined by the Human Connectome Project Multi-Modal Parcellation version 1.0, and a hierarchical cluster analysis was performed using Ward's linkage method. A general linear model with cortical thickness data was then used to compare clustering groups using FreeSurfer software. We identified two patterns of cortical atrophy. Compared with controls, patients grouped in pattern 1 (n = 33) were characterized by cortical thinning in bilateral orbitofrontal, anterior cingulate, and lateral and medial anterior temporal gyri. Patients in pattern 2 (n = 44) showed cortical thinning in bilateral occipital gyrus, cuneus, superior parietal gyrus, and left postcentral gyrus, and they showed neuropsychological impairment in memory and other cognitive domains. Even in the early stages of PD, there is evidence of cortical brain atrophy. Neuroimaging clustering analysis is able to detect two subgroups of cortical thinning, one with mainly anterior atrophy, and the other with posterior predominance and worse cognitive performance. Copyright © 2018 Elsevier Ltd. All rights reserved.
Montero, Rosa M; Herath, Athula; Qureshi, Ashfaq; Esfandiari, Ehsanollah; Pusey, Charles D; Frankel, Andrew H; Tam, Frederick W K
2018-01-08
The global increase in Diabetes Mellitus (DM) has led to an increase in DM-Chronic Kidney Disease (DM-CKD). In this cross-sectional observational study we aimed to define phenotypes for patients with DM-CKD that in future may be used to individualise treatment We report 4 DM-CKD phenotypes in 220 patients recruited from Imperial College NHS Trust clinics from 2004-2012. A robust principal component analysis (PCA) was used to statistically determine clusters with phenotypically different patients. 163 patients with complete data sets were analysed: 77 with CKD and 86 with DM-CKD. Four different clusters were identified. Phenotypes 1 and 2 are entirely composed of patients with DM-CKD and phenotypes 3 and 4 are predominantly CKD (non-DM-CKD). Phenotype 1 depicts a cardiovascular phenotype; phenotype 2: microvascular complications with advanced DM-CKD; phenotype 3: advanced CKD with less anaemia, lower weight and HbA1c; phenotype 4: hypercholesteraemic, younger, less severe CKD. We are the first group to describe different phenotypes in DM-CKD using a PCA approach. Identification of phenotypic groups illustrates the differences and similarities that occur under the umbrella term of DM-CKD providing an opportunity to study phenotypes within these groups thereby facilitating development of precision/personalised targeted medicine.
Millstone: software for multiplex microbial genome analysis and engineering
DOE Office of Scientific and Technical Information (OSTI.GOV)
Goodman, Daniel B.; Kuznetsov, Gleb; Lajoie, Marc J.
Inexpensive DNA sequencing and advances in genome editing have made computational analysis a major rate-limiting step in adaptive laboratory evolution and microbial genome engineering. Here, we describe Millstone, a web-based platform that automates genotype comparison and visualization for projects with up to hundreds of genomic samples. To enable iterative genome engineering, Millstone allows users to design oligonucleotide libraries and create successive versions of reference genomes. Millstone is open source and easily deployable to a cloud platform, local cluster, or desktop, making it a scalable solution for any lab.
Millstone: software for multiplex microbial genome analysis and engineering.
Goodman, Daniel B; Kuznetsov, Gleb; Lajoie, Marc J; Ahern, Brian W; Napolitano, Michael G; Chen, Kevin Y; Chen, Changping; Church, George M
2017-05-25
Inexpensive DNA sequencing and advances in genome editing have made computational analysis a major rate-limiting step in adaptive laboratory evolution and microbial genome engineering. We describe Millstone, a web-based platform that automates genotype comparison and visualization for projects with up to hundreds of genomic samples. To enable iterative genome engineering, Millstone allows users to design oligonucleotide libraries and create successive versions of reference genomes. Millstone is open source and easily deployable to a cloud platform, local cluster, or desktop, making it a scalable solution for any lab.
Millstone: software for multiplex microbial genome analysis and engineering
Goodman, Daniel B.; Kuznetsov, Gleb; Lajoie, Marc J.; ...
2017-05-25
Inexpensive DNA sequencing and advances in genome editing have made computational analysis a major rate-limiting step in adaptive laboratory evolution and microbial genome engineering. Here, we describe Millstone, a web-based platform that automates genotype comparison and visualization for projects with up to hundreds of genomic samples. To enable iterative genome engineering, Millstone allows users to design oligonucleotide libraries and create successive versions of reference genomes. Millstone is open source and easily deployable to a cloud platform, local cluster, or desktop, making it a scalable solution for any lab.
Cluster Computing for Embedded/Real-Time Systems
NASA Technical Reports Server (NTRS)
Katz, D.; Kepner, J.
1999-01-01
Embedded and real-time systems, like other computing systems, seek to maximize computing power for a given price, and thus can significantly benefit from the advancing capabilities of cluster computing.
Peptide protected gold clusters: chemical synthesis and biomedical applications
NASA Astrophysics Data System (ADS)
Yuan, Qing; Wang, Yaling; Zhao, Lina; Liu, Ru; Gao, Fuping; Gao, Liang; Gao, Xueyun
2016-06-01
Bridging the gap between atoms and nanoparticles, noble metal clusters with atomic precision continue to attract considerable attention due to their important applications in catalysis, energy transformation, biosensing and biomedicine. Greatly different to common chemical synthesis, a one-step biomimetic synthesis of peptide-conjugated metal clusters has been developed to meet the demand of emerging bioapplications. Under mild conditions, multifunctional peptides containing metal capturing, reactive and targeting groups are rationally designed and elaborately synthesized to fabricate atomically precise peptide protected metal clusters. Among them, peptide-protected Au Cs (peptide-Au Cs) possess a great deal of exceptional advantages such as nanometer dimensions, high photostability, good biocompatibility, accurate chemical formula and specific protein targeting capacity. In this review article, we focus on the recent advances in potential theranostic fields by introducing the rising progress of peptide-Au Cs for biological imaging, biological analysis and therapeutic applications. The interactions between Au Cs and biological systems as well as potential mechanisms are also our concerned theme. We expect that the rapidly growing interest in Au Cs-based theranostic applications will attract broader concerns across various disciplines.
Quantum Materials at the Nanoscale - Final Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cooper, Stephen Lance
The central aim of the Quantum Materials at the Nanoscale (QMN) cluster was to understand and control collective behavior involving the interplay of spins, orbitals, and charges, which governs many scientifically interesting and technologically important phenomena in numerous complex materials. Because these phenomena involve various competing interactions, and influence properties on many different length and energy scales in complex materials, tackling this important area of study motivated a collaborative effort that combined the diverse capabilities of QMN cluster experimentalists, the essential theoretical analysis provided by QMN cluster theorists, and the outstanding facilities and staff of the FSMRL. During the fundingmore » period 2007-2014, the DOE cluster grant for the Quantum Materials at the Nanoscale (QMN) cluster supported, at various times, 15 different faculty members (14 in Physics and 1 in Materials Science and Engineering), 7 postdoctoral research associates, and 57 physics and materials science PhD students. 41 of these PhD students have since graduated and have gone on to a variety of advanced technical positions at universities, industries, and national labs: 25 obtained postdoctoral positions at universities (14), industrial labs (2 at IBM), DOE national facilities (3 at Argonne National Laboratory, 1 at Brookhaven National Lab, 1 at Lawrence Berkeley National Lab, and 1 at Sandia National Lab), and other federal facilities (2 at NIST); 13 took various industrial positions, including positions at Intel (5), Quantum Design (1), Lasque Industries (1), Amazon (1), Bloomberg (1), and J.P. Morgan (1). Thus, the QMN grant provided the essential support for training a large number of technically advanced personnel who have now entered key national facilities, industries, and institutions. Additionally, during the period 2007-2015, the QMN cluster produced 159 publications (see pages 14-23), including 23 papers published in Physical Review Letters; 16 papers in Nature, Nature Physics, Nature Materials, or Nature Communications; 4 papers in Science, and 8 papers in Applied Physics Letters. In this report, we provide some key highlights of the collaborative projects in which the QMN cluster members have been involved since 2007.« less
Query Results Clustering by Extending SPARQL with CLUSTER BY
NASA Astrophysics Data System (ADS)
Ławrynowicz, Agnieszka
The task of dynamic clustering of the search results proved to be useful in the Web context, where the user often does not know the granularity of the search results in advance. The goal of this paper is to provide a declarative way for invoking dynamic clustering of the results of queries submitted over Semantic Web data. To achieve this goal the paper proposes an approach that extends SPARQL by clustering abilities. The approach introduces a new statement, CLUSTER BY, into the SPARQL grammar and proposes semantics for such extension.
Furnace Cyclic Oxidation Behavior of Multicomponent Low Conductivity Thermal Barrier Coatings
NASA Astrophysics Data System (ADS)
Zhu, Dongming; Nesbitt, James A.; Barrett, Charles A.; McCue, Terry R.; Miller, Robert A.
2004-03-01
Ceramic thermal barrier coatings (TBCs) will play an increasingly important role in advanced gas turbine engines due to their ability to further increase engine operating temperatures and reduce cooling, thus helping achieve future engine low emission, high efficiency, and improved reliability goals. Advanced multicomponent zirconia (ZrO2)-based TBCs are being developed using an oxide defect clustering design approach to achieve the required coating low thermal conductivity and high-temperature stability. Although the new composition coatings were not yet optimized for cyclic durability, an initial durability screening of the candidate coating materials was conducted using conventional furnace cyclic oxidation tests. In this paper, furnace cyclic oxidation behavior of plasma-sprayed ZrO2-based defect cluster TBCs was investigated at 1163°C using 45 min hot-time cycles. The ceramic coating failure mechanisms were studied using scanning electron microscopy (SEM) combined with x-ray diffraction (XRD) phase analysis after the furnace tests. The coating cyclic lifetime is also discussed in relation to coating processing, phase structures, dopant concentration, and other thermo-physical properties.
Furnace Cyclic Oxidation Behavior of Multi-Component Low Conductivity Thermal Barrier Coatings
NASA Technical Reports Server (NTRS)
Zhu, Dong-Ming; Nesbitt, James A.; Barrett, Charles A.; McCue, Terry R.; Miller, Robert A.
2004-01-01
Ceramic thermal barrier coatings will play an increasingly important role in advanced gas turbine engines because of their ability to further increase engine operating temperatures and reduce cooling, thus helping achieve future engine low emission, high efficiency and improved reliability goals. Advanced multi-component zirconia-based thermal barrier coatings are being developed using an oxide defect clustering design approach to achieve the required coating low thermal conductivity and high temperature stability. Although the new composition coatings were not yet optimized for cyclic durability, an initial durability screening of the candidate coating materials was conducted using conventional furnace cyclic oxidation tests. In this paper, furnace cyclic oxidation behavior of plasma-sprayed zirconia-based defect cluster thermal barrier coatings was investigated at 1163 C using 45 min hot cycles. The ceramic coating failure mechanisms were studied using scanning electron microscopy (SEM) combined with X-ray diffraction (XRD) phase analysis after the furnace tests. The coating cyclic lifetime is also discussed in relation to coating processing, phase structures, dopant concentration, and other thermo-physical properties.
Ba-Sang, Dan-Zeng; Long, Zi-Wen; Teng, Hao; Zhao, Xu-Peng; Qiu, Jian; Li, Ming-Shan
2016-01-01
Objective A network meta-analysis was conducted comparing the short-term efficacies of 16 targeted drugs in combination with chemotherapy for treatment of advanced/metastatic colorectal cancer (CRC). Results Twenty-seven RCTs were ultimately incorporated into this network meta-analysis. Compared with chemotherapy alone, bevacizumab + chemotherapy, panitumumab + chemotherapy and conatumumab + chemotherapy had higher PR rate. Bevacizumab + chemotherapy, cetuximab + chemotherapy, panitumumab + chemotherapy, trebananib + chemotherapy and conatumumab + chemotherapy had higher ORR rate in comparison to chemotherapy alone. Furthermore, bevacizumab + chemotherapy had higher DCR rate than chemotherapy alone. The results of our cluster analysis showed that chemotherapy combined with bevacizumab, cetuximab, panitumumab, conatumumab, ganitumab, or brivanib + cetuximab had better efficacies for the treatment of advanced/metastatic CRC in comparison to chemotherapy alone. Materials and Methods Electronic databases were comprehensively searched for potential and related randomized controlled trials (RCTs). Direct and indirect evidence were incorporated for evaluation of stable disease (SD), progressive disease (PD), complete response (CR), partial response (PR), disease control rate (DCR) and overall response ratio (ORR) by calculating odds ratio (OR) and 95% confidence intervals (CI), and using the surface under the cumulative ranking curve (SUCRA). Conclusions These results indicated that bevacizumab + chemotherapy, panitumumab + chemotherapy, conatumumab + chemotherapy and brivanib + cetuximab + chemotherapy may have better efficacies for the treatment of advanced/metastatic CRC. PMID:27806321
Cluster-based analysis of multi-model climate ensembles
NASA Astrophysics Data System (ADS)
Hyde, Richard; Hossaini, Ryan; Leeson, Amber A.
2018-06-01
Clustering - the automated grouping of similar data - can provide powerful and unique insight into large and complex data sets, in a fast and computationally efficient manner. While clustering has been used in a variety of fields (from medical image processing to economics), its application within atmospheric science has been fairly limited to date, and the potential benefits of the application of advanced clustering techniques to climate data (both model output and observations) has yet to be fully realised. In this paper, we explore the specific application of clustering to a multi-model climate ensemble. We hypothesise that clustering techniques can provide (a) a flexible, data-driven method of testing model-observation agreement and (b) a mechanism with which to identify model development priorities. We focus our analysis on chemistry-climate model (CCM) output of tropospheric ozone - an important greenhouse gas - from the recent Atmospheric Chemistry and Climate Model Intercomparison Project (ACCMIP). Tropospheric column ozone from the ACCMIP ensemble was clustered using the Data Density based Clustering (DDC) algorithm. We find that a multi-model mean (MMM) calculated using members of the most-populous cluster identified at each location offers a reduction of up to ˜ 20 % in the global absolute mean bias between the MMM and an observed satellite-based tropospheric ozone climatology, with respect to a simple, all-model MMM. On a spatial basis, the bias is reduced at ˜ 62 % of all locations, with the largest bias reductions occurring in the Northern Hemisphere - where ozone concentrations are relatively large. However, the bias is unchanged at 9 % of all locations and increases at 29 %, particularly in the Southern Hemisphere. The latter demonstrates that although cluster-based subsampling acts to remove outlier model data, such data may in fact be closer to observed values in some locations. We further demonstrate that clustering can provide a viable and useful framework in which to assess and visualise model spread, offering insight into geographical areas of agreement among models and a measure of diversity across an ensemble. Finally, we discuss caveats of the clustering techniques and note that while we have focused on tropospheric ozone, the principles underlying the cluster-based MMMs are applicable to other prognostic variables from climate models.
Salfati, Elias L; Herrington, David M; Assimes, Themistocles L
2016-01-01
The correlation between the extent of fatty streaks, more advanced atherosclerotic lesions, and community rates of coronary artery disease (CAD) is substantially higher for the coronary artery compared to the aorta. We sought to determine whether a genetic basis contributes to these differences. We conducted a cluster analysis of 6 subclinical atherosclerosis phenotypes documented in 564 white participants of the Pathobiological Determinants of Atherosclerosis in Youth study including the extent of fatty streaks and raised lesions in the coronary artery (CF and CR), thoracic aorta (TF and TR), and abdominal aorta (AF and AR) followed by a genetic association analysis of the same phenotypes. Our cluster analysis grouped all raised lesions and fatty streaks in the coronary into one cluster (CF, CR, TR, and AR) and the fatty streaks in the aorta into a second cluster (TF and AF). We found a genetic risk score of high-risk alleles at 57 susceptibility loci for CAD to be variably associated with the phenotypes in the first cluster (OR: 1.30 p = 0.009 for being in top quartile of degree of involvement of CF, 1.34 p = 0.005 for CR, 1.25: p = 0.11 for TR, and 1.19 p = 0.08 for AR) but not at all with the phenotypes in the second cluster (OR: 1.01, p = 0.95 for TF and 0.98, p = 0.82 for AF). The genetic determinants of fatty streaks in the aorta do not appear to overlap substantially with the genetic determinants of fatty streaks in the coronary as well as raised lesions in both the coronary and the aorta. These findings may explain why a larger fraction of fatty streaks in the aorta are less likely to progress to raised lesions compared to the coronary artery.
Herrington, David M.
2016-01-01
Objective The correlation between the extent of fatty streaks, more advanced atherosclerotic lesions, and community rates of coronary artery disease (CAD) is substantially higher for the coronary artery compared to the aorta. We sought to determine whether a genetic basis contributes to these differences. Approach and Results We conducted a cluster analysis of 6 subclinical atherosclerosis phenotypes documented in 564 white participants of the Pathobiological Determinants of Atherosclerosis in Youth study including the extent of fatty streaks and raised lesions in the coronary artery (CF and CR), thoracic aorta (TF and TR), and abdominal aorta (AF and AR) followed by a genetic association analysis of the same phenotypes. Our cluster analysis grouped all raised lesions and fatty streaks in the coronary into one cluster (CF, CR, TR, and AR) and the fatty streaks in the aorta into a second cluster (TF and AF). We found a genetic risk score of high-risk alleles at 57 susceptibility loci for CAD to be variably associated with the phenotypes in the first cluster (OR: 1.30 p = 0.009 for being in top quartile of degree of involvement of CF, 1.34 p = 0.005 for CR, 1.25: p = 0.11 for TR, and 1.19 p = 0.08 for AR) but not at all with the phenotypes in the second cluster (OR: 1.01, p = 0.95 for TF and 0.98, p = 0.82 for AF). Conclusions The genetic determinants of fatty streaks in the aorta do not appear to overlap substantially with the genetic determinants of fatty streaks in the coronary as well as raised lesions in both the coronary and the aorta. These findings may explain why a larger fraction of fatty streaks in the aorta are less likely to progress to raised lesions compared to the coronary artery. PMID:27861582
Mincholé, Ana; Martínez, Juan Pablo; Laguna, Pablo; Rodriguez, Blanca
2018-01-01
Widely developed for clinical screening, electrocardiogram (ECG) recordings capture the cardiac electrical activity from the body surface. ECG analysis can therefore be a crucial first step to help diagnose, understand and predict cardiovascular disorders responsible for 30% of deaths worldwide. Computational techniques, and more specifically machine learning techniques and computational modelling are powerful tools for classification, clustering and simulation, and they have recently been applied to address the analysis of medical data, especially ECG data. This review describes the computational methods in use for ECG analysis, with a focus on machine learning and 3D computer simulations, as well as their accuracy, clinical implications and contributions to medical advances. The first section focuses on heartbeat classification and the techniques developed to extract and classify abnormal from regular beats. The second section focuses on patient diagnosis from whole recordings, applied to different diseases. The third section presents real-time diagnosis and applications to wearable devices. The fourth section highlights the recent field of personalized ECG computer simulations and their interpretation. Finally, the discussion section outlines the challenges of ECG analysis and provides a critical assessment of the methods presented. The computational methods reported in this review are a strong asset for medical discoveries and their translation to the clinical world may lead to promising advances. PMID:29321268
The impact of therapeutic reference pricing on innovation in cardiovascular medicine.
Sheridan, Desmond; Attridge, Jim
2006-12-01
Therapeutic reference pricing (TRP) places medicines to treat the same medical condition into groups or 'clusters' with a single common reimbursed price. Underpinning this economic measure is an implicit assumption that the products included in the cluster have an equivalent effect on a typical patient with this disease. 'Truly innovative' products can be exempt from inclusion in the cluster. This increasingly common approach to cost containment allocates products into one of two categories - truly innovative or therapeutically equivalent. This study examines the implications of TRP against the step-wise evolution of drugs for cardiovascular conditions over the past 50 years. It illustrates the complex interactions between advances in understanding of cellular and molecular disease mechanisms, diagnostic techniques, treatment concepts, and the synthesis, testing and commercialisation of products. It confirms the highly unpredictable and incremental nature of the innovation process. Medical progress in terms of improvement in patient outcomes over the long-term depends on the cumulative effect of year after year of painstaking incremental advances. It shows that the parallel processes of advances in scientific knowledge and the industrial 'investment-innovative cycle' involve highly developed sets of complementary capabilities and resources. A framework is developed to assess the impact of TRP upon research and development investment decisions and the development of therapeutic classes. We conclude that a simple categorisation of products as either 'truly innovative' or 'therapeutically equivalent' is inconsistent with the incremental processes of innovation and the resulting differentiated product streams revealed by our analysis. Widespread introduction of TRP would probably have prematurely curtailed development of many incremental innovations that became the preferred 'product of choice' by physicians for some indications and patients in managing the incidence of cardiovascular disease.
Cloud Computing for Pharmacometrics: Using AWS, NONMEM, PsN, Grid Engine, and Sonic
Sanduja, S; Jewell, P; Aron, E; Pharai, N
2015-01-01
Cloud computing allows pharmacometricians to access advanced hardware, network, and security resources available to expedite analysis and reporting. Cloud-based computing environments are available at a fraction of the time and effort when compared to traditional local datacenter-based solutions. This tutorial explains how to get started with building your own personal cloud computer cluster using Amazon Web Services (AWS), NONMEM, PsN, Grid Engine, and Sonic. PMID:26451333
Cloud Computing for Pharmacometrics: Using AWS, NONMEM, PsN, Grid Engine, and Sonic.
Sanduja, S; Jewell, P; Aron, E; Pharai, N
2015-09-01
Cloud computing allows pharmacometricians to access advanced hardware, network, and security resources available to expedite analysis and reporting. Cloud-based computing environments are available at a fraction of the time and effort when compared to traditional local datacenter-based solutions. This tutorial explains how to get started with building your own personal cloud computer cluster using Amazon Web Services (AWS), NONMEM, PsN, Grid Engine, and Sonic.
TECHNOLOGICAL INNOVATION IN NEUROSURGERY: A QUANTITATIVE STUDY
Marcus, Hani J; Hughes-Hallett, Archie; Kwasnicki, Richard M; Darzi, Ara; Yang, Guang-Zhong; Nandi, Dipankar
2015-01-01
Object Technological innovation within healthcare may be defined as the introduction of a new technology that initiates a change in clinical practice. Neurosurgery is a particularly technologically intensive surgical discipline, and new technologies have preceded many of the major advances in operative neurosurgical technique. The aim of the present study was to quantitatively evaluate technological innovation in neurosurgery using patents and peer-reviewed publications as metrics of technology development and clinical translation respectively. Methods A patent database was searched between 1960 and 2010 using the search terms “neurosurgeon” OR “neurosurgical” OR “neurosurgery”. The top 50 performing patent codes were then grouped into technology clusters. Patent and publication growth curves were then generated for these technology clusters. A top performing technology cluster was then selected as an exemplar for more detailed analysis of individual patents. Results In all, 11,672 patents and 208,203 publications relating to neurosurgery were identified. The top performing technology clusters over the 50 years were: image guidance devices, clinical neurophysiology devices, neuromodulation devices, operating microscopes and endoscopes. Image guidance and neuromodulation devices demonstrated a highly correlated rapid rise in patents and publications, suggesting they are areas of technology expansion. In-depth analysis of neuromodulation patents revealed that the majority of high performing patents were related to Deep Brain Stimulation (DBS). Conclusions Patent and publication data may be used to quantitatively evaluate technological innovation in neurosurgery. PMID:25699414
Time fluctuation analysis of forest fire sequences
NASA Astrophysics Data System (ADS)
Vega Orozco, Carmen D.; Kanevski, Mikhaïl; Tonini, Marj; Golay, Jean; Pereira, Mário J. G.
2013-04-01
Forest fires are complex events involving both space and time fluctuations. Understanding of their dynamics and pattern distribution is of great importance in order to improve the resource allocation and support fire management actions at local and global levels. This study aims at characterizing the temporal fluctuations of forest fire sequences observed in Portugal, which is the country that holds the largest wildfire land dataset in Europe. This research applies several exploratory data analysis measures to 302,000 forest fires occurred from 1980 to 2007. The applied clustering measures are: Morisita clustering index, fractal and multifractal dimensions (box-counting), Ripley's K-function, Allan Factor, and variography. These algorithms enable a global time structural analysis describing the degree of clustering of a point pattern and defining whether the observed events occur randomly, in clusters or in a regular pattern. The considered methods are of general importance and can be used for other spatio-temporal events (i.e. crime, epidemiology, biodiversity, geomarketing, etc.). An important contribution of this research deals with the analysis and estimation of local measures of clustering that helps understanding their temporal structure. Each measure is described and executed for the raw data (forest fires geo-database) and results are compared to reference patterns generated under the null hypothesis of randomness (Poisson processes) embedded in the same time period of the raw data. This comparison enables estimating the degree of the deviation of the real data from a Poisson process. Generalizations to functional measures of these clustering methods, taking into account the phenomena, were also applied and adapted to detect time dependences in a measured variable (i.e. burned area). The time clustering of the raw data is compared several times with the Poisson processes at different thresholds of the measured function. Then, the clustering measure value depends on the threshold which helps to understand the time pattern of the studied events. Our findings detected the presence of overdensity of events in particular time periods and showed that the forest fire sequences in Portugal can be considered as a multifractal process with a degree of time-clustering of the events. Key words: time sequences, Morisita index, fractals, multifractals, box-counting, Ripley's K-function, Allan Factor, variography, forest fires, point process. Acknowledgements This work was partly supported by the SNFS Project No. 200021-140658, "Analysis and Modelling of Space-Time Patterns in Complex Regions". References - Kanevski M. (Editor). 2008. Advanced Mapping of Environmental Data: Geostatistics, Machine Learning and Bayesian Maximum Entropy. London / Hoboken: iSTE / Wiley. - Telesca L. and Pereira M.G. 2010. Time-clustering investigation of fire temporal fluctuations in Portugal, Nat. Hazards Earth Syst. Sci., vol. 10(4): 661-666. - Vega Orozco C., Tonini M., Conedera M., Kanevski M. (2012) Cluster recognition in spatial-temporal sequences: the case of forest fires, Geoinformatica, vol. 16(4): 653-673.
Barrera, L; Montes-Servín, E; Barrera, A; Ramírez-Tirado, L A; Salinas-Parra, F; Bañales-Méndez, J L; Sandoval-Ríos, M; Arrieta, Ó
2015-02-01
Immunoregulatory cytokines may play a fundamental role in tumor growth and metastases. Their effects are mediated through complex regulatory networks. Human cytokine profiles could define patient subgroups and represent new potential biomarkers. The aim of this study was to associate a cytokine profile obtained through data mining with the clinical characteristics of patients with advanced non-small-cell lung cancer (NSCLC). We conducted a prospective study of the plasma levels of 14 immunoregulatory cytokines by ELISA and a cytometric bead array assay in 110 NSCLC patients before chemotherapy and 25 control subjects. Cytokine levels and data-mining profiles were associated with clinical, quality of life and pathological outcomes. NSCLC patients had higher levels of interleukin (IL)-6, IL-8, IL-12p70, IL-17a and interferon (IFN)-γ, and lower levels of IL-33 and IL-29 compared with controls. The pro-inflammatory cytokines IL-1b, IL-6 and IL-8 were associated with lower hemoglobin levels, worse functional performance status (Eastern Cooperative Oncology Group, ECOG), fatigue and hyporexia. The anti-inflammatory cytokines IL-4, IL-10 and IL-33 were associated with anorexia and lower body mass index. We identified three clusters of patients according to data-mining analysis with different overall survival (OS; 25.4, 16.8 and 5.09 months, respectively, P = 0.0012). Multivariate analysis showed that ECOG performance status and data-mining clusters were significantly associated with OS (RR 3.59, [95% CI 1.9-6.7], P < 0.001 and 2.2, [1.2-3.8], P = 0.005). Our results provide evidence that complex cytokine networks may be used to identify patient subgroups with different prognoses in advanced NSCLC. These cytokines may represent potential biomarkers, particularly in the immunotherapy era in cancer research. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Uehara, Yuriko; Oda, Katsutoshi; Ikeda, Yuji; Koso, Takahiro; Tsuji, Shingo; Yamamoto, Shogo; Asada, Kayo; Sone, Kenbun; Kurikawa, Reiko; Makii, Chinami; Hagiwara, Otoe; Tanikawa, Michihiro; Maeda, Daichi; Hasegawa, Kosei; Nakagawa, Shunsuke; Wada-Hiraike, Osamu; Kawana, Kei; Fukayama, Masashi; Fujiwara, Keiichi; Yano, Tetsu; Osuga, Yutaka; Fujii, Tomoyuki; Aburatani, Hiroyuki
2015-01-01
Ovarian clear cell carcinoma (CCC) is generally associated with chemoresistance and poor clinical outcome, even with early diagnosis; whereas high-grade serous carcinomas (SCs) and endometrioid carcinomas (ECs) are commonly chemosensitive at advanced stages. Although an integrated genomic analysis of SC has been performed, conclusive views on copy number and expression profiles for CCC are still limited. In this study, we performed single nucleotide polymorphism analysis with 57 epithelial ovarian cancers (31 CCCs, 14 SCs, and 12 ECs) and microarray expression analysis with 55 cancers (25 CCCs, 16 SCs, and 14 ECs). We then evaluated PIK3CA mutations and ARID1A expression in CCCs. SNP array analysis classified 13% of CCCs into a cluster with high frequency and focal range of copy number alterations (CNAs), significantly lower than for SCs (93%, P < 0.01) and ECs (50%, P = 0.017). The ratio of whole-arm to all CNAs was higher in CCCs (46.9%) than SCs (21.7%; P < 0.0001). SCs with loss of heterozygosity (LOH) of BRCA1 (85%) also had LOH of NF1 and TP53, and LOH of BRCA2 (62%) coexisted with LOH of RB1 and TP53. Microarray analysis classified CCCs into three clusters. One cluster (CCC-2, n = 10) showed more favorable prognosis than the CCC-1 and CCC-3 clusters (P = 0.041). Coexistent alterations of PIK3CA and ARID1A were more common in CCC-1 and CCC-3 (7/11, 64%) than in CCC-2 (0/10, 0%; P < 0.01). Being in cluster CCC-2 was an independent favorable prognostic factor in CCC. In conclusion, CCC was characterized by a high ratio of whole-arm CNAs; whereas CNAs in SC were mainly focal, but preferentially caused LOH of well-known tumor suppressor genes. As such, expression profiles might be useful for sub-classification of CCC, and might provide useful information on prognosis. PMID:26043110
Module Cluster: RTE-001.00 (GSC) Advanced Teaching of Reading.
ERIC Educational Resources Information Center
Brown, Estelle
Several module clusters were developed at Glassboro State College as the result of involvement in the Camden Teacher Corps project. The clusters are the primary mode of instruction in this competency-based teacher education program. Many of these modules are based on a list of teacher competencies developed by members of the Elementary Education…
Long-term surface EMG monitoring using K-means clustering and compressive sensing
NASA Astrophysics Data System (ADS)
Balouchestani, Mohammadreza; Krishnan, Sridhar
2015-05-01
In this work, we present an advanced K-means clustering algorithm based on Compressed Sensing theory (CS) in combination with the K-Singular Value Decomposition (K-SVD) method for Clustering of long-term recording of surface Electromyography (sEMG) signals. The long-term monitoring of sEMG signals aims at recording of the electrical activity produced by muscles which are very useful procedure for treatment and diagnostic purposes as well as for detection of various pathologies. The proposed algorithm is examined for three scenarios of sEMG signals including healthy person (sEMG-Healthy), a patient with myopathy (sEMG-Myopathy), and a patient with neuropathy (sEMG-Neuropathr), respectively. The proposed algorithm can easily scan large sEMG datasets of long-term sEMG recording. We test the proposed algorithm with Principal Component Analysis (PCA) and Linear Correlation Coefficient (LCC) dimensionality reduction methods. Then, the output of the proposed algorithm is fed to K-Nearest Neighbours (K-NN) and Probabilistic Neural Network (PNN) classifiers in order to calclute the clustering performance. The proposed algorithm achieves a classification accuracy of 99.22%. This ability allows reducing 17% of Average Classification Error (ACE), 9% of Training Error (TE), and 18% of Root Mean Square Error (RMSE). The proposed algorithm also reduces 14% clustering energy consumption compared to the existing K-Means clustering algorithm.
VizieR Online Data Catalog: Strong lensing mass modeling of 4 HFF clusters (Kawamata+, 2016)
NASA Astrophysics Data System (ADS)
Kawamata, R.; Oguri, M.; Ishigaki, M.; Shimasaku, K.; Ouchi, M.
2018-02-01
We use the public HFF data (http://www.stsci.edu/hst/campaigns/frontier-fields/) for our analysis. The HFF targets six massive clusters, Abell 2744 (z=0.308), MACS J0416.1-2403 (z=0.397), MACS J0717.5+3745 (z=0.545), MACS J1149.6+2223 (z=0.541), Abell S1063 (z=0.348), and Abell 370 (z=0.375), which have been chosen according to their lensing strength and also their accessibility from major ground-based telescopes. The cluster core and parallel field region of each cluster are observed deeply with the IR channel of Wide Field Camera 3 (WFC3/IR) and the Advanced Camera for Surveys (ACS). As of 2015 October, HST observations for the first four clusters, Abell 2744, MACS J0416.1-2403, MACS J0717.5+3745, and MACS J1149.6+2223, are completed. In this study, we use the Version 1.0 data products of drizzled images with a pixel scale of 0.03"/pixel provided by the Space Telescope Science Institute (STScI). The images for each cluster consist of F435W (B435), F606W (V606), and F814W (i814) images from ACS, and F105W (Y105), F125W (J125), F140W (JH140), and F160W (H160) images from WFC3/IR. (7 data files).
Yokoyama, Eiji; Uchimura, Masako
2007-11-01
Ninety-five enterohemorrhagic Escherichia coli serovar O157 strains, including 30 strains isolated from 13 intrafamily outbreaks and 14 strains isolated from 3 mass outbreaks, were studied by pulsed-field gel electrophoresis (PFGE) and variable number of tandem repeats (VNTR) typing, and the resulting data were subjected to cluster analysis. Cluster analysis of the VNTR typing data revealed that 57 (60.0%) of 95 strains, including all epidemiologically linked strains, formed clusters with at least 95% similarity. Cluster analysis of the PFGE patterns revealed that 67 (70.5%) of 95 strains, including all but 1 of the epidemiologically linked strains, formed clusters with 90% similarity. The number of epidemiologically unlinked strains forming clusters was significantly less by VNTR cluster analysis than by PFGE cluster analysis. The congruence value between PFGE and VNTR cluster analysis was low and did not show an obvious correlation. With two-step cluster analysis, the number of clustered epidemiologically unlinked strains by PFGE cluster analysis that were divided by subsequent VNTR cluster analysis was significantly higher than the number by VNTR cluster analysis that were divided by subsequent PFGE cluster analysis. These results indicate that VNTR cluster analysis is more efficient than PFGE cluster analysis as an epidemiological tool to trace the transmission of enterohemorrhagic E. coli O157.
NASA Astrophysics Data System (ADS)
Alexandroni, Guy; Zimmerman Moreno, Gali; Sochen, Nir; Greenspan, Hayit
2016-03-01
Recent advances in Diffusion Weighted Magnetic Resonance Imaging (DW-MRI) of white matter in conjunction with improved tractography produce impressive reconstructions of White Matter (WM) pathways. These pathways (fiber sets) often contain hundreds of thousands of fibers, or more. In order to make fiber based analysis more practical, the fiber set needs to be preprocessed to eliminate redundancies and to keep only essential representative fibers. In this paper we demonstrate and compare two distinctive frameworks for selecting this reduced set of fibers. The first framework entails pre-clustering the fibers using k-means, followed by Hierarchical Clustering and replacing each cluster with one representative. For the second clustering stage seven distance metrics were evaluated. The second framework is based on an efficient geometric approximation paradigm named coresets. Coresets present a new approach to optimization and have huge success especially in tasks requiring large computation time and/or memory. We propose a modified version of the coresets algorithm, Density Coreset. It is used for extracting the main fibers from dense datasets, leaving a small set that represents the main structures and connectivity of the brain. A novel approach, based on a 3D indicator structure, is used for comparing the frameworks. This comparison was applied to High Angular Resolution Diffusion Imaging (HARDI) scans of 4 healthy individuals. We show that among the clustering based methods, that cosine distance gives the best performance. In comparing the clustering schemes with coresets, Density Coreset method achieves the best performance.
Advanced multivariate analysis to assess remediation of hydrocarbons in soils.
Lin, Deborah S; Taylor, Peter; Tibbett, Mark
2014-10-01
Accurate monitoring of degradation levels in soils is essential in order to understand and achieve complete degradation of petroleum hydrocarbons in contaminated soils. We aimed to develop the use of multivariate methods for the monitoring of biodegradation of diesel in soils and to determine if diesel contaminated soils could be remediated to a chemical composition similar to that of an uncontaminated soil. An incubation experiment was set up with three contrasting soil types. Each soil was exposed to diesel at varying stages of degradation and then analysed for key hydrocarbons throughout 161 days of incubation. Hydrocarbon distributions were analysed by Principal Coordinate Analysis and similar samples grouped by cluster analysis. Variation and differences between samples were determined using permutational multivariate analysis of variance. It was found that all soils followed trajectories approaching the chemical composition of the unpolluted soil. Some contaminated soils were no longer significantly different to that of uncontaminated soil after 161 days of incubation. The use of cluster analysis allows the assignment of a percentage chemical similarity of a diesel contaminated soil to an uncontaminated soil sample. This will aid in the monitoring of hydrocarbon contaminated sites and the establishment of potential endpoints for successful remediation.
A comparison of methods for the analysis of binomial clustered outcomes in behavioral research.
Ferrari, Alberto; Comelli, Mario
2016-12-01
In behavioral research, data consisting of a per-subject proportion of "successes" and "failures" over a finite number of trials often arise. This clustered binary data are usually non-normally distributed, which can distort inference if the usual general linear model is applied and sample size is small. A number of more advanced methods is available, but they are often technically challenging and a comparative assessment of their performances in behavioral setups has not been performed. We studied the performances of some methods applicable to the analysis of proportions; namely linear regression, Poisson regression, beta-binomial regression and Generalized Linear Mixed Models (GLMMs). We report on a simulation study evaluating power and Type I error rate of these models in hypothetical scenarios met by behavioral researchers; plus, we describe results from the application of these methods on data from real experiments. Our results show that, while GLMMs are powerful instruments for the analysis of clustered binary outcomes, beta-binomial regression can outperform them in a range of scenarios. Linear regression gave results consistent with the nominal level of significance, but was overall less powerful. Poisson regression, instead, mostly led to anticonservative inference. GLMMs and beta-binomial regression are generally more powerful than linear regression; yet linear regression is robust to model misspecification in some conditions, whereas Poisson regression suffers heavily from violations of the assumptions when used to model proportion data. We conclude providing directions to behavioral scientists dealing with clustered binary data and small sample sizes. Copyright © 2016 Elsevier B.V. All rights reserved.
Analysis of Big Data in Gait Biomechanics: Current Trends and Future Directions.
Phinyomark, Angkoon; Petri, Giovanni; Ibáñez-Marcelo, Esther; Osis, Sean T; Ferber, Reed
2018-01-01
The increasing amount of data in biomechanics research has greatly increased the importance of developing advanced multivariate analysis and machine learning techniques, which are better able to handle "big data". Consequently, advances in data science methods will expand the knowledge for testing new hypotheses about biomechanical risk factors associated with walking and running gait-related musculoskeletal injury. This paper begins with a brief introduction to an automated three-dimensional (3D) biomechanical gait data collection system: 3D GAIT, followed by how the studies in the field of gait biomechanics fit the quantities in the 5 V's definition of big data: volume, velocity, variety, veracity, and value. Next, we provide a review of recent research and development in multivariate and machine learning methods-based gait analysis that can be applied to big data analytics. These modern biomechanical gait analysis methods include several main modules such as initial input features, dimensionality reduction (feature selection and extraction), and learning algorithms (classification and clustering). Finally, a promising big data exploration tool called "topological data analysis" and directions for future research are outlined and discussed.
Nissen, Kathrine G; Trevino, Kelly; Lange, Theis; Prigerson, Holly G
2016-12-01
Caring for a family member with advanced cancer strains family caregivers. Classification of family types has been shown to identify patients at risk of poor psychosocial function. However, little is known about how family relationships affect caregiver psychosocial function. To investigate family types identified by a cluster analysis and to examine the reproducibility of cluster analyses. We also sought to examine the relationship between family types and caregivers' psychosocial function. Data from 622 caregivers of advanced cancer patients (part of the Coping with Cancer Study) were analyzed using Gaussian Mixture Modeling as the primary method to identify family types based on the Family Relationship Index questionnaire. We then examined the relationship between family type and caregiver quality of life (Medical Outcome Survey Short Form), social support (Interpersonal Support Evaluation List), and perceived caregiver burden (Caregiving Burden Scale). Three family types emerged: low-expressive, detached, and supportive. Analyses of variance with post hoc comparisons showed that caregivers of detached and low-expressive family types experienced lower levels of quality of life and perceived social support in comparison to supportive family types. The study identified supportive, low-expressive, and detached family types among caregivers of advanced cancer patients. The supportive family type was associated with the best outcomes and detached with the worst. These findings indicate that family function is related to psychosocial function of caregivers of advanced cancer patients. Therefore, paying attention to family support and family members' ability to share feelings and manage conflicts may serve as an important tool to improve psychosocial function in families affected by cancer. Copyright © 2016 American Academy of Hospice and Palliative Medicine. All rights reserved.
2015-01-01
explain the accuracy and speed increase. Exploring the underlying connections of the energy evolution of these methods and the energy landscape for the...unwanted trivial global minimizers from the energy landscape . Note that the second eigenvector of the Laplacian already provides a solution to a cut...von Brecht. Convergence and energy landscape for Cheeger cut clustering. Advances in Neural Information Processing Systems, 25:1394– 1402, 2012. [13] X
Musekamp, Gunda; Gerlich, Christian; Ehlebracht-König, Inge; Faller, Hermann; Reusch, Andrea
2016-02-03
Fibromyalgia syndrome (FMS) is a complex chronic condition that makes high demands on patients' self-management skills. Thus, patient education is considered an important component of multimodal therapy, although evidence regarding its effectiveness is scarce. The main objective of this study is to assess the effectiveness of an advanced self-management patient education program for patients with FMS as compared to usual care in the context of inpatient rehabilitation. We conducted a multicenter cluster randomized controlled trial in 3 rehabilitation clinics. Clusters are groups of patients with FMS consecutively recruited within one week after admission. Patients of the intervention group receive the advanced multidisciplinary self-management patient education program (considering new knowledge on FMS, with a focus on transfer into everyday life), whereas patients in the control group receive standard patient education programs including information on FMS and coping with pain. A total of 566 patients are assessed at admission, at discharge and after 6 and 12 months, using patient reported questionnaires. Primary outcomes are patients' disease- and treatment-specific knowledge at discharge and self-management skills after 6 months. Secondary outcomes include satisfaction, attitudes and coping competences, health-promoting behavior, psychological distress, health impairment and participation. Treatment effects between groups are evaluated using multilevel regression analysis adjusting for baseline values. The study evaluates the effectiveness of a self-management patient education program for patients with FMS in the context of inpatient rehabilitation in a cluster randomized trial. Study results will show whether self-management patient education is beneficial for this group of patients. German Clinical Trials Register, DRKS00008782 , Registered 8 July 2015.
Automated extraction and analysis of rock discontinuity characteristics from 3D point clouds
NASA Astrophysics Data System (ADS)
Bianchetti, Matteo; Villa, Alberto; Agliardi, Federico; Crosta, Giovanni B.
2016-04-01
A reliable characterization of fractured rock masses requires an exhaustive geometrical description of discontinuities, including orientation, spacing, and size. These are required to describe discontinuum rock mass structure, perform Discrete Fracture Network and DEM modelling, or provide input for rock mass classification or equivalent continuum estimate of rock mass properties. Although several advanced methodologies have been developed in the last decades, a complete characterization of discontinuity geometry in practice is still challenging, due to scale-dependent variability of fracture patterns and difficult accessibility to large outcrops. Recent advances in remote survey techniques, such as terrestrial laser scanning and digital photogrammetry, allow a fast and accurate acquisition of dense 3D point clouds, which promoted the development of several semi-automatic approaches to extract discontinuity features. Nevertheless, these often need user supervision on algorithm parameters which can be difficult to assess. To overcome this problem, we developed an original Matlab tool, allowing fast, fully automatic extraction and analysis of discontinuity features with no requirements on point cloud accuracy, density and homogeneity. The tool consists of a set of algorithms which: (i) process raw 3D point clouds, (ii) automatically characterize discontinuity sets, (iii) identify individual discontinuity surfaces, and (iv) analyse their spacing and persistence. The tool operates in either a supervised or unsupervised mode, starting from an automatic preliminary exploration data analysis. The identification and geometrical characterization of discontinuity features is divided in steps. First, coplanar surfaces are identified in the whole point cloud using K-Nearest Neighbor and Principal Component Analysis algorithms optimized on point cloud accuracy and specified typical facet size. Then, discontinuity set orientation is calculated using Kernel Density Estimation and principal vector similarity criteria. Poles to points are assigned to individual discontinuity objects using easy custom vector clustering and Jaccard distance approaches, and each object is segmented into planar clusters using an improved version of the DBSCAN algorithm. Modal set orientations are then recomputed by cluster-based orientation statistics to avoid the effects of biases related to cluster size and density heterogeneity of the point cloud. Finally, spacing values are measured between individual discontinuity clusters along scanlines parallel to modal pole vectors, whereas individual feature size (persistence) is measured using 3D convex hull bounding boxes. Spacing and size are provided both as raw population data and as summary statistics. The tool is optimized for parallel computing on 64bit systems, and a Graphic User Interface (GUI) has been developed to manage data processing, provide several outputs, including reclassified point clouds, tables, plots, derived fracture intensity parameters, and export to modelling software tools. We present test applications performed both on synthetic 3D data (simple 3D solids) and real case studies, validating the results with existing geomechanical datasets.
Intermediate and advanced topics in multilevel logistic regression analysis.
Austin, Peter C; Merlo, Juan
2017-09-10
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Kempler, Steve; Mathews, Tiffany
2016-01-01
The continuum of ever-evolving data management systems affords great opportunities to the enhancement of knowledge and facilitation of science research. To take advantage of these opportunities, it is essential to understand and develop methods that enable data relationships to be examined and the information to be manipulated. This presentation describes the efforts of the Earth Science Information Partners (ESIP) Federation Earth Science Data Analytics (ESDA) Cluster to understand, define, and facilitate the implementation of ESDA to advance science research. As a result of the void of Earth science data analytics publication material, the cluster has defined ESDA along with 10 goals to set the framework for a common understanding of tools and techniques that are available and still needed to support ESDA.
How the shape of an H-bonded network controls proton-coupled water activation in HONO formation.
Relph, Rachael A; Guasco, Timothy L; Elliott, Ben M; Kamrath, Michael Z; McCoy, Anne B; Steele, Ryan P; Schofield, Daniel P; Jordan, Kenneth D; Viggiano, Albert A; Ferguson, Eldon E; Johnson, Mark A
2010-01-15
Many chemical reactions in atmospheric aerosols and bulk aqueous environments are influenced by the surrounding solvation shell, but the precise molecular interactions underlying such effects have rarely been elucidated. We exploited recent advances in isomer-specific cluster vibrational spectroscopy to explore the fundamental relation between the hydrogen (H)-bonding arrangement of a set of ion-solvating water molecules and the chemical activity of this ensemble. We find that the extent to which the nitrosonium ion (NO+)and water form nitrous acid (HONO) and a hydrated proton cluster in the critical trihydrate depends sensitively on the geometrical arrangement of the water molecules in the network. Theoretical analysis of these data details the role of the water network in promoting charge delocalization.
DOT National Transportation Integrated Search
2016-12-15
Across regional economic development leaders and policy makers, the concept of clustering has grown in importance as a framework for structuring economic growth and resurgence (Muro & Katz, 2010). Cluster identification is most often treated as a com...
ERIC Educational Resources Information Center
Williams, Carrick C.; Pollatsek, Alexander; Cave, Kyle R.; Stroud, Michael J.
2009-01-01
In 2 experiments, eye movements were examined during searches in which elements were grouped into four 9-item clusters. The target (a red or blue "T") was known in advance, and each cluster contained different numbers of target-color elements. Rather than color composition of a cluster invariantly guiding the order of search though…
Synthesis and Electrochemistry of Cyclopentadienylcarbonyliron Tetramer: An Advanced Experiment.
ERIC Educational Resources Information Center
White, A. J.; Cunningham, Alice J.
1980-01-01
Describes an advanced level experiment in which a transition metal cluster compound, cyclopentadienylcarbonyliron tetramer, is synthesized and characterized spectroscopically. Its redox properties are then explored through cyclic voltammetry. (CS)
Yang, Yongxin; Bu, Dengpan; Zhao, Xiaowei; Sun, Peng; Wang, Jiaqi; Zhou, Lingyun
2013-04-05
To aid in unraveling diverse genetic and biological unknowns, a proteomic approach was used to analyze the whey proteome in cow, yak, buffalo, goat, and camel milk based on the isobaric tag for relative and absolute quantification (iTRAQ) techniques. This analysis is the first to produce proteomic data for the milk from the above-mentioned animal species: 211 proteins have been identified and 113 proteins have been categorized according to molecular function, cellular components, and biological processes based on gene ontology annotation. The results of principal component analysis showed significant differences in proteomic patterns among goat, camel, cow, buffalo, and yak milk. Furthermore, 177 differentially expressed proteins were submitted to advanced hierarchical clustering. The resulting clustering pattern included three major sample clusters: (1) cow, buffalo, and yak milk; (2) goat, cow, buffalo, and yak milk; and (3) camel milk. Certain proteins were chosen as characterization traits for a given species: whey acidic protein and quinone oxidoreductase for camel milk, biglycan for goat milk, uncharacterized protein (Accession Number: F1MK50 ) for yak milk, clusterin for buffalo milk, and primary amine oxidase for cow milk. These results help reveal the quantitative milk whey proteome pattern for analyzed species. This provides information for evaluating adulteration of specific specie milk and may provide potential directions for application of specific milk protein production based on physiological differences among animal species.
Text-mining analysis of mHealth research.
Ozaydin, Bunyamin; Zengul, Ferhat; Oner, Nurettin; Delen, Dursun
2017-01-01
In recent years, because of the advancements in communication and networking technologies, mobile technologies have been developing at an unprecedented rate. mHealth, the use of mobile technologies in medicine, and the related research has also surged parallel to these technological advancements. Although there have been several attempts to review mHealth research through manual processes such as systematic reviews, the sheer magnitude of the number of studies published in recent years makes this task very challenging. The most recent developments in machine learning and text mining offer some potential solutions to address this challenge by allowing analyses of large volumes of texts through semi-automated processes. The objective of this study is to analyze the evolution of mHealth research by utilizing text-mining and natural language processing (NLP) analyses. The study sample included abstracts of 5,644 mHealth research articles, which were gathered from five academic search engines by using search terms such as mobile health, and mHealth. The analysis used the Text Explorer module of JMP Pro 13 and an iterative semi-automated process involving tokenizing, phrasing, and terming. After developing the document term matrix (DTM) analyses such as single value decomposition (SVD), topic, and hierarchical document clustering were performed, along with the topic-informed document clustering approach. The results were presented in the form of word-clouds and trend analyses. There were several major findings regarding research clusters and trends. First, our results confirmed time-dependent nature of terminology use in mHealth research. For example, in earlier versus recent years the use of terminology changed from "mobile phone" to "smartphone" and from "applications" to "apps". Second, ten clusters for mHealth research were identified including (I) Clinical Research on Lifestyle Management, (II) Community Health, (III) Literature Review, (IV) Medical Interventions, (V) Research Design, (VI) Infrastructure, (VII) Applications, (VIII) Research and Innovation in Health Technologies, (IX) Sensor-based Devices and Measurement Algorithms, (X) Survey-based Research. Third, the trend analyses indicated the infrastructure cluster as the highest percentage researched area until 2014. The Research and Innovation in Health Technologies cluster experienced the largest increase in numbers of publications in recent years, especially after 2014. This study is unique because it is the only known study utilizing text-mining analyses to reveal the streams and trends for mHealth research. The fast growth in mobile technologies is expected to lead to higher numbers of studies focusing on mHealth and its implications for various healthcare outcomes. Findings of this study can be utilized by researchers in identifying areas for future studies.
Text-mining analysis of mHealth research
Zengul, Ferhat; Oner, Nurettin; Delen, Dursun
2017-01-01
In recent years, because of the advancements in communication and networking technologies, mobile technologies have been developing at an unprecedented rate. mHealth, the use of mobile technologies in medicine, and the related research has also surged parallel to these technological advancements. Although there have been several attempts to review mHealth research through manual processes such as systematic reviews, the sheer magnitude of the number of studies published in recent years makes this task very challenging. The most recent developments in machine learning and text mining offer some potential solutions to address this challenge by allowing analyses of large volumes of texts through semi-automated processes. The objective of this study is to analyze the evolution of mHealth research by utilizing text-mining and natural language processing (NLP) analyses. The study sample included abstracts of 5,644 mHealth research articles, which were gathered from five academic search engines by using search terms such as mobile health, and mHealth. The analysis used the Text Explorer module of JMP Pro 13 and an iterative semi-automated process involving tokenizing, phrasing, and terming. After developing the document term matrix (DTM) analyses such as single value decomposition (SVD), topic, and hierarchical document clustering were performed, along with the topic-informed document clustering approach. The results were presented in the form of word-clouds and trend analyses. There were several major findings regarding research clusters and trends. First, our results confirmed time-dependent nature of terminology use in mHealth research. For example, in earlier versus recent years the use of terminology changed from “mobile phone” to “smartphone” and from “applications” to “apps”. Second, ten clusters for mHealth research were identified including (I) Clinical Research on Lifestyle Management, (II) Community Health, (III) Literature Review, (IV) Medical Interventions, (V) Research Design, (VI) Infrastructure, (VII) Applications, (VIII) Research and Innovation in Health Technologies, (IX) Sensor-based Devices and Measurement Algorithms, (X) Survey-based Research. Third, the trend analyses indicated the infrastructure cluster as the highest percentage researched area until 2014. The Research and Innovation in Health Technologies cluster experienced the largest increase in numbers of publications in recent years, especially after 2014. This study is unique because it is the only known study utilizing text-mining analyses to reveal the streams and trends for mHealth research. The fast growth in mobile technologies is expected to lead to higher numbers of studies focusing on mHealth and its implications for various healthcare outcomes. Findings of this study can be utilized by researchers in identifying areas for future studies. PMID:29430456
Igloo-Plot: a tool for visualization of multidimensional datasets.
Kuntal, Bhusan K; Ghosh, Tarini Shankar; Mande, Sharmila S
2014-01-01
Advances in science and technology have resulted in an exponential growth of multivariate (or multi-dimensional) datasets which are being generated from various research areas especially in the domain of biological sciences. Visualization and analysis of such data (with the objective of uncovering the hidden patterns therein) is an important and challenging task. We present a tool, called Igloo-Plot, for efficient visualization of multidimensional datasets. The tool addresses some of the key limitations of contemporary multivariate visualization and analysis tools. The visualization layout, not only facilitates an easy identification of clusters of data-points having similar feature compositions, but also the 'marker features' specific to each of these clusters. The applicability of the various functionalities implemented herein is demonstrated using several well studied multi-dimensional datasets. Igloo-Plot is expected to be a valuable resource for researchers working in multivariate data mining studies. Igloo-Plot is available for download from: http://metagenomics.atc.tcs.com/IglooPlot/. Copyright © 2014 Elsevier Inc. All rights reserved.
Cluster analysis reveals subclinical subgroups with shared autistic and schizotypal traits.
Ford, Talitha C; Apputhurai, Pragalathan; Meyer, Denny; Crewther, David P
2018-07-01
Autism and schizophrenia spectrum research is typically based on coarse diagnostic classification, which overlooks individual variation within clinical groups. This method limits the identification of underlying cognitive, genetic and neural correlates of specific symptom dimensions. This study, therefore, aimed to identify homogenous subclinical subgroups of specific autistic and schizotypal traits dimensions, that may be utilised to establish more effective diagnostic and treatment practices. Latent profile analysis of subscale scores derived from an autism-schizotypy questionnaire, completed by 1678 subclinical adults aged 18-40 years (1250 females), identified a local optimum of eight population clusters: High, Moderate and Low Psychosocial Difficulties; High, Moderate and Low Autism-Schizotypy; High Psychosis-Proneness; and Moderate Schizotypy. These subgroups represent the convergent and discriminant dimensions of autism and schizotypy in the subclinical population, and highlight the importance of examining subgroups of specific symptom characteristics across these spectra in order to identify the underlying genetic and neural correlates that can be utilised to advance diagnostic and treatment practices. Copyright © 2018 Elsevier B.V. All rights reserved.
Keomanivong, F E; Camacho, L E; Lemley, C O; Kuemper, E A; Yunusova, R D; Borowicz, P P; Kirsch, J D; Vonnahme, K A; Caton, J S; Swanson, K C
2017-06-01
This study examined effects of stage of gestation and nutrient restriction with subsequent realimentation on maternal and foetal bovine pancreatic function. Dietary treatments were assigned on day 30 of pregnancy and included: control (CON; 100% requirements; n = 18) and restricted (R; 60% requirements; n = 30). On day 85, cows were slaughtered (CON, n = 6; R, n = 6), remained on control (CC; n = 12) and restricted (RR; n = 12), or realimented to control (RC; n = 11). On day 140, cows were slaughtered (CC, n = 6; RR, n = 6; RC, n = 5), remained on control (CCC, n = 6; RCC, n = 5) or realimented to control (RRC, n = 6). On day 254, the remaining cows were slaughtered and serum samples were collected from the maternal jugular vein and umbilical cord to determine insulin and glucose concentrations. Pancreases from cows and foetuses were removed, weighed, and subsampled for enzyme and histological analysis. As gestation progressed, maternal pancreatic α-amylase activity decreased and serum insulin concentrations increased (p ≤ 0.03). Foetal pancreatic trypsin activity increased (p < 0.001) with advancing gestation. Foetal pancreases subjected to realimentation (CCC vs. RCC and RRC) had increased protein and α-amylase activity at day 254 (p ≤ 0.02), while trypsin (U/g protein; p = 0.02) demonstrated the opposite effect. No treatment effects were observed for maternal or foetal pancreatic insulin-containing cell clusters. Foetal serum insulin and glucose levels were reduced with advancing gestation (p ≤ 0.03). The largest maternal insulin-containing cell cluster was not influenced by advancing gestation, while foetal clusters grew throughout (p = 0.01). These effects indicate that maternal digestive enzymes are influenced by nutrient restriction and there is a potential for programming of increased foetal digestive enzyme production resulting from previous maternal nutrient restriction. Journal of Animal Physiology and Animal Nutrition © 2016 Blackwell Verlag GmbH.
Seizure clusters: A common, understudied and undertreated phenomenon in refractory epilepsy.
Komaragiri, Arpitha; Detyniecki, Kamil; Hirsch, Lawrence J
2016-06-01
Epilepsy is widely prevalent globally and has emerged as a well-studied neurological condition in the recent past. Seizure clusters, a type of seizures, and several aspects pertaining to the etiopathogenesis and management of clusters are yet to be elucidated. This review is an attempt to recapitulate the current understanding of seizure clusters based on the research that has been performed on seizure clusters. This article will provide a comprehensive review of various aspects of clusters, and discusses definitions, prevalence, risk factors, impact on quality of life, approved treatment modalities, and recent advances in management. Copyright © 2016 Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Zhu, Dongming; Chen, Yuan L.; Miller, Robert A.
2004-01-01
Advanced thermal barrier coatings (TBCs) have been developed by incorporating multicomponent rare earth oxide dopants into zirconia-based thermal barrier coatings to promote the creation of the thermodynamically stable, immobile oxide defect clusters and/or nanophases within the coating systems. In this paper, the defect clusters, induced by Nd, Gd, and Yb rare earth dopants in the zirconia-yttria thermal barrier coatings, were characterized by high-resolution transmission electron microscopy (TEM). The TEM lattice imaging, selected area diffraction (SAD), and electron energy-loss spectroscopy (EELS) analyses demonstrated that the extensive nanoscale rare earth dopant segregation exists in the plasma-sprayed and electron-physical-vapor-deposited (EB PVD) thermal barrier coatings. The nanoscale concentration heterogeneity and the resulting large lattice distortion promoted the formation of parallel and rotational defective lattice clusters in the coating systems. The presence of the 5-to 100-nm-sized defect clusters and nanophases is believed to be responsible for the significant reduction of thermal conductivity, improved sintering resistance, and long-term high temperature stability of the advanced thermal barrier coating systems.
Large scale analysis of the mutational landscape in HT-SELEX improves aptamer discovery
Hoinka, Jan; Berezhnoy, Alexey; Dao, Phuong; Sauna, Zuben E.; Gilboa, Eli; Przytycka, Teresa M.
2015-01-01
High-Throughput (HT) SELEX combines SELEX (Systematic Evolution of Ligands by EXponential Enrichment), a method for aptamer discovery, with massively parallel sequencing technologies. This emerging technology provides data for a global analysis of the selection process and for simultaneous discovery of a large number of candidates but currently lacks dedicated computational approaches for their analysis. To close this gap, we developed novel in-silico methods to analyze HT-SELEX data and utilized them to study the emergence of polymerase errors during HT-SELEX. Rather than considering these errors as a nuisance, we demonstrated their utility for guiding aptamer discovery. Our approach builds on two main advancements in aptamer analysis: AptaMut—a novel technique allowing for the identification of polymerase errors conferring an improved binding affinity relative to the ‘parent’ sequence and AptaCluster—an aptamer clustering algorithm which is to our best knowledge, the only currently available tool capable of efficiently clustering entire aptamer pools. We applied these methods to an HT-SELEX experiment developing aptamers against Interleukin 10 receptor alpha chain (IL-10RA) and experimentally confirmed our predictions thus validating our computational methods. PMID:25870409
Technological innovation in neurosurgery: a quantitative study.
Marcus, Hani J; Hughes-Hallett, Archie; Kwasnicki, Richard M; Darzi, Ara; Yang, Guang-Zhong; Nandi, Dipankar
2015-07-01
Technological innovation within health care may be defined as the introduction of a new technology that initiates a change in clinical practice. Neurosurgery is a particularly technology-intensive surgical discipline, and new technologies have preceded many of the major advances in operative neurosurgical techniques. The aim of the present study was to quantitatively evaluate technological innovation in neurosurgery using patents and peer-reviewed publications as metrics of technology development and clinical translation, respectively. The authors searched a patent database for articles published between 1960 and 2010 using the Boolean search term "neurosurgeon OR neurosurgical OR neurosurgery." The top 50 performing patent codes were then grouped into technology clusters. Patent and publication growth curves were then generated for these technology clusters. A top-performing technology cluster was then selected as an exemplar for a more detailed analysis of individual patents. In all, 11,672 patents and 208,203 publications related to neurosurgery were identified. The top-performing technology clusters during these 50 years were image-guidance devices, clinical neurophysiology devices, neuromodulation devices, operating microscopes, and endoscopes. In relation to image-guidance and neuromodulation devices, the authors found a highly correlated rapid rise in the numbers of patents and publications, which suggests that these are areas of technology expansion. An in-depth analysis of neuromodulation-device patents revealed that the majority of well-performing patents were related to deep brain stimulation. Patent and publication data may be used to quantitatively evaluate technological innovation in neurosurgery.
New computing systems and their impact on structural analysis and design
NASA Technical Reports Server (NTRS)
Noor, Ahmed K.
1989-01-01
A review is given of the recent advances in computer technology that are likely to impact structural analysis and design. The computational needs for future structures technology are described. The characteristics of new and projected computing systems are summarized. Advances in programming environments, numerical algorithms, and computational strategies for new computing systems are reviewed, and a novel partitioning strategy is outlined for maximizing the degree of parallelism. The strategy is designed for computers with a shared memory and a small number of powerful processors (or a small number of clusters of medium-range processors). It is based on approximating the response of the structure by a combination of symmetric and antisymmetric response vectors, each obtained using a fraction of the degrees of freedom of the original finite element model. The strategy was implemented on the CRAY X-MP/4 and the Alliant FX/8 computers. For nonlinear dynamic problems on the CRAY X-MP with four CPUs, it resulted in an order of magnitude reduction in total analysis time, compared with the direct analysis on a single-CPU CRAY X-MP machine.
Hu, Yilin; Ribbe, Markus W.
2013-01-01
Nitrogenase contains two unique metalloclusters: the P-cluster and the M-cluster. The assembly processes of P- and M-clusters are arguably the most complicated processes in bioinorganic chemistry. There is considerable interest in decoding the biosynthetic mechanisms of the P- and M-clusters, because these clusters are not only biologically important, but also chemically unprecedented. Understanding the assembly mechanisms of these unique metalloclusters is crucial for understanding the structure-function relationship of nitrogenase. Here, we review the recent advances in this research area, with an emphasis on our work that provide important insights into the biosynthetic pathways of these high-nuclearity metal centers. PMID:23232096
Advances in single-molecule magnet surface patterning through microcontact printing.
Mannini, Matteo; Bonacchi, Daniele; Zobbi, Laura; Piras, Federica M; Speets, Emiel A; Caneschi, Andrea; Cornia, Andrea; Magnani, Agnese; Ravoo, Bart Jan; Reinhoudt, David N; Sessoli, Roberta; Gatteschi, Dante
2005-07-01
We present an implementation of strategies to deposit single-molecule magnets (SMMs) using microcontact printing microCP). We describe different approaches of microCP to print stripes of a sulfur-functionalized dodecamanganese (III, IV) cluster on gold surfaces. Comparison by atomic force microscopy profile analysis of the patterned structures confirms the formation of a chemically stable single layer of SMMs. Images based on chemical contrast, obtained by time-of-flight secondary ion mass spectrometry, confirm the patterned structure.
1997-06-01
composites. The topics ranged from molecular clusters, nanophase materials, growth, processing, and synthesis. Commercial composite materials have been on...example, an analysis of the emission from a GaAs target shows mainly (99.4%) neutral Ga and As atoms. [63] However, the fraction of molecular species...sputtered from ionic crystals can be considerably higher. [64] There is evidence that a large fraction of the molecular species originate from
NASA Astrophysics Data System (ADS)
Saragih, I. J. A.; Meygatama, A. G.; Sugihartati, F. M.; Sidauruk, M.; Mulsandi, A.
2018-03-01
During 2016, there are frequent heavy rains in the Bojonegoro region, one of which is rain on 9 February 2016. The occurrence of heavy rainfall can cause the floods that inundate the settlements, rice fields, roads, and public facilities. This makes it important to analyze the atmospheric conditions during the heavy rainfall events in Bojonegoro. One of the analytical methods that can be used is using WRF-Advanced Research WRF (WRF-ARW) model. This study was conducted by comparing the rain analysis from WRF-ARW model with the Himawari-8 satellite imagery. The data used are Final Analysis (FNL) data for the WRF-ARW model and infrared (IR) channel for Himawari-8 satellite imagery. The data are processed into the time-series images and then analyzed descriptively. The meteorological parameters selected to be analyzed are relative humidity, vortices, divergences, air stability index, and precipitation. These parameters are expected to indicate the existence of a convective activity in Bojonegoro during the heavy rainfall event. The Himawari-8 satellite imagery shows that there is a cluster of convective clouds in Bojonegoro during the heavy rainfall event. The lowest value of the cloud top temperature indicates that the cluster of convective clouds is a cluster of Cumulonimbus cloud (CB).
Molecular subtyping of bladder cancer using Kohonen self-organizing maps
Borkowska, Edyta M; Kruk, Andrzej; Jedrzejczyk, Adam; Rozniecki, Marek; Jablonowski, Zbigniew; Traczyk, Magdalena; Constantinou, Maria; Banaszkiewicz, Monika; Pietrusinski, Michal; Sosnowski, Marek; Hamdy, Freddie C; Peter, Stefan; Catto, James WF; Kaluzewski, Bogdan
2014-01-01
Kohonen self-organizing maps (SOMs) are unsupervised Artificial Neural Networks (ANNs) that are good for low-density data visualization. They easily deal with complex and nonlinear relationships between variables. We evaluated molecular events that characterize high- and low-grade BC pathways in the tumors from 104 patients. We compared the ability of statistical clustering with a SOM to stratify tumors according to the risk of progression to more advanced disease. In univariable analysis, tumor stage (log rank P = 0.006) and grade (P < 0.001), HPV DNA (P < 0.004), Chromosome 9 loss (P = 0.04) and the A148T polymorphism (rs 3731249) in CDKN2A (P = 0.02) were associated with progression. Multivariable analysis of these parameters identified that tumor grade (Cox regression, P = 0.001, OR.2.9 (95% CI 1.6–5.2)) and the presence of HPV DNA (P = 0.017, OR 3.8 (95% CI 1.3–11.4)) were the only independent predictors of progression. Unsupervised hierarchical clustering grouped the tumors into discreet branches but did not stratify according to progression free survival (log rank P = 0.39). These genetic variables were presented to SOM input neurons. SOMs are suitable for complex data integration, allow easy visualization of outcomes, and may stratify BC progression more robustly than hierarchical clustering. PMID:25142434
Merging history of three bimodal clusters
NASA Astrophysics Data System (ADS)
Maurogordato, S.; Sauvageot, J. L.; Bourdin, H.; Cappi, A.; Benoist, C.; Ferrari, C.; Mars, G.; Houairi, K.
2011-01-01
We present a combined X-ray and optical analysis of three bimodal galaxy clusters selected as merging candidates at z ~ 0.1. These targets are part of MUSIC (MUlti-Wavelength Sample of Interacting Clusters), which is a general project designed to study the physics of merging clusters by means of multi-wavelength observations. Observations include spectro-imaging with XMM-Newton EPIC camera, multi-object spectroscopy (260 new redshifts), and wide-field imaging at the ESO 3.6 m and 2.2 m telescopes. We build a global picture of these clusters using X-ray luminosity and temperature maps together with galaxy density and velocity distributions. Idealized numerical simulations were used to constrain the merging scenario for each system. We show that A2933 is very likely an equal-mass advanced pre-merger ~200 Myr before the core collapse, while A2440 and A2384 are post-merger systems (~450 Myr and ~1.5 Gyr after core collapse, respectively). In the case of A2384, we detect a spectacular filament of galaxies and gas spreading over more than 1 h-1 Mpc, which we infer to have been stripped during the previous collision. The analysis of the MUSIC sample allows us to outline some general properties of merging clusters: a strong luminosity segregation of galaxies in recent post-mergers; the existence of preferential axes - corresponding to the merging directions - along which the BCGs and structures on various scales are aligned; the concomitance, in most major merger cases, of secondary merging or accretion events, with groups infalling onto the main cluster, and in some cases the evidence of previous merging episodes in one of the main components. These results are in good agreement with the hierarchical scenario of structure formation, in which clusters are expected to form by successive merging events, and matter is accreted along large-scale filaments. Based on data obtained with the European Southern Observatory, Chile (programs 072.A-0595, 075.A-0264, and 079.A-0425).Tables 5-7 are only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/525/A79
OGLE II Eclipsing Binaries In The LMC: Analysis With Class
NASA Astrophysics Data System (ADS)
Devinney, Edward J.; Prsa, A.; Guinan, E. F.; DeGeorge, M.
2011-01-01
The Eclipsing Binaries (EBs) via Artificial Intelligence (EBAI) Project is applying machine learning techniques to elucidate the nature of EBs. Previously, Prsa, et al. applied artificial neural networks (ANNs) trained on physically-realistic Wilson-Devinney models to solve the light curves of the 1882 detached EBs in the LMC discovered by the OGLE II Project (Wyrzykowski, et al.) fully automatically, bypassing the need for manually-derived starting solutions. A curious result is the non-monotonic distribution of the temperature ratio parameter T2/T1, featuring a subsidiary peak noted previously by Mazeh, et al. in an independent analysis using the EBOP EB solution code (Tamuz, et al.). To explore this and to gain a fuller understanding of the multivariate EBAI LMC observational plus solutions data, we have employed automatic clustering and advanced visualization (CAV) techniques. Clustering the OGLE II data aggregates objects that are similar with respect to many parameter dimensions. Measures of similarity for example, could include the multidimensional Euclidean Distance between data objects, although other measures may be appropriate. Applying clustering, we find good evidence that the T2/T1 subsidiary peak is due to evolved binaries, in support of Mazeh et al.'s speculation. Further, clustering suggests that the LMC detached EBs occupying the main sequence region belong to two distinct classes. Also identified as a separate cluster in the multivariate data are stars having a Period-I band relation. Derekas et al. had previously found a Period-K band relation for LMC EBs discovered by the MACHO Project (Alcock, et al.). We suggest such CAV techniques will prove increasingly useful for understanding the large, multivariate datasets increasingly being produced in astronomy. We are grateful for the support of this research from NSF/RUI Grant AST-05-75042 f.
Dynamic gene expression analysis in a H1N1 influenza virus mouse pneumonia model.
Bao, Yanyan; Gao, Yingjie; Shi, Yujing; Cui, Xiaolan
2017-06-01
H1N1, a major pathogenic subtype of influenza A virus, causes a respiratory infection in humans and livestock that can range from a mild infection to more severe pneumonia associated with acute respiratory distress syndrome. Understanding the dynamic changes in the genome and the related functional changes induced by H1N1 influenza virus infection is essential to elucidating the pathogenesis of this virus and thereby determining strategies to prevent future outbreaks. In this study, we filtered the significantly expressed genes in mouse pneumonia using mRNA microarray analysis. Using STC analysis, seven significant gene clusters were revealed, and using STC-GO analysis, we explored the significant functions of these seven gene clusters. The results revealed GOs related to H1N1 virus-induced inflammatory and immune functions, including innate immune response, inflammatory response, specific immune response, and cellular response to interferon-beta. Furthermore, the dynamic regulation relationships of the key genes in mouse pneumonia were revealed by dynamic gene network analysis, and the most important genes were filtered, including Dhx58, Cxcl10, Cxcl11, Zbp1, Ifit1, Ifih1, Trim25, Mx2, Oas2, Cd274, Irgm1, and Irf7. These results suggested that during mouse pneumonia, changes in the expression of gene clusters and the complex interactions among genes lead to significant changes in function. Dynamic gene expression analysis revealed key genes that performed important functions. These results are a prelude to advancements in mouse H1N1 influenza virus infection biology, as well as the use of mice as a model organism for human H1N1 influenza virus infection studies.
Hubble Sees an Ancient Globular Cluster
2017-12-08
This image captures the stunning NGC 6535, a globular cluster 22,000 light-years away in the constellation of Serpens (The Serpent) that measures one light-year across. Globular clusters are tightly bound groups of stars which orbit galaxies. The large mass in the rich stellar centre of the globular cluster pulls the stars inward to form a ball of stars. The word globulus, from which these clusters take their name, is Latin for small sphere. Globular clusters are generally very ancient objects formed around the same time as their host galaxy. To date, no new star formation has been observed within a globular cluster, which explains the abundance of aging yellow stars in this image, most of them containing very few heavy elements. NGC 6535 was first discovered in 1852 by English astronomer John Russell Hind. The cluster would have appeared to Hind as a small, faint smudge through his telescope. Now, over 160 years later, instruments like the Advanced Camera for Surveys (ACS) and Wide Field Camera 3 (WFC3) on the NASA/ European Space Agency (ESA) Hubble Space Telescope allow us to marvel at the cluster and its contents in greater detail. Credit: ESA/Hubble & NASA, Acknowledgement: Gilles Chapdelaine NASA image use policy. NASA Goddard Space Flight Center enables NASA’s mission through four scientific endeavors: Earth Science, Heliophysics, Solar System Exploration, and Astrophysics. Goddard plays a leading role in NASA’s accomplishments by contributing compelling scientific knowledge to advance the Agency’s mission. Follow us on Twitter Like us on Facebook Find us on Instagram
ERIC Educational Resources Information Center
Brulles, Dina; Winebrenner, Susan
2012-01-01
Schools need to address the needs of their students with high ability. Not only does this raise achievement levels schoolwide, it also attracts students from surrounding districts and recaptures advanced learners who left the school because their needs weren't being met. One practical intervention--cluster grouping--provides an inclusive…
Balouchestani, Mohammadreza; Krishnan, Sridhar
2014-01-01
Long-term recording of Electrocardiogram (ECG) signals plays an important role in health care systems for diagnostic and treatment purposes of heart diseases. Clustering and classification of collecting data are essential parts for detecting concealed information of P-QRS-T waves in the long-term ECG recording. Currently used algorithms do have their share of drawbacks: 1) clustering and classification cannot be done in real time; 2) they suffer from huge energy consumption and load of sampling. These drawbacks motivated us in developing novel optimized clustering algorithm which could easily scan large ECG datasets for establishing low power long-term ECG recording. In this paper, we present an advanced K-means clustering algorithm based on Compressed Sensing (CS) theory as a random sampling procedure. Then, two dimensionality reduction methods: Principal Component Analysis (PCA) and Linear Correlation Coefficient (LCC) followed by sorting the data using the K-Nearest Neighbours (K-NN) and Probabilistic Neural Network (PNN) classifiers are applied to the proposed algorithm. We show our algorithm based on PCA features in combination with K-NN classifier shows better performance than other methods. The proposed algorithm outperforms existing algorithms by increasing 11% classification accuracy. In addition, the proposed algorithm illustrates classification accuracy for K-NN and PNN classifiers, and a Receiver Operating Characteristics (ROC) area of 99.98%, 99.83%, and 99.75% respectively.
Simulating Asymmetric Top Impurities in Superfluid Clusters: A para-Water Dopant in para-Hydrogen.
Zeng, Tao; Li, Hui; Roy, Pierre-Nicholas
2013-01-03
We present the first simulation study of bosonic clusters doped with an asymmetric top molecule. The path-integral Monte Carlo method with the latest methodological advance in treating rigid-body rotation [Noya, E. G.; Vega, C.; McBride, C. J. Chem. Phys.2011, 134, 054117] is employed to study a para-water impurity in para-hydrogen clusters with up to 20 para-hydrogen molecules. The growth pattern of the doped clusters is similar in nature to that of pure clusters. The para-water molecule appears to rotate freely in the cluster. The presence of para-water substantially quenches the superfluid response of para-hydrogen with respect to the space-fixed frame.
Monte-Carlo simulation of defect-cluster nucleation in metals during irradiation
NASA Astrophysics Data System (ADS)
Nakasuji, Toshiki; Morishita, Kazunori; Ruan, Xiaoyong
2017-02-01
A multiscale modeling approach was applied to investigate the nucleation process of CRPs (copper rich precipitates, i.e., copper-vacancy clusters) in α-Fe containing 1 at.% Cu during irradiation. Monte-Carlo simulations were performed to investigate the nucleation process, with the rate theory equation analysis to evaluate the concentration of displacement defects, along with the molecular dynamics technique to know CRP thermal stabilities in advance. Our MC simulations showed that there is long incubation period at first, followed by a rapid growth of CRPs. The incubation period depends on irradiation conditions such as the damage rate and temperature. CRP's composition during nucleation varies with time. The copper content of CRPs shows relatively rich at first, and then becomes poorer as the precipitate size increases. A widely-accepted model of CRP nucleation process is finally proposed.
Hadjisolomou, Ekaterini; Stefanidis, Konstantinos; Papatheodorou, George; Papastergiadou, Evanthia
2018-03-19
During the last decades, Mediterranean freshwater ecosystems, especially lakes, have been under severe pressure due to increasing eutrophication and water quality deterioration. In this article, we compared the effectiveness of different data analysis methods by assessing the contribution of environmental parameters to eutrophication processes. For this purpose, principal components analysis (PCA), cluster analysis, and a self-organizing map (SOM) were applied, using water quality data from two transboundary lakes of North Greece. SOM is considered as an advanced and powerful data analysis tool because of its ability to represent complex and nonlinear relationships among multivariate data sets. The results of PCA and cluster analysis agreed with the SOM results, although the latter provided more information because of the visualization abilities regarding the parameters' relationships. Besides nutrients that were found to be a key factor for controlling chlorophyll-a (Chl - a), water temperature was related positively with algal production, while the Secchi disk depth parameter was found to be highly important and negatively related toeutrophic conditions. In general, the SOM results were more specific and allowed direct associations between the water quality variables. Our work showed that SOMs can be used effectively in limnological studies to produce robust and interpretable results, aiding scientists and managers to cope with environmental problems such as eutrophication.
Accelerating epistasis analysis in human genetics with consumer graphics hardware.
Sinnott-Armstrong, Nicholas A; Greene, Casey S; Cancare, Fabio; Moore, Jason H
2009-07-24
Human geneticists are now capable of measuring more than one million DNA sequence variations from across the human genome. The new challenge is to develop computationally feasible methods capable of analyzing these data for associations with common human disease, particularly in the context of epistasis. Epistasis describes the situation where multiple genes interact in a complex non-linear manner to determine an individual's disease risk and is thought to be ubiquitous for common diseases. Multifactor Dimensionality Reduction (MDR) is an algorithm capable of detecting epistasis. An exhaustive analysis with MDR is often computationally expensive, particularly for high order interactions. This challenge has previously been met with parallel computation and expensive hardware. The option we examine here exploits commodity hardware designed for computer graphics. In modern computers Graphics Processing Units (GPUs) have more memory bandwidth and computational capability than Central Processing Units (CPUs) and are well suited to this problem. Advances in the video game industry have led to an economy of scale creating a situation where these powerful components are readily available at very low cost. Here we implement and evaluate the performance of the MDR algorithm on GPUs. Of primary interest are the time required for an epistasis analysis and the price to performance ratio of available solutions. We found that using MDR on GPUs consistently increased performance per machine over both a feature rich Java software package and a C++ cluster implementation. The performance of a GPU workstation running a GPU implementation reduces computation time by a factor of 160 compared to an 8-core workstation running the Java implementation on CPUs. This GPU workstation performs similarly to 150 cores running an optimized C++ implementation on a Beowulf cluster. Furthermore this GPU system provides extremely cost effective performance while leaving the CPU available for other tasks. The GPU workstation containing three GPUs costs $2000 while obtaining similar performance on a Beowulf cluster requires 150 CPU cores which, including the added infrastructure and support cost of the cluster system, cost approximately $82,500. Graphics hardware based computing provides a cost effective means to perform genetic analysis of epistasis using MDR on large datasets without the infrastructure of a computing cluster.
NASA Astrophysics Data System (ADS)
Cuca, Branka; Brumana, Raffaella; Oreni, Daniela; Iannaccone, Giuliana; Sesana, Marta Maria
2014-03-01
Steady technological progress has led to a noticeable advancement in disciplines associated with Earth observation. This has enabled information transition regarding changing scenarios, both natural and urban, to occur in (almost) real time. In particular, the need for integration on a local scale with the wider territorial framework has occurred in analysis and monitoring of built environments over the last few decades. The progress of Geographic Information (GI) science has provided significant advancements when it comes to spatial analysis, while the almost free availability of the internet has ensured a fast and constant exchange of geo-information, even for everyday users' requirements. Due to its descriptive and semantic nature, geo-spatial information is capable of providing a complete overview of a certain phenomenon and of predicting the implications within the natural, social and economic context. However, in order to integrate geospatial data into decision making processes, it is necessary to provide a specific context, which is well supported by verified data. This paper investigates the potentials of geo-portals as planning instruments developed to share multi-temporal/multi-scale spatial data, responding to specific end-users' demands in the case of Energy efficiency in Buildings (EeB) across European countries. The case study regards the GeoCluster geo-portal and mapping tool (Project GE2O, FP7), built upon a GeoClustering methodology for mapping of indicators relevant for energy efficiency technologies in the construction sector.
Summarizing metocean operating conditions as a climatology of marine hazards
NASA Astrophysics Data System (ADS)
Reid, Heather; Finnis, Joel
2018-03-01
Marine occupations are plagued by some of the highest accident and mortality rates of any occupation, due in part to the variety and severity of environmental hazards presented by the ocean environment. In order to better study and communicate the potential impacts of these hazards on occupational health and safety, a semi-objective, hazard-focused climatology of a particularly dangerous marine environment (Northwestern Atlantic) has been developed. Specifically, climate has been summarized as the frequency with which responsible government agencies are expected to issue relevant warnings or watches, couching results in language relevant to marine stakeholders. Applying cluster analysis to warning/watch frequencies identified seven distinct `hazard climatologies', ranging from near-Arctic conditions to areas dominated by calm seas and warm waters. Spatial and temporal variability in these clusters reflects relevant annual cycles, such as the advance/retreat of sea ice and shifts in the Atlantic storm track; the clusters also highlight regions and seasons with comparable operational risks. Our approach is proposed as an effective means to summarize and communicate marine risk with stakeholders, and a potential framework for describing climate change impacts.
SIMS of organics—Advances in 2D and 3D imaging and future outlook
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilmore, Ian S.
Secondary ion mass spectrometry (SIMS) has become a powerful technique for the label-free analysis of organics from cells to electronic devices. The development of cluster ion sources has revolutionized the field, increasing the sensitivity for organics by two or three orders of magnitude and for large clusters, such as C{sub 60} and argon clusters, allowing depth profiling of organics. The latter has provided the capability to generate stunning three dimensional images with depth resolutions of around 5 nm, simply unavailable by other techniques. Current state-of-the-art allows molecular images with a spatial resolution of around 500 nm to be achieved andmore » future developments are likely to progress into the sub-100 nm regime. This review is intended to bring those with some familiarity with SIMS up-to-date with the latest developments for organics, the fundamental principles that underpin this and define the future progress. State-of-the-art examples are showcased and signposts to more in-depth reviews about specific topics given for the specialist.« less
NASA Astrophysics Data System (ADS)
Koga, Yoshihiro; Kadono, Takeshi; Shigematsu, Satoshi; Hirose, Ryo; Onaka-Masada, Ayumi; Okuyama, Ryousuke; Okuda, Hidehiko; Kurita, Kazunari
2018-06-01
We propose a fabrication process for silicon wafers by combining carbon-cluster ion implantation and room-temperature bonding for advanced CMOS image sensors. These carbon-cluster ions are made of carbon and hydrogen, which can passivate process-induced defects. We demonstrated that this combination process can be used to form an epitaxial layer on a carbon-cluster ion-implanted Czochralski (CZ)-grown silicon substrate with a high dose of 1 × 1016 atoms/cm2. This implantation condition transforms the top-surface region of the CZ-grown silicon substrate into a thin amorphous layer. Thus, an epitaxial layer cannot be grown on this implanted CZ-grown silicon substrate. However, this combination process can be used to form an epitaxial layer on the amorphous layer of this implanted CZ-grown silicon substrate surface. This bonding wafer has strong gettering capability in both the wafer-bonding region and the carbon-cluster ion-implanted projection range. Furthermore, this wafer inhibits oxygen out-diffusion to the epitaxial layer from the CZ-grown silicon substrate after device fabrication. Therefore, we believe that this bonding wafer is effective in decreasing the dark current and white-spot defect density for advanced CMOS image sensors.
Preparation and Luminescence Thermochromism of Tetranuclear Copper(I)-Pyridine-Iodide Clusters
ERIC Educational Resources Information Center
Parmeggiani, Fabio; Sacchetti, Alessandro
2012-01-01
A simple and straightforward synthesis of a tetranuclear copper(I)-pyridine-iodide cluster is described as a laboratory experiment for advanced inorganic chemistry undergraduate students. The product is used to demonstrate the fascinating and visually impressive phenomenon of luminescence thermochromism: exposed to long-wave UV light, the…
Clustering for the Development of Engineering Students' Use of Enterprise 3.0
ERIC Educational Resources Information Center
Bassus, Olaf; Ahrens, Andreas; Zascerinska, Jelena
2011-01-01
Enterprise 3.0 which penetrates our society more thoroughly with the availability of broadband services has already been widely integrated into the contemporary processes and work environments. The synergy between Enterprise 3.0 and clustering advances innovation-stimulating environments in engineering education. The present research proposes…
75 FR 40857 - Webinar About Advanced Defense Technologies RFP
Federal Register 2010, 2011, 2012, 2013, 2014
2010-07-14
... Defense Technologies RFP. Please visit http://www.sba.gov/clusters/index.html for more information. The RFP may be found on http://www.fedbizopps.gov . LOGISTICAL INFORMATION: The webinar will be held on Monday, July 19, 2010. For details, please visit http://www.sba.gov/clusters/index.html . SUPPLEMENTARY...
NASA Astrophysics Data System (ADS)
Elangasinghe, M. A.; Singhal, N.; Dirks, K. N.; Salmond, J. A.; Samarasinghe, S.
2014-09-01
This paper uses artificial neural networks (ANN), combined with k-means clustering, to understand the complex time series of PM10 and PM2.5 concentrations at a coastal location of New Zealand based on data from a single site. Out of available meteorological parameters from the network (wind speed, wind direction, solar radiation, temperature, relative humidity), key factors governing the pattern of the time series concentrations were identified through input sensitivity analysis performed on the trained neural network model. The transport pathways of particulate matter under these key meteorological parameters were further analysed through bivariate concentration polar plots and k-means clustering techniques. The analysis shows that the external sources such as marine aerosols and local sources such as traffic and biomass burning contribute equally to the particulate matter concentrations at the study site. These results are in agreement with the results of receptor modelling by the Auckland Council based on Positive Matrix Factorization (PMF). Our findings also show that contrasting concentration-wind speed relationships exist between marine aerosols and local traffic sources resulting in very noisy and seemingly large random PM10 concentrations. The inclusion of cluster rankings as an input parameter to the ANN model showed a statistically significant (p < 0.005) improvement in the performance of the ANN time series model and also showed better performance in picking up high concentrations. For the presented case study, the correlation coefficient between observed and predicted concentrations improved from 0.77 to 0.79 for PM2.5 and from 0.63 to 0.69 for PM10 and reduced the root mean squared error (RMSE) from 5.00 to 4.74 for PM2.5 and from 6.77 to 6.34 for PM10. The techniques presented here enable the user to obtain an understanding of potential sources and their transport characteristics prior to the implementation of costly chemical analysis techniques or advanced air dispersion models.
Science Highlights from the First Year of Advanced Camera for Surveys
NASA Technical Reports Server (NTRS)
Clampin, M.; Ford, H. C.; Illingworth, G. D.; Hartig, G.; Ardila, D. R.; Blakeslee, J. P.; Bouwens, R. J.; Cross, N. J. G.; Feldman, P. D.; Golimowski, D. A.
2003-01-01
The Advanced Camera for Surveys (ACS) is a deep imaging camera installed on the Hubble Space Telescope during the fourth HST servicing mission. ACS recently entered its second year of science operations and continues to perform beyond pre-launch expectations. We present science highlights from the ACS Science Team's GTO program. These highlights include the evolution of Z approx. 6 galaxies from deep imaging observations; deep imaging of strongly lensed clusters which have been used to determine cluster mass, and independently constraint the geometry of the Universe; and coronagraphic observations of debris disks.
Lopez, Xavier Moles; Debeir, Olivier; Maris, Calliope; Rorive, Sandrine; Roland, Isabelle; Saerens, Marco; Salmon, Isabelle; Decaestecker, Christine
2012-09-01
Whole-slide scanners allow the digitization of an entire histological slide at very high resolution. This new acquisition technique opens a wide range of possibilities for addressing challenging image analysis problems, including the identification of tissue-based biomarkers. In this study, we use whole-slide scanner technology for imaging the proliferating activity patterns in tumor slides based on Ki67 immunohistochemistry. Faced with large images, pathologists require tools that can help them identify tumor regions that exhibit high proliferating activity, called "hot-spots" (HSs). Pathologists need tools that can quantitatively characterize these HS patterns. To respond to this clinical need, the present study investigates various clustering methods with the aim of identifying Ki67 HSs in whole tumor slide images. This task requires a method capable of identifying an unknown number of clusters, which may be highly variable in terms of shape, size, and density. We developed a hybrid clustering method, referred to as Seedlink. Compared to manual HS selections by three pathologists, we show that Seedlink provides an efficient way of detecting Ki67 HSs and improves the agreement among pathologists when identifying HSs. Copyright © 2012 International Society for Advancement of Cytometry.
Jolani, Shahab
2018-03-01
In health and medical sciences, multiple imputation (MI) is now becoming popular to obtain valid inferences in the presence of missing data. However, MI of clustered data such as multicenter studies and individual participant data meta-analysis requires advanced imputation routines that preserve the hierarchical structure of data. In clustered data, a specific challenge is the presence of systematically missing data, when a variable is completely missing in some clusters, and sporadically missing data, when it is partly missing in some clusters. Unfortunately, little is known about how to perform MI when both types of missing data occur simultaneously. We develop a new class of hierarchical imputation approach based on chained equations methodology that simultaneously imputes systematically and sporadically missing data while allowing for arbitrary patterns of missingness among them. Here, we use a random effect imputation model and adopt a simplification over fully Bayesian techniques such as Gibbs sampler to directly obtain draws of parameters within each step of the chained equations. We justify through theoretical arguments and extensive simulation studies that the proposed imputation methodology has good statistical properties in terms of bias and coverage rates of parameter estimates. An illustration is given in a case study with eight individual participant datasets. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Reactions and properties of clusters
NASA Astrophysics Data System (ADS)
Castleman, A. W., Jr.
1992-09-01
The elucidation from a molecular point of view of the differences and similarities in the properties and reactivity of matter in the gaseous compared to the condensed state is a subject of considerable current interest. One of the promising approaches to this problem is to utilize mass spectrometry in conjunction with laser spectroscopy and fast-flow reaction devices to investigate the changing properties, structure and reactivity of clusters as a function of the degree of solvation under well-controlled conditions. In this regard, an investigation of molecular cluster ions has provided considerable new insight into the basic mechanisms of ion reactions within a cluster, and this paper reviews some of the recent advances in cluster production, the origin of magic numbers and relationship to cluster ion stabilities, and solvation effects on reactions. There have been some notable advances in the production of large cluster ions under thermal reaction conditions, enabling a systematic study of the influence of solvation on reactions to be carried out. These and other new studies of magic numbers have traced their origin to the thermochemical stability of cluster ions. There are several classes of reaction where solvation has a notable influence on reactivity. A particularly interesting example comes from recent studies of the reactions of the hydroxyl anion with CO2 and SO2, studied as a function of the degree of hydration of OH-. Both reactions are highly exothermic, yet the differences in reactivity are dramatic. In the case of SO2, the reaction occurs at near the collision rate. By contrast, CO2 reactivity plummets dramatically for clusters having more than four water molecules. The slow rate is in accord with observations in the liquid phase.
Using Cluster Analysis to Examine Husband-Wife Decision Making
ERIC Educational Resources Information Center
Bonds-Raacke, Jennifer M.
2006-01-01
Cluster analysis has a rich history in many disciplines and although cluster analysis has been used in clinical psychology to identify types of disorders, its use in other areas of psychology has been less popular. The purpose of the current experiments was to use cluster analysis to investigate husband-wife decision making. Cluster analysis was…
NASA Astrophysics Data System (ADS)
Chen, Siyue; Leung, Henry; Dondo, Maxwell
2014-05-01
As computer network security threats increase, many organizations implement multiple Network Intrusion Detection Systems (NIDS) to maximize the likelihood of intrusion detection and provide a comprehensive understanding of intrusion activities. However, NIDS trigger a massive number of alerts on a daily basis. This can be overwhelming for computer network security analysts since it is a slow and tedious process to manually analyse each alert produced. Thus, automated and intelligent clustering of alerts is important to reveal the structural correlation of events by grouping alerts with common features. As the nature of computer network attacks, and therefore alerts, is not known in advance, unsupervised alert clustering is a promising approach to achieve this goal. We propose a joint optimization technique for feature selection and clustering to aggregate similar alerts and to reduce the number of alerts that analysts have to handle individually. More precisely, each identified feature is assigned a binary value, which reflects the feature's saliency. This value is treated as a hidden variable and incorporated into a likelihood function for clustering. Since computing the optimal solution of the likelihood function directly is analytically intractable, we use the Expectation-Maximisation (EM) algorithm to iteratively update the hidden variable and use it to maximize the expected likelihood. Our empirical results, using a labelled Defense Advanced Research Projects Agency (DARPA) 2000 reference dataset, show that the proposed method gives better results than the EM clustering without feature selection in terms of the clustering accuracy.
Intermediate and advanced topics in multilevel logistic regression analysis
Merlo, Juan
2017-01-01
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher‐level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within‐cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population‐average effect of covariates measured at the subject and cluster level, in contrast to the within‐cluster or cluster‐specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster‐level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28543517
Wagner, Sara E; Bauer, Sarah E; Bayakly, A Rana; Vena, John E
2013-01-01
Limited research has been conducted to describe the geographical clustering and distribution of prostate cancer (PrCA) incidence in Georgia (GA). This study describes and compares the temporal and geographic trends of PrCA incidence in GA with a specific focus on racial disparities. GA Comprehensive Cancer Registry PrCA incidence data were obtained for 1998-2008. Directly standardized age-adjusted PrCA incidence rates per 100,000 were analyzed by race, stage, grade, and county. County-level hotspots of PrCA incidence were analyzed with the Getis-Ord Gi* statistic in a geographic information system; a census tract-level cluster analysis was performed with a Discrete Poisson model and implemented in SaTScan(®) software. Significant (p < 0.05) hotspots of PrCA incidence were observed in nine southwestern counties and six centrally located counties among men of both races. Six significant (p < 0.1) clusters of PrCA incidence rates were detected for men of both races in north and northwest central Georgia. When stratified by race, clusters among white and black men were similar, although centroids were slightly shifted. Most notably, a large (122 km radius) cluster in northwest central Georgia was detected only in whites, and two smaller clusters (0-32 km radii) were detected in Southwest Georgia only in black men. Clusters of high-grade and late-stage tumors were identified primarily in the northern portion of the state among men of both races. This study revealed a pattern of higher incidence and more advanced disease in northern and northwest central Georgia, highlighting geographic patterns that need more research and investigation of possible environmental determinants.
Jung, Chanho; Kim, Changick
2014-08-01
Automatic segmentation of cell nuclei clusters is a key building block in systems for quantitative analysis of microscopy cell images. For that reason, it has received a great attention over the last decade, and diverse automatic approaches to segment clustered nuclei with varying levels of performance under different test conditions have been proposed in literature. To the best of our knowledge, however, so far there is no comparative study on the methods. This study is a first attempt to fill this research gap. More precisely, the purpose of this study is to present an objective performance comparison of existing state-of-the-art segmentation methods. Particularly, the impact of their accuracy on classification of thyroid follicular lesions is also investigated "quantitatively" under the same experimental condition, to evaluate the applicability of the methods. Thirteen different segmentation approaches are compared in terms of not only errors in nuclei segmentation and delineation, but also their impact on the performance of system to classify thyroid follicular lesions using different metrics (e.g., diagnostic accuracy, sensitivity, specificity, etc.). Extensive experiments have been conducted on a total of 204 digitized thyroid biopsy specimens. Our study demonstrates that significant diagnostic errors can be avoided using more advanced segmentation approaches. We believe that this comprehensive comparative study serves as a reference point and guide for developers and practitioners in choosing an appropriate automatic segmentation technique adopted for building automated systems for specifically classifying follicular thyroid lesions. © 2014 International Society for Advancement of Cytometry.
One hundred years of work design research: Looking back and looking forward.
Parker, Sharon K; Morgeson, Frederick P; Johns, Gary
2017-03-01
In this article we take a big picture perspective on work design research. In the first section of the paper we identify influential work design articles and use scientific mapping to identify distinct clusters of research. Pulling this material together, we identify five key work design perspectives that map onto distinct historical developments: (a) sociotechnical systems and autonomous work groups, (b) job characteristics model, (c) job demands-control model, (d) job demands-resources model, and (e) role theory. The grounding of these perspectives in the past is understandable, but we suggest that some of the distinction between clusters is convenient rather than substantive. Thus we also identify contemporary integrative perspectives on work design that build connections across the clusters and we argue that there is scope for further integration. In the second section of the paper, we review the role of Journal of Applied Psychology ( JAP ) in shaping work design research. We conclude that JAP has played a vital role in the advancement of this topic over the last 100 years. Nevertheless, we suspect that to continue to play a leading role in advancing the science and practice of work design, the journal might need to publish research that is broader, more contextualized, and team-oriented. In the third section, we address the impact of work design research on: applied psychology and management, disciplines beyond our own, management thinking, work practice, and national policy agendas. Finally, we draw together observations from our analysis and identify key future directions for the field. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Mitochondrial redox and pH signaling occurs in axonal and synaptic organelle clusters.
Breckwoldt, Michael O; Armoundas, Antonis A; Aon, Miguel A; Bendszus, Martin; O'Rourke, Brian; Schwarzländer, Markus; Dick, Tobias P; Kurz, Felix T
2016-03-22
Redox switches are important mediators in neoplastic, cardiovascular and neurological disorders. We recently identified spontaneous redox signals in neurons at the single mitochondrion level where transients of glutathione oxidation go along with shortening and re-elongation of the organelle. We now have developed advanced image and signal-processing methods to re-assess and extend previously obtained data. Here we analyze redox and pH signals of entire mitochondrial populations. In total, we quantified the effects of 628 redox and pH events in 1797 mitochondria from intercostal axons and neuromuscular synapses using optical sensors (mito-Grx1-roGFP2; mito-SypHer). We show that neuronal mitochondria can undergo multiple redox cycles exhibiting markedly different signal characteristics compared to single redox events. Redox and pH events occur more often in mitochondrial clusters (medium cluster size: 34.1 ± 4.8 μm(2)). Local clusters possess higher mitochondrial densities than the rest of the axon, suggesting morphological and functional inter-mitochondrial coupling. We find that cluster formation is redox sensitive and can be blocked by the antioxidant MitoQ. In a nerve crush paradigm, mitochondrial clusters form sequentially adjacent to the lesion site and oxidation spreads between mitochondria. Our methodology combines optical bioenergetics and advanced signal processing and allows quantitative assessment of entire mitochondrial populations.
1982-10-15
the two may interact. L1 *1 -35- REFERENCES [1] Arnold, Stephen J. (1979), "A Test for Clusters," Journal of Marketing Research , November, pp 545-551...of Marketing Research , August, pp 405-412. APPENDIX A RESULTS OF FACTOR ANALYSIS OF LIFE GOALS . . . . . -37- Ft-M AnSIS CF FEq. LIFE GOALS GM"L...Volume 5, Pre-intervention Recruiting Environ- ment, 1981. [9] Wind, Yoram (1978), "Issues and Advances in Segmentation Research," Journal of Marketing
NASA Astrophysics Data System (ADS)
Martinet, L.; Mayor, M.
The basic problems and analysis techniques in examining the morphology, dynamics, and interactions between star systems, galaxies, and galactic clusters are detailed. Attention is devoted to the dynamics of hot stellar systems, with note taken of the derivation and application of the Vlasov equation, Jean's theorem, and the virial equations. Observations of galactic structure and dynamics are reviewed, and consideration is directed toward environmental influences on galactic structure. For individual items see A84-15503 to A84-15505
New data clustering for RBF classifier of agriculture products from x-ray images
NASA Astrophysics Data System (ADS)
Casasent, David P.; Chen, Xuewen
1999-08-01
Classification of real-time x-ray images of randomly oriented touching pistachio nuts is discussed. The ultimate objective is the development of a subsystem for automated non-invasive detection of defective product items on a conveyor belt. We discuss the use of clustering and how it is vital to achieve useful classification. New clustering methods using class identify and new cluster classes are advanced and shown to be of use for this application. Radial basis function neural net classifiers are emphasized. We expect our results to be of use for other classifiers and applications.
Aoki, Shuichiro; Murata, Hiroshi; Fujino, Yuri; Matsuura, Masato; Miki, Atsuya; Tanito, Masaki; Mizoue, Shiro; Mori, Kazuhiko; Suzuki, Katsuyoshi; Yamashita, Takehiro; Kashiwagi, Kenji; Hirasawa, Kazunori; Shoji, Nobuyuki; Asaoka, Ryo
2017-12-01
To investigate the usefulness of the Octopus (Haag-Streit) EyeSuite's cluster trend analysis in glaucoma. Ten visual fields (VFs) with the Humphrey Field Analyzer (Carl Zeiss Meditec), spanning 7.7 years on average were obtained from 728 eyes of 475 primary open angle glaucoma patients. Mean total deviation (mTD) trend analysis and EyeSuite's cluster trend analysis were performed on various series of VFs (from 1st to 10th: VF1-10 to 6th to 10th: VF6-10). The results of the cluster-based trend analysis, based on different lengths of VF series, were compared against mTD trend analysis. Cluster-based trend analysis and mTD trend analysis results were significantly associated in all clusters and with all lengths of VF series. Between 21.2% and 45.9% (depending on VF series length and location) of clusters were deemed to progress when the mTD trend analysis suggested no progression. On the other hand, 4.8% of eyes were observed to progress using the mTD trend analysis when cluster trend analysis suggested no progression in any two (or more) clusters. Whole field trend analysis can miss local VF progression. Cluster trend analysis appears as robust as mTD trend analysis and useful to assess both sectorial and whole field progression. Cluster-based trend analyses, in particular the definition of two or more progressing cluster, may help clinicians to detect glaucomatous progression in a timelier manner than using a whole field trend analysis, without significantly compromising specificity. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Schaub, Jochen; Clemens, Christoph; Kaufmann, Hitto; Schulz, Torsten W
2012-01-01
Development of efficient bioprocesses is essential for cost-effective manufacturing of recombinant therapeutic proteins. To achieve further process improvement and process rationalization comprehensive data analysis of both process data and phenotypic cell-level data is essential. Here, we present a framework for advanced bioprocess data analysis consisting of multivariate data analysis (MVDA), metabolic flux analysis (MFA), and pathway analysis for mapping of large-scale gene expression data sets. This data analysis platform was applied in a process development project with an IgG-producing Chinese hamster ovary (CHO) cell line in which the maximal product titer could be increased from about 5 to 8 g/L.Principal component analysis (PCA), k-means clustering, and partial least-squares (PLS) models were applied to analyze the macroscopic bioprocess data. MFA and gene expression analysis revealed intracellular information on the characteristics of high-performance cell cultivations. By MVDA, for example, correlations between several essential amino acids and the product concentration were observed. Also, a grouping into rather cell specific productivity-driven and process control-driven processes could be unraveled. By MFA, phenotypic characteristics in glycolysis, glutaminolysis, pentose phosphate pathway, citrate cycle, coupling of amino acid metabolism to citrate cycle, and in the energy yield could be identified. By gene expression analysis 247 deregulated metabolic genes were identified which are involved, inter alia, in amino acid metabolism, transport, and protein synthesis.
Telescope Scientist on the Advanced X-ray Astrophysics Observatory
NASA Technical Reports Server (NTRS)
VanSpeybroeck, L.; Smith, Carl M. (Technical Monitor)
2002-01-01
This period included many scientific observations made with the Chandra Observatory. The results, as is well known, are spectacular. Fortunately, the High Resolution Mirror Assembly (HRMA) performance continues to be essentially identical to that predicted from ground calibration data. The Telescope Scientist Team has improved the mirror model to provide a more accurate description to the Chandra observers and enable them to reduce the systematic errors and uncertainties in their data reduction. We also have made considerable progress in improving the scattering model. There also has been progress in the scientific program. At this time 58 distant clusters of galaxies have been observed. We are performing a systematic analysis of this rather large data set for the purpose of determining absolute distances utilizing the Sunyaev Zel'dovich effect. These observations also have been used to study the evolution of the cluster baryon mass function and the cosmological constraints which result from this evolution.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lu, Chenyang; Niu, Liangliang; Chen, Nanjun
A grand challenge in material science is to understand the correlation between intrinsic properties and defect dynamics. Radiation tolerant materials are in great demand for safe operation and advancement of nuclear and aerospace systems. Unlike traditional approaches that rely on microstructural and nanoscale features to mitigate radiation damage, this study demonstrates enhancement of radiation tolerance with the suppression of void formation by two orders magnitude at elevated temperatures in equiatomic single-phase concentrated solid solution alloys, and more importantly, reveals its controlling mechanism through a detailed analysis of the depth distribution of defect clusters and an atomistic computer simulation. The enhancedmore » swelling resistance is attributed to the tailored interstitial defect cluster motion in the alloys from a long-range one-dimensional mode to a short-range three-dimensional mode, which leads to enhanced point defect recombination. Finally, the results suggest design criteria for next generation radiation tolerant structural alloys.« less
Multi-Nozzle Base Flow Model in the 10- by 10-Foot Supersonic Wind Tunnel
1964-02-21
Researchers check the setup of a multi-nozzle base flow model in the 10- by 10-Foot Supersonic Wind Tunnel at the National Aeronautics and Space Administration (NASA) Lewis Research Center. NASA researchers were struggling to understand the complex flow phenomena resulting from the use of multiple rocket engines. Robert Wasko and Theodore Cover of the Advanced Development and Evaluation Division’s analysis and operations sections conducted a set of tests in the 10- by 10 tunnel to further understand the flow issues. The Lewis researchers studied four and five-nozzle configurations in the 10- by 10 at simulated altitudes from 60,000 to 200,000 feet. The nozzles were gimbaled during some of the test runs to simulate steering. The flow field for the four-nozzle clusters was surveyed in the center and the lateral areas between the nozzles, whereas the five-nozzle cluster was surveyed in the lateral area only.
Gao, Wei; Tian, Yong; Xuan, Xiaopeng
2015-07-01
The cation-cation π-π stacking is uncommon but it is essential for the understanding of some supramolecular structures. We explore theoretically the nature of non-covalent interaction occurring in the stacked structure within modeled clusters of 1,3-dimethylimidazolium and halide. The evidences of the energy decomposition analysis (EDA) and reduced density gradient (RDG) approach are different from those of common π-π interaction. Isosurfaces with RDG also illustrate the strength of the titled π-π interaction and their region. Additionally, we find that the occurrence of this interaction is attributed to a few C-H···X interactions, as depicted using atom in molecule (AIM) method. This work presents a clear picture of the typical cation-cation π-π interaction and can serve to advance the understanding of this uncommon interaction. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Lu, Chenyang; Niu, Liangliang; Chen, Nanjun; Jin, Ke; Yang, Taini; Xiu, Pengyuan; Zhang, Yanwen; Gao, Fei; Bei, Hongbin; Shi, Shi; He, Mo-Rigen; Robertson, Ian M.; Weber, William J.; Wang, Lumin
2016-12-01
A grand challenge in material science is to understand the correlation between intrinsic properties and defect dynamics. Radiation tolerant materials are in great demand for safe operation and advancement of nuclear and aerospace systems. Unlike traditional approaches that rely on microstructural and nanoscale features to mitigate radiation damage, this study demonstrates enhancement of radiation tolerance with the suppression of void formation by two orders magnitude at elevated temperatures in equiatomic single-phase concentrated solid solution alloys, and more importantly, reveals its controlling mechanism through a detailed analysis of the depth distribution of defect clusters and an atomistic computer simulation. The enhanced swelling resistance is attributed to the tailored interstitial defect cluster motion in the alloys from a long-range one-dimensional mode to a short-range three-dimensional mode, which leads to enhanced point defect recombination. The results suggest design criteria for next generation radiation tolerant structural alloys.
Rong, Junkang; Feltus, F. Alex; Waghmare, Vijay N.; Pierce, Gary J.; Chee, Peng W.; Draye, Xavier; Saranga, Yehoshua; Wright, Robert J.; Wilkins, Thea A.; May, O. Lloyd; Smith, C. Wayne; Gannaway, John R.; Wendel, Jonathan F.; Paterson, Andrew H.
2007-01-01
QTL mapping experiments yield heterogeneous results due to the use of different genotypes, environments, and sampling variation. Compilation of QTL mapping results yields a more complete picture of the genetic control of a trait and reveals patterns in organization of trait variation. A total of 432 QTL mapped in one diploid and 10 tetraploid interspecific cotton populations were aligned using a reference map and depicted in a CMap resource. Early demonstrations that genes from the non-fiber-producing diploid ancestor contribute to tetraploid lint fiber genetics gain further support from multiple populations and environments and advanced-generation studies detecting QTL of small phenotypic effect. Both tetraploid subgenomes contribute QTL at largely non-homeologous locations, suggesting divergent selection acting on many corresponding genes before and/or after polyploid formation. QTL correspondence across studies was only modest, suggesting that additional QTL for the target traits remain to be discovered. Crosses between closely-related genotypes differing by single-gene mutants yield profoundly different QTL landscapes, suggesting that fiber variation involves a complex network of interacting genes. Members of the lint fiber development network appear clustered, with cluster members showing heterogeneous phenotypic effects. Meta-analysis linked to synteny-based and expression-based information provides clues about specific genes and families involved in QTL networks. PMID:17565937
Rong, Junkang; Feltus, F Alex; Waghmare, Vijay N; Pierce, Gary J; Chee, Peng W; Draye, Xavier; Saranga, Yehoshua; Wright, Robert J; Wilkins, Thea A; May, O Lloyd; Smith, C Wayne; Gannaway, John R; Wendel, Jonathan F; Paterson, Andrew H
2007-08-01
QTL mapping experiments yield heterogeneous results due to the use of different genotypes, environments, and sampling variation. Compilation of QTL mapping results yields a more complete picture of the genetic control of a trait and reveals patterns in organization of trait variation. A total of 432 QTL mapped in one diploid and 10 tetraploid interspecific cotton populations were aligned using a reference map and depicted in a CMap resource. Early demonstrations that genes from the non-fiber-producing diploid ancestor contribute to tetraploid lint fiber genetics gain further support from multiple populations and environments and advanced-generation studies detecting QTL of small phenotypic effect. Both tetraploid subgenomes contribute QTL at largely non-homeologous locations, suggesting divergent selection acting on many corresponding genes before and/or after polyploid formation. QTL correspondence across studies was only modest, suggesting that additional QTL for the target traits remain to be discovered. Crosses between closely-related genotypes differing by single-gene mutants yield profoundly different QTL landscapes, suggesting that fiber variation involves a complex network of interacting genes. Members of the lint fiber development network appear clustered, with cluster members showing heterogeneous phenotypic effects. Meta-analysis linked to synteny-based and expression-based information provides clues about specific genes and families involved in QTL networks.
SparRec: An effective matrix completion framework of missing data imputation for GWAS
NASA Astrophysics Data System (ADS)
Jiang, Bo; Ma, Shiqian; Causey, Jason; Qiao, Linbo; Hardin, Matthew Price; Bitts, Ian; Johnson, Daniel; Zhang, Shuzhong; Huang, Xiuzhen
2016-10-01
Genome-wide association studies present computational challenges for missing data imputation, while the advances of genotype technologies are generating datasets of large sample sizes with sample sets genotyped on multiple SNP chips. We present a new framework SparRec (Sparse Recovery) for imputation, with the following properties: (1) The optimization models of SparRec, based on low-rank and low number of co-clusters of matrices, are different from current statistics methods. While our low-rank matrix completion (LRMC) model is similar to Mendel-Impute, our matrix co-clustering factorization (MCCF) model is completely new. (2) SparRec, as other matrix completion methods, is flexible to be applied to missing data imputation for large meta-analysis with different cohorts genotyped on different sets of SNPs, even when there is no reference panel. This kind of meta-analysis is very challenging for current statistics based methods. (3) SparRec has consistent performance and achieves high recovery accuracy even when the missing data rate is as high as 90%. Compared with Mendel-Impute, our low-rank based method achieves similar accuracy and efficiency, while the co-clustering based method has advantages in running time. The testing results show that SparRec has significant advantages and competitive performance over other state-of-the-art existing statistics methods including Beagle and fastPhase.
Molecular subtyping of bladder cancer using Kohonen self-organizing maps.
Borkowska, Edyta M; Kruk, Andrzej; Jedrzejczyk, Adam; Rozniecki, Marek; Jablonowski, Zbigniew; Traczyk, Magdalena; Constantinou, Maria; Banaszkiewicz, Monika; Pietrusinski, Michal; Sosnowski, Marek; Hamdy, Freddie C; Peter, Stefan; Catto, James W F; Kaluzewski, Bogdan
2014-10-01
Kohonen self-organizing maps (SOMs) are unsupervised Artificial Neural Networks (ANNs) that are good for low-density data visualization. They easily deal with complex and nonlinear relationships between variables. We evaluated molecular events that characterize high- and low-grade BC pathways in the tumors from 104 patients. We compared the ability of statistical clustering with a SOM to stratify tumors according to the risk of progression to more advanced disease. In univariable analysis, tumor stage (log rank P = 0.006) and grade (P < 0.001), HPV DNA (P < 0.004), Chromosome 9 loss (P = 0.04) and the A148T polymorphism (rs 3731249) in CDKN2A (P = 0.02) were associated with progression. Multivariable analysis of these parameters identified that tumor grade (Cox regression, P = 0.001, OR.2.9 (95% CI 1.6-5.2)) and the presence of HPV DNA (P = 0.017, OR 3.8 (95% CI 1.3-11.4)) were the only independent predictors of progression. Unsupervised hierarchical clustering grouped the tumors into discreet branches but did not stratify according to progression free survival (log rank P = 0.39). These genetic variables were presented to SOM input neurons. SOMs are suitable for complex data integration, allow easy visualization of outcomes, and may stratify BC progression more robustly than hierarchical clustering. © 2014 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Implementing Enrichment Clusters in Elementary Schools: Lessons Learned
ERIC Educational Resources Information Center
Fiddyment, Gail E.
2014-01-01
Enrichment clusters offer a way for schools to encourage a high level of learning as students and adults work together to develop a product, service, or performance by applying advanced knowledge and authentic processes to real-world problems. This study utilized a qualitative research design to examine the perceptions and experiences of two…
Schootman, Mario; Jeffe, Donna B; Lian, Min; Gillanders, William E; Aft, Rebecca
2009-03-01
The authors examined disparities in survival among women aged 66 years or older in association with census-tract-level poverty rate, racial distribution, and individual-level factors, including patient-, treatment-, and tumor-related factors, utilization of medical care, and mammography use. They used linked data from the 1992-1999 Surveillance, Epidemiology, and End Results (SEER) programs, 1991-1999 Medicare claims, and the 1990 US Census. A geographic information system and advanced statistics identified areas of increased or reduced breast cancer survival and possible reasons for geographic variation in survival in 2 of the 5 SEER areas studied. In the Detroit, Michigan, area, one geographic cluster of shorter-than-expected breast cancer survival was identified (hazard ratio (HR) = 1.60). An additional area where survival was longer than expected approached statistical significance (HR = 0.4; P = 0.056). In the Atlanta, Georgia, area, one cluster of shorter- (HR = 1.81) and one cluster of longer-than-expected (HR = 0.72) breast cancer survival were identified. Stage at diagnosis and census-tract poverty (and patient's race in Atlanta) explained the geographic variation in breast cancer survival. No geographic clusters were identified in the 3 other SEER programs. Interventions to reduce late-stage breast cancer, focusing on areas of high poverty and targeting African Americans, may reduce disparities in breast cancer survival in the Detroit and Atlanta areas.
Maglo, Koffi N.; Mersha, Tesfaye B.; Martin, Lisa J.
2016-01-01
The biological status and biomedical significance of the concept of race as applied to humans continue to be contentious issues despite the use of advanced statistical and clustering methods to determine continental ancestry. It is thus imperative for researchers to understand the limitations as well as potential uses of the concept of race in biology and biomedicine. This paper deals with the theoretical assumptions behind cluster analysis in human population genomics. Adopting an interdisciplinary approach, it demonstrates that the hypothesis that attributes the clustering of human populations to “frictional” effects of landform barriers at continental boundaries is empirically incoherent. It then contrasts the scientific status of the “cluster” and “cline” constructs in human population genomics, and shows how cluster may be instrumentally produced. It also shows how statistical values of race vindicate Darwin's argument that race is evolutionarily meaningless. Finally, the paper explains why, due to spatiotemporal parameters, evolutionary forces, and socio-cultural factors influencing population structure, continental ancestry may be pragmatically relevant to global and public health genomics. Overall, this work demonstrates that, from a biological systematic and evolutionary taxonomical perspective, human races/continental groups or clusters have no natural meaning or objective biological reality. In fact, the utility of racial categorizations in research and in clinics can be explained by spatiotemporal parameters, socio-cultural factors, and evolutionary forces affecting disease causation and treatment response. PMID:26925096
Energy Efficient Engine Low Pressure Subsystem Flow Analysis
NASA Technical Reports Server (NTRS)
Hall, Edward J.; Lynn, Sean R.; Heidegger, Nathan J.; Delaney, Robert A.
1998-01-01
The objective of this project is to provide the capability to analyze the aerodynamic performance of the complete low pressure subsystem (LPS) of the Energy Efficient Engine (EEE). The analyses were performed using three-dimensional Navier-Stokes numerical models employing advanced clustered processor computing platforms. The analysis evaluates the impact of steady aerodynamic interaction effects between the components of the LPS at design and off-design operating conditions. Mechanical coupling is provided by adjusting the rotational speed of common shaft-mounted components until a power balance is achieved. The Navier-Stokes modeling of the complete low pressure subsystem provides critical knowledge of component aero/mechanical interactions that previously were unknown to the designer until after hardware testing.
Energy Efficient Engine Low Pressure Subsystem Aerodynamic Analysis
NASA Technical Reports Server (NTRS)
Hall, Edward J.; Delaney, Robert A.; Lynn, Sean R.; Veres, Joseph P.
1998-01-01
The objective of this study was to demonstrate the capability to analyze the aerodynamic performance of the complete low pressure subsystem (LPS) of the Energy Efficient Engine (EEE). Detailed analyses were performed using three- dimensional Navier-Stokes numerical models employing advanced clustered processor computing platforms. The analysis evaluates the impact of steady aerodynamic interaction effects between the components of the LPS at design and off- design operating conditions. Mechanical coupling is provided by adjusting the rotational speed of common shaft-mounted components until a power balance is achieved. The Navier-Stokes modeling of the complete low pressure subsystem provides critical knowledge of component acro/mechanical interactions that previously were unknown to the designer until after hardware testing.
CFD Based Computations of Flexible Helicopter Blades for Stability Analysis
NASA Technical Reports Server (NTRS)
Guruswamy, Guru P.
2011-01-01
As a collaborative effort among government aerospace research laboratories an advanced version of a widely used computational fluid dynamics code, OVERFLOW, was recently released. This latest version includes additions to model flexible rotating multiple blades. In this paper, the OVERFLOW code is applied to improve the accuracy of airload computations from the linear lifting line theory that uses displacements from beam model. Data transfers required at every revolution are managed through a Unix based script that runs jobs on large super-cluster computers. Results are demonstrated for the 4-bladed UH-60A helicopter. Deviations of computed data from flight data are evaluated. Fourier analysis post-processing that is suitable for aeroelastic stability computations are performed.
Adult Learning Disorders: Contemporary Issues
ERIC Educational Resources Information Center
Wolf, Lorraine E., Ed.; Schreiber, Hope E., Ed.; Wasserstein, Jeanette, Ed.
2008-01-01
Recent advances in neuroimaging and genetics technologies have enhanced our understanding of neurodevelopmental disorders in adults. The authors in this volume not only discuss such advances as they apply to adults with learning disorders, but also address their translation into clinical practice. One cluster of chapters addresses developmental…
Stefurak, Tres; Calhoun, Georgia B
2007-01-01
The current study sought to explore subtypes of adolescents within a sample of female juvenile offenders. Using the Millon Adolescent Clinical Inventory with 101 female juvenile offenders, a two-step cluster analysis was performed beginning with a Ward's method hierarchical cluster analysis followed by a K-Means iterative partitioning cluster analysis. The results suggest an optimal three-cluster solution, with cluster profiles leading to the following group labels: Externalizing Problems, Depressed/Interpersonally Ambivalent, and Anxious Prosocial. Analysis along the factors of age, race, offense typology and offense chronicity were conducted to further understand the nature of found clusters. Only the effect for race was significant with the Anxious Prosocial and Depressed Intepersonally Ambivalent clusters appearing disproportionately comprised of African American girls. To establish external validity, clusters were compared across scales of the Behavioral Assessment System for Children - Self Report of Personality, and corroborative distinctions between clusters were found here.
[Cluster analysis in biomedical researches].
Akopov, A S; Moskovtsev, A A; Dolenko, S A; Savina, G D
2013-01-01
Cluster analysis is one of the most popular methods for the analysis of multi-parameter data. The cluster analysis reveals the internal structure of the data, group the separate observations on the degree of their similarity. The review provides a definition of the basic concepts of cluster analysis, and discusses the most popular clustering algorithms: k-means, hierarchical algorithms, Kohonen networks algorithms. Examples are the use of these algorithms in biomedical research.
Algorithms of maximum likelihood data clustering with applications
NASA Astrophysics Data System (ADS)
Giada, Lorenzo; Marsili, Matteo
2002-12-01
We address the problem of data clustering by introducing an unsupervised, parameter-free approach based on maximum likelihood principle. Starting from the observation that data sets belonging to the same cluster share a common information, we construct an expression for the likelihood of any possible cluster structure. The likelihood in turn depends only on the Pearson's coefficient of the data. We discuss clustering algorithms that provide a fast and reliable approximation to maximum likelihood configurations. Compared to standard clustering methods, our approach has the advantages that (i) it is parameter free, (ii) the number of clusters need not be fixed in advance and (iii) the interpretation of the results is transparent. In order to test our approach and compare it with standard clustering algorithms, we analyze two very different data sets: time series of financial market returns and gene expression data. We find that different maximization algorithms produce similar cluster structures whereas the outcome of standard algorithms has a much wider variability.
A Fast Density-Based Clustering Algorithm for Real-Time Internet of Things Stream
Ying Wah, Teh
2014-01-01
Data streams are continuously generated over time from Internet of Things (IoT) devices. The faster all of this data is analyzed, its hidden trends and patterns discovered, and new strategies created, the faster action can be taken, creating greater value for organizations. Density-based method is a prominent class in clustering data streams. It has the ability to detect arbitrary shape clusters, to handle outlier, and it does not need the number of clusters in advance. Therefore, density-based clustering algorithm is a proper choice for clustering IoT streams. Recently, several density-based algorithms have been proposed for clustering data streams. However, density-based clustering in limited time is still a challenging issue. In this paper, we propose a density-based clustering algorithm for IoT streams. The method has fast processing time to be applicable in real-time application of IoT devices. Experimental results show that the proposed approach obtains high quality results with low computation time on real and synthetic datasets. PMID:25110753
A fast density-based clustering algorithm for real-time Internet of Things stream.
Amini, Amineh; Saboohi, Hadi; Wah, Teh Ying; Herawan, Tutut
2014-01-01
Data streams are continuously generated over time from Internet of Things (IoT) devices. The faster all of this data is analyzed, its hidden trends and patterns discovered, and new strategies created, the faster action can be taken, creating greater value for organizations. Density-based method is a prominent class in clustering data streams. It has the ability to detect arbitrary shape clusters, to handle outlier, and it does not need the number of clusters in advance. Therefore, density-based clustering algorithm is a proper choice for clustering IoT streams. Recently, several density-based algorithms have been proposed for clustering data streams. However, density-based clustering in limited time is still a challenging issue. In this paper, we propose a density-based clustering algorithm for IoT streams. The method has fast processing time to be applicable in real-time application of IoT devices. Experimental results show that the proposed approach obtains high quality results with low computation time on real and synthetic datasets.
Mimicking the magnetic properties of rare earth elements using superatoms.
Cheng, Shi-Bo; Berkdemir, Cuneyt; Castleman, A W
2015-04-21
Rare earth elements (REs) consist of a very important group in the periodic table that is vital to many modern technologies. The mining process, however, is extremely damaging to the environment, making them low yield and very expensive. Therefore, mimicking the properties of REs in a superatom framework is especially valuable but at the same time, technically challenging and requiring advanced concepts about manipulating properties of atom/molecular complexes. Herein, by using photoelectron imaging spectroscopy, we provide original idea and direct experimental evidence that chosen boron-doped clusters could mimic the magnetic characteristics of REs. Specifically, the neutral LaB and NdB clusters are found to have similar unpaired electrons and magnetic moments as their isovalent REs (namely Nd and Eu, respectively), opening up the great possibility in accomplishing rare earth mimicry. Extension of the superatom concept into the rare earth group not only further shows the power and advance of this concept but also, will stimulate more efforts to explore new superatomic clusters to mimic the chemistry of these heavy atoms, which will be of great importance in designing novel building blocks in the application of cluster-assembled nanomaterials. Additionally, based on these experimental findings, a novel "magic boron" counting rule is proposed to estimate the numbers of unpaired electrons in diatomic LnB clusters.
Martín, Juan F.; Liras, Paloma
2017-01-01
The clavine alkaloids produced by the fungi of the Aspergillaceae and Arthrodermatacea families differ from the ergot alkaloids produced by Claviceps and Neotyphodium. The clavine alkaloids lack the extensive peptide chain modifications that occur in lysergic acid derived ergot alkaloids. Both clavine and ergot alkaloids arise from the condensation of tryptophan and dimethylallylpyrophosphate by the action of the dimethylallyltryptophan synthase. The first five steps of the biosynthetic pathway that convert tryptophan and dimethylallyl-pyrophosphate (DMA-PP) in chanoclavine-1-aldehyde are common to both clavine and ergot alkaloids. The biosynthesis of ergot alkaloids has been extensively studied and is not considered in this article. We focus this review on recent advances in the gene clusters for clavine alkaloids in the species of Penicillium, Aspergillus (Neosartorya), Arthroderma and Trychophyton and the enzymes encoded by them. The final products of the clavine alkaloids pathways derive from the tetracyclic ergoline ring, which is modified by late enzymes, including a reverse type prenyltransferase, P450 monooxygenases and acetyltransferases. In Aspergillus japonicus, a α-ketoglutarate and Fe2+-dependent dioxygenase is involved in the cyclization of a festuclavine-like unknown type intermediate into cycloclavine. Related dioxygenases occur in the biosynthetic gene clusters of ergot alkaloids in Claviceps purpurea and also in the clavine clusters in Penicillium species. The final products of the clavine alkaloid pathway in these fungi differ from each other depending on the late biosynthetic enzymes involved. An important difference between clavine and ergot alkaloid pathways is that clavine producers lack the enzyme CloA, a P450 monooxygenase, involved in one of the steps of the conversion of chanoclavine-1-aldehyde into lysergic acid. Bioinformatic analysis of the sequenced genomes of the Aspergillaceae and Arthrodermataceae fungi showed the presence of clavine gene clusters in Arthroderma species, Penicillium roqueforti, Penicillium commune, Penicillium camemberti, Penicillium expansum, Penicillium steckii and Penicillium griseofulvum. Analysis of the gene clusters in several clavine alkaloid producers indicates that there are gene gains, gene losses and gene rearrangements. These findings may be explained by a divergent evolution of the gene clusters of ergot and clavine alkaloids from a common ancestral progenitor six genes cluster although horizontal gene transfer of some specific genes may have occurred more recently. PMID:29186777
Gallardo-Gómez, María; Moran, Sebastian; Páez de la Cadena, María; Martínez-Zorzano, Vicenta Soledad; Rodríguez-Berrocal, Francisco Javier; Rodríguez-Girondo, Mar; Esteller, Manel; Cubiella, Joaquín; Bujanda, Luis; Castells, Antoni; Balaguer, Francesc; Jover, Rodrigo; De Chiara, Loretta
2018-01-01
Colorectal cancer is the fourth cause of cancer-related deaths worldwide, though detection at early stages associates with good prognosis. Thus, there is a clear demand for novel non-invasive tests for the early detection of colorectal cancer and premalignant advanced adenomas, to be used in population-wide screening programs. Aberrant DNA methylation detected in liquid biopsies, such as serum circulating cell-free DNA (cfDNA), is a promising source of non-invasive biomarkers. This study aimed to assess the feasibility of using cfDNA pooled samples to identify potential serum methylation biomarkers for the detection of advanced colorectal neoplasia (colorectal cancer or advanced adenomas) using microarray-based technology. cfDNA was extracted from serum samples from 20 individuals with no colorectal findings, 20 patients with advanced adenomas, and 20 patients with colorectal cancer (stages I and II). Two pooled samples were prepared for each pathological group using equal amounts of cfDNA from 10 individuals, sex-, age-, and recruitment hospital-matched. We measured the methylation levels of 866,836 CpG positions across the genome using the MethylationEPIC array. Pooled serum cfDNA methylation data meets the quality requirements. The proportion of detected CpG in all pools (> 99% with detection p value < 0.01) exceeded Illumina Infinium methylation data quality metrics of the number of sites detected. The differential methylation analysis revealed 1384 CpG sites (5% false discovery rate) with at least 10% difference in the methylation level between no colorectal findings controls and advanced neoplasia, the majority of which were hypomethylated. Unsupervised clustering showed that cfDNA methylation patterns can distinguish advanced neoplasia from healthy controls, as well as separate tumor tissue from healthy mucosa in an independent dataset. We also observed that advanced adenomas and stage I/II colorectal cancer methylation profiles, grouped as advanced neoplasia, are largely homogenous and clustered close together. This preliminary study shows the viability of microarray-based methylation biomarker discovery using pooled serum cfDNA samples as an alternative approach to tissue specimens. Our strategy sets an open door for deciphering new non-invasive biomarkers not only for colorectal cancer detection, but also for other types of cancers.
Veeraraghavan, Rengasayee; Gourdie, Robert G
2016-11-07
The spatial association between proteins is crucial to understanding how they function in biological systems. Colocalization analysis of fluorescence microscopy images is widely used to assess this. However, colocalization analysis performed on two-dimensional images with diffraction-limited resolution merely indicates that the proteins are within 200-300 nm of each other in the xy-plane and within 500-700 nm of each other along the z-axis. Here we demonstrate a novel three-dimensional quantitative analysis applicable to single-molecule positional data: stochastic optical reconstruction microscopy-based relative localization analysis (STORM-RLA). This method offers significant advantages: 1) STORM imaging affords 20-nm resolution in the xy-plane and <50 nm along the z-axis; 2) STORM-RLA provides a quantitative assessment of the frequency and degree of overlap between clusters of colabeled proteins; and 3) STORM-RLA also calculates the precise distances between both overlapping and nonoverlapping clusters in three dimensions. Thus STORM-RLA represents a significant advance in the high-throughput quantitative assessment of the spatial organization of proteins. © 2016 Veeraraghavan and Gourdie. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Stefanidis, Konstantinos; Papatheodorou, George
2018-01-01
During the last decades, Mediterranean freshwater ecosystems, especially lakes, have been under severe pressure due to increasing eutrophication and water quality deterioration. In this article, we compared the effectiveness of different data analysis methods by assessing the contribution of environmental parameters to eutrophication processes. For this purpose, principal components analysis (PCA), cluster analysis, and a self-organizing map (SOM) were applied, using water quality data from two transboundary lakes of North Greece. SOM is considered as an advanced and powerful data analysis tool because of its ability to represent complex and nonlinear relationships among multivariate data sets. The results of PCA and cluster analysis agreed with the SOM results, although the latter provided more information because of the visualization abilities regarding the parameters’ relationships. Besides nutrients that were found to be a key factor for controlling chlorophyll-a (Chl-a), water temperature was related positively with algal production, while the Secchi disk depth parameter was found to be highly important and negatively related toeutrophic conditions. In general, the SOM results were more specific and allowed direct associations between the water quality variables. Our work showed that SOMs can be used effectively in limnological studies to produce robust and interpretable results, aiding scientists and managers to cope with environmental problems such as eutrophication. PMID:29562675
NASA Astrophysics Data System (ADS)
Wagstaff, Kiri L.
2012-03-01
On obtaining a new data set, the researcher is immediately faced with the challenge of obtaining a high-level understanding from the observations. What does a typical item look like? What are the dominant trends? How many distinct groups are included in the data set, and how is each one characterized? Which observable values are common, and which rarely occur? Which items stand out as anomalies or outliers from the rest of the data? This challenge is exacerbated by the steady growth in data set size [11] as new instruments push into new frontiers of parameter space, via improvements in temporal, spatial, and spectral resolution, or by the desire to "fuse" observations from different modalities and instruments into a larger-picture understanding of the same underlying phenomenon. Data clustering algorithms provide a variety of solutions for this task. They can generate summaries, locate outliers, compress data, identify dense or sparse regions of feature space, and build data models. It is useful to note up front that "clusters" in this context refer to groups of items within some descriptive feature space, not (necessarily) to "galaxy clusters" which are dense regions in physical space. The goal of this chapter is to survey a variety of data clustering methods, with an eye toward their applicability to astronomical data analysis. In addition to improving the individual researcher’s understanding of a given data set, clustering has led directly to scientific advances, such as the discovery of new subclasses of stars [14] and gamma-ray bursts (GRBs) [38]. All clustering algorithms seek to identify groups within a data set that reflect some observed, quantifiable structure. Clustering is traditionally an unsupervised approach to data analysis, in the sense that it operates without any direct guidance about which items should be assigned to which clusters. There has been a recent trend in the clustering literature toward supporting semisupervised or constrained clustering, in which some partial information about item assignments or other components of the resulting output are already known and must be accommodated by the solution. Some algorithms seek a partition of the data set into distinct clusters, while others build a hierarchy of nested clusters that can capture taxonomic relationships. Some produce a single optimal solution, while others construct a probabilistic model of cluster membership. More formally, clustering algorithms operate on a data set X composed of items represented by one or more features (dimensions). These could include physical location, such as right ascension and declination, as well as other properties such as brightness, color, temporal change, size, texture, and so on. Let D be the number of dimensions used to represent each item, xi ∈ RD. The clustering goal is to produce an organization P of the items in X that optimizes an objective function f : P -> R, which quantifies the quality of solution P. Often f is defined so as to maximize similarity within a cluster and minimize similarity between clusters. To that end, many algorithms make use of a measure d : X x X -> R of the distance between two items. A partitioning algorithm produces a set of clusters P = {c1, . . . , ck} such that the clusters are nonoverlapping (c_i intersected with c_j = empty set, i != j) subsets of the data set (Union_i c_i=X). Hierarchical algorithms produce a series of partitions P = {p1, . . . , pn }. For a complete hierarchy, the number of partitions n’= n, the number of items in the data set; the top partition is a single cluster containing all items, and the bottom partition contains n clusters, each containing a single item. For model-based clustering, each cluster c_j is represented by a model m_j , such as the cluster center or a Gaussian distribution. The wide array of available clustering algorithms may seem bewildering, and covering all of them is beyond the scope of this chapter. Choosing among them for a particular application involves considerations of the kind of data being analyzed, algorithm runtime efficiency, and how much prior knowledge is available about the problem domain, which can dictate the nature of clusters sought. Fundamentally, the clustering method and its representations of clusters carries with it a definition of what a cluster is, and it is important that this be aligned with the analysis goals for the problem at hand. In this chapter, I emphasize this point by identifying for each algorithm the cluster representation as a model, m_j , even for algorithms that are not typically thought of as creating a “model.” This chapter surveys a basic collection of clustering methods useful to any practitioner who is interested in applying clustering to a new data set. The algorithms include k-means (Section 25.2), EM (Section 25.3), agglomerative (Section 25.4), and spectral (Section 25.5) clustering, with side mentions of variants such as kernel k-means and divisive clustering. The chapter also discusses each algorithm’s strengths and limitations and provides pointers to additional in-depth reading for each subject. Section 25.6 discusses methods for incorporating domain knowledge into the clustering process. This chapter concludes with a brief survey of interesting applications of clustering methods to astronomy data (Section 25.7). The chapter begins with k-means because it is both generally accessible and so widely used that understanding it can be considered a necessary prerequisite for further work in the field. EM can be viewed as a more sophisticated version of k-means that uses a generative model for each cluster and probabilistic item assignments. Agglomerative clustering is the most basic form of hierarchical clustering and provides a basis for further exploration of algorithms in that vein. Spectral clustering permits a departure from feature-vector-based clustering and can operate on data sets instead represented as affinity, or similarity matrices—cases in which only pairwise information is known. The list of algorithms covered in this chapter is representative of those most commonly in use, but it is by no means comprehensive. There is an extensive collection of existing books on clustering that provide additional background and depth. Three early books that remain useful today are Anderberg’s Cluster Analysis for Applications [3], Hartigan’s Clustering Algorithms [25], and Gordon’s Classification [22]. The latter covers basics on similarity measures, partitioning and hierarchical algorithms, fuzzy clustering, overlapping clustering, conceptual clustering, validations methods, and visualization or data reduction techniques such as principal components analysis (PCA),multidimensional scaling, and self-organizing maps. More recently, Jain et al. provided a useful and informative survey [27] of a variety of different clustering algorithms, including those mentioned here as well as fuzzy, graph-theoretic, and evolutionary clustering. Everitt’s Cluster Analysis [19] provides a modern overview of algorithms, similarity measures, and evaluation methods.
NASA Astrophysics Data System (ADS)
Simioni, M.; Bedin, L. R.; Aparicio, A.; Piotto, G.; Milone, A. P.; Nardiello, D.; Anderson, J.; Bellini, A.; Brown, T. M.; Cassisi, S.; Cunial, A.; Granata, V.; Ortolani, S.; van der Marel, R. P.; Vesperini, E.
2018-05-01
As part of the Hubble Space Telescope UV Legacy Survey of Galactic globular clusters, 110 parallel fields were observed with the Wide Field Channel of the Advanced Camera for Surveys, in the outskirts of 48 globular clusters, plus the open cluster NGC 6791. Totalling about 0.3 deg2 of observed sky, this is the largest homogeneous Hubble Space Telescope photometric survey of Galalctic globular clusters outskirts to date. In particular, two distinct pointings have been obtained for each target on average, all centred at about 6.5 arcmin from the cluster centre, thus covering a mean area of about 23 arcmin2 for each globular cluster. For each field, at least one exposure in both F475W and F814W filters was collected. In this work, we publicly release the astrometric and photometric catalogues and the astrometrized atlases for each of these fields.
Steps Toward Understanding Mitochondrial Fe/S Cluster Biogenesis.
Melber, Andrew; Winge, Dennis R
2018-01-01
Iron-sulfur clusters (Fe/S clusters) are essential cofactors required throughout the clades of biology for performing a myriad of unique functions including nitrogen fixation, ribosome assembly, DNA repair, mitochondrial respiration, and metabolite catabolism. Although Fe/S clusters can be synthesized in vitro and transferred to a client protein without enzymatic assistance, biology has evolved intricate mechanisms to assemble and transfer Fe/S clusters within the cellular environment. In eukaryotes, the foundation of all cellular clusters starts within the mitochondria. The focus of this review is to detail the mitochondrial Fe/S biogenesis (ISC) pathway along with the Fe/S cluster transfer steps necessary to mature Fe/S proteins. New advances in our understanding of the mitochondrial Fe/S biogenesis machinery will be highlighted. Additionally, we will address various experimental approaches that have been successful in the identification and characterization of components of the ISC pathway. © 2018 Elsevier Inc. All rights reserved.
Evaluating Cellular Polyfunctionality with a Novel Polyfunctionality Index
Larsen, Martin; Sauce, Delphine; Arnaud, Laurent; Fastenackels, Solène; Appay, Victor; Gorochov, Guy
2012-01-01
Functional evaluation of naturally occurring or vaccination-induced T cell responses in mice, men and monkeys has in recent years advanced from single-parameter (e.g. IFN-γ-secretion) to much more complex multidimensional measurements. Co-secretion of multiple functional molecules (such as cytokines and chemokines) at the single-cell level is now measurable due primarily to major advances in multiparametric flow cytometry. The very extensive and complex datasets generated by this technology raise the demand for proper analytical tools that enable the analysis of combinatorial functional properties of T cells, hence polyfunctionality. Presently, multidimensional functional measures are analysed either by evaluating all combinations of parameters individually or by summing frequencies of combinations that include the same number of simultaneous functions. Often these evaluations are visualized as pie charts. Whereas pie charts effectively represent and compare average polyfunctionality profiles of particular T cell subsets or patient groups, they do not document the degree or variation of polyfunctionality within a group nor does it allow more sophisticated statistical analysis. Here we propose a novel polyfunctionality index that numerically evaluates the degree and variation of polyfuntionality, and enable comparative and correlative parametric and non-parametric statistical tests. Moreover, it allows the usage of more advanced statistical approaches, such as cluster analysis. We believe that the polyfunctionality index will render polyfunctionality an appropriate end-point measure in future studies of T cell responsiveness. PMID:22860124
Spatial Temporal Sowing Pattern of Rapeseed-Mustard Crop in India Using Multi-Date IRS Awifs Data
NASA Astrophysics Data System (ADS)
Rajak, D. R.; Patel, H. A.; Chaudhari, K. N.; Patel, N. K.; Panigrahy, S.; Parihar, J. S.
2011-08-01
This paper highlights the results on spatial pattern of sowing of rapeseed/mustard in four major states in India using multidate Advanced Wide Field Sensor (AWiFS) data for 2010-11 crop season. Geo-referenced, calibrated AWiFS data acquired during October 2010 to February 2011 were used to generate the Normalised Difference Vegetation Index (NDVI) image sets. Iterative Self-Organizing Data Analysis Technique (ISODATA) based clustering of the multi date NDVI dataset for mustard crop pixels was performed. The clusters were segregated to spectral emergence classes using a spectral profile matching approach with reference to ground truth data. The sowing dates were derived from the spectral emergence data using a lag period based on field observation. Analysis showed the sowing pattern in the study states is spread over around 60 days from mid October to mid December. Three distinct clusters of sowing pattern were observed. The major one (around 40%) is sown between mid October and first week of November. Around 25% area is sown from last week of November to mid December. The other 35% area is sown in between these two periods. Analysis of temperature, a key weather variable influencing the growth of this crop, showed that the crop sowing in northern Rajasthan and Haryana is delayed by about one month to avoid the frost damage during reproductive phase. In the parts of Gujarat, southern parts of Rajasthan and Madhya Pradesh (MP), an early sowing in the second fortnight of October was observed, mainly to avoid higher mean temperatures during the month of March.
Novel molecular subtypes of serous and endometrioid ovarian cancer linked to clinical outcome.
Tothill, Richard W; Tinker, Anna V; George, Joshy; Brown, Robert; Fox, Stephen B; Lade, Stephen; Johnson, Daryl S; Trivett, Melanie K; Etemadmoghadam, Dariush; Locandro, Bianca; Traficante, Nadia; Fereday, Sian; Hung, Jillian A; Chiew, Yoke-Eng; Haviv, Izhak; Gertig, Dorota; DeFazio, Anna; Bowtell, David D L
2008-08-15
The study aim to identify novel molecular subtypes of ovarian cancer by gene expression profiling with linkage to clinical and pathologic features. Microarray gene expression profiling was done on 285 serous and endometrioid tumors of the ovary, peritoneum, and fallopian tube. K-means clustering was applied to identify robust molecular subtypes. Statistical analysis identified differentially expressed genes, pathways, and gene ontologies. Laser capture microdissection, pathology review, and immunohistochemistry validated the array-based findings. Patient survival within k-means groups was evaluated using Cox proportional hazards models. Class prediction validated k-means groups in an independent dataset. A semisupervised survival analysis of the array data was used to compare against unsupervised clustering results. Optimal clustering of array data identified six molecular subtypes. Two subtypes represented predominantly serous low malignant potential and low-grade endometrioid subtypes, respectively. The remaining four subtypes represented higher grade and advanced stage cancers of serous and endometrioid morphology. A novel subtype of high-grade serous cancers reflected a mesenchymal cell type, characterized by overexpression of N-cadherin and P-cadherin and low expression of differentiation markers, including CA125 and MUC1. A poor prognosis subtype was defined by a reactive stroma gene expression signature, correlating with extensive desmoplasia in such samples. A similar poor prognosis signature could be found using a semisupervised analysis. Each subtype displayed distinct levels and patterns of immune cell infiltration. Class prediction identified similar subtypes in an independent ovarian dataset with similar prognostic trends. Gene expression profiling identified molecular subtypes of ovarian cancer of biological and clinical importance.
Chen, Ran; Zhang, Yuntao; Sahneh, Faryad Darabi; Scoglio, Caterina M; Wohlleben, Wendel; Haase, Andrea; Monteiro-Riviere, Nancy A; Riviere, Jim E
2014-09-23
Quantitative characterization of nanoparticle interactions with their surrounding environment is vital for safe nanotechnological development and standardization. A recent quantitative measure, the biological surface adsorption index (BSAI), has demonstrated promising applications in nanomaterial surface characterization and biological/environmental prediction. This paper further advances the approach beyond the application of five descriptors in the original BSAI to address the concentration dependence of the descriptors, enabling better prediction of the adsorption profile and more accurate categorization of nanomaterials based on their surface properties. Statistical analysis on the obtained adsorption data was performed based on three different models: the original BSAI, a concentration-dependent polynomial model, and an infinite dilution model. These advancements in BSAI modeling showed a promising development in the application of quantitative predictive modeling in biological applications, nanomedicine, and environmental safety assessment of nanomaterials.
ERIC Educational Resources Information Center
Maryland State Department of Education, 2006
2006-01-01
Maryland Career Clusters are a resource for high schools as they reorganize into smaller learning communities. Instruction is organized around career themes, providing more students the opportunity to explore career choices while still in high school and enroll in pathway programs that enable them to successfully transition from high school to…
Cianciolo, R E; Mohr, F C; Aresu, L; Brown, C A; James, C; Jansen, J H; Spangler, W L; van der Lugt, J J; Kass, P H; Brovida, C; Cowgill, L D; Heiene, R; Polzin, D J; Syme, H; Vaden, S L; van Dongen, A M; Lees, G E
2016-01-01
Evaluation of canine renal biopsy tissue has generally relied on light microscopic (LM) evaluation of hematoxylin and eosin-stained sections ranging in thickness from 3 to 5 µm. Advanced modalities, such as transmission electron microscopy (TEM) and immunofluorescence (IF), have been used sporadically or retrospectively. Diagnostic algorithms of glomerular diseases have been extrapolated from the World Health Organization classification scheme for human glomerular disease. With the recent establishment of 2 veterinary nephropathology services that evaluate 3-µm sections with a panel of histochemical stains and routinely perform TEM and IF, a standardized objective species-specific approach for the diagnosis of canine glomerular disease was needed. Eight veterinary pathologists evaluated 114 parameters (lesions) in renal biopsy specimens from 89 dogs. Hierarchical cluster analysis of the data revealed 2 large categories of glomerular disease based on the presence or absence of immune complex deposition: The immune complex-mediated glomerulonephritis (ICGN) category included cases with histologic lesions of membranoproliferative or membranous patterns. The second category included control dogs and dogs with non-ICGN (glomerular amyloidosis or focal segmental glomerulosclerosis). Cluster analysis performed on only the LM parameters led to misdiagnosis of 22 of the 89 cases-that is, ICGN cases moved to the non-ICGN branch of the dendrogram or vice versa, thereby emphasizing the importance of advanced diagnostic modalities in the evaluation of canine glomerular disease. Salient LM, TEM, and IF features for each pattern of disease were identified, and a preliminary investigation of related clinicopathologic data was performed. © The Author(s) 2015.
NASA Astrophysics Data System (ADS)
Lee, S.; Maharani, Y. N.; Ki, S. J.
2015-12-01
The application of Self-Organizing Map (SOM) to analyze social vulnerability to recognize the resilience within sites is a challenging tasks. The aim of this study is to propose a computational method to identify the sites according to their similarity and to determine the most relevant variables to characterize the social vulnerability in each cluster. For this purposes, SOM is considered as an effective platform for analysis of high dimensional data. By considering the cluster structure, the characteristic of social vulnerability of the sites identification can be fully understand. In this study, the social vulnerability variable is constructed from 17 variables, i.e. 12 independent variables which represent the socio-economic concepts and 5 dependent variables which represent the damage and losses due to Merapi eruption in 2010. These variables collectively represent the local situation of the study area, based on conducted fieldwork on September 2013. By using both independent and dependent variables, we can identify if the social vulnerability is reflected onto the actual situation, in this case, Merapi eruption 2010. However, social vulnerability analysis in the local communities consists of a number of variables that represent their socio-economic condition. Some of variables employed in this study might be more or less redundant. Therefore, SOM is used to reduce the redundant variable(s) by selecting the representative variables using the component planes and correlation coefficient between variables in order to find the effective sample size. Then, the selected dataset was effectively clustered according to their similarities. Finally, this approach can produce reliable estimates of clustering, recognize the most significant variables and could be useful for social vulnerability assessment, especially for the stakeholder as decision maker. This research was supported by a grant 'Development of Advanced Volcanic Disaster Response System considering Potential Volcanic Risk around Korea' [MPSS-NH-2015-81] from the Natural Hazard Mitigation Research Group, National Emergency Management Agency of Korea. Keywords: Self-organizing map, Component Planes, Correlation coefficient, Cluster analysis, Sites identification, Social vulnerability, Merapi eruption 2010
EXPLORING FUNCTIONAL CONNECTIVITY IN FMRI VIA CLUSTERING.
Venkataraman, Archana; Van Dijk, Koene R A; Buckner, Randy L; Golland, Polina
2009-04-01
In this paper we investigate the use of data driven clustering methods for functional connectivity analysis in fMRI. In particular, we consider the K-Means and Spectral Clustering algorithms as alternatives to the commonly used Seed-Based Analysis. To enable clustering of the entire brain volume, we use the Nyström Method to approximate the necessary spectral decompositions. We apply K-Means, Spectral Clustering and Seed-Based Analysis to resting-state fMRI data collected from 45 healthy young adults. Without placing any a priori constraints, both clustering methods yield partitions that are associated with brain systems previously identified via Seed-Based Analysis. Our empirical results suggest that clustering provides a valuable tool for functional connectivity analysis.
NASA System-Level Design, Analysis and Simulation Tools Research on NextGen
NASA Technical Reports Server (NTRS)
Bardina, Jorge
2011-01-01
A review of the research accomplished in 2009 in the System-Level Design, Analysis and Simulation Tools (SLDAST) of the NASA's Airspace Systems Program is presented. This research thrust focuses on the integrated system-level assessment of component level innovations, concepts and technologies of the Next Generation Air Traffic System (NextGen) under research in the ASP program to enable the development of revolutionary improvements and modernization of the National Airspace System. The review includes the accomplishments on baseline research and the advancements on design studies and system-level assessment, including the cluster analysis as an annualization standard of the air traffic in the U.S. National Airspace, and the ACES-Air MIDAS integration for human-in-the-loop analyzes within the NAS air traffic simulation.
Titanium oxo-clusters: precursors for a Lego-like construction of nanostructured hybrid materials.
Rozes, Laurence; Sanchez, Clément
2011-02-01
Titanium oxo-clusters, well-defined monodispersed nano-objects, are appropriate nano-building blocks for the preparation of organic-inorganic materials by a bottom up approach. This critical review proposes to present the different structures of titanium oxo-clusters referenced in the literature and the different strategies followed to build up hybrid materials with these versatile building units. In particular, this critical review cites and reports on the most important papers in the literature, concentrating on recent developments in the field of synthesis, characterization, and the use of titanium oxo-clusters for the construction of advanced hybrid materials (137 references).
ClusterViz: A Cytoscape APP for Cluster Analysis of Biological Network.
Wang, Jianxin; Zhong, Jiancheng; Chen, Gang; Li, Min; Wu, Fang-xiang; Pan, Yi
2015-01-01
Cluster analysis of biological networks is one of the most important approaches for identifying functional modules and predicting protein functions. Furthermore, visualization of clustering results is crucial to uncover the structure of biological networks. In this paper, ClusterViz, an APP of Cytoscape 3 for cluster analysis and visualization, has been developed. In order to reduce complexity and enable extendibility for ClusterViz, we designed the architecture of ClusterViz based on the framework of Open Services Gateway Initiative. According to the architecture, the implementation of ClusterViz is partitioned into three modules including interface of ClusterViz, clustering algorithms and visualization and export. ClusterViz fascinates the comparison of the results of different algorithms to do further related analysis. Three commonly used clustering algorithms, FAG-EC, EAGLE and MCODE, are included in the current version. Due to adopting the abstract interface of algorithms in module of the clustering algorithms, more clustering algorithms can be included for the future use. To illustrate usability of ClusterViz, we provided three examples with detailed steps from the important scientific articles, which show that our tool has helped several research teams do their research work on the mechanism of the biological networks.
Star clusters in evolving galaxies
NASA Astrophysics Data System (ADS)
Renaud, Florent
2018-04-01
Their ubiquity and extreme densities make star clusters probes of prime importance of galaxy evolution. Old globular clusters keep imprints of the physical conditions of their assembly in the early Universe, and younger stellar objects, observationally resolved, tell us about the mechanisms at stake in their formation. Yet, we still do not understand the diversity involved: why is star cluster formation limited to 105M⊙ objects in the Milky Way, while some dwarf galaxies like NGC 1705 are able to produce clusters 10 times more massive? Why do dwarfs generally host a higher specific frequency of clusters than larger galaxies? How to connect the present-day, often resolved, stellar systems to the formation of globular clusters at high redshift? And how do these links depend on the galactic and cosmological environments of these clusters? In this review, I present recent advances on star cluster formation and evolution, in galactic and cosmological context. The emphasis is put on the theory, formation scenarios and the effects of the environment on the evolution of the global properties of clusters. A few open questions are identified.
Mediator and RNA polymerase II clusters associate in transcription-dependent condensates.
Cho, Won-Ki; Spille, Jan-Hendrik; Hecht, Micca; Lee, Choongman; Li, Charles; Grube, Valentin; Cisse, Ibrahim I
2018-06-21
Models of gene control have emerged from genetic and biochemical studies, with limited consideration of the spatial organization and dynamics of key components in living cells. Here we used live cell super-resolution and light sheet imaging to study the organization and dynamics of the Mediator coactivator and RNA polymerase II (Pol II) directly. Mediator and Pol II each form small transient and large stable clusters in living embryonic stem cells. Mediator and Pol II are colocalized in the stable clusters, which associate with chromatin, have properties of phase-separated condensates, and are sensitive to transcriptional inhibitors. We suggest that large clusters of Mediator, recruited by transcription factors at large or clustered enhancer elements, interact with large Pol II clusters in transcriptional condensates in vivo. Copyright © 2018, American Association for the Advancement of Science.
Understanding 3D human torso shape via manifold clustering
NASA Astrophysics Data System (ADS)
Li, Sheng; Li, Peng; Fu, Yun
2013-05-01
Discovering the variations in human torso shape plays a key role in many design-oriented applications, such as suit designing. With recent advances in 3D surface imaging technologies, people can obtain 3D human torso data that provide more information than traditional measurements. However, how to find different human shapes from 3D torso data is still an open problem. In this paper, we propose to use spectral clustering approach on torso manifold to address this problem. We first represent high-dimensional torso data in a low-dimensional space using manifold learning algorithm. Then the spectral clustering method is performed to get several disjoint clusters. Experimental results show that the clusters discovered by our approach can describe the discrepancies in both genders and human shapes, and our approach achieves better performance than the compared clustering method.
2013-01-01
Background Next generation sequencing technologies have greatly advanced many research areas of the biomedical sciences through their capability to generate massive amounts of genetic information at unprecedented rates. The advent of next generation sequencing has led to the development of numerous computational tools to analyze and assemble the millions to billions of short sequencing reads produced by these technologies. While these tools filled an important gap, current approaches for storing, processing, and analyzing short read datasets generally have remained simple and lack the complexity needed to efficiently model the produced reads and assemble them correctly. Results Previously, we presented an overlap graph coarsening scheme for modeling read overlap relationships on multiple levels. Most current read assembly and analysis approaches use a single graph or set of clusters to represent the relationships among a read dataset. Instead, we use a series of graphs to represent the reads and their overlap relationships across a spectrum of information granularity. At each information level our algorithm is capable of generating clusters of reads from the reduced graph, forming an integrated graph modeling and clustering approach for read analysis and assembly. Previously we applied our algorithm to simulated and real 454 datasets to assess its ability to efficiently model and cluster next generation sequencing data. In this paper we extend our algorithm to large simulated and real Illumina datasets to demonstrate that our algorithm is practical for both sequencing technologies. Conclusions Our overlap graph theoretic algorithm is able to model next generation sequencing reads at various levels of granularity through the process of graph coarsening. Additionally, our model allows for efficient representation of the read overlap relationships, is scalable for large datasets, and is practical for both Illumina and 454 sequencing technologies. PMID:24564333
Analysis of the shapes of hemocytes of Callista brevisiphonata in vitro (Bivalvia, Veneridae).
Karetin, Yu A; Pushchin, I I
2015-08-01
Fractal formalism in conjunction with linear methods of image analysis is suitable for the comparative analysis of such "irregular" shapes (from the point of view of classical Euclidean geometry) as flattened amoeboid cells of invertebrates in vitro. Cell morphology of in vitro spreading hemocytes from the bivalve mollusc Callista brevisiphonata was analyzed using correlation, factor and cluster analysis. Four significantly different cell types were identified on the basis of 36 linear and nonlinear parameters. The analysis confirmed the adequacy of the selected methodology for numerical description of the shape and the adequacy of classification of nonlinear shapes of spread hemocytes belonging to the same species. Investigation has practical significance for the description of the morphology of cultured cells, since cell shape is a result of summation of a number of extracellular and intracellular factors. © 2015 International Society for Advancement of Cytometry.
NASA Technical Reports Server (NTRS)
Clowe, Douglas; Markevitch, Maxim; Bradac, Marusa; Gonzalez, Anthony H.; Chung, Sun Mi
2012-01-01
Merging clusters of galaxies are unique in their power to directly probe and place limits on the self-interaction cross-section of dark matter. Detailed observations of several merging clusters have shown the intracluster gas to be displaced from the centroids of dark matter and galaxy density by ram pressure, while the latter components are spatially coincident, consistent with collisionless dark matter. This has been used to place upper limits on the dark matter particle self-interaction cross-section of order 1 sq cm/g. The cluster A520 has been seen as a possible exception. We revisit A520 presenting new Hubble Space Telescope Advanced Camera for Surveys mosaic images and a Magellan image set. We perform a detailed weak-lensing analysis and show that the weak-lensing mass measurements and morphologies of the core galaxy-filled structures are mostly in good agreement with previous works. There is, however, one significant difference: We do not detect the previously claimed "dark core" that contains excess mass with no significant galaxy overdensity at the location of the X-ray plasma. This peak has been suggested to be indicative of a large self-interaction cross-section for dark matter (at least approx 5alpha larger than the upper limit of 0.7 sq cm/g determined by observations of the Bullet Cluster). We find no such indication and instead find that the mass distribution of A520, after subtraction of the X-ray plasma mass, is in good agreement with the luminosity distribution of the cluster galaxies.We conclude that A520 shows no evidence to contradict the collisionless dark matter scenario.
A cluster of immunoresolvents links coagulation to innate host defense in human blood.
Norris, Paul C; Libreros, Stephania; Chiang, Nan; Serhan, Charles N
2017-08-01
Blood coagulation is a protective response that prevents excessive bleeding upon blood vessel injury. We investigated the relationship between coagulation and the resolution of inflammation and infection by lipid mediators (LMs) through metabololipidomics-based profiling of human whole blood (WB) during coagulation. We identified temporal clusters of endogenously produced prothrombotic and proinflammatory LMs (eicosanoids), as well as specialized proresolving mediators (SPMs). In addition to eicosanoids, a specific SPM cluster was identified that consisted of resolvin E1 (RvE1), RvD1, RvD5, lipoxin B 4 , and maresin 1, each of which was present at bioactive concentrations (0.1 to 1 nM). Removal of adenosine from the coagulating blood markedly enhanced the amounts of SPMs produced and further increased the biosynthesis of RvD3, RvD4, and RvD6. The cyclooxygenase inhibitors celecoxib and indomethacin, which block the production of thromboxanes and prostanoids, did not block the production of clot-driven SPMs. Unbiased mass cytometry analysis demonstrated that the SPM cluster produced in human blood targeted leukocytes at the single-cell level, directly activating ERK and CREB signaling in neutrophils and CD14 + monocytes. Treatment of human WB with the components of this SPM cluster enhanced both the phagocytosis and killing of Escherichia coli by leukocytes. Together, these data identify a proresolving LM circuit, including endogenous molecular brakes and accelerators, which promoted host defense. These temporal LM-SPM clusters can provide accessible metabolomic profiles for precision and personalized medicine. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Enabling Diverse Software Stacks on Supercomputers using High Performance Virtual Clusters.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Younge, Andrew J.; Pedretti, Kevin; Grant, Ryan
While large-scale simulations have been the hallmark of the High Performance Computing (HPC) community for decades, Large Scale Data Analytics (LSDA) workloads are gaining attention within the scientific community not only as a processing component to large HPC simulations, but also as standalone scientific tools for knowledge discovery. With the path towards Exascale, new HPC runtime systems are also emerging in a way that differs from classical distributed com- puting models. However, system software for such capabilities on the latest extreme-scale DOE supercomputing needs to be enhanced to more appropriately support these types of emerging soft- ware ecosystems. In thismore » paper, we propose the use of Virtual Clusters on advanced supercomputing resources to enable systems to support not only HPC workloads, but also emerging big data stacks. Specifi- cally, we have deployed the KVM hypervisor within Cray's Compute Node Linux on a XC-series supercomputer testbed. We also use libvirt and QEMU to manage and provision VMs directly on compute nodes, leveraging Ethernet-over-Aries network emulation. To our knowledge, this is the first known use of KVM on a true MPP supercomputer. We investigate the overhead our solution using HPC benchmarks, both evaluating single-node performance as well as weak scaling of a 32-node virtual cluster. Overall, we find single node performance of our solution using KVM on a Cray is very efficient with near-native performance. However overhead increases by up to 20% as virtual cluster size increases, due to limitations of the Ethernet-over-Aries bridged network. Furthermore, we deploy Apache Spark with large data analysis workloads in a Virtual Cluster, ef- fectively demonstrating how diverse software ecosystems can be supported by High Performance Virtual Clusters.« less
Stellar and Binary Evolution in Star Clusters
NASA Technical Reports Server (NTRS)
McMillan, Stephen L. W.
2001-01-01
This paper presents a final report on research activities covered on Stellar and Binary Evolution in Star Clusters. Substantial progress was made in the development and dissemination of the "Starlab" software environment. Significant improvements were made to "kira," an N-body simulation program tailored to the study of dense stellar systems such as star clusters and galactic nuclei. Key advances include (1) the inclusion of stellar and binary evolution in a self-consistent manner, (2) proper treatment of the anisotropic Galactic tidal field, (3) numerous technical enhancements in the treatment of binary dynamics and interactions, and (4) full support for the special-purpose GRAPE-4 hardware, boosting the program's performance by a factor of 10-100 over the accelerated version. The data-reduction and analysis tools in Starlab were also substantially expanded. A Starlab Web site (http://www.sns.ias.edu/-starlab) was created and developed. The site contains detailed information on the structure and function of the various tools that comprise the package, as well as download information, "how to" tips and examples of common operations, demonstration programs, animations, etc. All versions of the software are freely distributed to all interested users, along with detailed installation instructions.
Cluster-search based monitoring of local earthquakes in SeisComP3
NASA Astrophysics Data System (ADS)
Roessler, D.; Becker, J.; Ellguth, E.; Herrnkind, S.; Weber, B.; Henneberger, R.; Blanck, H.
2016-12-01
We present a new cluster-search based SeisComP3 module for locating local and regional earthquakes in real time. Real-time earthquake monitoring systems such as SeisComP3 provide the backbones for earthquake early warning (EEW), tsunami early warning (TEW) and the rapid assessment of natural and induced seismicity. For any earthquake monitoring system fast and accurate event locations are fundamental determining the reliability and the impact of further analysis. SeisComP3 in the OpenSource version includes a two-stage detector for picking P waves and a phase associator for locating earthquakes based on P-wave detections. scanloc is a more advanced earthquake location program developed by gempa GmbH with seamless integration into SeisComP3. scanloc performs advanced cluster search to discriminate earthquakes occurring closely in space and time and makes additional use of S-wave detections. It has proven to provide fast and accurate earthquake locations at local and regional distances where it outperforms the base SeisComP3 tools. We demonstrate the performance of scanloc for monitoring induced seismicity as well as local and regional earthquakes in different tectonic regimes including subduction, spreading and intra-plate regions. In particular we present examples and catalogs from real-time monitoring of earthquake in Northern Chile based on data from the IPOC network by GFZ German Research Centre for Geosciences for the recent years. Depending on epicentral distance and data transmission, earthquake locations are available within a few seconds after origin time when using scanloc. The association of automatic S-wave detections provides a better constraint on focal depth.
BioTextQuest(+): a knowledge integration platform for literature mining and concept discovery.
Papanikolaou, Nikolas; Pavlopoulos, Georgios A; Pafilis, Evangelos; Theodosiou, Theodosios; Schneider, Reinhard; Satagopam, Venkata P; Ouzounis, Christos A; Eliopoulos, Aristides G; Promponas, Vasilis J; Iliopoulos, Ioannis
2014-11-15
The iterative process of finding relevant information in biomedical literature and performing bioinformatics analyses might result in an endless loop for an inexperienced user, considering the exponential growth of scientific corpora and the plethora of tools designed to mine PubMed(®) and related biological databases. Herein, we describe BioTextQuest(+), a web-based interactive knowledge exploration platform with significant advances to its predecessor (BioTextQuest), aiming to bridge processes such as bioentity recognition, functional annotation, document clustering and data integration towards literature mining and concept discovery. BioTextQuest(+) enables PubMed and OMIM querying, retrieval of abstracts related to a targeted request and optimal detection of genes, proteins, molecular functions, pathways and biological processes within the retrieved documents. The front-end interface facilitates the browsing of document clustering per subject, the analysis of term co-occurrence, the generation of tag clouds containing highly represented terms per cluster and at-a-glance popup windows with information about relevant genes and proteins. Moreover, to support experimental research, BioTextQuest(+) addresses integration of its primary functionality with biological repositories and software tools able to deliver further bioinformatics services. The Google-like interface extends beyond simple use by offering a range of advanced parameterization for expert users. We demonstrate the functionality of BioTextQuest(+) through several exemplary research scenarios including author disambiguation, functional term enrichment, knowledge acquisition and concept discovery linking major human diseases, such as obesity and ageing. The service is accessible at http://bioinformatics.med.uoc.gr/biotextquest. g.pavlopoulos@gmail.com or georgios.pavlopoulos@esat.kuleuven.be Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Coll, Francesc; Harrison, Ewan M; Toleman, Michelle S; Reuter, Sandra; Raven, Kathy E; Blane, Beth; Palmer, Beverley; Kappeler, A Ruth M; Brown, Nicholas M; Török, M Estée; Parkhill, Julian; Peacock, Sharon J
2017-10-25
Genome sequencing has provided snapshots of the transmission of methicillin-resistant Staphylococcus aureus (MRSA) during suspected outbreaks in isolated hospital wards. Scale-up to populations is now required to establish the full potential of this technology for surveillance. We prospectively identified all individuals over a 12-month period who had at least one MRSA-positive sample processed by a routine diagnostic microbiology laboratory in the East of England, which received samples from three hospitals and 75 general practitioner (GP) practices. We sequenced at least 1 MRSA isolate from 1465 individuals (2282 MRSA isolates) and recorded epidemiological data. An integrated epidemiological and phylogenetic analysis revealed 173 transmission clusters containing between 2 and 44 cases and involving 598 people (40.8%). Of these, 118 clusters (371 people) involved hospital contacts alone, 27 clusters (72 people) involved community contacts alone, and 28 clusters (157 people) had both types of contact. Community- and hospital-associated MRSA lineages were equally capable of transmission in the community, with instances of spread in households, long-term care facilities, and GP practices. Our study provides a comprehensive picture of MRSA transmission in a sampled population of 1465 people and suggests the need to review existing infection control policy and practice. Copyright © 2017 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Cluster tool solution for fabrication and qualification of advanced photomasks
NASA Astrophysics Data System (ADS)
Schaetz, Thomas; Hartmann, Hans; Peter, Kai; Lalanne, Frederic P.; Maurin, Olivier; Baracchi, Emanuele; Miramond, Corinne; Brueck, Hans-Juergen; Scheuring, Gerd; Engel, Thomas; Eran, Yair; Sommer, Karl
2000-07-01
The reduction of wavelength in optical lithography, phase shift technology and optical proximity correction (OPC), requires a rapid increase in cost effective qualification of photomasks. The knowledge about CD variation, loss of pattern fidelity especially for OPC pattern and mask defects concerning the impact on wafer level is becoming a key issue for mask quality assessment. As part of the European Community supported ESPRIT projection 'Q-CAP', a new cluster concept has been developed, which allows the combination of hardware tools as well as software tools via network communication. It is designed to be open for any tool manufacturer and mask hose. The bi-directional network access allows the exchange of all relevant mask data including grayscale images, measurement results, lithography parameters, defect coordinates, layout data, process data etc. and its storage to a SQL database. The system uses SEMI format descriptions as well as standard network hardware and software components for the client server communication. Each tool is used mainly to perform its specific application without using expensive time to perform optional analysis, but the availability of the database allows each component to share the full data ste gathered by all components. Therefore, the cluster can be considered as one single virtual tool. The paper shows the advantage of the cluster approach, the benefits of the tools linked together already, and a vision of a mask house in the near future.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petkov, Valeri; Prasai, Binay; Shastri, Sarvjit
Practical applications require the production and usage of metallic nanocrystals (NCs) in large ensembles. Besides, due to their cluster-bulk solid duality, metallic NCs exhibit a large degree of structural diversity. This poses the question as to what atomic-scale basis is to be used when the structure–function relationship for metallic NCs is to be quantified precisely. In this paper, we address the question by studying bi-functional Fe core-Pt skin type NCs optimized for practical applications. In particular, the cluster-like Fe core and skin-like Pt surface of the NCs exhibit superparamagnetic properties and a superb catalytic activity for the oxygen reduction reaction,more » respectively. We determine the atomic-scale structure of the NCs by non-traditional resonant high-energy X-ray diffraction coupled to atomic pair distribution function analysis. Using the experimental structure data we explain the observed magnetic and catalytic behavior of the NCs in a quantitative manner. Lastly, we demonstrate that NC ensemble-averaged 3D positions of atoms obtained by advanced X-ray scattering techniques are a very proper basis for not only establishing but also quantifying the structure–function relationship for the increasingly complex metallic NCs explored for practical applications.« less
Clustering, randomness, and regularity in cloud fields: 2. Cumulus cloud fields
NASA Astrophysics Data System (ADS)
Zhu, T.; Lee, J.; Weger, R. C.; Welch, R. M.
1992-12-01
During the last decade a major controversy has been brewing concerning the proper characterization of cumulus convection. The prevailing view has been that cumulus clouds form in clusters, in which cloud spacing is closer than that found for the overall cloud field and which maintains its identity over many cloud lifetimes. This "mutual protection hypothesis" of Randall and Huffman (1980) has been challenged by the "inhibition hypothesis" of Ramirez et al. (1990) which strongly suggests that the spatial distribution of cumuli must tend toward a regular distribution. A dilemma has resulted because observations have been reported to support both hypotheses. The present work reports a detailed analysis of cumulus cloud field spatial distributions based upon Landsat, Advanced Very High Resolution Radiometer, and Skylab data. Both nearest-neighbor and point-to-cloud cumulative distribution function statistics are investigated. The results show unequivocally that when both large and small clouds are included in the cloud field distribution, the cloud field always has a strong clustering signal. The strength of clustering is largest at cloud diameters of about 200-300 m, diminishing with increasing cloud diameter. In many cases, clusters of small clouds are found which are not closely associated with large clouds. As the small clouds are eliminated from consideration, the cloud field typically tends towards regularity. Thus it would appear that the "inhibition hypothesis" of Ramirez and Bras (1990) has been verified for the large clouds. However, these results are based upon the analysis of point processes. A more exact analysis also is made which takes into account the cloud size distributions. Since distinct clouds are by definition nonoverlapping, cloud size effects place a restriction upon the possible locations of clouds in the cloud field. The net effect of this analysis is that the large clouds appear to be randomly distributed, with only weak tendencies towards regularity. For clouds less than 1 km in diameter, the average nearest-neighbor distance is equal to 3-7 cloud diameters. For larger clouds, the ratio of cloud nearest-neighbor distance to cloud diameter increases sharply with increasing cloud diameter. This demonstrates that large clouds inhibit the growth of other large clouds in their vicinity. Nevertheless, this leads to random distributions of large clouds, not regularity.
Dunn, Heather; Quinn, Laurie; Corbridge, Susan J; Eldeirawi, Kamal; Kapella, Mary; Collins, Eileen G
2017-05-01
The use of cluster analysis in the nursing literature is limited to the creation of classifications of homogeneous groups and the discovery of new relationships. As such, it is important to provide clarity regarding its use and potential. The purpose of this article is to provide an introduction to distance-based, partitioning-based, and model-based cluster analysis methods commonly utilized in the nursing literature, provide a brief historical overview on the use of cluster analysis in nursing literature, and provide suggestions for future research. An electronic search included three bibliographic databases, PubMed, CINAHL and Web of Science. Key terms were cluster analysis and nursing. The use of cluster analysis in the nursing literature is increasing and expanding. The increased use of cluster analysis in the nursing literature is positioning this statistical method to result in insights that have the potential to change clinical practice.
Fast Image Texture Classification Using Decision Trees
NASA Technical Reports Server (NTRS)
Thompson, David R.
2011-01-01
Texture analysis would permit improved autonomous, onboard science data interpretation for adaptive navigation, sampling, and downlink decisions. These analyses would assist with terrain analysis and instrument placement in both macroscopic and microscopic image data products. Unfortunately, most state-of-the-art texture analysis demands computationally expensive convolutions of filters involving many floating-point operations. This makes them infeasible for radiation- hardened computers and spaceflight hardware. A new method approximates traditional texture classification of each image pixel with a fast decision-tree classifier. The classifier uses image features derived from simple filtering operations involving integer arithmetic. The texture analysis method is therefore amenable to implementation on FPGA (field-programmable gate array) hardware. Image features based on the "integral image" transform produce descriptive and efficient texture descriptors. Training the decision tree on a set of training data yields a classification scheme that produces reasonable approximations of optimal "texton" analysis at a fraction of the computational cost. A decision-tree learning algorithm employing the traditional k-means criterion of inter-cluster variance is used to learn tree structure from training data. The result is an efficient and accurate summary of surface morphology in images. This work is an evolutionary advance that unites several previous algorithms (k-means clustering, integral images, decision trees) and applies them to a new problem domain (morphology analysis for autonomous science during remote exploration). Advantages include order-of-magnitude improvements in runtime, feasibility for FPGA hardware, and significant improvements in texture classification accuracy.
Metsalu, Tauno; Vilo, Jaak
2015-01-01
The Principal Component Analysis (PCA) is a widely used method of reducing the dimensionality of high-dimensional data, often followed by visualizing two of the components on the scatterplot. Although widely used, the method is lacking an easy-to-use web interface that scientists with little programming skills could use to make plots of their own data. The same applies to creating heatmaps: it is possible to add conditional formatting for Excel cells to show colored heatmaps, but for more advanced features such as clustering and experimental annotations, more sophisticated analysis tools have to be used. We present a web tool called ClustVis that aims to have an intuitive user interface. Users can upload data from a simple delimited text file that can be created in a spreadsheet program. It is possible to modify data processing methods and the final appearance of the PCA and heatmap plots by using drop-down menus, text boxes, sliders etc. Appropriate defaults are given to reduce the time needed by the user to specify input parameters. As an output, users can download PCA plot and heatmap in one of the preferred file formats. This web server is freely available at http://biit.cs.ut.ee/clustvis/. PMID:25969447
ICAP - An Interactive Cluster Analysis Procedure for analyzing remotely sensed data
NASA Technical Reports Server (NTRS)
Wharton, S. W.; Turner, B. J.
1981-01-01
An Interactive Cluster Analysis Procedure (ICAP) was developed to derive classifier training statistics from remotely sensed data. ICAP differs from conventional clustering algorithms by allowing the analyst to optimize the cluster configuration by inspection, rather than by manipulating process parameters. Control of the clustering process alternates between the algorithm, which creates new centroids and forms clusters, and the analyst, who can evaluate and elect to modify the cluster structure. Clusters can be deleted, or lumped together pairwise, or new centroids can be added. A summary of the cluster statistics can be requested to facilitate cluster manipulation. The principal advantage of this approach is that it allows prior information (when available) to be used directly in the analysis, since the analyst interacts with ICAP in a straightforward manner, using basic terms with which he is more likely to be familiar. Results from testing ICAP showed that an informed use of ICAP can improve classification, as compared to an existing cluster analysis procedure.
Self-similarity of temperature profiles in distant galaxy clusters: the quest for a universal law
NASA Astrophysics Data System (ADS)
Baldi, A.; Ettori, S.; Molendi, S.; Gastaldello, F.
2012-09-01
Context. We present the XMM-Newton temperature profiles of 12 bright (LX > 4 × 1044 erg s-1) clusters of galaxies at 0.4 < z < 0.9, having an average temperature in the range 5 ≲ kT ≲ 11 keV. Aims: The main goal of this paper is to study for the first time the temperature profiles of a sample of high-redshift clusters, to investigate their properties, and to define a universal law to describe the temperature radial profiles in galaxy clusters as a function of both cosmic time and their state of relaxation. Methods: We performed a spatially resolved spectral analysis, using Cash statistics, to measure the temperature in the intracluster medium at different radii. Results: We extracted temperature profiles for the clusters in our sample, finding that all profiles are declining toward larger radii. The normalized temperature profiles (normalized by the mean temperature T500) are found to be generally self-similar. The sample was subdivided into five cool-core (CC) and seven non cool-core (NCC) clusters by introducing a pseudo-entropy ratio σ = (TIN/TOUT) × (EMIN/EMOUT)-1/3 and defining the objects with σ < 0.6 as CC clusters and those with σ ≥ 0.6 as NCC clusters. The profiles of CC and NCC clusters differ mainly in the central regions, with the latter exhibiting a slightly flatter central profile. A significant dependence of the temperature profiles on the pseudo-entropy ratio σ is detected by fitting a function of r and σ, showing an indication that the outer part of the profiles becomes steeper for higher values of σ (i.e. transitioning toward the NCC clusters). No significant evidence of redshift evolution could be found within the redshift range sampled by our clusters (0.4 < z < 0.9). A comparison of our high-z sample with intermediate clusters at 0.1 < z < 0.3 showed how the CC and NCC cluster temperature profiles have experienced some sort of evolution. This can happen because higher z clusters are at a less advanced stage of their formation and did not have enough time to create a relaxed structure, which is characterized by a central temperature dip in CC clusters and by flatter profiles in NCC clusters. Conclusions: This is the first time that a systematic study of the temperature profiles of galaxy clusters at z > 0.4 has been attempted. We were able to define the closest possible relation to a universal law for the temperature profiles of galaxy clusters at 0.1 < z < 0.9, showing a dependence on both the relaxation state of the clusters and the redshift. Appendix A is only available in electronic form at http://www.aanda.org
[Advances in clustered regularly interspaced short palindromic repeats--a review].
Wang, Lili; He, Jin; Wang, Jieping
2011-08-01
The recently discovered Clustered Regularly Interspaced Short Palindromic Repeat (CRISPRs) can protect bacteria and archaea with adaptive and heritable defense systems against the invasion of phage- and plasmid- associated mobile genetic elements. Here, we review the structure, diversity, mechanism of interference and self versus non-self discrimination of CRISPR systems. We also discuss the potential applications of this novel interference system.
Missing continuous outcomes under covariate dependent missingness in cluster randomised trials
Diaz-Ordaz, Karla; Bartlett, Jonathan W
2016-01-01
Attrition is a common occurrence in cluster randomised trials which leads to missing outcome data. Two approaches for analysing such trials are cluster-level analysis and individual-level analysis. This paper compares the performance of unadjusted cluster-level analysis, baseline covariate adjusted cluster-level analysis and linear mixed model analysis, under baseline covariate dependent missingness in continuous outcomes, in terms of bias, average estimated standard error and coverage probability. The methods of complete records analysis and multiple imputation are used to handle the missing outcome data. We considered four scenarios, with the missingness mechanism and baseline covariate effect on outcome either the same or different between intervention groups. We show that both unadjusted cluster-level analysis and baseline covariate adjusted cluster-level analysis give unbiased estimates of the intervention effect only if both intervention groups have the same missingness mechanisms and there is no interaction between baseline covariate and intervention group. Linear mixed model and multiple imputation give unbiased estimates under all four considered scenarios, provided that an interaction of intervention and baseline covariate is included in the model when appropriate. Cluster mean imputation has been proposed as a valid approach for handling missing outcomes in cluster randomised trials. We show that cluster mean imputation only gives unbiased estimates when missingness mechanism is the same between the intervention groups and there is no interaction between baseline covariate and intervention group. Multiple imputation shows overcoverage for small number of clusters in each intervention group. PMID:27177885
Missing continuous outcomes under covariate dependent missingness in cluster randomised trials.
Hossain, Anower; Diaz-Ordaz, Karla; Bartlett, Jonathan W
2017-06-01
Attrition is a common occurrence in cluster randomised trials which leads to missing outcome data. Two approaches for analysing such trials are cluster-level analysis and individual-level analysis. This paper compares the performance of unadjusted cluster-level analysis, baseline covariate adjusted cluster-level analysis and linear mixed model analysis, under baseline covariate dependent missingness in continuous outcomes, in terms of bias, average estimated standard error and coverage probability. The methods of complete records analysis and multiple imputation are used to handle the missing outcome data. We considered four scenarios, with the missingness mechanism and baseline covariate effect on outcome either the same or different between intervention groups. We show that both unadjusted cluster-level analysis and baseline covariate adjusted cluster-level analysis give unbiased estimates of the intervention effect only if both intervention groups have the same missingness mechanisms and there is no interaction between baseline covariate and intervention group. Linear mixed model and multiple imputation give unbiased estimates under all four considered scenarios, provided that an interaction of intervention and baseline covariate is included in the model when appropriate. Cluster mean imputation has been proposed as a valid approach for handling missing outcomes in cluster randomised trials. We show that cluster mean imputation only gives unbiased estimates when missingness mechanism is the same between the intervention groups and there is no interaction between baseline covariate and intervention group. Multiple imputation shows overcoverage for small number of clusters in each intervention group.
Stellar clusters in the Gaia era
NASA Astrophysics Data System (ADS)
Bragaglia, Angela
2018-04-01
Stellar clusters are important for astrophysics in many ways, for instance as optimal tracers of the Galactic populations to which they belong or as one of the best test bench for stellar evolutionary models. Gaia DR1, with TGAS, is just skimming the wealth of exquisite information we are expecting from the more advanced catalogues, but already offers good opportunities and indicates the vast potentialities. Gaia results can be efficiently complemented by ground-based data, in particular by large spectroscopic and photometric surveys. Examples of some scientific results of the Gaia-ESO survey are presented, as a teaser for what will be possible once advanced Gaia releases and ground-based data will be combined.
The formation and evolution of M33 as revealed by its star clusters
NASA Astrophysics Data System (ADS)
San Roman, Izaskun
2012-03-01
Numerical simulations based on the Lambda-Cold Dark Matter (Λ-CDM) model predict a scenario consistent with observational evidence in terms of the build-up of Milky Way-like halos. Under this scenario, large disk galaxies derive from the merger and accretion of many smaller subsystems. However, it is less clear how low-mass spiral galaxies fit into this picture. The best way to answer this question is to study the nearest example of a dwarf spiral galaxy, M33. We will use star clusters to understand the structure, kinematics and stellar populations of this galaxy. Star clusters provide a unique and powerful tool for studying the star formation histories of galaxies. In particular, the ages and metallicities of star clusters bear the imprint of the galaxy formation process. We have made use of the star clusters to uncover the formation and evolution of M33. In this dissertation, we have carried out a comprehensive study of the M33 star cluster system, including deep photometry as well as high signal-to-noise spectroscopy. In order to mitigate the significant incompleteness presents in previous catalogs, we have conducted ground-based and space-based photometric surveys of M33 star clusters. Using archival images, we have analyzed 12 fields using the Advanced Camera for Surveys Wide Field Channel onboard the Hubble Space Telescope (ACS/HST) along the major axis of the galaxy. We present integrated photometry and color-magnitude diagrams for 161 star clusters in M33, of which 115 were previously uncataloged. This survey extends the depth of the existing M33 cluster catalogs by ˜ 1 mag. We have expanded our search through a photometric survey in a 1° x 1° area centered on M33 using the MegaCam camera on the 3.6m Canada-France-Hawaii Telescope (CFHT). In this work we discuss the photometric properties of the sample, including color-color diagrams of 599 new candidate stellar clusters, and 204 confirmed clusters. Comparisons with models of simple stellar populations suggest a large range of ages some as old as ˜ 10 Gyr. In addition, we find in the color-color diagrams a significant population of very young clusters (< 10 Myr) possessing nebular emission. Analysis of the radial density distribution suggests that the cluster system of M33 has suffered from significant depletion, possibly due to interactions with M31. To further understand the properties of M33 star clusters, we have carried out a morphological study 161 star clusters in M33 using ACS/HST images. We have obtained, for the first time, ellipticities, position angles, and surface brightness profiles of a statistically significant number of clusters. Ellipticities show that, on average, M33 clusters are more flattened than those of the Milky Way and M31, and more similar to clusters in the Small Magellanic Cloud. The ellipticities do not show any correlation with age or mass, suggesting that rotation is not the main cause of elongation in the M33 clusters. The position angles of the clusters show a bimodality with a strong peak perpendicular to the position angle of the galaxy. These results support the notion that tidal forces are the reason for the cluster flattening. We have fit analytical models to the surface brightness profiles, and derived structural parameters. The overall analysis shows several differences between the structural properties of the M33 cluster system and cluster systems in nearby galaxies. Finally, we have performed a spectroscopic study of star clusters in the above mentioned catalog. We present high-precision velocity measures of 45 star clusters, based on observations from the 10.4m Gran Telescopio Canarias (GTC) using OSIRIS and 4.2m William Herschel Telescope (WHT) using WYFFOS. All the clusters have been previously confirmed using HST imaging, and ages and integrated photometry are known. The velocity of the clusters with respect to local disk motion increases with age for young and intermediate clusters. The mean dispersion velocity for the intermediate age clusters in our sample is significantly larger than in previous studies. Analysis of these velocities along the major axis of the galaxy show no net rotation of the intermediate age subsample. The small number of old clusters in our sample does not allow for any conclusive evidence in that age division.
NASA Astrophysics Data System (ADS)
Ma, Mengli; Lei, En; Meng, Hengling; Wang, Tiantao; Xie, Linyan; Shen, Dong; Xianwang, Zhou; Lu, Bingyue
2017-08-01
Amomum tsao-ko is a commercial plant that used for various purposes in medicinal and food industries. For the present investigation, 44 germplasm samples were collected from Jinping County of Yunnan Province. Clusters analysis and 2-dimensional principal component analysis (PCA) was used to represent the genetic relations among Amomum tsao-ko by using simple sequence repeat (SSR) markers. Clustering analysis clearly distinguished the samples groups. Two major clusters were formed; first (Cluster I) consisted of 34 individuals, the second (Cluster II) consisted of 10 individuals, Cluster I as the main group contained multiple sub-clusters. PCA also showed 2 groups: PCA Group 1 included 29 individuals, PCA Group 2 included 12 individuals, consistent with the results of cluster analysis. The purpose of the present investigation was to provide information on genetic relationship of Amomum tsao-ko germplasm resources in main producing areas, also provide a theoretical basis for the protection and utilization of Amomum tsao-ko resources.
A cross-omics toxicological evaluation of drinking water treated with different processes.
Shi, Peng; Jia, Shuyu; Zhang, Xu-Xiang; Zhao, Fuzheng; Chen, Yajun; Zhou, Qing; Cheng, Shupei; Li, Ai-Min
2014-04-30
Cross-omics profiling and phenotypic analysis were conducted to comprehensively assess the toxicities of source of drinking water (SDW), effluent of conventional treatment (ECT) and effluent of advanced treatment (EAT) in a water treatment plant. SDW feeding increased body weight, and relative liver and kidney weights of mice. Hepatic histopathological damages and serum biochemical alterations were observed in the mice fed with SDW and ECT, but EAT feeding showed no obvious effects. Transcriptomic analysis demonstrated that exposure to water samples caused differential expression of hundreds of genes in livers. Cluster analysis of the differentially expressed genes which generated by both microarrays and digital gene expression showed similar grouping patterns. Proteomic and metabolomics analyses indicated that drinking SDW, ECT and EAT generated 59, 145 and 41 significantly altered proteins in livers and 8, 2 and 0 altered metabolites in serum, respectively. SDW was found to affect several metabolic pathways including metabolism of xenobiotics by cytochrome P450 and fatty acid metabolism. SDW and ECT might induce molecular toxicities to mice, but the advanced treatment process can reduce the potential health risk by effectively removing toxic chemicals in drinking water. Copyright © 2014 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petrie, G.M.; Perry, E.M.; Kirkham, R.R.
1997-09-01
This report describes the work performed at the Pacific Northwest National Laboratory (PNNL) for the U.S. Department of Energy`s Office of Nonproliferation and National Security, Office of Research and Development (NN-20). The work supports the NN-20 Broad Area Search and Analysis, a program initiated by NN-20 to improve the detection and classification of undeclared weapons facilities. Ongoing PNNL research activities are described in three main components: image collection, information processing, and change analysis. The Multispectral Airborne Imaging System, which was developed to collect georeferenced imagery in the visible through infrared regions of the spectrum, and flown on a light aircraftmore » platform, will supply current land use conditions. The image information extraction software (dynamic clustering and end-member extraction) uses imagery, like the multispectral data collected by the PNNL multispectral system, to efficiently generate landcover information. The advanced change detection uses a priori (benchmark) information, current landcover conditions, and user-supplied rules to rank suspect areas by probable risk of undeclared facilities or proliferation activities. These components, both separately and combined, provide important tools for improving the detection of undeclared facilities.« less
Lalonde, Michel; Wells, R Glenn; Birnie, David; Ruddy, Terrence D; Wassenaar, Richard
2014-07-01
Phase analysis of single photon emission computed tomography (SPECT) radionuclide angiography (RNA) has been investigated for its potential to predict the outcome of cardiac resynchronization therapy (CRT). However, phase analysis may be limited in its potential at predicting CRT outcome as valuable information may be lost by assuming that time-activity curves (TAC) follow a simple sinusoidal shape. A new method, cluster analysis, is proposed which directly evaluates the TACs and may lead to a better understanding of dyssynchrony patterns and CRT outcome. Cluster analysis algorithms were developed and optimized to maximize their ability to predict CRT response. About 49 patients (N = 27 ischemic etiology) received a SPECT RNA scan as well as positron emission tomography (PET) perfusion and viability scans prior to undergoing CRT. A semiautomated algorithm sampled the left ventricle wall to produce 568 TACs from SPECT RNA data. The TACs were then subjected to two different cluster analysis techniques, K-means, and normal average, where several input metrics were also varied to determine the optimal settings for the prediction of CRT outcome. Each TAC was assigned to a cluster group based on the comparison criteria and global and segmental cluster size and scores were used as measures of dyssynchrony and used to predict response to CRT. A repeated random twofold cross-validation technique was used to train and validate the cluster algorithm. Receiver operating characteristic (ROC) analysis was used to calculate the area under the curve (AUC) and compare results to those obtained for SPECT RNA phase analysis and PET scar size analysis methods. Using the normal average cluster analysis approach, the septal wall produced statistically significant results for predicting CRT results in the ischemic population (ROC AUC = 0.73;p < 0.05 vs. equal chance ROC AUC = 0.50) with an optimal operating point of 71% sensitivity and 60% specificity. Cluster analysis results were similar to SPECT RNA phase analysis (ROC AUC = 0.78, p = 0.73 vs cluster AUC; sensitivity/specificity = 59%/89%) and PET scar size analysis (ROC AUC = 0.73, p = 1.0 vs cluster AUC; sensitivity/specificity = 76%/67%). A SPECT RNA cluster analysis algorithm was developed for the prediction of CRT outcome. Cluster analysis results produced results equivalent to those obtained from Fourier and scar analysis.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lalonde, Michel, E-mail: mlalonde15@rogers.com; Wassenaar, Richard; Wells, R. Glenn
2014-07-15
Purpose: Phase analysis of single photon emission computed tomography (SPECT) radionuclide angiography (RNA) has been investigated for its potential to predict the outcome of cardiac resynchronization therapy (CRT). However, phase analysis may be limited in its potential at predicting CRT outcome as valuable information may be lost by assuming that time-activity curves (TAC) follow a simple sinusoidal shape. A new method, cluster analysis, is proposed which directly evaluates the TACs and may lead to a better understanding of dyssynchrony patterns and CRT outcome. Cluster analysis algorithms were developed and optimized to maximize their ability to predict CRT response. Methods: Aboutmore » 49 patients (N = 27 ischemic etiology) received a SPECT RNA scan as well as positron emission tomography (PET) perfusion and viability scans prior to undergoing CRT. A semiautomated algorithm sampled the left ventricle wall to produce 568 TACs from SPECT RNA data. The TACs were then subjected to two different cluster analysis techniques, K-means, and normal average, where several input metrics were also varied to determine the optimal settings for the prediction of CRT outcome. Each TAC was assigned to a cluster group based on the comparison criteria and global and segmental cluster size and scores were used as measures of dyssynchrony and used to predict response to CRT. A repeated random twofold cross-validation technique was used to train and validate the cluster algorithm. Receiver operating characteristic (ROC) analysis was used to calculate the area under the curve (AUC) and compare results to those obtained for SPECT RNA phase analysis and PET scar size analysis methods. Results: Using the normal average cluster analysis approach, the septal wall produced statistically significant results for predicting CRT results in the ischemic population (ROC AUC = 0.73;p < 0.05 vs. equal chance ROC AUC = 0.50) with an optimal operating point of 71% sensitivity and 60% specificity. Cluster analysis results were similar to SPECT RNA phase analysis (ROC AUC = 0.78, p = 0.73 vs cluster AUC; sensitivity/specificity = 59%/89%) and PET scar size analysis (ROC AUC = 0.73, p = 1.0 vs cluster AUC; sensitivity/specificity = 76%/67%). Conclusions: A SPECT RNA cluster analysis algorithm was developed for the prediction of CRT outcome. Cluster analysis results produced results equivalent to those obtained from Fourier and scar analysis.« less
[Recent advances in research on Malassezia microbiota in humans].
Sugita, Takashi; Zhang, Enshi; Tanaka, Takafumi; Nishikawa, Akemi; Tajima, Mami; Tsuboi, Ryoji
2013-01-01
Malassezia species of lipophilic yeasts account for most fungal microbiota. Although they colonize healthy skin, they are also associated with several skin diseases, including pityriasis versicolor, seborrheic dermatitis, Malassezia folliculitis, and atopic dermatitis. To date, 14 members of the Malassezia genus have been identified. Of these, both M. globosa and M. restricta predominate, regardless of skin-disease type. Comprehensive analysis of fungal microbiota in the skin of patients with atopic dermatitis using an rRNA clone library method revealed that fungal microbiota cluster according to disease severity. The external ear canal and sole of the foot are colonized by specific Malassezia microbiota.
Katwal, Santosh B; Gore, John C; Marois, Rene; Rogers, Baxter P
2013-09-01
We present novel graph-based visualizations of self-organizing maps for unsupervised functional magnetic resonance imaging (fMRI) analysis. A self-organizing map is an artificial neural network model that transforms high-dimensional data into a low-dimensional (often a 2-D) map using unsupervised learning. However, a postprocessing scheme is necessary to correctly interpret similarity between neighboring node prototypes (feature vectors) on the output map and delineate clusters and features of interest in the data. In this paper, we used graph-based visualizations to capture fMRI data features based upon 1) the distribution of data across the receptive fields of the prototypes (density-based connectivity); and 2) temporal similarities (correlations) between the prototypes (correlation-based connectivity). We applied this approach to identify task-related brain areas in an fMRI reaction time experiment involving a visuo-manual response task, and we correlated the time-to-peak of the fMRI responses in these areas with reaction time. Visualization of self-organizing maps outperformed independent component analysis and voxelwise univariate linear regression analysis in identifying and classifying relevant brain regions. We conclude that the graph-based visualizations of self-organizing maps help in advanced visualization of cluster boundaries in fMRI data enabling the separation of regions with small differences in the timings of their brain responses.
Glatman-Freedman, Aharona; Kaufman, Zalman; Kopel, Eran; Bassal, Ravit; Taran, Diana; Valinsky, Lea; Agmon, Vered; Shpriz, Manor; Cohen, Daniel; Anis, Emilia; Shohat, Tamy
2016-08-01
To enhance timely surveillance of bacterial enteric pathogens, space-time cluster analysis was introduced in Israel in May 2013. Stool isolation data of Salmonella, Shigella, and Campylobacter from patients of a large Health Maintenance Organization were analyzed weekly by ArcGIS and SaTScan, and cluster results were sent promptly to local departments of health (LDOHs). During eighteen months, we identified 52 Shigella sonnei clusters, two Salmonella clusters, and no Campylobacter clusters. S. sonnei clusters lasted from one to 33 days and included three to 30 individuals. Thirty-one (60%) of the S. sonnei clusters were known to LDOHs prior to cluster analysis. Clusters not previously known by the LDOHs prompted epidemiologic investigations. In 31 of the 37 (84%) confirmed clusters, educational institutes (nursery schools, kindergartens, and a primary school) were involved. Cluster analysis demonstrated capability to complement enteric disease surveillance. Scaling up the system can further enhance timely detection and control of outbreaks. Copyright © 2016 The British Infection Association. Published by Elsevier Ltd. All rights reserved.
An effective fuzzy kernel clustering analysis approach for gene expression data.
Sun, Lin; Xu, Jiucheng; Yin, Jiaojiao
2015-01-01
Fuzzy clustering is an important tool for analyzing microarray data. A major problem in applying fuzzy clustering method to microarray gene expression data is the choice of parameters with cluster number and centers. This paper proposes a new approach to fuzzy kernel clustering analysis (FKCA) that identifies desired cluster number and obtains more steady results for gene expression data. First of all, to optimize characteristic differences and estimate optimal cluster number, Gaussian kernel function is introduced to improve spectrum analysis method (SAM). By combining subtractive clustering with max-min distance mean, maximum distance method (MDM) is proposed to determine cluster centers. Then, the corresponding steps of improved SAM (ISAM) and MDM are given respectively, whose superiority and stability are illustrated through performing experimental comparisons on gene expression data. Finally, by introducing ISAM and MDM into FKCA, an effective improved FKCA algorithm is proposed. Experimental results from public gene expression data and UCI database show that the proposed algorithms are feasible for cluster analysis, and the clustering accuracy is higher than the other related clustering algorithms.
Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M
2015-05-01
To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.
Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor
2015-01-01
Abstract To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice. PMID:25560745
Five task clusters that enable efficient and effective digitization of biological collections
Nelson, Gil; Paul, Deborah; Riccardi, Gregory; Mast, Austin R.
2012-01-01
Abstract This paper describes and illustrates five major clusters of related tasks (herein referred to as task clusters) that are common to efficient and effective practices in the digitization of biological specimen data and media. Examples of these clusters come from the observation of diverse digitization processes. The staff of iDigBio (The U.S. National Science Foundation’s National Resource for Advancing Digitization of Biological Collections) visited active biological and paleontological collections digitization programs for the purpose of documenting and assessing current digitization practices and tools. These observations identified five task clusters that comprise the digitization process leading up to data publication: (1) pre-digitization curation and staging, (2) specimen image capture, (3) specimen image processing, (4) electronic data capture, and (5) georeferencing locality descriptions. While not all institutions are completing each of these task clusters for each specimen, these clusters describe a composite picture of digitization of biological and paleontological specimens across the programs that were observed. We describe these clusters, three workflow patterns that dominate the implemention of these clusters, and offer a set of workflow recommendations for digitization programs. PMID:22859876
Prokaryotic Gene Clusters: A Rich Toolbox for Synthetic Biology
Fischbach, Michael; Voigt, Christopher A.
2014-01-01
Bacteria construct elaborate nanostructures, obtain nutrients and energy from diverse sources, synthesize complex molecules, and implement signal processing to react to their environment. These complex phenotypes require the coordinated action of multiple genes, which are often encoded in a contiguous region of the genome, referred to as a gene cluster. Gene clusters sometimes contain all of the genes necessary and sufficient for a particular function. As an evolutionary mechanism, gene clusters facilitate the horizontal transfer of the complete function between species. Here, we review recent work on a number of clusters whose functions are relevant to biotechnology. Engineering these clusters has been hindered by their regulatory complexity, the need to balance the expression of many genes, and a lack of tools to design and manipulate DNA at this scale. Advances in synthetic biology will enable the large-scale bottom-up engineering of the clusters to optimize their functions, wake up cryptic clusters, or to transfer them between organisms. Understanding and manipulating gene clusters will move towards an era of genome engineering, where multiple functions can be “mixed-and-matched” to create a designer organism. PMID:21154668
Bacterial Iron–Sulfur Regulatory Proteins As Biological Sensor-Switches
Crack, Jason C.; Green, Jeffrey; Hutchings, Matthew I.; Thomson, Andrew J.
2012-01-01
Abstract Significance: In recent years, bacterial iron–sulfur cluster proteins that function as regulators of gene transcription have emerged as a major new group. In all cases, the cluster acts as a sensor of the environment and enables the organism to adapt to the prevailing conditions. This can range from mounting a response to oxidative or nitrosative stress to switching between anaerobic and aerobic respiratory pathways. The sensitivity of these ancient cofactors to small molecule reactive oxygen and nitrogen species, in particular, makes them ideally suited to function as sensors. Recent Advances: An important challenge is to obtain mechanistic and structural information about how these regulators function and, in particular, how the chemistry occurring at the cluster drives the subsequent regulatory response. For several regulators, including FNR, SoxR, NsrR, IscR, and Wbl proteins, major advances in understanding have been gained recently and these are reviewed here. Critical Issues: A common theme emerging from these studies is that the sensitivity and specificity of the cluster of each regulatory protein must be exquisitely controlled by the protein environment of the cluster. Future Directions: A major future challenge is to determine, for a range of regulators, the key factors for achieving control of sensitivity/specificity. Such information will lead, eventually, to a system understanding of stress response, which often involves more than one regulator. Antioxid. Redox Signal. 17, 1215–1231. PMID:22239203
NASA Astrophysics Data System (ADS)
Sahi, Qurat-ul-ain; Kim, Yong-Soo
2018-04-01
The understanding of radiation-induced microstructural defects in body-centered cubic (BCC) iron is of major interest to those using advanced steel under extreme conditions in nuclear reactors. In this study, molecular dynamics (MD) simulations were implemented to examine the primary radiation damage in BCC iron with displacement cascades of energy 1, 5, 10, 20, and 30 keV at temperatures ranging from 100 to 1000 K. Statistical analysis of eight MD simulations of collision cascades were carried out along each [110], [112], [111] and a high index [135] direction and the temperature dependence of the surviving number of point defects and the in-cascade clustering of vacancies and interstitials were studied. The peak time and the corresponding number of defects increase with increasing irradiation temperature and primary knock-on atom (PKA) energy. However, the final number of surviving point defects decreases with increasing lattice temperature. This is associated with the increase of thermal spike at high PKA energy and its long timespan at higher temperatures. Defect production efficiency (i.e., surviving MD defects, per Norgett-Robinson-Torrens displacements) also showed a continuous decrease with the increasing irradiation temperature and PKA energy. The number of interstitial clusters increases with both irradiation temperature and PKA energy. However, the increase in the number of vacancy clusters with PKA energy is minimal-to-constant and decreases as the irradiation temperature increases. Similarly, the probability and cluster size distribution for larger interstitials increase with temperature, whereas only smaller size vacancy clusters were observed at higher temperatures.
Effects of Group Size and Lack of Sphericity on the Recovery of Clusters in K-Means Cluster Analysis
ERIC Educational Resources Information Center
de Craen, Saskia; Commandeur, Jacques J. F.; Frank, Laurence E.; Heiser, Willem J.
2006-01-01
K-means cluster analysis is known for its tendency to produce spherical and equally sized clusters. To assess the magnitude of these effects, a simulation study was conducted, in which populations were created with varying departures from sphericity and group sizes. An analysis of the recovery of clusters in the samples taken from these…
2014-01-01
Background There are many methodological challenges in the conduct and analysis of cluster randomised controlled trials, but one that has received little attention is that of post-randomisation changes to cluster composition. To illustrate this, we focus on the issue of cluster merging, considering the impact on the design, analysis and interpretation of trial outcomes. Methods We explored the effects of merging clusters on study power using standard methods of power calculation. We assessed the potential impacts on study findings of both homogeneous cluster merges (involving clusters randomised to the same arm of a trial) and heterogeneous merges (involving clusters randomised to different arms of a trial) by simulation. To determine the impact on bias and precision of treatment effect estimates, we applied standard methods of analysis to different populations under analysis. Results Cluster merging produced a systematic reduction in study power. This effect depended on the number of merges and was most pronounced when variability in cluster size was at its greatest. Simulations demonstrate that the impact on analysis was minimal when cluster merges were homogeneous, with impact on study power being balanced by a change in observed intracluster correlation coefficient (ICC). We found a decrease in study power when cluster merges were heterogeneous, and the estimate of treatment effect was attenuated. Conclusions Examples of cluster merges found in previously published reports of cluster randomised trials were typically homogeneous rather than heterogeneous. Simulations demonstrated that trial findings in such cases would be unbiased. However, simulations also showed that any heterogeneous cluster merges would introduce bias that would be hard to quantify, as well as having negative impacts on the precision of estimates obtained. Further methodological development is warranted to better determine how to analyse such trials appropriately. Interim recommendations include avoidance of cluster merges where possible, discontinuation of clusters following heterogeneous merges, allowance for potential loss of clusters and additional variability in cluster size in the original sample size calculation, and use of appropriate ICC estimates that reflect cluster size. PMID:24884591
A generalized analysis of hydrophobic and loop clusters within globular protein sequences
Eudes, Richard; Le Tuan, Khanh; Delettré, Jean; Mornon, Jean-Paul; Callebaut, Isabelle
2007-01-01
Background Hydrophobic Cluster Analysis (HCA) is an efficient way to compare highly divergent sequences through the implicit secondary structure information directly derived from hydrophobic clusters. However, its efficiency and application are currently limited by the need of user expertise. In order to help the analysis of HCA plots, we report here the structural preferences of hydrophobic cluster species, which are frequently encountered in globular domains of proteins. These species are characterized only by their hydrophobic/non-hydrophobic dichotomy. This analysis has been extended to loop-forming clusters, using an appropriate loop alphabet. Results The structural behavior of hydrophobic cluster species, which are typical of protein globular domains, was investigated within banks of experimental structures, considered at different levels of sequence redundancy. The 294 more frequent hydrophobic cluster species were analyzed with regard to their association with the different secondary structures (frequencies of association with secondary structures and secondary structure propensities). Hydrophobic cluster species are predominantly associated with regular secondary structures, and a large part (60 %) reveals preferences for α-helices or β-strands. Moreover, the analysis of the hydrophobic cluster amino acid composition generally allows for finer prediction of the regular secondary structure associated with the considered cluster within a cluster species. We also investigated the behavior of loop forming clusters, using a "PGDNS" alphabet. These loop clusters do not overlap with hydrophobic clusters and are highly associated with coils. Finally, the structural information contained in the hydrophobic structural words, as deduced from experimental structures, was compared to the PSI-PRED predictions, revealing that β-strands and especially α-helices are generally over-predicted within the limits of typical β and α hydrophobic clusters. Conclusion The dictionary of hydrophobic clusters described here can help the HCA user to interpret and compare the HCA plots of globular protein sequences, as well as provides an original fundamental insight into the structural bricks of protein folds. Moreover, the novel loop cluster analysis brings additional information for secondary structure prediction on the whole sequence through a generalized cluster analysis (GCA), and not only on regular secondary structures. Such information lays the foundations for developing a new and original tool for secondary structure prediction. PMID:17210072
NASA Astrophysics Data System (ADS)
Ryu, Hoon; Jeong, Yosang; Kang, Ji-Hoon; Cho, Kyu Nam
2016-12-01
Modelling of multi-million atomic semiconductor structures is important as it not only predicts properties of physically realizable novel materials, but can accelerate advanced device designs. This work elaborates a new Technology-Computer-Aided-Design (TCAD) tool for nanoelectronics modelling, which uses a sp3d5s∗ tight-binding approach to describe multi-million atomic structures, and simulate electronic structures with high performance computing (HPC), including atomic effects such as alloy and dopant disorders. Being named as Quantum simulation tool for Advanced Nanoscale Devices (Q-AND), the tool shows nice scalability on traditional multi-core HPC clusters implying the strong capability of large-scale electronic structure simulations, particularly with remarkable performance enhancement on latest clusters of Intel Xeon PhiTM coprocessors. A review of the recent modelling study conducted to understand an experimental work of highly phosphorus-doped silicon nanowires, is presented to demonstrate the utility of Q-AND. Having been developed via Intel Parallel Computing Center project, Q-AND will be open to public to establish a sound framework of nanoelectronics modelling with advanced HPC clusters of a many-core base. With details of the development methodology and exemplary study of dopant electronics, this work will present a practical guideline for TCAD development to researchers in the field of computational nanoelectronics.
NASA Technical Reports Server (NTRS)
Ferguson, Henry C.; Binggeli, Bruno
1994-01-01
Dwarf elliptical (dE) galaxies, with blue absolute magnitudes typically fainter than M(sub B) = -16, are the most numerous type of galaxy in the nearby universe. Tremendous advances have been made over the past several years in delineating the properties of both Local Group satellite dE's and the large dE populations of nearby clusters. We review some of these advances, with particular attention to how well currently availiable data can constrain (a) models for the formation of dE's, (b) the physical and evolutionary connections between different types of galaxies that overlap in the same portion of the mass-spectrum of galaxies, (c) the contribution of dE's to the galaxy luminosity functions in clusters and the field, (d) the star-forming histories of dE's and their possible contribution to faint galaxy counts, and (e) the clustering properties of dE's. In addressing these issues, we highlight the extent to which selection effects temper these constraints, and outline areas where new data would be particularly valuable.
Reconstructing galaxy histories from globular clusters.
West, Michael J; Côté, Patrick; Marzke, Ronald O; Jordán, Andrés
2004-01-01
Nearly a century after the true nature of galaxies as distant 'island universes' was established, their origin and evolution remain great unsolved problems of modern astrophysics. One of the most promising ways to investigate galaxy formation is to study the ubiquitous globular star clusters that surround most galaxies. Globular clusters are compact groups of up to a few million stars. They generally formed early in the history of the Universe, but have survived the interactions and mergers that alter substantially their parent galaxies. Recent advances in our understanding of the globular cluster systems of the Milky Way and other galaxies point to a complex picture of galaxy genesis driven by cannibalism, collisions, bursts of star formation and other tumultuous events.
Borri, Marco; Schmidt, Maria A; Powell, Ceri; Koh, Dow-Mu; Riddell, Angela M; Partridge, Mike; Bhide, Shreerang A; Nutting, Christopher M; Harrington, Kevin J; Newbold, Katie L; Leach, Martin O
2015-01-01
To describe a methodology, based on cluster analysis, to partition multi-parametric functional imaging data into groups (or clusters) of similar functional characteristics, with the aim of characterizing functional heterogeneity within head and neck tumour volumes. To evaluate the performance of the proposed approach on a set of longitudinal MRI data, analysing the evolution of the obtained sub-sets with treatment. The cluster analysis workflow was applied to a combination of dynamic contrast-enhanced and diffusion-weighted imaging MRI data from a cohort of squamous cell carcinoma of the head and neck patients. Cumulative distributions of voxels, containing pre and post-treatment data and including both primary tumours and lymph nodes, were partitioned into k clusters (k = 2, 3 or 4). Principal component analysis and cluster validation were employed to investigate data composition and to independently determine the optimal number of clusters. The evolution of the resulting sub-regions with induction chemotherapy treatment was assessed relative to the number of clusters. The clustering algorithm was able to separate clusters which significantly reduced in voxel number following induction chemotherapy from clusters with a non-significant reduction. Partitioning with the optimal number of clusters (k = 4), determined with cluster validation, produced the best separation between reducing and non-reducing clusters. The proposed methodology was able to identify tumour sub-regions with distinct functional properties, independently separating clusters which were affected differently by treatment. This work demonstrates that unsupervised cluster analysis, with no prior knowledge of the data, can be employed to provide a multi-parametric characterization of functional heterogeneity within tumour volumes.
Buijck, Bianca I; Zuidema, Sytse U; Spruit-van Eijk, Monica; Bor, Hans; Gerritsen, Debby L; Koopmans, Raymond T C M
2012-12-04
Geriatric stroke patients are generally frail, have an advanced age and co-morbidity. It is yet unclear whether specific groups of patients might benefit differently from structured multidisciplinary rehabilitation programs. Therefore, the aims of our study are 1) to determine relevant patient characteristics to distinguish groups of patients based on their admission scores in skilled nursing facilities (SNFs), and (2) to study the course of these particular patient-groups in relation to their discharge destination. This is a longitudinal, multicenter, observational study. We collected data on patient characteristics, balance, walking ability, arm function, co-morbidity, activities of daily living (ADL), neuropsychiatric symptoms, and depressive complaints of 127 geriatric stroke patients admitted to skilled nursing facilities with specific units for geriatric rehabilitation after stroke. Cluster analyses revealed two groups: cluster 1 included patients in poor condition upon admission (n = 52), and cluster 2 included patients in fair/good condition upon admission (n = 75). Patients in both groups improved in balance, walking abilities, and arm function. Patients in cluster 1 also improved in ADL. Depressive complaints decreased significantly in patients in cluster 1 who were discharged to an independent- or assisted-living situation. Compared to 80% of the patients in cluster 2, a lower proportion (46%) of the patients in cluster 1 were discharged to an independent- or assisted-living situation. Stroke patients referred for rehabilitation to SNFs could be clustered on the basis of their condition upon admission. Although patients in poor condition on admission were more likely to be referred to a facility for long-term care, this was certainly not the case in all patients. Almost half of them could be discharged to an independent or assisted living situation, which implied that also in patients in poor condition on admission, discharge to an independent or assisted living situation was an attainable goal. It is important to put substantial effort into the rehabilitation of patients in poor condition at admission.
Evaluating Mixture Modeling for Clustering: Recommendations and Cautions
ERIC Educational Resources Information Center
Steinley, Douglas; Brusco, Michael J.
2011-01-01
This article provides a large-scale investigation into several of the properties of mixture-model clustering techniques (also referred to as latent class cluster analysis, latent profile analysis, model-based clustering, probabilistic clustering, Bayesian classification, unsupervised learning, and finite mixture models; see Vermunt & Magdison,…
ERIC Educational Resources Information Center
DiStefano, Christine; Kamphaus, R. W.
2006-01-01
Two classification methods, latent class cluster analysis and cluster analysis, are used to identify groups of child behavioral adjustment underlying a sample of elementary school children aged 6 to 11 years. Behavioral rating information across 14 subscales was obtained from classroom teachers and used as input for analyses. Both the procedures…
Effect of sexual steroids on boar kinematic sperm subpopulations.
Ayala, E M E; Aragón, M A
2017-11-01
Here, we show the effects of sexual steroids, progesterone, testosterone, or estradiol on motility parameters of boar sperm. Sixteen commercial seminal doses, four each of four adult boars, were analyzed using computer assisted sperm analysis (CASA). Mean values of motility parameters were analyzed by bivariate and multivariate statistics. Principal component analysis (PCA), followed by hierarchical clustering, was applied on data of motility parameters, provided automatically as intervals by the CASA system. Effects of sexual steroids were described in the kinematic subpopulations identified from multivariate statistics. Mean values of motility parameters were not significantly changed after addition of sexual steroids. Multivariate graphics showed that sperm subpopulations were not sensitive to the addition of either testosterone or estradiol, but sperm subpopulations responsive to progesterone were found. Distribution of motility parameters were wide in controls but sharpened at distinct concentrations of progesterone. We conclude that kinematic sperm subpopulations responsive to progesterone are present in boar semen, and these subpopulations are masked in evaluations of mean values of motility parameters. © 2017 International Society for Advancement of Cytometry. © 2017 International Society for Advancement of Cytometry.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peng, Bo; Kowalski, Karol
In this paper we derive basic properties of the Green’s function matrix elements stemming from the exponential coupled cluster (CC) parametrization of the ground-state wave function. We demon- strate that all intermediates used to express retarded (or equivalently, ionized) part of the Green’s function in the ω-representation can be expressed through connected diagrams only. Similar proper- ties are also shared by the first order ω-derivatives of the retarded part of the CC Green’s function. This property can be extended to any order ω-derivatives of the Green’s function. Through the Dyson equation of CC Green’s function, the derivatives of corresponding CCmore » self-energy can be evaluated analytically. In analogy to the CC Green’s function, the corresponding CC self-energy is expressed in terms of connected diagrams only. Moreover, the ionized part of the CC Green’s func- tion satisfies the non-homogeneous linear system of ordinary differential equations, whose solution may be represented in the exponential form. Our analysis can be easily generalized to the advanced part of the CC Green’s function.« less
NASA Astrophysics Data System (ADS)
Micheletti, Natan; Tonini, Marj; Lane, Stuart N.
2017-02-01
Acquisition of high density point clouds using terrestrial laser scanners (TLSs) has become commonplace in geomorphic science. The derived point clouds are often interpolated onto regular grids and the grids compared to detect change (i.e. erosion and deposition/advancement movements). This procedure is necessary for some applications (e.g. digital terrain analysis), but it inevitably leads to a certain loss of potentially valuable information contained within the point clouds. In the present study, an alternative methodology for geomorphological analysis and feature detection from point clouds is proposed. It rests on the use of the Density-Based Spatial Clustering of Applications with Noise (DBSCAN), applied to TLS data for a rock glacier front slope in the Swiss Alps. The proposed methods allowed the detection and isolation of movements directly from point clouds which yield to accuracies in the following computation of volumes that depend only on the actual registered distance between points. We demonstrated that these values are more conservative than volumes computed with the traditional DEM comparison. The results are illustrated for the summer of 2015, a season of enhanced geomorphic activity associated with exceptionally high temperatures.
Cluster analysis in phenotyping a Portuguese population.
Loureiro, C C; Sa-Couto, P; Todo-Bom, A; Bousquet, J
2015-09-03
Unbiased cluster analysis using clinical parameters has identified asthma phenotypes. Adding inflammatory biomarkers to this analysis provided a better insight into the disease mechanisms. This approach has not yet been applied to asthmatic Portuguese patients. To identify phenotypes of asthma using cluster analysis in a Portuguese asthmatic population treated in secondary medical care. Consecutive patients with asthma were recruited from the outpatient clinic. Patients were optimally treated according to GINA guidelines and enrolled in the study. Procedures were performed according to a standard evaluation of asthma. Phenotypes were identified by cluster analysis using Ward's clustering method. Of the 72 patients enrolled, 57 had full data and were included for cluster analysis. Distribution was set in 5 clusters described as follows: cluster (C) 1, early onset mild allergic asthma; C2, moderate allergic asthma, with long evolution, female prevalence and mixed inflammation; C3, allergic brittle asthma in young females with early disease onset and no evidence of inflammation; C4, severe asthma in obese females with late disease onset, highly symptomatic despite low Th2 inflammation; C5, severe asthma with chronic airflow obstruction, late disease onset and eosinophilic inflammation. In our study population, the identified clusters were mainly coincident with other larger-scale cluster analysis. Variables such as age at disease onset, obesity, lung function, FeNO (Th2 biomarker) and disease severity were important for cluster distinction. Copyright © 2015. Published by Elsevier España, S.L.U.
M-DAS: System for multispectral data analysis. [in Saginaw Bay, Michigan
NASA Technical Reports Server (NTRS)
Johnson, R. H.
1975-01-01
M-DAS is a ground data processing system designed for analysis of multispectral data. M-DAS operates on multispectral data from LANDSAT, S-192, M2S and other sources in CCT form. Interactive training by operator-investigators using a variable cursor on a color display was used to derive optimum processing coefficients and data on cluster separability. An advanced multivariate normal-maximum likelihood processing algorithm was used to produce output in various formats: color-coded film images, geometrically corrected map overlays, moving displays of scene sections, coverage tabulations and categorized CCTs. The analysis procedure for M-DAS involves three phases: (1) screening and training, (2) analysis of training data to compute performance predictions and processing coefficients, and (3) processing of multichannel input data into categorized results. Typical M-DAS applications involve iteration between each of these phases. A series of photographs of the M-DAS display are used to illustrate M-DAS operation.
Airborne laser-guided imaging spectroscopy to map forest trait diversity and guide conservation.
Asner, G P; Martin, R E; Knapp, D E; Tupayachi, R; Anderson, C B; Sinca, F; Vaughn, N R; Llactayo, W
2017-01-27
Functional biogeography may bridge a gap between field-based biodiversity information and satellite-based Earth system studies, thereby supporting conservation plans to protect more species and their contributions to ecosystem functioning. We used airborne laser-guided imaging spectroscopy with environmental modeling to derive large-scale, multivariate forest canopy functional trait maps of the Peruvian Andes-to-Amazon biodiversity hotspot. Seven mapped canopy traits revealed functional variation in a geospatial pattern explained by geology, topography, hydrology, and climate. Clustering of canopy traits yielded a map of forest beta functional diversity for land-use analysis. Up to 53% of each mapped, functionally distinct forest presents an opportunity for new conservation action. Mapping functional diversity advances our understanding of the biosphere to conserve more biodiversity in the face of land use and climate change. Copyright © 2017, American Association for the Advancement of Science.
Embrace the Dark Side: Advancing the Dark Energy Survey
NASA Astrophysics Data System (ADS)
Suchyta, Eric
The Dark Energy Survey (DES) is an ongoing cosmological survey intended to study the properties of the accelerated expansion of the Universe. In this dissertation, I present work of mine that has advanced the progress of DES. First is an introduction, which explores the physics of the cosmos, as well as how DES intends to probe it. Attention is given to developing the theoretical framework cosmologists use to describe the Universe, and to explaining observational evidence which has furnished our current conception of the cosmos. Emphasis is placed on the dark sector - dark matter and dark energy - the content of the Universe not explained by the Standard Model of particle physics. As its name suggests, the Dark Energy Survey has been specially designed to measure the properties of dark energy. DES will use a combination of galaxy cluster, weak gravitational lensing, angular clustering, and supernovae measurements to derive its state of the art constraints, each of which is discussed in the text. The work described in this dissertation includes science measurements directly related to the first three of these probes. The dissertation presents my contributions to the readout and control system of the Dark Energy Camera (DECam); the name of this software is SISPI. SISPI uses client-server and publish-subscribe communication patterns to coordinate and command actions among the many hardware components of DECam - the survey instrument for DES, a 570 megapixel CCD camera, mounted at prime focus of the Blanco 4-m Telescope. The SISPI work I discuss includes coding applications for DECam's filter changer mechanism and hexapod, as well as developing the Scripts Editor, a GUI application for DECam users to edit and export observing sequence SISPI can load and execute. Next, the dissertation describes the processing of early DES data, which I contributed. This furnished the data products used in the first-completed DES science analysis, and contributed to improving the collaboration-wide treatment of the data. The science measurements themselves are also detailed. We verified DES's capabilities for performing weak lensing analyses by measuring the masses of four galaxy clusters, finding consistency with previous measurements, and utilized DECam's wide field-of-view for a photometric study of filament-like structures in the fields. Finally, my recent work with Balrog is presented. Balrog is a simulation toolkit for embedding fake objects into real survey images in order to correct for systematic biases. We have used Balrog to extend DES galaxy clustering measurements down to fainter limits than previously possible, finding results consistent with higher-resolution space-based data. The methodology used in this analysis generalizes beyond galaxy clustering alone, and promises to be useful in future imaging survey measurements.
Korfage, Ida J; Rietjens, Judith A C; Overbeek, Anouk; Jabbarian, Lea J; Billekens, Pascalle; Hammes, Bernard J; Hansen-van der Meer, Ellen; Polinder, Suzanne; Severijnen, Johan; Swart, Siebe J; Witkamp, Frederika E; van der Heide, Agnes
2015-07-22
Currently, health care and medical decision-making at the end of life for older people are often insufficiently patient-centred. In this trial we study the effects of Advance Care Planning (ACP), a formalised process of timely communication about care preferences at the end of life, for frail older people. We will conduct a cluster randomised controlled trial among older people residing in care homes or receiving home care in the Netherlands. The intervention group will receive the ACP program Respecting Choices® in addition to usual care. The control group will receive usual care only. Participants in both groups will fill out questionnaires at baseline and after 12 months. We hypothesize that ACP will lead to better patient activation in medical decision making and quality of life, while reducing the number of medical interventions and thus health care costs. Multivariate analysis will be used to compare differences between the intervention group and the control group at baseline and to compare differences in changes after 12 months following the inclusion. Our study can contribute to more understanding of the effects of ACP on patient activation and quality of life in frail older people. Further, we will gain insight in the costs and cost-effectiveness of ACP. This study will facilitate ACP policy for older people in the Netherlands. Nederlands Trial Register: NTR4454.
Phenotypes Determined by Cluster Analysis in Moderate to Severe Bronchial Asthma.
Youroukova, Vania M; Dimitrova, Denitsa G; Valerieva, Anna D; Lesichkova, Spaska S; Velikova, Tsvetelina V; Ivanova-Todorova, Ekaterina I; Tumangelova-Yuzeir, Kalina D
2017-06-01
Bronchial asthma is a heterogeneous disease that includes various subtypes. They may share similar clinical characteristics, but probably have different pathological mechanisms. To identify phenotypes using cluster analysis in moderate to severe bronchial asthma and to compare differences in clinical, physiological, immunological and inflammatory data between the clusters. Forty adult patients with moderate to severe bronchial asthma out of exacerbation were included. All underwent clinical assessment, anthropometric measurements, skin prick testing, standard spirometry and measurement fraction of exhaled nitric oxide. Blood eosinophilic count, serum total IgE and periostin levels were determined. Two-step cluster approach, hierarchical clustering method and k-mean analysis were used for identification of the clusters. We have identified four clusters. Cluster 1 (n=14) - late-onset, non-atopic asthma with impaired lung function, Cluster 2 (n=13) - late-onset, atopic asthma, Cluster 3 (n=6) - late-onset, aspirin sensitivity, eosinophilic asthma, and Cluster 4 (n=7) - early-onset, atopic asthma. Our study is the first in Bulgaria in which cluster analysis is applied to asthmatic patients. We identified four clusters. The variables with greatest force for differentiation in our study were: age of asthma onset, duration of diseases, atopy, smoking, blood eosinophils, nonsteroidal anti-inflammatory drugs hypersensitivity, baseline FEV1/FVC and symptoms severity. Our results support the concept of heterogeneity of bronchial asthma and demonstrate that cluster analysis can be an useful tool for phenotyping of disease and personalized approach to the treatment of patients.
Cross-scale analysis of cluster correspondence using different operational neighborhoods
NASA Astrophysics Data System (ADS)
Lu, Yongmei; Thill, Jean-Claude
2008-09-01
Cluster correspondence analysis examines the spatial autocorrelation of multi-location events at the local scale. This paper argues that patterns of cluster correspondence are highly sensitive to the definition of operational neighborhoods that form the spatial units of analysis. A subset of multi-location events is examined for cluster correspondence if they are associated with the same operational neighborhood. This paper discusses the construction of operational neighborhoods for cluster correspondence analysis based on the spatial properties of the underlying zoning system and the scales at which the zones are aggregated into neighborhoods. Impacts of this construction on the degree of cluster correspondence are also analyzed. Empirical analyses of cluster correspondence between paired vehicle theft and recovery locations are conducted on different zoning methods and across a series of geographic scales and the dynamics of cluster correspondence patterns are discussed.
Rodenbach, Rachel A; Brandes, Kim; Fiscella, Kevin; Kravitz, Richard L; Butow, Phyllis N; Walczak, Adam; Duberstein, Paul R; Sullivan, Peter; Hoh, Beth; Xing, Guibo; Plumb, Sandy; Epstein, Ronald M
2017-03-10
Purpose To build on results of a cluster randomized controlled trial (RCT) of a combined patient-oncologist intervention to improve communication in advanced cancer, we conducted a post hoc analysis of the patient intervention component, a previsit patient coaching session that used a question prompt list (QPL). We hypothesized that intervention-group participants would bring up more QPL-related topics, particularly prognosis-related topics, during the subsequent oncologist visit. Patients and Methods This cluster RCT with 170 patients who had advanced nonhematologic cancer (and their caregivers) recruited from practices of 24 participating oncologists in western New York. Intervention-group oncologists (n = 12) received individualized communication training; up to 10 of their patients (n = 84) received a previsit individualized communication coaching session that incorporated a QPL. Control-group oncologists (n = 12) and patients (n = 86) received no interventions. Topics of interest identified by patients during the coaching session were summarized from coaching notes; one office visit after the coaching session was audio recorded, transcribed, and analyzed by using linear regression modeling for group differences. Results Compared with controls, more than twice as many intervention-group participants brought up QPL-related topics during their office visits (70.2% v 32.6%; P < .001). Patients in the intervention group were nearly three times more likely to ask about prognosis (16.7% v 5.8%; P =.03). Of 262 topics of interest identified during coaching, 158 (60.3%) were QPL related; 20 (12.7%) addressed prognosis. Overall, patients in the intervention group brought up 82.4% of topics of interest during the office visit. Conclusion A combined coaching and QPL intervention was effective to help patients with advanced cancer and their caregivers identify and bring up topics of concern, including prognosis, during their subsequent oncologist visits. Considering that most patients are misinformed about prognosis, more intensive steps are needed to better promote such discussions.
2017-01-01
Background Palliative care planning for nursing home residents with advanced dementia is often suboptimal. This study compared effects of facilitated case conferencing (FCC) with usual care (UC) on end-of-life care. Methods A two arm parallel cluster randomised controlled trial was conducted. The sample included people with advanced dementia from 20 Australian nursing homes and their families and professional caregivers. In each intervention nursing home (n = 10), Palliative Care Planning Coordinators (PCPCs) facilitated family case conferences and trained staff in person-centred palliative care for 16 hours per week over 18 months. The primary outcome was family-rated quality of end-of-life care (End-of-Life Dementia [EOLD] Scales). Secondary outcomes included nurse-rated EOLD scales, resident quality of life (Quality of Life in Late-stage Dementia [QUALID]) and quality of care over the last month of life (pharmacological/non-pharmacological palliative strategies, hospitalization or inappropriate interventions). Results Two-hundred-eighty-six people with advanced dementia took part but only 131 died (64 in UC and 67 in FCC which was fewer than anticipated), rendering the primary analysis under-powered with no group effect seen in EOLD scales. Significant differences in pharmacological (P < 0.01) and non-pharmacological (P < 0.05) palliative management in last month of life were seen. Intercurrent illness was associated with lower family-rated EOLD Satisfaction with Care (coefficient 2.97, P < 0.05) and lower staff-rated EOLD Comfort Assessment with Dying (coefficient 4.37, P < 0.01). Per protocol analyses showed positive relationships between EOLD and staff hours to bed ratios, proportion of residents with dementia and staff attitudes. Conclusion FCC facilitates a palliative approach to care. Future trials of case conferencing should consider outcomes and processes regarding decision making and planning for anticipated events and acute illness. Trial registration Australian New Zealand Clinical Trial Registry ACTRN12612001164886 PMID:28786995
Lee, Hyokyeong; Moody-Davis, Asher; Saha, Utsab; Suzuki, Brian M; Asarnow, Daniel; Chen, Steven; Arkin, Michelle; Caffrey, Conor R; Singh, Rahul
2012-01-01
Neglected tropical diseases, especially those caused by helminths, constitute some of the most common infections of the world's poorest people. Development of techniques for automated, high-throughput drug screening against these diseases, especially in whole-organism settings, constitutes one of the great challenges of modern drug discovery. We present a method for enabling high-throughput phenotypic drug screening against diseases caused by helminths with a focus on schistosomiasis. The proposed method allows for a quantitative analysis of the systemic impact of a drug molecule on the pathogen as exhibited by the complex continuum of its phenotypic responses. This method consists of two key parts: first, biological image analysis is employed to automatically monitor and quantify shape-, appearance-, and motion-based phenotypes of the parasites. Next, we represent these phenotypes as time-series and show how to compare, cluster, and quantitatively reason about them using techniques of time-series analysis. We present results on a number of algorithmic issues pertinent to the time-series representation of phenotypes. These include results on appropriate representation of phenotypic time-series, analysis of different time-series similarity measures for comparing phenotypic responses over time, and techniques for clustering such responses by similarity. Finally, we show how these algorithmic techniques can be used for quantifying the complex continuum of phenotypic responses of parasites. An important corollary is the ability of our method to recognize and rigorously group parasites based on the variability of their phenotypic response to different drugs. The methods and results presented in this paper enable automatic and quantitative scoring of high-throughput phenotypic screens focused on helmintic diseases. Furthermore, these methods allow us to analyze and stratify parasites based on their phenotypic response to drugs. Together, these advancements represent a significant breakthrough for the process of drug discovery against schistosomiasis in particular and can be extended to other helmintic diseases which together afflict a large part of humankind.
2012-01-01
Background Neglected tropical diseases, especially those caused by helminths, constitute some of the most common infections of the world's poorest people. Development of techniques for automated, high-throughput drug screening against these diseases, especially in whole-organism settings, constitutes one of the great challenges of modern drug discovery. Method We present a method for enabling high-throughput phenotypic drug screening against diseases caused by helminths with a focus on schistosomiasis. The proposed method allows for a quantitative analysis of the systemic impact of a drug molecule on the pathogen as exhibited by the complex continuum of its phenotypic responses. This method consists of two key parts: first, biological image analysis is employed to automatically monitor and quantify shape-, appearance-, and motion-based phenotypes of the parasites. Next, we represent these phenotypes as time-series and show how to compare, cluster, and quantitatively reason about them using techniques of time-series analysis. Results We present results on a number of algorithmic issues pertinent to the time-series representation of phenotypes. These include results on appropriate representation of phenotypic time-series, analysis of different time-series similarity measures for comparing phenotypic responses over time, and techniques for clustering such responses by similarity. Finally, we show how these algorithmic techniques can be used for quantifying the complex continuum of phenotypic responses of parasites. An important corollary is the ability of our method to recognize and rigorously group parasites based on the variability of their phenotypic response to different drugs. Conclusions The methods and results presented in this paper enable automatic and quantitative scoring of high-throughput phenotypic screens focused on helmintic diseases. Furthermore, these methods allow us to analyze and stratify parasites based on their phenotypic response to drugs. Together, these advancements represent a significant breakthrough for the process of drug discovery against schistosomiasis in particular and can be extended to other helmintic diseases which together afflict a large part of humankind. PMID:22369037
An extended affinity propagation clustering method based on different data density types.
Zhao, XiuLi; Xu, WeiXiang
2015-01-01
Affinity propagation (AP) algorithm, as a novel clustering method, does not require the users to specify the initial cluster centers in advance, which regards all data points as potential exemplars (cluster centers) equally and groups the clusters totally by the similar degree among the data points. But in many cases there exist some different intensive areas within the same data set, which means that the data set does not distribute homogeneously. In such situation the AP algorithm cannot group the data points into ideal clusters. In this paper, we proposed an extended AP clustering algorithm to deal with such a problem. There are two steps in our method: firstly the data set is partitioned into several data density types according to the nearest distances of each data point; and then the AP clustering method is, respectively, used to group the data points into clusters in each data density type. Two experiments are carried out to evaluate the performance of our algorithm: one utilizes an artificial data set and the other uses a real seismic data set. The experiment results show that groups are obtained more accurately by our algorithm than OPTICS and AP clustering algorithm itself.
fast_protein_cluster: parallel and optimized clustering of large-scale protein modeling data.
Hung, Ling-Hong; Samudrala, Ram
2014-06-15
fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60 000 sets of protein models (with up to 550 000 models per set) generated by the Nutritious Rice for the World project. fast_protein_cluster is an optimized and extensible toolkit that supports Root Mean Square Deviation after optimal superposition (RMSD) and Template Modeling score (TM-score) as metrics. RMSD calculations using a laptop CPU are 60× faster than qcprot and 3× faster than current graphics processing unit (GPU) implementations. New GPU code further increases the speed of RMSD and TM-score calculations. fast_protein_cluster provides novel k-means and hierarchical clustering methods that are up to 250× and 2000× faster, respectively, than Clusco, and identify significantly more accurate models than Spicker and Clusco. fast_protein_cluster is written in C++ using OpenMP for multi-threading support. Custom streaming Single Instruction Multiple Data (SIMD) extensions and advanced vector extension intrinsics code accelerate CPU calculations, and OpenCL kernels support AMD and Nvidia GPUs. fast_protein_cluster is available under the M.I.T. license. (http://software.compbio.washington.edu/fast_protein_cluster) © The Author 2014. Published by Oxford University Press.
Scalable computing for evolutionary genomics.
Prins, Pjotr; Belhachemi, Dominique; Möller, Steffen; Smant, Geert
2012-01-01
Genomic data analysis in evolutionary biology is becoming so computationally intensive that analysis of multiple hypotheses and scenarios takes too long on a single desktop computer. In this chapter, we discuss techniques for scaling computations through parallelization of calculations, after giving a quick overview of advanced programming techniques. Unfortunately, parallel programming is difficult and requires special software design. The alternative, especially attractive for legacy software, is to introduce poor man's parallelization by running whole programs in parallel as separate processes, using job schedulers. Such pipelines are often deployed on bioinformatics computer clusters. Recent advances in PC virtualization have made it possible to run a full computer operating system, with all of its installed software, on top of another operating system, inside a "box," or virtual machine (VM). Such a VM can flexibly be deployed on multiple computers, in a local network, e.g., on existing desktop PCs, and even in the Cloud, to create a "virtual" computer cluster. Many bioinformatics applications in evolutionary biology can be run in parallel, running processes in one or more VMs. Here, we show how a ready-made bioinformatics VM image, named BioNode, effectively creates a computing cluster, and pipeline, in a few steps. This allows researchers to scale-up computations from their desktop, using available hardware, anytime it is required. BioNode is based on Debian Linux and can run on networked PCs and in the Cloud. Over 200 bioinformatics and statistical software packages, of interest to evolutionary biology, are included, such as PAML, Muscle, MAFFT, MrBayes, and BLAST. Most of these software packages are maintained through the Debian Med project. In addition, BioNode contains convenient configuration scripts for parallelizing bioinformatics software. Where Debian Med encourages packaging free and open source bioinformatics software through one central project, BioNode encourages creating free and open source VM images, for multiple targets, through one central project. BioNode can be deployed on Windows, OSX, Linux, and in the Cloud. Next to the downloadable BioNode images, we provide tutorials online, which empower bioinformaticians to install and run BioNode in different environments, as well as information for future initiatives, on creating and building such images.
2010-01-01
Background Cluster analysis, and in particular hierarchical clustering, is widely used to extract information from gene expression data. The aim is to discover new classes, or sub-classes, of either individuals or genes. Performing a cluster analysis commonly involve decisions on how to; handle missing values, standardize the data and select genes. In addition, pre-processing, involving various types of filtration and normalization procedures, can have an effect on the ability to discover biologically relevant classes. Here we consider cluster analysis in a broad sense and perform a comprehensive evaluation that covers several aspects of cluster analyses, including normalization. Result We evaluated 2780 cluster analysis methods on seven publicly available 2-channel microarray data sets with common reference designs. Each cluster analysis method differed in data normalization (5 normalizations were considered), missing value imputation (2), standardization of data (2), gene selection (19) or clustering method (11). The cluster analyses are evaluated using known classes, such as cancer types, and the adjusted Rand index. The performances of the different analyses vary between the data sets and it is difficult to give general recommendations. However, normalization, gene selection and clustering method are all variables that have a significant impact on the performance. In particular, gene selection is important and it is generally necessary to include a relatively large number of genes in order to get good performance. Selecting genes with high standard deviation or using principal component analysis are shown to be the preferred gene selection methods. Hierarchical clustering using Ward's method, k-means clustering and Mclust are the clustering methods considered in this paper that achieves the highest adjusted Rand. Normalization can have a significant positive impact on the ability to cluster individuals, and there are indications that background correction is preferable, in particular if the gene selection is successful. However, this is an area that needs to be studied further in order to draw any general conclusions. Conclusions The choice of cluster analysis, and in particular gene selection, has a large impact on the ability to cluster individuals correctly based on expression profiles. Normalization has a positive effect, but the relative performance of different normalizations is an area that needs more research. In summary, although clustering, gene selection and normalization are considered standard methods in bioinformatics, our comprehensive analysis shows that selecting the right methods, and the right combinations of methods, is far from trivial and that much is still unexplored in what is considered to be the most basic analysis of genomic data. PMID:20937082
Modest validity and fair reproducibility of dietary patterns derived by cluster analysis.
Funtikova, Anna N; Benítez-Arciniega, Alejandra A; Fitó, Montserrat; Schröder, Helmut
2015-03-01
Cluster analysis is widely used to analyze dietary patterns. We aimed to analyze the validity and reproducibility of the dietary patterns defined by cluster analysis derived from a food frequency questionnaire (FFQ). We hypothesized that the dietary patterns derived by cluster analysis have fair to modest reproducibility and validity. Dietary data were collected from 107 individuals from population-based survey, by an FFQ at baseline (FFQ1) and after 1 year (FFQ2), and by twelve 24-hour dietary recalls (24-HDR). Repeatability and validity were measured by comparing clusters obtained by the FFQ1 and FFQ2 and by the FFQ2 and 24-HDR (reference method), respectively. Cluster analysis identified a "fruits & vegetables" and a "meat" pattern in each dietary data source. Cluster membership was concordant for 66.7% of participants in FFQ1 and FFQ2 (reproducibility), and for 67.0% in FFQ2 and 24-HDR (validity). Spearman correlation analysis showed reasonable reproducibility, especially in the "fruits & vegetables" pattern, and lower validity also especially in the "fruits & vegetables" pattern. κ statistic revealed a fair validity and reproducibility of clusters. Our findings indicate a reasonable reproducibility and fair to modest validity of dietary patterns derived by cluster analysis. Copyright © 2015 Elsevier Inc. All rights reserved.
Cluster Analysis to Identify Possible Subgroups in Tinnitus Patients.
van den Berge, Minke J C; Free, Rolien H; Arnold, Rosemarie; de Kleine, Emile; Hofman, Rutger; van Dijk, J Marc C; van Dijk, Pim
2017-01-01
In tinnitus treatment, there is a tendency to shift from a "one size fits all" to a more individual, patient-tailored approach. Insight in the heterogeneity of the tinnitus spectrum might improve the management of tinnitus patients in terms of choice of treatment and identification of patients with severe mental distress. The goal of this study was to identify subgroups in a large group of tinnitus patients. Data were collected from patients with severe tinnitus complaints visiting our tertiary referral tinnitus care group at the University Medical Center Groningen. Patient-reported and physician-reported variables were collected during their visit to our clinic. Cluster analyses were used to characterize subgroups. For the selection of the right variables to enter in the cluster analysis, two approaches were used: (1) variable reduction with principle component analysis and (2) variable selection based on expert opinion. Various variables of 1,783 tinnitus patients were included in the analyses. Cluster analysis (1) included 976 patients and resulted in a four-cluster solution. The effect of external influences was the most discriminative between the groups, or clusters, of patients. The "silhouette measure" of the cluster outcome was low (0.2), indicating a "no substantial" cluster structure. Cluster analysis (2) included 761 patients and resulted in a three-cluster solution, comparable to the first analysis. Again, a "no substantial" cluster structure was found (0.2). Two cluster analyses on a large database of tinnitus patients revealed that clusters of patients are mostly formed by a different response of external influences on their disease. However, both cluster outcomes based on this dataset showed a poor stability, suggesting that our tinnitus population comprises a continuum rather than a number of clearly defined subgroups.
Ecological tolerances of Miocene larger benthic foraminifera from Indonesia
NASA Astrophysics Data System (ADS)
Novak, Vibor; Renema, Willem
2018-01-01
To provide a comprehensive palaeoenvironmental reconstruction based on larger benthic foraminifera (LBF), a quantitative analysis of their assemblage composition is needed. Besides microfacies analysis which includes environmental preferences of foraminiferal taxa, statistical analyses should also be employed. Therefore, detrended correspondence analysis and cluster analysis were performed on relative abundance data of identified LBF assemblages deposited in mixed carbonate-siliciclastic (MCS) systems and blue-water (BW) settings. Studied MCS system localities include ten sections from the central part of the Kutai Basin in East Kalimantan, ranging from late Burdigalian to Serravallian age. The BW samples were collected from eleven sections of the Bulu Formation on Central Java, dated as Serravallian. Results from detrended correspondence analysis reveal significant differences between these two environmental settings. Cluster analysis produced five clusters of samples; clusters 1 and 2 comprise dominantly MCS samples, clusters 3 and 4 with dominance of BW samples, and cluster 5 showing a mixed composition with both MCS and BW samples. The results of cluster analysis were afterwards subjected to indicator species analysis resulting in the interpretation that generated three groups among LBF taxa: typical assemblage indicators, regularly occurring taxa and rare taxa. By interpreting the results of detrended correspondence analysis, cluster analysis and indicator species analysis, along with environmental preferences of identified LBF taxa, a palaeoenvironmental model is proposed for the distribution of LBF in Miocene MCS systems and adjacent BW settings of Indonesia.
Rennard, Stephen I; Locantore, Nicholas; Delafont, Bruno; Tal-Singer, Ruth; Silverman, Edwin K; Vestbo, Jørgen; Miller, Bruce E; Bakke, Per; Celli, Bartolomé; Calverley, Peter M A; Coxson, Harvey; Crim, Courtney; Edwards, Lisa D; Lomas, David A; MacNee, William; Wouters, Emiel F M; Yates, Julie C; Coca, Ignacio; Agustí, Alvar
2015-03-01
Chronic obstructive pulmonary disease (COPD) is a heterogeneous disease that likely includes clinically relevant subgroups. To identify subgroups of COPD in ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints) subjects using cluster analysis and to assess clinically meaningful outcomes of the clusters during 3 years of longitudinal follow-up. Factor analysis was used to reduce 41 variables determined at recruitment in 2,164 patients with COPD to 13 main factors, and the variables with the highest loading were used for cluster analysis. Clusters were evaluated for their relationship with clinically meaningful outcomes during 3 years of follow-up. The relationships among clinical parameters were evaluated within clusters. Five subgroups were distinguished using cross-sectional clinical features. These groups differed regarding outcomes. Cluster A included patients with milder disease and had fewer deaths and hospitalizations. Cluster B had less systemic inflammation at baseline but had notable changes in health status and emphysema extent. Cluster C had many comorbidities, evidence of systemic inflammation, and the highest mortality. Cluster D had low FEV1, severe emphysema, and the highest exacerbation and COPD hospitalization rate. Cluster E was intermediate for most variables and may represent a mixed group that includes further clusters. The relationships among clinical variables within clusters differed from that in the entire COPD population. Cluster analysis using baseline data in ECLIPSE identified five COPD subgroups that differ in outcomes and inflammatory biomarkers and show different relationships between clinical parameters, suggesting the clusters represent clinically and biologically different subtypes of COPD.
Interactive visual exploration and refinement of cluster assignments.
Kern, Michael; Lex, Alexander; Gehlenborg, Nils; Johnson, Chris R
2017-09-12
With ever-increasing amounts of data produced in biology research, scientists are in need of efficient data analysis methods. Cluster analysis, combined with visualization of the results, is one such method that can be used to make sense of large data volumes. At the same time, cluster analysis is known to be imperfect and depends on the choice of algorithms, parameters, and distance measures. Most clustering algorithms don't properly account for ambiguity in the source data, as records are often assigned to discrete clusters, even if an assignment is unclear. While there are metrics and visualization techniques that allow analysts to compare clusterings or to judge cluster quality, there is no comprehensive method that allows analysts to evaluate, compare, and refine cluster assignments based on the source data, derived scores, and contextual data. In this paper, we introduce a method that explicitly visualizes the quality of cluster assignments, allows comparisons of clustering results and enables analysts to manually curate and refine cluster assignments. Our methods are applicable to matrix data clustered with partitional, hierarchical, and fuzzy clustering algorithms. Furthermore, we enable analysts to explore clustering results in context of other data, for example, to observe whether a clustering of genomic data results in a meaningful differentiation in phenotypes. Our methods are integrated into Caleydo StratomeX, a popular, web-based, disease subtype analysis tool. We show in a usage scenario that our approach can reveal ambiguities in cluster assignments and produce improved clusterings that better differentiate genotypes and phenotypes.
Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach.
Liang, Muxuan; Li, Zhizhong; Chen, Ting; Zeng, Jianyang
2015-01-01
Identification of cancer subtypes plays an important role in revealing useful insights into disease pathogenesis and advancing personalized therapy. The recent development of high-throughput sequencing technologies has enabled the rapid collection of multi-platform genomic data (e.g., gene expression, miRNA expression, and DNA methylation) for the same set of tumor samples. Although numerous integrative clustering approaches have been developed to analyze cancer data, few of them are particularly designed to exploit both deep intrinsic statistical properties of each input modality and complex cross-modality correlations among multi-platform input data. In this paper, we propose a new machine learning model, called multimodal deep belief network (DBN), to cluster cancer patients from multi-platform observation data. In our integrative clustering framework, relationships among inherent features of each single modality are first encoded into multiple layers of hidden variables, and then a joint latent model is employed to fuse common features derived from multiple input modalities. A practical learning algorithm, called contrastive divergence (CD), is applied to infer the parameters of our multimodal DBN model in an unsupervised manner. Tests on two available cancer datasets show that our integrative data analysis approach can effectively extract a unified representation of latent features to capture both intra- and cross-modality correlations, and identify meaningful disease subtypes from multi-platform cancer data. In addition, our approach can identify key genes and miRNAs that may play distinct roles in the pathogenesis of different cancer subtypes. Among those key miRNAs, we found that the expression level of miR-29a is highly correlated with survival time in ovarian cancer patients. These results indicate that our multimodal DBN based data analysis approach may have practical applications in cancer pathogenesis studies and provide useful guidelines for personalized cancer therapy.
Somatotyping using 3D anthropometry: a cluster analysis.
Olds, Tim; Daniell, Nathan; Petkov, John; David Stewart, Arthur
2013-01-01
Somatotyping is the quantification of human body shape, independent of body size. Hitherto, somatotyping (including the most popular method, the Heath-Carter system) has been based on subjective visual ratings, sometimes supported by surface anthropometry. This study used data derived from three-dimensional (3D) whole-body scans as inputs for cluster analysis to objectively derive clusters of similar body shapes. Twenty-nine dimensions normalised for body size were measured on a purposive sample of 301 adults aged 17-56 years who had been scanned using a Vitus Smart laser scanner. K-means Cluster Analysis with v-fold cross-validation was used to determine shape clusters. Three male and three female clusters emerged, and were visualised using those scans closest to the cluster centroid and a caricature defined by doubling the difference between the average scan and the cluster centroid. The male clusters were decidedly endomorphic (high fatness), ectomorphic (high linearity), and endo-mesomorphic (a mixture of fatness and muscularity). The female clusters were clearly endomorphic, ectomorphic, and the ecto-mesomorphic (a mixture of linearity and muscularity). An objective shape quantification procedure combining 3D scanning and cluster analysis yielded shape clusters strikingly similar to traditional somatotyping.
Clusters of Occupations Based on Systematically Derived Work Dimensions: An Exploratory Study.
ERIC Educational Resources Information Center
Cunningham, J. W.; And Others
The study explored the feasibility of deriving an educationally relevant occupational cluster structure based on Occupational Analysis Inventory (OAI) work dimensions. A hierarchical cluster analysis was applied to the factor score profiles of 814 occupations on 22 higher-order OAI work dimensions. From that analysis, 73 occupational clusters were…
Using cluster analysis to identify phenotypes and validation of mortality in men with COPD.
Chen, Chiung-Zuei; Wang, Liang-Yi; Ou, Chih-Ying; Lee, Cheng-Hung; Lin, Chien-Chung; Hsiue, Tzuen-Ren
2014-12-01
Cluster analysis has been proposed to examine phenotypic heterogeneity in chronic obstructive pulmonary disease (COPD). The aim of this study was to use cluster analysis to define COPD phenotypes and validate them by assessing their relationship with mortality. Male subjects with COPD were recruited to identify and validate COPD phenotypes. Seven variables were assessed for their relevance to COPD, age, FEV(1) % predicted, BMI, history of severe exacerbations, mMRC, SpO(2), and Charlson index. COPD groups were identified by cluster analysis and validated prospectively against mortality during a 4-year follow-up. Analysis of 332 COPD subjects identified five clusters from cluster A to cluster E. Assessment of the predictive validity of these clusters of COPD showed that cluster E patients had higher all cause mortality (HR 18.3, p < 0.0001), and respiratory cause mortality (HR 21.5, p < 0.0001) than those in the other four groups. Cluster E patients also had higher all cause mortality (HR 14.3, p = 0.0002) and respiratory cause mortality (HR 10.1, p = 0.0013) than patients in cluster D alone. COPD patient with severe airflow limitation, many symptoms, and a history of frequent severe exacerbations was a novel and distinct clinical phenotype predicting mortality in men with COPD.
Star Formation in Nearby Clusters (SFiNCs)
NASA Astrophysics Data System (ADS)
Getman, Konstantin
Most stars form in clusters that rapidly disperse, yet we have a poor understanding of the processes of cluster formation and early evolution. Do clusters form `top-down', rapidly in a dense molecular cloud core? Or, since clouds are turbulent, do clusters form `bottomup' by merging subclusters produced in small kinematically-distinct molecular structures? Do clusters principally form in elongated molecular structures such as Infrared Dark Clouds and Herschel filaments? One of the central reasons for slow progress in resolving these questions is the lack of homogeneous and reliable census of stellar members (both disk-bearing and disk-free) for a wide range of star forming environments. To address these issues we are now completing our major effort, called MYStIX (Massive Young Star-Forming Complex Study in Infrared and X-ray). It combines the Chandra archive with UKIRT+2MASS near-infrared and Spitzer mid-infrared surveys to identify young stellar objects in a wide range of evolutionary stages, from protostars to disk-free pre-main sequence stars, in 20 star forming regions at distances from 0.4 to 3.6 kpc. Each MYStIX region was chosen to have a rich OB-dominated cluster. Started in 2009 with NASA/ADAP and NSF funding, MYStIX has emerged with 8 technical/catalog and the first 4 of a series of science papers (http://astro.psu.edu/mystix). Early MYStIX results include: demonstration of diverse morphologies of young clusters from simple ellipsoids to elongated, clumpy substructures; demonstration of spatio-age gradients across star formation regions; the discovery of core-halo age gradients within two rich nearby MYStIX clusters; and the discovery of important astrophysically empirical correlations among different subcluster properties such as age, absorption, core radius, central stellar density, and total intrinsic population. The early MYStIX result provide new observational evidence for subcluster merging and cluster expansion following gas dissipation. We propose here to extend the MYStIX effort to an archive study of 19 nearer and smaller star forming regions where the stellar clusters are dominated by a single late-OB star rather than numerous O stars as in the MYStIX fields. We call this project `Star Formation in Nearby Clusters' or SFiNCs (homophonic with `sphinx'). With a homogeneous analysis of the Chandra, 2MASS, Spitzer and Herschel archives, we expect to identify and characterize over 50 SFiNCs subclusters. The inferred empirical correlations among different cluster properties for nearly 200 SFiNCs+MYStIX subclusters with 30-3000 detected stars on scales of 0.1-20 pc will allow, for the first time, direct comparison with the results of theoretical simulations of cluster formation to seek deeper answers to the fundamental questions posed above. It is possible, for example, that smaller molecular clouds have less turbulence and thus produce small clusters in a single event rather than through subcluster mergers. Models based on meteoritic isotopes suggest that our Solar System formed in a complex of SFiNCs/MYStIX-like clusters (Gounelle & Meynet 2012, A&A, 545, 4). This project addresses NASA SMD Strategic Subgoals 3C (Advance scientific knowledge of the origin and history of the solar system) and 3D.3 (Understand how individual stars form and how those processes ultimately affect the formation of planetary systems). It lies in the `Star formation and pre-main sequence stars' Research Area of the Astrophysics Data Analysis program.
Borri, Marco; Schmidt, Maria A.; Powell, Ceri; Koh, Dow-Mu; Riddell, Angela M.; Partridge, Mike; Bhide, Shreerang A.; Nutting, Christopher M.; Harrington, Kevin J.; Newbold, Katie L.; Leach, Martin O.
2015-01-01
Purpose To describe a methodology, based on cluster analysis, to partition multi-parametric functional imaging data into groups (or clusters) of similar functional characteristics, with the aim of characterizing functional heterogeneity within head and neck tumour volumes. To evaluate the performance of the proposed approach on a set of longitudinal MRI data, analysing the evolution of the obtained sub-sets with treatment. Material and Methods The cluster analysis workflow was applied to a combination of dynamic contrast-enhanced and diffusion-weighted imaging MRI data from a cohort of squamous cell carcinoma of the head and neck patients. Cumulative distributions of voxels, containing pre and post-treatment data and including both primary tumours and lymph nodes, were partitioned into k clusters (k = 2, 3 or 4). Principal component analysis and cluster validation were employed to investigate data composition and to independently determine the optimal number of clusters. The evolution of the resulting sub-regions with induction chemotherapy treatment was assessed relative to the number of clusters. Results The clustering algorithm was able to separate clusters which significantly reduced in voxel number following induction chemotherapy from clusters with a non-significant reduction. Partitioning with the optimal number of clusters (k = 4), determined with cluster validation, produced the best separation between reducing and non-reducing clusters. Conclusion The proposed methodology was able to identify tumour sub-regions with distinct functional properties, independently separating clusters which were affected differently by treatment. This work demonstrates that unsupervised cluster analysis, with no prior knowledge of the data, can be employed to provide a multi-parametric characterization of functional heterogeneity within tumour volumes. PMID:26398888
Romano, G
2017-09-01
Certain malignant cells may detach from the primary tumor and enter the vascular system, forming so-called circulating tumor cells (CTCs). Clusters of malignant cells associated with other cell types can also be observed in the peripheral blood of oncological patients. Such cell clusters are termed circulating tumor microemboli (CTM). The isolation and quantification of CTCs and/or CTM from blood samples allow for an accurate prognosis of the clinical course of the disease and to monitor the response to therapy. Current protocols rely on epithelial markers for the isolation of CTCs and/or CTM from hematopoietic cells. However, epithelial markers may be silenced during the progression of the epithelial-mesenchymal transition, which regulates the detachment and migration of malignant cells from the primary tumor. This review summarizes the achievements and challenges of various modalities for the isolation, enrichment, analysis and enumeration of CTCs and/or CTM, in order to assess the advancement of the disease and the response to therapy.
Lu, Chenyang; Niu, Liangliang; Chen, Nanjun; ...
2016-12-15
A grand challenge in material science is to understand the correlation between intrinsic properties and defect dynamics. Radiation tolerant materials are in great demand for safe operation and advancement of nuclear and aerospace systems. Unlike traditional approaches that rely on microstructural and nanoscale features to mitigate radiation damage, this study demonstrates enhancement of radiation tolerance with the suppression of void formation by two orders magnitude at elevated temperatures in equiatomic single-phase concentrated solid solution alloys, and more importantly, reveals its controlling mechanism through a detailed analysis of the depth distribution of defect clusters and an atomistic computer simulation. The enhancedmore » swelling resistance is attributed to the tailored interstitial defect cluster motion in the alloys from a long-range one-dimensional mode to a short-range three-dimensional mode, which leads to enhanced point defect recombination. Finally, the results suggest design criteria for next generation radiation tolerant structural alloys.« less
clusterProfiler: an R package for comparing biological themes among gene clusters.
Yu, Guangchuang; Wang, Li-Gen; Han, Yanyan; He, Qing-Yu
2012-05-01
Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.
Generalized fuzzy C-means clustering algorithm with improved fuzzy partitions.
Zhu, Lin; Chung, Fu-Lai; Wang, Shitong
2009-06-01
The fuzziness index m has important influence on the clustering result of fuzzy clustering algorithms, and it should not be forced to fix at the usual value m = 2. In view of its distinctive features in applications and its limitation in having m = 2 only, a recent advance of fuzzy clustering called fuzzy c-means clustering with improved fuzzy partitions (IFP-FCM) is extended in this paper, and a generalized algorithm called GIFP-FCM for more effective clustering is proposed. By introducing a novel membership constraint function, a new objective function is constructed, and furthermore, GIFP-FCM clustering is derived. Meanwhile, from the viewpoints of L(p) norm distance measure and competitive learning, the robustness and convergence of the proposed algorithm are analyzed. Furthermore, the classical fuzzy c-means algorithm (FCM) and IFP-FCM can be taken as two special cases of the proposed algorithm. Several experimental results including its application to noisy image texture segmentation are presented to demonstrate its average advantage over FCM and IFP-FCM in both clustering and robustness capabilities.
Clinical Characteristics of Exacerbation-Prone Adult Asthmatics Identified by Cluster Analysis.
Kim, Mi Ae; Shin, Seung Woo; Park, Jong Sook; Uh, Soo Taek; Chang, Hun Soo; Bae, Da Jeong; Cho, You Sook; Park, Hae Sim; Yoon, Ho Joo; Choi, Byoung Whui; Kim, Yong Hoon; Park, Choon Sik
2017-11-01
Asthma is a heterogeneous disease characterized by various types of airway inflammation and obstruction. Therefore, it is classified into several subphenotypes, such as early-onset atopic, obese non-eosinophilic, benign, and eosinophilic asthma, using cluster analysis. A number of asthmatics frequently experience exacerbation over a long-term follow-up period, but the exacerbation-prone subphenotype has rarely been evaluated by cluster analysis. This prompted us to identify clusters reflecting asthma exacerbation. A uniform cluster analysis method was applied to 259 adult asthmatics who were regularly followed-up for over 1 year using 12 variables, selected on the basis of their contribution to asthma phenotypes. After clustering, clinical profiles and exacerbation rates during follow-up were compared among the clusters. Four subphenotypes were identified: cluster 1 was comprised of patients with early-onset atopic asthma with preserved lung function, cluster 2 late-onset non-atopic asthma with impaired lung function, cluster 3 early-onset atopic asthma with severely impaired lung function, and cluster 4 late-onset non-atopic asthma with well-preserved lung function. The patients in clusters 2 and 3 were identified as exacerbation-prone asthmatics, showing a higher risk of asthma exacerbation. Two different phenotypes of exacerbation-prone asthma were identified among Korean asthmatics using cluster analysis; both were characterized by impaired lung function, but the age at asthma onset and atopic status were different between the two. Copyright © 2017 The Korean Academy of Asthma, Allergy and Clinical Immunology · The Korean Academy of Pediatric Allergy and Respiratory Disease
2013-01-01
Background The body of disease mutations with known phenotypic relevance continues to increase and is expected to do so even faster with the advent of new experimental techniques such as whole-genome sequencing coupled with disease association studies. However, genomic association studies are limited by the molecular complexity of the phenotype being studied and the population size needed to have adequate statistical power. One way to circumvent this problem, which is critical for the study of rare diseases, is to study the molecular patterns emerging from functional studies of existing disease mutations. Current gene-centric analyses to study mutations in coding regions are limited by their inability to account for the functional modularity of the protein. Previous studies of the functional patterns of known human disease mutations have shown a significant tendency to cluster at protein domain positions, namely position-based domain hotspots of disease mutations. However, the limited number of known disease mutations remains the main factor hindering the advancement of mutation studies at a functional level. In this paper, we address this problem by incorporating mutations known to be disruptive of phenotypes in other species. Focusing on two evolutionarily distant organisms, human and yeast, we describe the first inter-species analysis of mutations of phenotypic relevance at the protein domain level. Results The results of this analysis reveal that phenotypic mutations from yeast cluster at specific positions on protein domains, a characteristic previously revealed to be displayed by human disease mutations. We found over one hundred domain hotspots in yeast with approximately 50% in the exact same domain position as known human disease mutations. Conclusions We describe an analysis using protein domains as a framework for transferring functional information by studying domain hotspots in human and yeast and relating phenotypic changes in yeast to diseases in human. This first-of-a-kind study of phenotypically relevant yeast mutations in relation to human disease mutations demonstrates the utility of a multi-species analysis for advancing the understanding of the relationship between genetic mutations and phenotypic changes at the organismal level. PMID:23819456
Multicolor CCD photometry of the open cluster NGC 752
NASA Astrophysics Data System (ADS)
Bartašiūtė, Stanislava; Janusz, Robert; Boyle, Richard P.; Philip, A. G. Davis; Deveikis, Viktoras
2010-01-01
We obtained CCD observations of the open cluster NGC 752 with the 1.8m Vatican Advanced Technology Telescope (Mt. Graham, Arizona) with a 4K CCD camera and eight intermediate-band filters of the Stromvil (Strömgren + Vilnius) system. Four 12‧ × 12‧ fields were observed, covering the central part of the cluster. The good-quality multicolor data made it possible to obtain precise estimates of distance moduli, metallicity and foreground reddening for individual stars down to the limiting magnitude, V = 17.5, enabling photometric identification of faint cluster members. The new observations provide an extension of the lower main sequence to three magnitudes beyond the previous (photographic) limit. A relatively small number of photometric members identified at fainter magnitudes seems to be indicative of actual dissolution of the cluster from the low-mass end.
Artim-Esen, Bahar; Çene, Erhan; Şahinkaya, Yasemin; Ertan, Semra; Pehlivan, Özlem; Kamali, Sevil; Gül, Ahmet; Öcal, Lale; Aral, Orhan; Inanç, Murat
2014-07-01
Associations between autoantibodies and clinical features have been described in systemic lupus erythematosus (SLE). Herein, we aimed to define autoantibody clusters and their clinical correlations in a large cohort of patients with SLE. We analyzed 852 patients with SLE who attended our clinic. Seven autoantibodies were selected for cluster analysis: anti-DNA, anti-Sm, anti-RNP, anticardiolipin (aCL) immunoglobulin (Ig)G or IgM, lupus anticoagulant (LAC), anti-Ro, and anti-La. Two-step clustering and Kaplan-Meier survival analyses were used. Five clusters were identified. A cluster consisted of patients with only anti-dsDNA antibodies, a cluster of anti-Sm and anti-RNP, a cluster of aCL IgG/M and LAC, and a cluster of anti-Ro and anti-La antibodies. Analysis revealed 1 more cluster that consisted of patients who did not belong to any of the clusters formed by antibodies chosen for cluster analysis. Sm/RNP cluster had significantly higher incidence of pulmonary hypertension and Raynaud phenomenon. DsDNA cluster had the highest incidence of renal involvement. In the aCL/LAC cluster, there were significantly more patients with neuropsychiatric involvement, antiphospholipid syndrome, autoimmune hemolytic anemia, and thrombocytopenia. According to the Systemic Lupus International Collaborating Clinics damage index, the highest frequency of damage was in the aCL/LAC cluster. Comparison of 10 and 20 years survival showed reduced survival in the aCL/LAC cluster. This study supports the existence of autoantibody clusters with distinct clinical features in SLE and shows that forming clinical subsets according to autoantibody clusters may be useful in predicting the outcome of the disease. Autoantibody clusters in SLE may exhibit differences according to the clinical setting or population.
ERIC Educational Resources Information Center
Hegedus, Stephen J.; Dalton, Sara; Tapper, John R.
2015-01-01
We report on two large studies conducted in advanced algebra classrooms in the US, which evaluated the effect of replacing traditional algebra 2 curriculum with an integrated suite of dynamic interactive software, wireless networks and technology-enhanced curriculum on student learning. The first study was a cluster randomized trial and the second…
Is It Feasible to Identify Natural Clusters of TSC-Associated Neuropsychiatric Disorders (TAND)?
Leclezio, Loren; Gardner-Lubbe, Sugnet; de Vries, Petrus J
2018-04-01
Tuberous sclerosis complex (TSC) is a genetic disorder with multisystem involvement. The lifetime prevalence of TSC-Associated Neuropsychiatric Disorders (TAND) is in the region of 90% in an apparently unique, individual pattern. This "uniqueness" poses significant challenges for diagnosis, psycho-education, and intervention planning. To date, no studies have explored whether there may be natural clusters of TAND. The purpose of this feasibility study was (1) to investigate the practicability of identifying natural TAND clusters, and (2) to identify appropriate multivariate data analysis techniques for larger-scale studies. TAND Checklist data were collected from 56 individuals with a clinical diagnosis of TSC (n = 20 from South Africa; n = 36 from Australia). Using R, the open-source statistical platform, mean squared contingency coefficients were calculated to produce a correlation matrix, and various cluster analyses and exploratory factor analysis were examined. Ward's method rendered six TAND clusters with good face validity and significant convergence with a six-factor exploratory factor analysis solution. The "bottom-up" data-driven strategies identified a "scholastic" cluster of TAND manifestations, an "autism spectrum disorder-like" cluster, a "dysregulated behavior" cluster, a "neuropsychological" cluster, a "hyperactive/impulsive" cluster, and a "mixed/mood" cluster. These feasibility results suggest that a combination of cluster analysis and exploratory factor analysis methods may be able to identify clinically meaningful natural TAND clusters. Findings require replication and expansion in larger dataset, and could include quantification of cluster or factor scores at an individual level. Copyright © 2018 Elsevier Inc. All rights reserved.
Psychosocial Costs of Racism to Whites: Exploring Patterns through Cluster Analysis
ERIC Educational Resources Information Center
Spanierman, Lisa B.; Poteat, V. Paul; Beer, Amanda M.; Armstrong, Patrick Ian
2006-01-01
Participants (230 White college students) completed the Psychosocial Costs of Racism to Whites (PCRW) Scale. Using cluster analysis, we identified 5 distinct cluster groups on the basis of PCRW subscale scores: the unempathic and unaware cluster contained the lowest empathy scores; the insensitive and afraid cluster consisted of low empathy and…
Allergen Sensitization Pattern by Sex: A Cluster Analysis in Korea.
Ohn, Jungyoon; Paik, Seung Hwan; Doh, Eun Jin; Park, Hyun-Sun; Yoon, Hyun-Sun; Cho, Soyun
2017-12-01
Allergens tend to sensitize simultaneously. Etiology of this phenomenon has been suggested to be allergen cross-reactivity or concurrent exposure. However, little is known about specific allergen sensitization patterns. To investigate the allergen sensitization characteristics according to gender. Multiple allergen simultaneous test (MAST) is widely used as a screening tool for detecting allergen sensitization in dermatologic clinics. We retrospectively reviewed the medical records of patients with MAST results between 2008 and 2014 in our Department of Dermatology. A cluster analysis was performed to elucidate the allergen-specific immunoglobulin (Ig)E cluster pattern. The results of MAST (39 allergen-specific IgEs) from 4,360 cases were analyzed. By cluster analysis, 39items were grouped into 8 clusters. Each cluster had characteristic features. When compared with female, the male group tended to be sensitized more frequently to all tested allergens, except for fungus allergens cluster. The cluster and comparative analysis results demonstrate that the allergen sensitization is clustered, manifesting allergen similarity or co-exposure. Only the fungus cluster allergens tend to sensitize female group more frequently than male group.
Adherens Junctions Revisualized: Organizing Cadherins as Nanoassemblies.
Yap, Alpha S; Gomez, Guillermo A; Parton, Robert G
2015-10-12
This Perspective considers how classical cadherin cell-cell adhesion receptors are organized at the nanoscale to generate lateral clusters. Recent advances in optical microscopy reveal that clustering constitutes a general feature of cadherin organization, but one that takes diverse forms. Here we consider the molecular mechanisms responsible for cadherin clustering and their functional implications. We frame our discussion in light of what is known about how nanoscale organization is conferred upon the plasma membrane, through protein-protein interactions, regulation of the cortical actin cytoskeleton, and the lipid environment of the membrane. Copyright © 2015 Elsevier Inc. All rights reserved.
Orbit Clustering Based on Transfer Cost
NASA Technical Reports Server (NTRS)
Gustafson, Eric D.; Arrieta-Camacho, Juan J.; Petropoulos, Anastassios E.
2013-01-01
We propose using cluster analysis to perform quick screening for combinatorial global optimization problems. The key missing component currently preventing cluster analysis from use in this context is the lack of a useable metric function that defines the cost to transfer between two orbits. We study several proposed metrics and clustering algorithms, including k-means and the expectation maximization algorithm. We also show that proven heuristic methods such as the Q-law can be modified to work with cluster analysis.
A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
Wang, Zhihao; Yi, Jing
2016-01-01
For the shortcoming of fuzzy c-means algorithm (FCM) needing to know the number of clusters in advance, this paper proposed a new self-adaptive method to determine the optimal number of clusters. Firstly, a density-based algorithm was put forward. The algorithm, according to the characteristics of the dataset, automatically determined the possible maximum number of clusters instead of using the empirical rule n and obtained the optimal initial cluster centroids, improving the limitation of FCM that randomly selected cluster centroids lead the convergence result to the local minimum. Secondly, this paper, by introducing a penalty function, proposed a new fuzzy clustering validity index based on fuzzy compactness and separation, which ensured that when the number of clusters verged on that of objects in the dataset, the value of clustering validity index did not monotonically decrease and was close to zero, so that the optimal number of clusters lost robustness and decision function. Then, based on these studies, a self-adaptive FCM algorithm was put forward to estimate the optimal number of clusters by the iterative trial-and-error process. At last, experiments were done on the UCI, KDD Cup 1999, and synthetic datasets, which showed that the method not only effectively determined the optimal number of clusters, but also reduced the iteration of FCM with the stable clustering result. PMID:28042291
Lei, Yang; Yu, Dai; Bin, Zhang; Yang, Yang
2017-01-01
Clustering algorithm as a basis of data analysis is widely used in analysis systems. However, as for the high dimensions of the data, the clustering algorithm may overlook the business relation between these dimensions especially in the medical fields. As a result, usually the clustering result may not meet the business goals of the users. Then, in the clustering process, if it can combine the knowledge of the users, that is, the doctor's knowledge or the analysis intent, the clustering result can be more satisfied. In this paper, we propose an interactive K -means clustering method to improve the user's satisfactions towards the result. The core of this method is to get the user's feedback of the clustering result, to optimize the clustering result. Then, a particle swarm optimization algorithm is used in the method to optimize the parameters, especially the weight settings in the clustering algorithm to make it reflect the user's business preference as possible. After that, based on the parameter optimization and adjustment, the clustering result can be closer to the user's requirement. Finally, we take an example in the breast cancer, to testify our method. The experiments show the better performance of our algorithm.
The Peculiar Radial Distribution of Multiple Populations in the Massive Globular Cluster M80
NASA Astrophysics Data System (ADS)
Dalessandro, E.; Cadelano, M.; Vesperini, E.; Salaris, M.; Ferraro, F. R.; Lanzoni, B.; Raso, S.; Hong, J.; Webb, J. J.; Zocchi, A.
2018-05-01
We present a detailed analysis of the radial distribution of light-element multiple populations (LE-MPs) in the massive and dense globular cluster M80, based on a combination of UV and optical Hubble Space Telescope data. Surprisingly, we find that first-generation (FG) stars (FG) are significantly more centrally concentrated than extreme second-generation (SG) stars out to ∼2.5r h from the cluster center. To understand the origin of such peculiar behavior, we used a set of N-body simulations following the long-term dynamical evolution of LE-MPs. We find that, given the advanced dynamical state of the cluster, the observed difference does not depend on the primordial relative distributions of FG and SG stars. On the contrary, a difference of ∼0.05–0.10 M ⊙ between the average masses of the two subpopulations is needed to account for the observed radial distributions. We argue that such a mass difference might be the result of the higher He abundance of SG stars (of the order of ΔY ∼ 0.05–0.06) with respect to FG stars. Interestingly, we find that a similar He variation is necessary to reproduce the horizontal branch morphology of M80. These results demonstrate that differences in mass among LE-MPs, due to different He content, should be properly taken into account for a correct interpretation of their radial distribution, at least in dynamically evolved systems.
Advances in the understanding of cluster headache.
Leone, Massimo; Proietti Cecchini, Alberto
2017-02-01
Cluster headache is the worst primary headache form; it occurs in paroxysmal excruciatingly severe unilateral head pain attacks usually grouped in cluster periods. The familial occurrence of the disease indicates a genetic component but a gene abnormality is yet to be disclosed. Activation of trigeminal afferents and cranial parasympathetic efferents, the so-called trigemino-parasympathetic reflex, can explain pain and accompanying oculo-facial autonomic phenomena. In particular, pain in cluster headache is attributed, at least in part, to the increased CGRP plasma levels released by activated trigeminal system. Posterior hypothalamus was hypothesized to be the cluster generator activating the trigemino-parasympathetic reflex. Efficacy of monoclonal antibodies against CRGP is under investigation in randomized clinical trials. Areas covered: This paper will focus on main findings contributing to consider cluster headache as a neurovascular disorder with an origin from within the brain. Expert commentary: Accumulated evidence with hypothalamic stimulation in cluster headache patients indicate that posterior hypothalamus terminates rather than triggers the attacks. More extensive studies on the genetics of cluster headache are necessary to disclose anomalies behind the increased familial risk of the disease. Results from ongoing clinical trials in cluster headache sufferers using monoclonal antibodies against CGRP will open soon a new era.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less
Steenbergen, K G; Gaston, N
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.
Won, Jong Chul; Im, Yong-Jin; Lee, Ji-Hyun; Kim, Chong Hwa; Kwon, Hyuk Sang; Cha, Bong-Yun; Park, Tae Sun
2017-01-01
Patients with diabetic peripheral neuropathy (DPN) is the most common complication. However, patients are usually suffering from not only diverse sensory deficit but also neuropathy-related discomforts. The aim of this study is to identify distinct groups of patients with DPN with respect to its clinical impacts on symptom patterns and comorbidities. A hierarchical cluster analysis and factor analysis were performed to identify relevant subgroups of patients with DPN ( n = 1338) and symptom patterns. Patients with DPN were divided into three clusters: asymptomatic (cluster 1, n = 448, 33.5%), moderate symptoms with disturbed sleep (cluster 2, n = 562, 42.0%), and severe symptoms with decreased quality of life (cluster 3, n = 328, 24.5%). Patients in cluster 3, compared with clusters 1 and 2, were characterized by higher levels of HbA1c and more severe pain and physical impairments. Patients in cluster 2 had moderate pain levels but disturbed sleep patterns comparable to those in cluster 3. The frequency of symptoms on each item of MNSI by "painful" symptom pattern showed a similar distribution pattern with increasing intensities along the three clusters. Cluster and factor analysis endorsed the use of comprehensive and symptomatic subgrouping to individualize the evaluation of patients with DPN.
A hybrid monkey search algorithm for clustering analysis.
Chen, Xin; Zhou, Yongquan; Luo, Qifang
2014-01-01
Clustering is a popular data analysis and data mining technique. The k-means clustering algorithm is one of the most commonly used methods. However, it highly depends on the initial solution and is easy to fall into local optimum solution. In view of the disadvantages of the k-means method, this paper proposed a hybrid monkey algorithm based on search operator of artificial bee colony algorithm for clustering analysis and experiment on synthetic and real life datasets to show that the algorithm has a good performance than that of the basic monkey algorithm for clustering analysis.
Multimorbidity patterns of and use of health services by Swedish 85-year-olds: an exploratory study
2013-01-01
Background As life expectancy continues to rise, more elderly are reaching advanced ages (≥80 years). The increasing prevalence of multimorbidity places additional demands on health-care resources for the elderly. Previous studies noted the impact of multimorbidity on the use of health services, but the effects of multimorbidity patterns on health-service use have not been well studied, especially for very old people. This study determines patterns of multimorbidity associated with emergency-room visits and hospitalization in an 85-year-old population. Methods Health and living conditions were reported via postal questionnaire by 496 Linköping residents aged 85 years (189 men and 307 women). Diagnoses of morbidity were reviewed in patients’ case reports, and the local health-care register provided information on the use of health services. Hierarchical cluster analysis was applied to evaluate patterns of multimorbidity with gender stratification. Factors associated with emergency-room visits and hospitalization were analyzed using logistic regression models. Results Cluster analyses revealed five clusters: vascular, cardiopulmonary, cardiac (only for men), somatic–mental (only for men), mental disease (only for women), and three other clusters related to aging (one for men and two for women). Heart failure in men (OR = 2.4, 95% CI = 1–5.7) and women (OR = 3, 95% CI = 1.3–6.9) as a single morbidity explained more variance than morbidity clusters in models of emergency-room visits. Men's cardiac cluster (OR = 1.6; 95% CI = 1–2.7) and women's cardiopulmonary cluster (OR = 1.7, 95% CI = 1.2–2.4) were significantly associated with hospitalization. The combination of the cardiopulmonary cluster with the men’s cardiac cluster (OR = 1.6, 95% CI = 1–2.4) and one of the women’s aging clusters (OR = 0.5, 95% CI = 0.3–0.8) showed interaction effects on hospitalization. Conclusion In this 85-year-old population, patterns of cardiac and pulmonary conditions were better than a single morbidity in explaining hospitalization. Heart failure was superior to multimorbidity patterns in explaining emergency-room visits. A holistic approach to examining the patterns of multimorbidity and their relationships with the use of health services will contribute to both local health care policy and geriatric practice. PMID:24195643
Malkassian, Anthony; Nerini, David; van Dijk, Mark A; Thyssen, Melilotus; Mante, Claude; Gregori, Gerald
2011-04-01
Analytical flow cytometry (FCM) is well suited for the analysis of phytoplankton communities in fresh and sea waters. The measurement of light scatter and autofluorescence properties of particles by FCM provides optical fingerprints, which enables different phytoplankton groups to be separated. A submersible version of the CytoSense flow cytometer (the CytoSub) has been designed for in situ autonomous sampling and analysis, making it possible to monitor phytoplankton at a short temporal scale and obtain accurate information about its dynamics. For data analysis, a manual clustering is usually performed a posteriori: data are displayed on histograms and scatterplots, and group discrimination is made by drawing and combining regions (gating). The purpose of this study is to provide greater objectivity in the data analysis by applying a nonmanual and consistent method to automatically discriminate clusters of particles. In other words, we seek for partitioning methods based on the optical fingerprints of each particle. As the CytoSense is able to record the full pulse shape for each variable, it quickly generates a large and complex dataset to analyze. The shape, length, and area of each curve were chosen as descriptors for the analysis. To test the developed method, numerical experiments were performed on simulated curves. Then, the method was applied and validated on phytoplankton cultures data. Promising results have been obtained with a mixture of various species whose optical fingerprints overlapped considerably and could not be accurately separated using manual gating. Copyright © 2011 International Society for Advancement of Cytometry.
Solid state and aqueous behavior of uranyl peroxide cage clusters
NASA Astrophysics Data System (ADS)
Pellegrini, Kristi Lynn
Uranyl peroxide cage clusters include a large family of more than 50 published clusters of a variety of sizes, which can incorporate various ligands including pyrophosphate and oxalate. Previous studies have reported that uranyl clusters can be used as a method to separate uranium from a solid matrix, with potential applications in reprocessing of irradiated nuclear fuel. Because of the potential applications of these novel structures in an advanced nuclear fuel cycle and their likely presence in areas of contamination, it is important to understand their behavior in both solid state and aqueous systems, including complex environments where other ions are present. In this thesis, I examine the aqueous behavior of U24Pp 12, as well as aqueous cluster systems with added mono-, di-, and trivalent cations. The resulting solutions were analyzed using dynamic light scattering and ultra-small angle X-ray scattering to evaluate the species in solution. Precipitates of these systems were analyzed using powder X-ray diffraction, X-ray fluorescence spectrometry, and Raman spectroscopy. The results of these analyses demonstrate the importance of cation size, charge, and concentration of added cations on the aqueous behavior of uranium macroions. Specifically, aggregates of various sizes and shapes form rapidly upon addition of cations, and in some cases these aggregates appear to precipitate into an X-ray amorphous material that still contains U24Pp12 clusters. In addition, I probe aggregation of U24Pp12 and U60, another uranyl peroxide cage cluster, in mixed solvent water-alcohol systems. The aggregation of uranyl clusters in water-alcohol systems is a result of hydrogen bonding with polar organic molecules and the reduction of the dielectric constant of the system. Studies of aggregation of uranyl clusters also allow for comparison between the newer uranyl polyoxometalate family and century-old transition metal polyoxometalates. To complement the solution studies of uranyl cage clusters, solid state analyses of U24Pp12 are presented, including single crystal X-ray diffraction and preliminary single crystal neutron diffraction. Solid state analyses are used to probe the complicated bonding environments between U24Pp12 and crystallized counterions, giving further insight into the importance of cluster protonation and counterions in uranyl cluster systems. The combination of solid state and solution techniques provides information about the complicated nature of uranyl peroxide nanoclusters, and insight towards future applications of clusters in the advanced nuclear fuel cycle and the environment.
Esplin, M Sean; Manuck, Tracy A.; Varner, Michael W.; Christensen, Bryce; Biggio, Joseph; Bukowski, Radek; Parry, Samuel; Zhang, Heping; Huang, Hao; Andrews, William; Saade, George; Sadovsky, Yoel; Reddy, Uma M.; Ilekis, John
2015-01-01
Objective We sought to employ an innovative tool based on common biological pathways to identify specific phenotypes among women with spontaneous preterm birth (SPTB), in order to enhance investigators' ability to identify to highlight common mechanisms and underlying genetic factors responsible for SPTB. Study Design A secondary analysis of a prospective case-control multicenter study of SPTB. All cases delivered a preterm singleton at SPTB ≤34.0 weeks gestation. Each woman was assessed for the presence of underlying SPTB etiologies. A hierarchical cluster analysis was used to identify groups of women with homogeneous phenotypic profiles. One of the phenotypic clusters was selected for candidate gene association analysis using VEGAS software. Results 1028 women with SPTB were assigned phenotypes. Hierarchical clustering of the phenotypes revealed five major clusters. Cluster 1 (N=445) was characterized by maternal stress, cluster 2 (N=294) by premature membrane rupture, cluster 3 (N=120) by familial factors, and cluster 4 (N=63) by maternal comorbidities. Cluster 5 (N=106) was multifactorial, characterized by infection (INF), decidual hemorrhage (DH) and placental dysfunction (PD). These three phenotypes were highly correlated by Chi-square analysis [PD and DH (p<2.2e-6); PD and INF (p=6.2e-10); INF and DH (p=0.0036)]. Gene-based testing identified the INS (insulin) gene as significantly associated with cluster 3 of SPTB. Conclusion We identified 5 major clusters of SPTB based on a phenotype tool and hierarchal clustering. There was significant correlation between several of the phenotypes. The INS gene was associated with familial factors underlying SPTB. PMID:26070700
Method for exploratory cluster analysis and visualisation of single-trial ERP ensembles.
Williams, N J; Nasuto, S J; Saddy, J D
2015-07-30
The validity of ensemble averaging on event-related potential (ERP) data has been questioned, due to its assumption that the ERP is identical across trials. Thus, there is a need for preliminary testing for cluster structure in the data. We propose a complete pipeline for the cluster analysis of ERP data. To increase the signal-to-noise (SNR) ratio of the raw single-trials, we used a denoising method based on Empirical Mode Decomposition (EMD). Next, we used a bootstrap-based method to determine the number of clusters, through a measure called the Stability Index (SI). We then used a clustering algorithm based on a Genetic Algorithm (GA) to define initial cluster centroids for subsequent k-means clustering. Finally, we visualised the clustering results through a scheme based on Principal Component Analysis (PCA). After validating the pipeline on simulated data, we tested it on data from two experiments - a P300 speller paradigm on a single subject and a language processing study on 25 subjects. Results revealed evidence for the existence of 6 clusters in one experimental condition from the language processing study. Further, a two-way chi-square test revealed an influence of subject on cluster membership. Our analysis operates on denoised single-trials, the number of clusters are determined in a principled manner and the results are presented through an intuitive visualisation. Given the cluster structure in some experimental conditions, we suggest application of cluster analysis as a preliminary step before ensemble averaging. Copyright © 2015 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Petkov, Valeri; Prasai, Binay; Shastri, Sarvjit
2017-09-12
Practical applications require the production and usage of metallic nanocrystals (NCs) in large ensembles. Besides, due to their cluster-bulk solid duality, metallic NCs exhibit a large degree of structural diversity. This poses the question as to what atomic-scale basis is to be used when the structure–function relationship for metallic NCs is to be quantified precisely. In this paper, we address the question by studying bi-functional Fe core-Pt skin type NCs optimized for practical applications. In particular, the cluster-like Fe core and skin-like Pt surface of the NCs exhibit superparamagnetic properties and a superb catalytic activity for the oxygen reduction reaction,more » respectively. We determine the atomic-scale structure of the NCs by non-traditional resonant high-energy X-ray diffraction coupled to atomic pair distribution function analysis. Using the experimental structure data we explain the observed magnetic and catalytic behavior of the NCs in a quantitative manner. Lastly, we demonstrate that NC ensemble-averaged 3D positions of atoms obtained by advanced X-ray scattering techniques are a very proper basis for not only establishing but also quantifying the structure–function relationship for the increasingly complex metallic NCs explored for practical applications.« less
NASA Astrophysics Data System (ADS)
Abe, M.; Prasannaa, V. S.; Das, B. P.
2018-03-01
Heavy polar diatomic molecules are currently among the most promising probes of fundamental physics. Constraining the electric dipole moment of the electron (e EDM ), in order to explore physics beyond the standard model, requires a synergy of molecular experiment and theory. Recent advances in experiment in this field have motivated us to implement a finite-field coupled-cluster (FFCC) approach. This work has distinct advantages over the theoretical methods that we had used earlier in the analysis of e EDM searches. We used relativistic FFCC to calculate molecular properties of interest to e EDM experiments, that is, the effective electric field (Eeff) and the permanent electric dipole moment (PDM). We theoretically determine these quantities for the alkaline-earth monofluorides (AEMs), the mercury monohalides (Hg X ), and PbF. The latter two systems, as well as BaF from the AEMs, are of interest to e EDM searches. We also report the calculation of the properties using a relativistic finite-field coupled-cluster approach with single, double, and partial triples' excitations, which is considered to be the gold standard of electronic structure calculations. We also present a detailed error estimate, including errors that stem from our choice of basis sets, and higher-order correlation effects.
VariantSpark: population scale clustering of genotype information.
O'Brien, Aidan R; Saunders, Neil F W; Guo, Yi; Buske, Fabian A; Scott, Rodney J; Bauer, Denis C
2015-12-10
Genomic information is increasingly used in medical practice giving rise to the need for efficient analysis methodology able to cope with thousands of individuals and millions of variants. The widely used Hadoop MapReduce architecture and associated machine learning library, Mahout, provide the means for tackling computationally challenging tasks. However, many genomic analyses do not fit the Map-Reduce paradigm. We therefore utilise the recently developed SPARK engine, along with its associated machine learning library, MLlib, which offers more flexibility in the parallelisation of population-scale bioinformatics tasks. The resulting tool, VARIANTSPARK provides an interface from MLlib to the standard variant format (VCF), offers seamless genome-wide sampling of variants and provides a pipeline for visualising results. To demonstrate the capabilities of VARIANTSPARK, we clustered more than 3,000 individuals with 80 Million variants each to determine the population structure in the dataset. VARIANTSPARK is 80 % faster than the SPARK-based genome clustering approach, ADAM, the comparable implementation using Hadoop/Mahout, as well as ADMIXTURE, a commonly used tool for determining individual ancestries. It is over 90 % faster than traditional implementations using R and Python. The benefits of speed, resource consumption and scalability enables VARIANTSPARK to open up the usage of advanced, efficient machine learning algorithms to genomic data.
Neumann, Sindy; Hartmann, Holger; Martin-Galiano, Antonio J; Fuchs, Angelika; Frishman, Dmitrij
2012-03-01
Structural bioinformatics of membrane proteins is still in its infancy, and the picture of their fold space is only beginning to emerge. Because only a handful of three-dimensional structures are available, sequence comparison and structure prediction remain the main tools for investigating sequence-structure relationships in membrane protein families. Here we present a comprehensive analysis of the structural families corresponding to α-helical membrane proteins with at least three transmembrane helices. The new version of our CAMPS database (CAMPS 2.0) covers nearly 1300 eukaryotic, prokaryotic, and viral genomes. Using an advanced classification procedure, which is based on high-order hidden Markov models and considers both sequence similarity as well as the number of transmembrane helices and loop lengths, we identified 1353 structurally homogeneous clusters roughly corresponding to membrane protein folds. Only 53 clusters are associated with experimentally determined three-dimensional structures, and for these clusters CAMPS is in reasonable agreement with structure-based classification approaches such as SCOP and CATH. We therefore estimate that ∼1300 structures would need to be determined to provide a sufficient structural coverage of polytopic membrane proteins. CAMPS 2.0 is available at http://webclu.bio.wzw.tum.de/CAMPS2.0/. Copyright © 2011 Wiley Periodicals, Inc.
Aarabi, A; Grebe, R; Berquin, P; Bourel Ponchel, E; Jalin, C; Fohlen, M; Bulteau, C; Delalande, O; Gondry, C; Héberlé, C; Moullart, V; Wallois, F
2012-06-01
This case study aims to demonstrate that spatiotemporal spike discrimination and source analysis are effective to monitor the development of sources of epileptic activity in time and space. Therefore, they can provide clinically useful information allowing a better understanding of the pathophysiology of individual seizures with time- and space-resolved characteristics of successive epileptic states, including interictal, preictal, postictal, and ictal states. High spatial resolution scalp EEGs (HR-EEG) were acquired from a 2-year-old girl with refractory central epilepsy and single-focus seizures as confirmed by intracerebral EEG recordings and ictal single-photon emission computed tomography (SPECT). Evaluation of HR-EEG consists of the following three global steps: (1) creation of the initial head model, (2) automatic spike and seizure detection, and finally (3) source localization. During the source localization phase, epileptic states are determined to allow state-based spike detection and localization of underlying sources for each spike. In a final cluster analysis, localization results are integrated to determine the possible sources of epileptic activity. The results were compared with the cerebral locations identified by intracerebral EEG recordings and SPECT. The results obtained with this approach were concordant with those of MRI, SPECT and distribution of intracerebral potentials. Dipole cluster centres found for spikes in interictal, preictal, ictal and postictal states were situated an average of 6.3mm from the intracerebral contacts with the highest voltage. Both amplitude and shape of spikes change between states. Dispersion of the dipoles was higher in the preictal state than in the postictal state. Two clusters of spikes were identified. The centres of these clusters changed position periodically during the various epileptic states. High-resolution surface EEG evaluated by an advanced algorithmic approach can be used to investigate the spatiotemporal characteristics of sources located in the epileptic focus. The results were validated by standard methods, ensuring good spatial resolution by MRI and SPECT and optimal temporal resolution by intracerebral EEG. Surface EEG can be used to identify different spike clusters and sources of the successive epileptic states. The method that was used in this study will provide physicians with a better understanding of the pathophysiological characteristics of epileptic activities. In particular, this method may be useful for more effective positioning of implantable intracerebral electrodes. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
Aikawa, Ken; Kataoka, Masao; Ogawa, Soichiro; Akaihata, Hidenori; Sato, Yuichi; Yabe, Michihiro; Hata, Junya; Koguchi, Tomoyuki; Kojima, Yoshiyuki; Shiragasawa, Chihaya; Kobayashi, Toshimitsu; Yamaguchi, Osamu
2015-08-01
To present a new grouping of male patients with lower urinary tract symptoms (LUTS) based on symptom patterns and clarify whether the therapeutic effect of α1-blocker differs among the groups. We performed secondary analysis of anonymous data from 4815 patients enrolled in a postmarketing surveillance study of tamsulosin in Japan. Data on 7 International Prostate Symptom Score (IPSS) items at the initial visit were used in the cluster analysis. IPSS and quality of life (QOL) scores before and after tamsulosin treatment for 12 weeks were assessed in each cluster. Partial correlation coefficients were also obtained for IPSS and QOL scores based on changes before and after treatment. Five symptom groups were identified by cluster analysis of IPSS. On their symptom profile, each cluster was labeled as minimal type (cluster 1), multiple severe type (cluster 2), weak stream type (cluster 3), storage type (cluster 4), and voiding type (cluster 5). Prevalence and the mean symptom score were significantly improved in almost all symptoms in all clusters by tamsulosin treatment. Nocturia and weak stream had the strongest effect on QOL in clusters 1, 2, and 4 and clusters 3 and 5, respectively. The study clarified that 5 characteristic symptom patterns exist by cluster analysis of IPSS in male patients with LUTS. Tamsulosin improved various symptoms and QOL in each symptom group. The study reports many male patients with LUTS being satisfied with monotherapy using tamsulosin and suggests the usefulness of α1-blockers as a drug of first choice. Copyright © 2015 Elsevier Inc. All rights reserved.
Panda, Sasmita; Jena, Smrutiti; Sharma, Savitri; Dhawan, Benu; Nath, Gopal; Singh, Durg Vijai
2016-01-01
The aim of this study was to determine sequence types of 34 S. haemolyticus strains isolated from a variety of infections between 2013 and 2016 in India by MLST. The MEGA5.2 software was used to align and compare the nucleotide sequences. The advanced cluster analysis was performed to define the clonal complexes. MLST analysis showed 24 new sequence types (ST) among S. haemolyticus isolates, irrespective of sources and place of isolation. The finding of this study allowed to set up an MLST database on the PubMLST.org website using BIGSdb software and made available at http://pubmlst.org/shaemolyticus/. The data of this study thus suggest that MLST can be used to study population structure and diversity among S. haemolyticus isolates.
Panda, Sasmita; Jena, Smrutiti; Sharma, Savitri; Dhawan, Benu; Nath, Gopal
2016-01-01
The aim of this study was to determine sequence types of 34 S. haemolyticus strains isolated from a variety of infections between 2013 and 2016 in India by MLST. The MEGA5.2 software was used to align and compare the nucleotide sequences. The advanced cluster analysis was performed to define the clonal complexes. MLST analysis showed 24 new sequence types (ST) among S. haemolyticus isolates, irrespective of sources and place of isolation. The finding of this study allowed to set up an MLST database on the PubMLST.org website using BIGSdb software and made available at http://pubmlst.org/shaemolyticus/. The data of this study thus suggest that MLST can be used to study population structure and diversity among S. haemolyticus isolates. PMID:27824930
User identified positive outcome expectancies of electronic cigarette use: A concept mapping study.
Soule, Eric K; Maloney, Sarah F; Guy, Mignonne C; Eissenberg, Thomas; Fagan, Pebbles
2017-05-01
Electronic cigarette (ECIG) use is growing in popularity, but little is known about the perceived positive outcomes of ECIG use. This study used concept mapping (CM) to examine positive ECIG outcome expectancies. Sixty-three past 30-day ECIG users (38.1% female) between the ages of 18 and 64 (M = 37.8, SD = 13.3) completed a CM module. In an online program, participants provided statements that completed a prompt: "A specific positive, enjoyable, or exciting effect (i.e., physical or psychological) that I have experienced WHILE USING or IMMEDIATELY AFTER USING an electronic cigarette/electronic vaping device is. . . ." Participants (n = 35) sorted 123 statements into "piles" of similar content and rated (n = 43) each statement on a 7-point scale (1 = Definitely NOT a positive effect to 7 = Definitely a positive effect). A cluster map was created using data from the sorting task, and analysis indicated a 7 cluster model of positive ECIG use outcome expectancies: Therapeutic/Affect Regulation, High/Euphoria, Sensation Enjoyment, Perceived Health Effects, Benefits of Decreased Cigarette Use, Convenience, and Social Impacts. The Perceived Health Effects cluster was rated highest, although all mean ratings were greater than 4.69. Mean cluster ratings were compared, and females, younger adults, past 30-day cigarette smokers, users of more "advanced" ECIG devices, and nonlifetime (less than 100 lifetime cigarettes) participants rated certain clusters higher than comparison groups (ps < 0.05). ECIG users associate positive outcomes with ECIG use. ECIG outcome expectancies may affect product appeal and tobacco use behaviors and should be examined further to inform regulatory policies. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Multiscale visual quality assessment for cluster analysis with self-organizing maps
NASA Astrophysics Data System (ADS)
Bernard, Jürgen; von Landesberger, Tatiana; Bremm, Sebastian; Schreck, Tobias
2011-01-01
Cluster analysis is an important data mining technique for analyzing large amounts of data, reducing many objects to a limited number of clusters. Cluster visualization techniques aim at supporting the user in better understanding the characteristics and relationships among the found clusters. While promising approaches to visual cluster analysis already exist, these usually fall short of incorporating the quality of the obtained clustering results. However, due to the nature of the clustering process, quality plays an important aspect, as for most practical data sets, typically many different clusterings are possible. Being aware of clustering quality is important to judge the expressiveness of a given cluster visualization, or to adjust the clustering process with refined parameters, among others. In this work, we present an encompassing suite of visual tools for quality assessment of an important visual cluster algorithm, namely, the Self-Organizing Map (SOM) technique. We define, measure, and visualize the notion of SOM cluster quality along a hierarchy of cluster abstractions. The quality abstractions range from simple scalar-valued quality scores up to the structural comparison of a given SOM clustering with output of additional supportive clustering methods. The suite of methods allows the user to assess the SOM quality on the appropriate abstraction level, and arrive at improved clustering results. We implement our tools in an integrated system, apply it on experimental data sets, and show its applicability.
NASA Astrophysics Data System (ADS)
Bhattacharyya, Kaustuve; Ke, Chih-Ming; Huang, Guo-Tsai; Chen, Kai-Hsiung; Smilde, Henk-Jan H.; Fuchs, Andreas; Jak, Martin; van Schijndel, Mark; Bozkurt, Murat; van der Schaar, Maurits; Meyer, Steffen; Un, Miranda; Morgan, Stephen; Wu, Jon; Tsai, Vincent; Liang, Frida; den Boef, Arie; ten Berge, Peter; Kubis, Michael; Wang, Cathy; Fouquet, Christophe; Terng, L. G.; Hwang, David; Cheng, Kevin; Gau, TS; Ku, Y. C.
2013-04-01
Aggressive on-product overlay requirements in advanced nodes are setting a superior challenge for the semiconductor industry. This forces the industry to look beyond the traditional way-of-working and invest in several new technologies. Integrated metrology2, in-chip overlay control, advanced sampling and process correction-mechanism (using the highest order of correction possible with scanner interface today), are a few of such technologies considered in this publication.
Hubble Admires a Youthful Globular Star Cluster
2017-12-08
Hubble sees an unusal global cluster that is enriching the interstellar medium with metals Globular clusters offer some of the most spectacular sights in the night sky. These ornate spheres contain hundreds of thousands of stars, and reside in the outskirts of galaxies. The Milky Way contains over 150 such clusters — and the one shown in this NASA/ESA Hubble Space Telescope image, named NGC 362, is one of the more unusual ones. As stars make their way through life they fuse elements together in their cores, creating heavier and heavier elements — known in astronomy as metals — in the process. When these stars die, they flood their surroundings with the material they have formed during their lifetimes, enriching the interstellar medium with metals. Stars that form later therefore contain higher proportions of metals than their older relatives. By studying the different elements present within individual stars in NGC 362, astronomers discovered that the cluster boasts a surprisingly high metal content, indicating that it is younger than expected. Although most globular clusters are much older than the majority of stars in their host galaxy, NGC 362 bucks the trend, with an age lying between 10 and 11 billion years old. For reference, the age of the Milky Way is estimated to be above 13 billion years. This image, in which you can view NGC 362’s individual stars, was taken by Hubble’s Advanced Camera for Surveys (ACS). Credit: ESA/Hubble& NASA NASA image use policy. NASA Goddard Space Flight Center enables NASA’s mission through four scientific endeavors: Earth Science, Heliophysics, Solar System Exploration, and Astrophysics. Goddard plays a leading role in NASA’s accomplishments by contributing compelling scientific knowledge to advance the Agency’s mission. Follow us on Twitter Like us on Facebook Find us on Instagram
Double blue straggler sequences in globular clusters: The case of NGC 362
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dalessandro, E.; Ferraro, F. R.; Massari, D.
2013-12-01
We used high-quality images acquired with the Wide Field Camera 3 on board the Hubble Space Telescope to probe the blue straggler star (BSS) population of the galactic globular cluster NGC 362. We have found two distinct sequences of BSSs: this is the second case, after M30, where such a feature has been observed. Indeed, the BSS location, their extension in magnitude and color, and their radial distribution within the cluster nicely resemble those observed in M30, thus suggesting that the same interpretative scenario can be applied: the red BSS sub-population is generated by mass-transfer binaries, the blue one bymore » collisions. The discovery of four new W UMa stars, three of which lie along the red BSS sequence, further supports this scenario. We also found that the inner portion of the density profile deviates from a King model and is well reproduced by either a mild power law (α ∼ –0.2) or a double King profile. This feature supports the hypothesis that the cluster is currently undergoing the core-collapse phase. Moreover, the BSS radial distribution shows a central peak and monotonically decreases outward without any evidence of an external rising branch. This evidence is a further indication of the advanced dynamical age of NGC 362; in fact, together with M30, NGC 362 belongs to the family of dynamically old clusters (Family III) in the 'dynamical clock' classification proposed by Ferraro et al. The observational evidence presented here strengthens the possible connection between the existence of a double BSS sequence and a quite advanced dynamical status of the parent cluster.« less
Redman, Regina S.; Ranson, Judith; Rodriguez, Rusty J.
2006-01-01
Cantharellus formosus growing on the Olympic Peninsula of the Pacific Northwest was sampled from September – November 1995 for genetic analysis. A total of ninety-six basidiomes from five clusters separated from one another by 3 - 25 meters were genetically characterized by PCR analysis of 13 arbitrary loci and rDNA sequences. The number of basidiomes in each cluster varied from 15 to 25 and genetic analysis delineated 15 genets among the clusters. Analysis of variance utilizing thirteen apPCR generated genetic molecular markers and PCR amplification of the ribosomal ITS regions indicated that 81.41% of the genetic variation occurred between clusters and 18.59% within clusters. Proximity of the basidiomes within a cluster was not an indicator of genotypic similarity. The molecular profiles of each cluster were distinct and defined as unique populations containing 2 - 6 genets. The monitoring and analysis of this species through non-lethal sampling and future applications is discussed.
Taylor, Angela J; Lappi, Victoria; Wolfgang, William J; Lapierre, Pascal; Palumbo, Michael J; Medus, Carlota; Boxrud, David
2015-10-01
Salmonella enterica serovar Enteritidis is a significant cause of gastrointestinal illness in the United States; however, current molecular subtyping methods lack resolution for this highly clonal serovar. Advances in next-generation sequencing technologies have made it possible to examine whole-genome sequencing (WGS) as a potential molecular subtyping tool for outbreak detection and source trace back. Here, we conducted a retrospective analysis of S. Enteritidis isolates from seven epidemiologically confirmed foodborne outbreaks and sporadic isolates (not epidemiologically linked) to determine the utility of WGS to identify outbreaks. A collection of 55 epidemiologically characterized clinical and environmental S. Enteritidis isolates were sequenced. Single nucleotide polymorphism (SNP)-based cluster analysis of the S. Enteritidis genomes revealed well supported clades, with less than four-SNP pairwise diversity, that were concordant with epidemiologically defined outbreaks. Sporadic isolates were an average of 42.5 SNPs distant from the outbreak clusters. Isolates collected from the same patient over several weeks differed by only two SNPs. Our findings show that WGS provided greater resolution between outbreak, sporadic, and suspect isolates than the current gold standard subtyping method, pulsed-field gel electrophoresis (PFGE). Furthermore, results could be obtained in a time frame suitable for surveillance activities, supporting the use of WGS as an outbreak detection and characterization method for S. Enteritidis. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Li, Hai-juan; Zhao, Xin; Jia, Qing-fei; Li, Tian-lai; Ning, Wei
2012-08-01
The achenes morphological and micro-morphological characteristics of six species of genus Taraxacum from northeastern China as well as SRAP cluster analysis were observed for their classification evidences. The achenes were observed by microscope and EPMA. Cluster analysis was given on the basis of the size, shape, cone proportion, color and surface sculpture of achenes. The Taraxacum inter-species achene shape characteristic difference is obvious, particularly spinulose distribution and size, achene color and achene size; with the Taraxacum plant achene shape the cluster method T. antungense Kitag. and the T. urbanum Kitag. should combine for the identical kind; the achene morphology cluster analysis and the SRAP tagged molecule systematics's cluster result retrieves in the table with "the Chinese flora". The class group to divide the result is consistent. Taraxacum plant achene shape characteristic stable conservative, may carry on the inter-species division and the sibship analysis according to the achene shape characteristic combination difference; the achene morphology cluster analysis as well as the SRAP tagged molecule systematics confirmation support dandelion classification result of "the Chinese flora".
Exploratory Item Classification Via Spectral Graph Clustering
Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen; Xu, Gongjun; Ying, Zhiliang
2017-01-01
Large-scale assessments are supported by a large item pool. An important task in test development is to assign items into scales that measure different characteristics of individuals, and a popular approach is cluster analysis of items. Classical methods in cluster analysis, such as the hierarchical clustering, K-means method, and latent-class analysis, often induce a high computational overhead and have difficulty handling missing data, especially in the presence of high-dimensional responses. In this article, the authors propose a spectral clustering algorithm for exploratory item cluster analysis. The method is computationally efficient, effective for data with missing or incomplete responses, easy to implement, and often outperforms traditional clustering algorithms in the context of high dimensionality. The spectral clustering algorithm is based on graph theory, a branch of mathematics that studies the properties of graphs. The algorithm first constructs a graph of items, characterizing the similarity structure among items. It then extracts item clusters based on the graphical structure, grouping similar items together. The proposed method is evaluated through simulations and an application to the revised Eysenck Personality Questionnaire. PMID:29033476
Cancer Cluster Investigations: Review of the Past and Proposals for the Future
Goodman, Michael; LaKind, Judy S.; Fagliano, Jerald A.; Lash, Timothy L.; Wiemels, Joseph L.; Winn, Deborah M.; Patel, Chirag; Van Eenwyk, Juliet; Kohler, Betsy A.; Schisterman, Enrique F.; Albert, Paul; Mattison, Donald R.
2014-01-01
Residential clusters of non-communicable diseases are a source of enduring public concern, and at times, controversy. Many clusters reported to public health agencies by concerned citizens are accompanied by expectations that investigations will uncover a cause of disease. While goals, methods and conclusions of cluster studies are debated in the scientific literature and popular press, investigations of reported residential clusters rarely provide definitive answers about disease etiology. Further, it is inherently difficult to study a cluster for diseases with complex etiology and long latency (e.g., most cancers). Regardless, cluster investigations remain an important function of local, state and federal public health agencies. Challenges limiting the ability of cluster investigations to uncover causes for disease include the need to consider long latency, low statistical power of most analyses, uncertain definitions of cluster boundaries and population of interest, and in- and out-migration. A multi-disciplinary Workshop was held to discuss innovative and/or under-explored approaches to investigate cancer clusters. Several potentially fruitful paths forward are described, including modern methods of reconstructing residential history, improved approaches to analyzing spatial data, improved utilization of electronic data sources, advances using biomarkers of carcinogenesis, novel concepts for grouping cases, investigations of infectious etiology of cancer, and “omics” approaches. PMID:24477211
Exploring spatial evolution of economic clusters: A case study of Beijing
NASA Astrophysics Data System (ADS)
Yang, Zhenshan; Sliuzas, Richard; Cai, Jianming; Ottens, Henk F. L.
2012-10-01
An identification of economic clusters and analysing their changing spatial patterns is important for understanding urban economic space dynamics. Previous studies, however, suffer from limitations as a consequence of using fixed geographically areas and not combining functional and spatial dynamics. The paper presents an approach, based on local spatial statistics and the case of Beijing to understand the spatial clustering of industries that are functionally interconnected by common or complementary patterns of demand or supply relations. Using register data of business establishments, it identifies economic clusters and analyses their pattern based on postcodes at different time slices during the period 1983-2002. The study shows how the advanced services occupy the urban centre and key sub centres. The Information and Communication Technology (ICT) cluster is mainly concentrated in the north part of the city and circles the urban centre, and the main manufacturing clusters are evolved in the key sub centers. This type of outcomes improves understanding of urban-economic dynamics, which can support spatial and economic planning.
Down-to-earth studies of carbon clusters
NASA Technical Reports Server (NTRS)
Smalley, R. E.
1990-01-01
Recent advances in supersonic beam experiments with laser-vaporization sources of clusters have provided some interesting new insights into the nature of the small clusters of carbon, and the processes through which carbon condenses. One cluster in particular, C(sub 60), appears to play a central role. It is argued that this cluster takes the shape of a soccerball: a hollow sphere composed of a shell of 60 carbon atoms connected by a lattice of hexagonal and pentagonal rings, in a pattern of overall icosahedral symmetry. Although C(sub 60) appears to be uniquely stable due to its perfect symmetry, all other even-numbered carbon clusters in the 32 to 100+ atom size range seem to favor similar closed spheroidal forms. These species are interpreted as relatively unreactive side products in condensation reactions of carbon vapor involving spiraling graphitic sheets. The prevalence of C(sub 60) in laser-vaporized carbon vapors and sooting flames suggests that it may be formed readily whenever carbon condenses. Such ready formation and extraordinary stability may have substantial astrophysical implications.
Down-to-earth studies of carbon clusters
NASA Astrophysics Data System (ADS)
Smalley, R. E.
1990-04-01
Recent advances in supersonic beam experiments with laser-vaporization sources of clusters have provided some interesting new insights into the nature of the small clusters of carbon, and the processes through which carbon condenses. One cluster in particular, C60, appears to play a central role. It is argued that this cluster takes the shape of a soccer ball: a hollow sphere composed of a shell of 60 carbon atoms connected by a lattice of hexagonal and pentagonal rings, in a pattern of overall icosahedral symmetry. Although C60 appears to be uniquely stable due to its perfect symmetry, all other even-numbered carbon clusters in the 32 to 100+ atom size range seem to favor similar closed spheroidal forms. These species are interpreted as relatively unreactive side products in condensation reactions of carbon vapor involving spiraling graphitic sheets. The prevalence of C60 in laser-vaporized carbon vapors and sooting flames suggests that it may be formed readily whenever carbon condenses. Such ready formation and extraordinary stability may have substantial astrophysical implications.
ERIC Educational Resources Information Center
Hegedus, Stephen J.; Tapper, John; Dalton, Sara
2016-01-01
In this study, we examine the relationship between contextual variables related to teachers and student performance in Advanced Algebra classrooms in the USA. The data were gathered from a cluster-randomized study on the effects of SimCalc MathWorlds®, a curricular and technological intervention as a replacement for Algebra 2 curriculum, on…
NASA Technical Reports Server (NTRS)
Fomenkova, M. N.
1997-01-01
The computer-intensive project consisted of the analysis and synthesis of existing data on composition of comet Halley dust particles. The main objective was to obtain a complete inventory of sulfur containing compounds in the comet Halley dust by building upon the existing classification of organic and inorganic compounds and applying a variety of statistical techniques for cluster and cross-correlational analyses. A student hired for this project wrote and tested the software to perform cluster analysis. The following tasks were carried out: (1) selecting the data from existing database for the proposed project; (2) finding access to a standard library of statistical routines for cluster analysis; (3) reformatting the data as necessary for input into the library routines; (4) performing cluster analysis and constructing hierarchical cluster trees using three methods to define the proximity of clusters; (5) presenting the output results in different formats to facilitate the interpretation of the obtained cluster trees; (6) selecting groups of data points common for all three trees as stable clusters. We have also considered the chemistry of sulfur in inorganic compounds.
Performance analysis of clustering techniques over microarray data: A case study
NASA Astrophysics Data System (ADS)
Dash, Rasmita; Misra, Bijan Bihari
2018-03-01
Handling big data is one of the major issues in the field of statistical data analysis. In such investigation cluster analysis plays a vital role to deal with the large scale data. There are many clustering techniques with different cluster analysis approach. But which approach suits a particular dataset is difficult to predict. To deal with this problem a grading approach is introduced over many clustering techniques to identify a stable technique. But the grading approach depends on the characteristic of dataset as well as on the validity indices. So a two stage grading approach is implemented. In this study the grading approach is implemented over five clustering techniques like hybrid swarm based clustering (HSC), k-means, partitioning around medoids (PAM), vector quantization (VQ) and agglomerative nesting (AGNES). The experimentation is conducted over five microarray datasets with seven validity indices. The finding of grading approach that a cluster technique is significant is also established by Nemenyi post-hoc hypothetical test.
Spatial Competition: Roughening of an Experimental Interface.
Allstadt, Andrew J; Newman, Jonathan A; Walter, Jonathan A; Korniss, G; Caraco, Thomas
2016-07-28
Limited dispersal distance generates spatial aggregation. Intraspecific interactions are then concentrated within clusters, and between-species interactions occur near cluster boundaries. Spread of a locally dispersing invader can become motion of an interface between the invading and resident species, and spatial competition will produce variation in the extent of invasive advance along the interface. Kinetic roughening theory offers a framework for quantifying the development of these fluctuations, which may structure the interface as a self-affine fractal, and so induce a series of temporal and spatial scaling relationships. For most clonal plants, advance should become spatially correlated along the interface, and width of the interface (where invader and resident compete directly) should increase as a power function of time. Once roughening equilibrates, interface width and the relative location of the most advanced invader should each scale with interface length. We tested these predictions by letting white clover (Trifolium repens) invade ryegrass (Lolium perenne). The spatial correlation of clover growth developed as anticipated by kinetic roughening theory, and both interface width and the most advanced invader's lead scaled with front length. However, the scaling exponents differed from those predicted by recent simulation studies, likely due to clover's growth morphology.
Spatial Competition: Roughening of an Experimental Interface
Allstadt, Andrew J.; Newman, Jonathan A.; Walter, Jonathan A.; Korniss, G.; Caraco, Thomas
2016-01-01
Limited dispersal distance generates spatial aggregation. Intraspecific interactions are then concentrated within clusters, and between-species interactions occur near cluster boundaries. Spread of a locally dispersing invader can become motion of an interface between the invading and resident species, and spatial competition will produce variation in the extent of invasive advance along the interface. Kinetic roughening theory offers a framework for quantifying the development of these fluctuations, which may structure the interface as a self-affine fractal, and so induce a series of temporal and spatial scaling relationships. For most clonal plants, advance should become spatially correlated along the interface, and width of the interface (where invader and resident compete directly) should increase as a power function of time. Once roughening equilibrates, interface width and the relative location of the most advanced invader should each scale with interface length. We tested these predictions by letting white clover (Trifolium repens) invade ryegrass (Lolium perenne). The spatial correlation of clover growth developed as anticipated by kinetic roughening theory, and both interface width and the most advanced invader’s lead scaled with front length. However, the scaling exponents differed from those predicted by recent simulation studies, likely due to clover’s growth morphology. PMID:27465518
Miller, Christopher B; Bartlett, Delwyn J; Mullins, Anna E; Dodds, Kirsty L; Gordon, Christopher J; Kyle, Simon D; Kim, Jong Won; D'Rozario, Angela L; Lee, Rico S C; Comas, Maria; Marshall, Nathaniel S; Yee, Brendon J; Espie, Colin A; Grunstein, Ronald R
2016-11-01
To empirically derive and evaluate potential clusters of Insomnia Disorder through cluster analysis from polysomnography (PSG). We hypothesized that clusters would differ on neurocognitive performance, sleep-onset measures of quantitative ( q )-EEG and heart rate variability (HRV). Research volunteers with Insomnia Disorder (DSM-5) completed a neurocognitive assessment and overnight PSG measures of total sleep time (TST), wake time after sleep onset (WASO), and sleep onset latency (SOL) were used to determine clusters. From 96 volunteers with Insomnia Disorder, cluster analysis derived at least two clusters from objective sleep parameters: Insomnia with normal objective sleep duration (I-NSD: n = 53) and Insomnia with short sleep duration (I-SSD: n = 43). At sleep onset, differences in HRV between I-NSD and I-SSD clusters suggest attenuated parasympathetic activity in I-SSD (P < 0.05). Preliminary work suggested three clusters by retaining the I-NSD and splitting the I-SSD cluster into two: I-SSD A (n = 29): defined by high WASO and I-SSD B (n = 14): a second I-SSD cluster with high SOL and medium WASO. The I-SSD B cluster performed worse than I-SSD A and I-NSD for sustained attention (P ≤ 0.05). In an exploratory analysis, q -EEG revealed reduced spectral power also in I-SSD B before (Delta, Alpha, Beta-1) and after sleep-onset (Beta-2) compared to I-SSD A and I-NSD (P ≤ 0.05). Two insomnia clusters derived from cluster analysis differ in sleep onset HRV. Preliminary data suggest evidence for three clusters in insomnia with differences for sustained attention and sleep-onset q -EEG. Insomnia 100 sleep study: Australia New Zealand Clinical Trials Registry (ANZCTR) identification number 12612000049875. URL: https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=347742. © 2016 Associated Professional Sleep Societies, LLC.
Kelly, Patrick L; Rodney, Steven A; Treu, Tommaso; Foley, Ryan J; Brammer, Gabriel; Schmidt, Kasper B; Zitrin, Adi; Sonnenfeld, Alessandro; Strolger, Louis-Gregory; Graur, Or; Filippenko, Alexei V; Jha, Saurabh W; Riess, Adam G; Bradac, Marusa; Weiner, Benjamin J; Scolnic, Daniel; Malkan, Matthew A; von der Linden, Anja; Trenti, Michele; Hjorth, Jens; Gavazzi, Raphael; Fontana, Adriano; Merten, Julian C; McCully, Curtis; Jones, Tucker; Postman, Marc; Dressler, Alan; Patel, Brandon; Cenko, S Bradley; Graham, Melissa L; Tucker, Bradley E
2015-03-06
In 1964, Refsdal hypothesized that a supernova whose light traversed multiple paths around a strong gravitational lens could be used to measure the rate of cosmic expansion. We report the discovery of such a system. In Hubble Space Telescope imaging, we have found four images of a single supernova forming an Einstein cross configuration around a redshift z = 0.54 elliptical galaxy in the MACS J1149.6+2223 cluster. The cluster's gravitational potential also creates multiple images of the z = 1.49 spiral supernova host galaxy, and a future appearance of the supernova elsewhere in the cluster field is expected. The magnifications and staggered arrivals of the supernova images probe the cosmic expansion rate, as well as the distribution of matter in the galaxy and cluster lenses. Copyright © 2015, American Association for the Advancement of Science.
Peterson, Leif E
2002-01-01
CLUSFAVOR (CLUSter and Factor Analysis with Varimax Orthogonal Rotation) 5.0 is a Windows-based computer program for hierarchical cluster and principal-component analysis of microarray-based transcriptional profiles. CLUSFAVOR 5.0 standardizes input data; sorts data according to gene-specific coefficient of variation, standard deviation, average and total expression, and Shannon entropy; performs hierarchical cluster analysis using nearest-neighbor, unweighted pair-group method using arithmetic averages (UPGMA), or furthest-neighbor joining methods, and Euclidean, correlation, or jack-knife distances; and performs principal-component analysis. PMID:12184816
DICON: interactive visual analysis of multidimensional clusters.
Cao, Nan; Gotz, David; Sun, Jimeng; Qu, Huamin
2011-12-01
Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis. © 2011 IEEE
Flow Cytometry Data Preparation Guidelines for Improved Automated Phenotypic Analysis.
Jimenez-Carretero, Daniel; Ligos, José M; Martínez-López, María; Sancho, David; Montoya, María C
2018-05-15
Advances in flow cytometry (FCM) increasingly demand adoption of computational analysis tools to tackle the ever-growing data dimensionality. In this study, we tested different data input modes to evaluate how cytometry acquisition configuration and data compensation procedures affect the performance of unsupervised phenotyping tools. An analysis workflow was set up and tested for the detection of changes in reference bead subsets and in a rare subpopulation of murine lymph node CD103 + dendritic cells acquired by conventional or spectral cytometry. Raw spectral data or pseudospectral data acquired with the full set of available detectors by conventional cytometry consistently outperformed datasets acquired and compensated according to FCM standards. Our results thus challenge the paradigm of one-fluorochrome/one-parameter acquisition in FCM for unsupervised cluster-based analysis. Instead, we propose to configure instrument acquisition to use all available fluorescence detectors and to avoid integration and compensation procedures, thereby using raw spectral or pseudospectral data for improved automated phenotypic analysis. Copyright © 2018 by The American Association of Immunologists, Inc.
Cluster Correspondence Analysis.
van de Velden, M; D'Enza, A Iodice; Palumbo, F
2017-03-01
A method is proposed that combines dimension reduction and cluster analysis for categorical data by simultaneously assigning individuals to clusters and optimal scaling values to categories in such a way that a single between variance maximization objective is achieved. In a unified framework, a brief review of alternative methods is provided and we show that the proposed method is equivalent to GROUPALS applied to categorical data. Performance of the methods is appraised by means of a simulation study. The results of the joint dimension reduction and clustering methods are compared with the so-called tandem approach, a sequential analysis of dimension reduction followed by cluster analysis. The tandem approach is conjectured to perform worse when variables are added that are unrelated to the cluster structure. Our simulation study confirms this conjecture. Moreover, the results of the simulation study indicate that the proposed method also consistently outperforms alternative joint dimension reduction and clustering methods.
Towards Effective Clustering Techniques for the Analysis of Electric Power Grids
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hogan, Emilie A.; Cotilla Sanchez, Jose E.; Halappanavar, Mahantesh
2013-11-30
Clustering is an important data analysis technique with numerous applications in the analysis of electric power grids. Standard clustering techniques are oblivious to the rich structural and dynamic information available for power grids. Therefore, by exploiting the inherent topological and electrical structure in the power grid data, we propose new methods for clustering with applications to model reduction, locational marginal pricing, phasor measurement unit (PMU or synchrophasor) placement, and power system protection. We focus our attention on model reduction for analysis based on time-series information from synchrophasor measurement devices, and spectral techniques for clustering. By comparing different clustering techniques onmore » two instances of realistic power grids we show that the solutions are related and therefore one could leverage that relationship for a computational advantage. Thus, by contrasting different clustering techniques we make a case for exploiting structure inherent in the data with implications for several domains including power systems.« less
Walthouwer, Michel Jean Louis; Oenema, Anke; Soetens, Katja; Lechner, Lilian; de Vries, Hein
2014-11-01
Developing nutrition education interventions based on clusters of dietary patterns can only be done adequately when it is clear if distinctive clusters of dietary patterns can be derived and reproduced over time, if cluster membership is stable, and if it is predictable which type of people belong to a certain cluster. Hence, this study aimed to: (1) identify clusters of dietary patterns among Dutch adults, (2) test the reproducibility of these clusters and stability of cluster membership over time, and (3) identify sociodemographic predictors of cluster membership and cluster transition. This study had a longitudinal design with online measurements at baseline (N=483) and 6 months follow-up (N=379). Dietary intake was assessed with a validated food frequency questionnaire. A hierarchical cluster analysis was performed, followed by a K-means cluster analysis. Multinomial logistic regression analyses were conducted to identify the sociodemographic predictors of cluster membership and cluster transition. At baseline and follow-up, a comparable three-cluster solution was derived, distinguishing a healthy, moderately healthy, and unhealthy dietary pattern. Male and lower educated participants were significantly more likely to have a less healthy dietary pattern. Further, 251 (66.2%) participants remained in the same cluster, 45 (11.9%) participants changed to an unhealthier cluster, and 83 (21.9%) participants shifted to a healthier cluster. Men and people living alone were significantly more likely to shift toward a less healthy dietary pattern. Distinctive clusters of dietary patterns can be derived. Yet, cluster membership is unstable and only few sociodemographic factors were associated with cluster membership and cluster transition. These findings imply that clusters based on dietary intake may not be suitable as a basis for nutrition education interventions. Copyright © 2014 Elsevier Ltd. All rights reserved.
Acquisition and processing of advanced sensor data for ERW and UXO detection and classification
NASA Astrophysics Data System (ADS)
Schultz, Gregory M.; Keranen, Joe; Miller, Jonathan S.; Shubitidze, Fridon
2014-06-01
The remediation of explosive remnants of war (ERW) and associated unexploded ordnance (UXO) has seen improvements through the injection of modern technological advances and streamlined standard operating procedures. However, reliable and cost-effective detection and geophysical mapping of sites contaminated with UXO such as cluster munitions, abandoned ordnance, and improvised explosive devices rely on the ability to discriminate hazardous items from metallic clutter. In addition to anthropogenic clutter, handheld and vehicle-based metal detector systems are plagued by natural geologic and environmental noise in many post conflict areas. We present new and advanced electromagnetic induction (EMI) technologies including man-portable and towed EMI arrays and associated data processing software. While these systems feature vastly different form factors and transmit-receive configurations, they all exhibit several fundamental traits that enable successful classification of EMI anomalies. Specifically, multidirectional sampling of scattered magnetic fields from targets and corresponding high volume of unique data provide rich information for extracting useful classification features for clutter rejection analysis. The quality of classification features depends largely on the extent to which the data resolve unique physics-based parameters. To date, most of the advanced sensors enable high quality inversion by producing data that are extremely rich in spatial content through multi-angle illumination and multi-point reception.
X-ray and optical substructures of the DAFT/FADA survey clusters
NASA Astrophysics Data System (ADS)
Guennou, L.; Durret, F.; Adami, C.; Lima Neto, G. B.
2013-04-01
We have undertaken the DAFT/FADA survey with the double aim of setting constraints on dark energy based on weak lensing tomography and of obtaining homogeneous and high quality data for a sample of 91 massive clusters in the redshift range 0.4-0.9 for which there were HST archive data. We have analysed the XMM-Newton data available for 42 of these clusters to derive their X-ray temperatures and luminosities and search for substructures. Out of these, a spatial analysis was possible for 30 clusters, but only 23 had deep enough X-ray data for a really robust analysis. This study was coupled with a dynamical analysis for the 26 clusters having at least 30 spectroscopic galaxy redshifts in the cluster range. Altogether, the X-ray sample of 23 clusters and the optical sample of 26 clusters have 14 clusters in common. We present preliminary results on the coupled X-ray and dynamical analyses of these 14 clusters.
Image-based quantification and mathematical modeling of spatial heterogeneity in ESC colonies.
Herberg, Maria; Zerjatke, Thomas; de Back, Walter; Glauche, Ingmar; Roeder, Ingo
2015-06-01
Pluripotent embryonic stem cells (ESCs) have the potential to differentiate into cells of all three germ layers. This unique property has been extensively studied on the intracellular, transcriptional level. However, ESCs typically form clusters of cells with distinct size and shape, and establish spatial structures that are vital for the maintenance of pluripotency. Even though it is recognized that the cells' arrangement and local interactions play a role in fate decision processes, the relations between transcriptional and spatial patterns have not yet been studied. We present a systems biology approach which combines live-cell imaging, quantitative image analysis, and multiscale, mathematical modeling of ESC growth. In particular, we develop quantitative measures of the morphology and of the spatial clustering of ESCs with different expression levels and apply them to images of both in vitro and in silico cultures. Using the same measures, we are able to compare model scenarios with different assumptions on cell-cell adhesions and intercellular feedback mechanisms directly with experimental data. Applying our methodology to microscopy images of cultured ESCs, we demonstrate that the emerging colonies are highly variable regarding both morphological and spatial fluorescence patterns. Moreover, we can show that most ESC colonies contain only one cluster of cells with high self-renewing capacity. These cells are preferentially located in the interior of a colony structure. The integrated approach combining image analysis with mathematical modeling allows us to reveal potential transcription factor related cellular and intercellular mechanisms behind the emergence of observed patterns that cannot be derived from images directly. © 2015 International Society for Advancement of Cytometry.
Gene Discovery in Bladder Cancer Progression using cDNA Microarrays
Sanchez-Carbayo, Marta; Socci, Nicholas D.; Lozano, Juan Jose; Li, Wentian; Charytonowicz, Elizabeth; Belbin, Thomas J.; Prystowsky, Michael B.; Ortiz, Angel R.; Childs, Geoffrey; Cordon-Cardo, Carlos
2003-01-01
To identify gene expression changes along progression of bladder cancer, we compared the expression profiles of early-stage and advanced bladder tumors using cDNA microarrays containing 17,842 known genes and expressed sequence tags. The application of bootstrapping techniques to hierarchical clustering segregated early-stage and invasive transitional carcinomas into two main clusters. Multidimensional analysis confirmed these clusters and more importantly, it separated carcinoma in situ from papillary superficial lesions and subgroups within early-stage and invasive tumors displaying different overall survival. Additionally, it recognized early-stage tumors showing gene profiles similar to invasive disease. Different techniques including standard t-test, single-gene logistic regression, and support vector machine algorithms were applied to identify relevant genes involved in bladder cancer progression. Cytokeratin 20, neuropilin-2, p21, and p33ING1 were selected among the top ranked molecular targets differentially expressed and validated by immunohistochemistry using tissue microarrays (n = 173). Their expression patterns were significantly associated with pathological stage, tumor grade, and altered retinoblastoma (RB) expression. Moreover, p33ING1 expression levels were significantly associated with overall survival. Analysis of the annotation of the most significant genes revealed the relevance of critical genes and pathways during bladder cancer progression, including the overexpression of oncogenic genes such as DEK in superficial tumors or immune response genes such as Cd86 antigen in invasive disease. Gene profiling successfully classified bladder tumors based on their progression and clinical outcome. The present study has identified molecular biomarkers of potential clinical significance and critical molecular targets associated with bladder cancer progression. PMID:12875971
2013-01-01
Background Multicellular organisms consist of cells of many different types that are established during development. Each type of cell is characterized by the unique combination of expressed gene products as a result of spatiotemporal gene regulation. Currently, a fundamental challenge in regulatory biology is to elucidate the gene expression controls that generate the complex body plans during development. Recent advances in high-throughput biotechnologies have generated spatiotemporal expression patterns for thousands of genes in the model organism fruit fly Drosophila melanogaster. Existing qualitative methods enhanced by a quantitative analysis based on computational tools we present in this paper would provide promising ways for addressing key scientific questions. Results We develop a set of computational methods and open source tools for identifying co-expressed embryonic domains and the associated genes simultaneously. To map the expression patterns of many genes into the same coordinate space and account for the embryonic shape variations, we develop a mesh generation method to deform a meshed generic ellipse to each individual embryo. We then develop a co-clustering formulation to cluster the genes and the mesh elements, thereby identifying co-expressed embryonic domains and the associated genes simultaneously. Experimental results indicate that the gene and mesh co-clusters can be correlated to key developmental events during the stages of embryogenesis we study. The open source software tool has been made available at http://compbio.cs.odu.edu/fly/. Conclusions Our mesh generation and machine learning methods and tools improve upon the flexibility, ease-of-use and accuracy of existing methods. PMID:24373308
Identifying novel phenotypes of acute heart failure using cluster analysis of clinical variables.
Horiuchi, Yu; Tanimoto, Shuzou; Latif, A H M Mahbub; Urayama, Kevin Y; Aoki, Jiro; Yahagi, Kazuyuki; Okuno, Taishi; Sato, Yu; Tanaka, Tetsu; Koseki, Keita; Komiyama, Kota; Nakajima, Hiroyoshi; Hara, Kazuhiro; Tanabe, Kengo
2018-07-01
Acute heart failure (AHF) is a heterogeneous disease caused by various cardiovascular (CV) pathophysiology and multiple non-CV comorbidities. We aimed to identify clinically important subgroups to improve our understanding of the pathophysiology of AHF and inform clinical decision-making. We evaluated detailed clinical data of 345 consecutive AHF patients using non-hierarchical cluster analysis of 77 variables, including age, sex, HF etiology, comorbidities, physical findings, laboratory data, electrocardiogram, echocardiogram and treatment during hospitalization. Cox proportional hazards regression analysis was performed to estimate the association between the clusters and clinical outcomes. Three clusters were identified. Cluster 1 (n=108) represented "vascular failure". This cluster had the highest average systolic blood pressure at admission and lung congestion with type 2 respiratory failure. Cluster 2 (n=89) represented "cardiac and renal failure". They had the lowest ejection fraction (EF) and worst renal function. Cluster 3 (n=148) comprised mostly older patients and had the highest prevalence of atrial fibrillation and preserved EF. Death or HF hospitalization within 12-month occurred in 23% of Cluster 1, 36% of Cluster 2 and 36% of Cluster 3 (p=0.034). Compared with Cluster 1, risk of death or HF hospitalization was 1.74 (95% CI, 1.03-2.95, p=0.037) for Cluster 2 and 1.82 (95% CI, 1.13-2.93, p=0.014) for Cluster 3. Cluster analysis may be effective in producing clinically relevant categories of AHF, and may suggest underlying pathophysiology and potential utility in predicting clinical outcomes. Copyright © 2018 Elsevier B.V. All rights reserved.
Mixture modelling for cluster analysis.
McLachlan, G J; Chang, S U
2004-10-01
Cluster analysis via a finite mixture model approach is considered. With this approach to clustering, the data can be partitioned into a specified number of clusters g by first fitting a mixture model with g components. An outright clustering of the data is then obtained by assigning an observation to the component to which it has the highest estimated posterior probability of belonging; that is, the ith cluster consists of those observations assigned to the ith component (i = 1,..., g). The focus is on the use of mixtures of normal components for the cluster analysis of data that can be regarded as being continuous. But attention is also given to the case of mixed data, where the observations consist of both continuous and discrete variables.
Miller, Christopher B.; Bartlett, Delwyn J.; Mullins, Anna E.; Dodds, Kirsty L.; Gordon, Christopher J.; Kyle, Simon D.; Kim, Jong Won; D'Rozario, Angela L.; Lee, Rico S.C.; Comas, Maria; Marshall, Nathaniel S.; Yee, Brendon J.; Espie, Colin A.; Grunstein, Ronald R.
2016-01-01
Study Objectives: To empirically derive and evaluate potential clusters of Insomnia Disorder through cluster analysis from polysomnography (PSG). We hypothesized that clusters would differ on neurocognitive performance, sleep-onset measures of quantitative (q)-EEG and heart rate variability (HRV). Methods: Research volunteers with Insomnia Disorder (DSM-5) completed a neurocognitive assessment and overnight PSG measures of total sleep time (TST), wake time after sleep onset (WASO), and sleep onset latency (SOL) were used to determine clusters. Results: From 96 volunteers with Insomnia Disorder, cluster analysis derived at least two clusters from objective sleep parameters: Insomnia with normal objective sleep duration (I-NSD: n = 53) and Insomnia with short sleep duration (I-SSD: n = 43). At sleep onset, differences in HRV between I-NSD and I-SSD clusters suggest attenuated parasympathetic activity in I-SSD (P < 0.05). Preliminary work suggested three clusters by retaining the I-NSD and splitting the I-SSD cluster into two: I-SSD A (n = 29): defined by high WASO and I-SSD B (n = 14): a second I-SSD cluster with high SOL and medium WASO. The I-SSD B cluster performed worse than I-SSD A and I-NSD for sustained attention (P ≤ 0.05). In an exploratory analysis, q-EEG revealed reduced spectral power also in I-SSD B before (Delta, Alpha, Beta-1) and after sleep-onset (Beta-2) compared to I-SSD A and I-NSD (P ≤ 0.05). Conclusions: Two insomnia clusters derived from cluster analysis differ in sleep onset HRV. Preliminary data suggest evidence for three clusters in insomnia with differences for sustained attention and sleep-onset q-EEG. Clinical Trial Registration: Insomnia 100 sleep study: Australia New Zealand Clinical Trials Registry (ANZCTR) identification number 12612000049875. URL: https://www.anzctr.org.au/Trial/Registration/TrialReview.aspx?id=347742. Citation: Miller CB, Bartlett DJ, Mullins AE, Dodds KL, Gordon CJ, Kyle SD, Kim JW, D'Rozario AL, Lee RS, Comas M, Marshall NS, Yee BJ, Espie CA, Grunstein RR. Clusters of Insomnia Disorder: an exploratory cluster analysis of objective sleep parameters reveals differences in neurocognitive functioning, quantitative EEG, and heart rate variability. SLEEP 2016;39(11):1993–2004. PMID:27568796
Polymer-based composites for aerospace: An overview of IMAST results
NASA Astrophysics Data System (ADS)
Milella, Eva; Cammarano, Aniello
2016-05-01
This paper gives an overview of technological results, achieved by IMAST, the Technological Cluster on Engineering of Polymeric Composite Materials and Structures, in the completed Research Projects in the aerospace field. In this sector, the Cluster developed different solutions: lightweight multifunctional fiber-reinforced polymer composites for aeronautic structures, advanced manufacturing processes (for the optimization of energy consumption and waste reduction) and multifunctional components (e.g., thermal, electrical, acoustic and fire resistance).
Esplin, M Sean; Manuck, Tracy A; Varner, Michael W; Christensen, Bryce; Biggio, Joseph; Bukowski, Radek; Parry, Samuel; Zhang, Heping; Huang, Hao; Andrews, William; Saade, George; Sadovsky, Yoel; Reddy, Uma M; Ilekis, John
2015-09-01
We sought to use an innovative tool that is based on common biologic pathways to identify specific phenotypes among women with spontaneous preterm birth (SPTB) to enhance investigators' ability to identify and to highlight common mechanisms and underlying genetic factors that are responsible for SPTB. We performed a secondary analysis of a prospective case-control multicenter study of SPTB. All cases delivered a preterm singleton at SPTB ≤34.0 weeks' gestation. Each woman was assessed for the presence of underlying SPTB causes. A hierarchic cluster analysis was used to identify groups of women with homogeneous phenotypic profiles. One of the phenotypic clusters was selected for candidate gene association analysis with the use of VEGAS software. One thousand twenty-eight women with SPTB were assigned phenotypes. Hierarchic clustering of the phenotypes revealed 5 major clusters. Cluster 1 (n = 445) was characterized by maternal stress; cluster 2 (n = 294) was characterized by premature membrane rupture; cluster 3 (n = 120) was characterized by familial factors, and cluster 4 (n = 63) was characterized by maternal comorbidities. Cluster 5 (n = 106) was multifactorial and characterized by infection (INF), decidual hemorrhage (DH), and placental dysfunction (PD). These 3 phenotypes were correlated highly by χ(2) analysis (PD and DH, P < 2.2e-6; PD and INF, P = 6.2e-10; INF and DH, (P = .0036). Gene-based testing identified the INS (insulin) gene as significantly associated with cluster 3 of SPTB. We identified 5 major clusters of SPTB based on a phenotype tool and hierarch clustering. There was significant correlation between several of the phenotypes. The INS gene was associated with familial factors that were underlying SPTB. Copyright © 2015 Elsevier Inc. All rights reserved.
EDITORIAL: Focus on Gravitational Lensing
NASA Astrophysics Data System (ADS)
Jain, Bhuvnesh
2007-11-01
Gravitational lensing emerged as an observational field following the 1979 discovery of a doubly imaged quasar lensed by a foreground galaxy. In the 1980s and '90s dozens of other multiply imaged systems were observed, as well as time delay measurements, weak and strong lensing by galaxies and galaxy clusters, and the discovery of microlensing in our galaxy. The rapid pace of advances has continued into the new century. Lensing is currently one of best techniques for finding and mapping dark matter over a wide range of scales, and also addresses broader cosmological questions such as understanding the nature of dark energy. This focus issue of New Journal of Physics presents a snapshot of current research in some of the exciting areas of lensing. It provides an occasion to look back at the advances of the last decade and ahead to the potential of the coming years. Just about a decade ago, microlensing was discovered through the magnification of stars in our galaxy by invisible objects with masses between that of Jupiter and a tenth the mass of the Sun. Thus a new component of the mass of our galaxy, dubbed MACHOs, was established (though a diffuse, cold dark matter-like component is still needed to make up most of the galaxy mass). More recently, microlensing led to another exciting discovery—of extra-solar planets with masses ranging from about five times that of Earth to that of Neptune. We can expect many more planets to be discovered through ongoing surveys. Microlensing is the best technique for finding Earth mass planets, though it is not as productive overall as other methods and does not allow for follow up observations. Beyond planet hunting, microlensing has enabled us to observe previously inaccessible systems, ranging from the surfaces of other stars to the accretion disks around the black holes powering distant quasars. Galaxies and galaxy clusters at cosmological distances can produce dramatic lensing effects: multiple images of background galaxies or quasars which are strongly magnified and sheared. In the last decade, double and quadruply imaged systems due to galactic lenses have been studied with optical and radio observations. An interesting result obtained from the flux ratio 'anomalies' of quadruply imaged systems is the statistical detection of dark sub-clumps in galaxy halos. More broadly, while we have learned a lot about the mass distribution in lens galaxies and improved time delay constraints on the Hubble constant, the limitations of cosmological studies with strong lensing due to uncertainties in lens mass models have also come to be appreciated. That said, progress will no doubt continue with qualitative advances in observations such as astrometric counterparts to the flux anomalies, clever ideas such as the use of spectroscopic signatures to assemble the SLACS lens sample, and combining optical imaging, spectroscopy and radio data to continue the quest for a set of golden lenses to measure the Hubble constant. Galaxy clusters are a fascinating arena for studying the distribution of dark and baryonic matter. Weak and strong lensing information can be combined with dynamical information from the spectroscopic measurements of member galaxies and x-ray/Sunyaev Zeldovich measurements of the hot ionized gas. Hubble Space Telescope observations have yielded spectacular images of clusters, such as Abell 1689, which has over a hundred multiply imaged arcs. Mass measurements have progressed to the level of 10 percent accuracy for several clusters. Unfortunately, it is unclear if one can do much better for individual clusters given inherent limitations such as unknown projection effects. The statistical study of clusters is likely to remain a promising way to study dark matter, gravity theories, and cosmology. Techniques to combine weak and strong lensing information to obtain the mass distribution of clusters have also advanced, and work continues on parameter-free techniques that are agnostic to the relation of cluster light and mass. An interesting twist in cluster lensing was provided by the post-merger Bullet Cluster (identified as 1E0657-558). In this and other merging clusters, the lensing mass is displaced from the baryonic center of mass, presenting a challenge to theories that attempt to explain away dark matter by positing a modification to the law of gravity. Detailed modeling and multi-wavelength data on these systems will provide interesting limits on dark matter as well as the possibility of a major surprise. Other advances may come from the gravitational telescope effect of galaxy clusters: regions with very high magnification can be used to image proto-galaxies at z ~ 10. Statistical studies of galaxy and cluster lenses and of invisible, diffuse large-scale structures via weak lensing have come into their own in recent years. A census of the mass distribution at low redshift has been made using the technique of galaxy galaxy lensing: the mean mass profiles of galaxies and clusters have been measured using the weak tangential shear imprinted on background galaxies. These can be correlated with a variety of luminous tracers to study galaxy/cluster properties at a level of detail not possible until recently. Equally impressive is the measurement of excess mass correlations out to ~30 Mpc from these halos, requiring measurements of shear signals below 0.01%. These measurements account for the total matter density inferred from the CMB plus other observations, thus providing a direct measure of dark matter in the present day universe. Cosmic shear refers to the more challenging measurement of shear shear correlations without the use of foreground objects to orient the shear. The first detections of such correlations were published in 2001; since then measurements from arcminute to degree scales have been made with much improved accuracy. Theoretical techniques of lensing tomography and advances in analysis methods to eliminate systematic errors have progressed rapidly. That cosmic shear is now regarded as a key element of major missions aimed at probing dark energy is a feat of scientific persuasion—a decade ago not many believed it was realistic to even detect this tiny shear signal, let alone measure it with the percent-level accuracy needed to advance dark energy measurements. If weak lensing measurements deliver on their promise, then, in combination with other imaging and spectroscopic probes, they may well impact fundamental physics and cosmology. For example they may find evidence for an evolving dark energy component or signatures of departures from general relativity. These exciting prospects rest on new optical surveys planned for the next five years which will image a thousand square degrees or more of the sky to redshifts ~1 (compared to about a hundred square degrees imaged currently). Further, through photometric redshifts based on galaxy colors, lensing tomography methods will be applied to learn about the three-dimensional distribution of dark matter. Lensing measurements in other wavelengths, such as planned 21-cm surveys and CMB lensing, would add valuable diversity to measurement techniques. The case for the next generation optical surveys from the ground and space is compelling as well: they will produce another order of magnitude in data quantity and deliver images with minimal distortions due to the atmosphere and telescope optics. The coming decade therefore has the potential for exciting discoveries in gravitational lensing. Focus on Gravitational Lensing Contents A Bayesian approach to strong lensing modelling of galaxy clusters E Jullo, J-P Kneib, M Limousin, Á Elíasdóttir, P J Marshall and T Verdugo Probing dark energy with cluster counts and cosmic shear power spectra: including the full covariance Masahiro Takada and Sarah Bridle How robust are the constraints on cosmology and galaxy evolution from the lens-redshift test? Pedro R Capelo and Priyamvada Natarajan Dark energy constraints from cosmic shear power spectra: impact of intrinsic alignments on photometric redshift requirements Sarah Bridle and Lindsay King An integral-field spectroscopic strong lens survey Adam S Bolton and Scott Burles Is there a quad problem among optical gravitational lenses? Masamune Oguri Cluster mass estimators from CMB temperature and polarization lensing Wayne Hu, Simon DeDeo and Chris Vale
Cluster analysis of the hot subdwarfs in the PG survey
NASA Technical Reports Server (NTRS)
Thejll, Peter; Charache, Darryl; Shipman, Harry L.
1989-01-01
Application of cluster analysis to the hot subdwarfs in the Palomar Green (PG) survey of faint blue high-Galactic-latitude objects is assessed, with emphasis on data noise and the number of clusters to subdivide the data into. The data used in the study are presented, and cluster analysis, using the CLUSTAN program, is applied to it. Distances are calculated using the Euclidean formula, and clustering is done by Ward's method. The results are discussed, and five groups representing natural divisions of the subdwarfs in the PG survey are presented.
Using Machine Learning Techniques in the Analysis of Oceanographic Data
NASA Astrophysics Data System (ADS)
Falcinelli, K. E.; Abuomar, S.
2017-12-01
Acoustic Doppler Current Profilers (ADCPs) are oceanographic tools capable of collecting large amounts of current profile data. Using unsupervised machine learning techniques such as principal component analysis, fuzzy c-means clustering, and self-organizing maps, patterns and trends in an ADCP dataset are found. Cluster validity algorithms such as visual assessment of cluster tendency and clustering index are used to determine the optimal number of clusters in the ADCP dataset. These techniques prove to be useful in analysis of ADCP data and demonstrate potential for future use in other oceanographic applications.
Impact of Sampling Density on the Extent of HIV Clustering
Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor
2014-01-01
Abstract Identifying and monitoring HIV clusters could be useful in tracking the leading edge of HIV transmission in epidemics. Currently, greater specificity in the definition of HIV clusters is needed to reduce confusion in the interpretation of HIV clustering results. We address sampling density as one of the key aspects of HIV cluster analysis. The proportion of viral sequences in clusters was estimated at sampling densities from 1.0% to 70%. A set of 1,248 HIV-1C env gp120 V1C5 sequences from a single community in Botswana was utilized in simulation studies. Matching numbers of HIV-1C V1C5 sequences from the LANL HIV Database were used as comparators. HIV clusters were identified by phylogenetic inference under bootstrapped maximum likelihood and pairwise distance cut-offs. Sampling density below 10% was associated with stochastic HIV clustering with broad confidence intervals. HIV clustering increased linearly at sampling density >10%, and was accompanied by narrowing confidence intervals. Patterns of HIV clustering were similar at bootstrap thresholds 0.7 to 1.0, but the extent of HIV clustering decreased with higher bootstrap thresholds. The origin of sampling (local concentrated vs. scattered global) had a substantial impact on HIV clustering at sampling densities ≥10%. Pairwise distances at 10% were estimated as a threshold for cluster analysis of HIV-1 V1C5 sequences. The node bootstrap support distribution provided additional evidence for 10% sampling density as the threshold for HIV cluster analysis. The detectability of HIV clusters is substantially affected by sampling density. A minimal genotyping density of 10% and sampling density of 50–70% are suggested for HIV-1 V1C5 cluster analysis. PMID:25275430
GraphCrunch 2: Software tool for network modeling, alignment and clustering.
Kuchaiev, Oleksii; Stevanović, Aleksandar; Hayes, Wayne; Pržulj, Nataša
2011-01-19
Recent advancements in experimental biotechnology have produced large amounts of protein-protein interaction (PPI) data. The topology of PPI networks is believed to have a strong link to their function. Hence, the abundance of PPI data for many organisms stimulates the development of computational techniques for the modeling, comparison, alignment, and clustering of networks. In addition, finding representative models for PPI networks will improve our understanding of the cell just as a model of gravity has helped us understand planetary motion. To decide if a model is representative, we need quantitative comparisons of model networks to real ones. However, exact network comparison is computationally intractable and therefore several heuristics have been used instead. Some of these heuristics are easily computable "network properties," such as the degree distribution, or the clustering coefficient. An important special case of network comparison is the network alignment problem. Analogous to sequence alignment, this problem asks to find the "best" mapping between regions in two networks. It is expected that network alignment might have as strong an impact on our understanding of biology as sequence alignment has had. Topology-based clustering of nodes in PPI networks is another example of an important network analysis problem that can uncover relationships between interaction patterns and phenotype. We introduce the GraphCrunch 2 software tool, which addresses these problems. It is a significant extension of GraphCrunch which implements the most popular random network models and compares them with the data networks with respect to many network properties. Also, GraphCrunch 2 implements the GRAph ALigner algorithm ("GRAAL") for purely topological network alignment. GRAAL can align any pair of networks and exposes large, dense, contiguous regions of topological and functional similarities far larger than any other existing tool. Finally, GraphCruch 2 implements an algorithm for clustering nodes within a network based solely on their topological similarities. Using GraphCrunch 2, we demonstrate that eukaryotic and viral PPI networks may belong to different graph model families and show that topology-based clustering can reveal important functional similarities between proteins within yeast and human PPI networks. GraphCrunch 2 is a software tool that implements the latest research on biological network analysis. It parallelizes computationally intensive tasks to fully utilize the potential of modern multi-core CPUs. It is open-source and freely available for research use. It runs under the Windows and Linux platforms.
2012-01-01
Background Decreasing salt consumption can prevent cardiovascular diseases (CVD). Practically, it is difficult to promote people’s awareness of daily salt intake and to change their eating habits in terms of reducing salt intake for better cardiovascular health. Health education programs visualizing daily dietary salt content and intake may promote lifestyle changes in patients at high risk of cardiovascular diseases. Methods/Design This is a cluster randomized trial. A total of 800 high-CVD-risk patients attending diabetes and hypertension clinics at health centers in Muang District, Chiang Rai province, Thailand, will be studied with informed consent. A health center recruiting 100 participants is a cluster, the unit of randomization. Eight clusters will be randomized into intervention and control arms and followed up for 1 year. Within the intervention clusters the following will be undertaken: (1) salt content in the daily diet will be measured and shown to study participants; (2) 24-hour salt intake will be estimated in overnight-collected urine and the results shown to the participants; (3) a dietician will assist small group health education classes in cooking meals with less salt. The primary outcome is blood pressure change at the 1-year follow-up. Secondary outcomes at the 1-year follow-up are estimated 24-hoursalt intake, incidence of CVD events and CVD death. The intention-to-treat analysis will be followed. Blood pressure and estimated 24-hour salt intake will be compared between intervention and control groups at the cluster and individual level at the 1-year follow-up. Clinical CVD events and deaths will be analyzed by time-event analysis. Retinal blood vessel calibers of CVD-risk patients will be assessed cross-sectionally. Behavioral change to reduce salt intake and the influencing factors will be determined by structured equation model (SEM). Multilevel regression analyses will be applied. Finally, the cost effectiveness of the intervention will be analyzed. Discussion This study is unique as it will recruit the individuals most vulnerable to CVD morbidity and mortality by applying the general Framingham CVD risk scoring system. Dietary salt reduction will be applied as a prioritized, community level intervention for the prevention of CVD in a developing country. Trial registration ISRCTN39416277 PMID:22947342
Aung, Myo Nyein; Yuasa, Motoyuki; Moolphate, Saiyud; Nedsuwan, Supalert; Yokokawa, Hidehiro; Kitajima, Tsutomu; Minematsu, Kazuo; Tanimura, Susumu; Fukuda, Hiroshi; Hiratsuka, Yoshimune; Ono, Koichi; Kawai, Sachio; Marui, Eiji
2012-09-04
Decreasing salt consumption can prevent cardiovascular diseases (CVD). Practically, it is difficult to promote people's awareness of daily salt intake and to change their eating habits in terms of reducing salt intake for better cardiovascular health. Health education programs visualizing daily dietary salt content and intake may promote lifestyle changes in patients at high risk of cardiovascular diseases. This is a cluster randomized trial. A total of 800 high-CVD-risk patients attending diabetes and hypertension clinics at health centers in Muang District, Chiang Rai province, Thailand, will be studied with informed consent. A health center recruiting 100 participants is a cluster, the unit of randomization. Eight clusters will be randomized into intervention and control arms and followed up for 1 year. Within the intervention clusters the following will be undertaken: (1) salt content in the daily diet will be measured and shown to study participants; (2) 24-hour salt intake will be estimated in overnight-collected urine and the results shown to the participants; (3) a dietician will assist small group health education classes in cooking meals with less salt. The primary outcome is blood pressure change at the 1-year follow-up. Secondary outcomes at the 1-year follow-up are estimated 24-hoursalt intake, incidence of CVD events and CVD death. The intention-to-treat analysis will be followed.Blood pressure and estimated 24-hour salt intake will be compared between intervention and control groups at the cluster and individual level at the 1-year follow-up. Clinical CVD events and deaths will be analyzed by time-event analysis. Retinal blood vessel calibers of CVD-risk patients will be assessed cross-sectionally. Behavioral change to reduce salt intake and the influencing factors will be determined by structured equation model (SEM). Multilevel regression analyses will be applied. Finally, the cost effectiveness of the intervention will be analyzed. This study is unique as it will recruit the individuals most vulnerable to CVD morbidity and mortality by applying the general Framingham CVD risk scoring system. Dietary salt reduction will be applied as a prioritized, community level intervention for the prevention of CVD in a developing country. ISRCTN39416277.
Bae, Hyoung Won; Rho, Seungsoo; Lee, Hye Sun; Lee, Naeun; Hong, Samin; Seong, Gong Je; Sung, Kyung Rim; Kim, Chan Yun
2014-04-29
To classify medically treated open-angle glaucoma (OAG) by the pattern of progression using hierarchical cluster analysis, and to determine OAG progression characteristics by comparing clusters. Ninety-five eyes of 95 OAG patients who received medical treatment, and who had undergone visual field (VF) testing at least once per year for 5 or more years. OAG was classified into subgroups using hierarchical cluster analysis based on the following five variables: baseline mean deviation (MD), baseline visual field index (VFI), MD slope, VFI slope, and Glaucoma Progression Analysis (GPA) printout. After that, other parameters were compared between clusters. Two clusters were made after a hierarchical cluster analysis. Cluster 1 showed -4.06 ± 2.43 dB baseline MD, 92.58% ± 6.27% baseline VFI, -0.28 ± 0.38 dB per year MD slope, -0.52% ± 0.81% per year VFI slope, and all "no progression" cases in GPA printout, whereas cluster 2 showed -8.68 ± 3.81 baseline MD, 77.54 ± 12.98 baseline VFI, -0.72 ± 0.55 MD slope, -2.22 ± 1.89 VFI slope, and seven "possible" and four "likely" progression cases in GPA printout. There were no significant differences in age, sex, mean IOP, central corneal thickness, and axial length between clusters. However, cluster 2 included more high-tension glaucoma patients and used a greater number of antiglaucoma eye drops significantly compared with cluster 1. Hierarchical cluster analysis of progression patterns divided OAG into slow and fast progression groups, evidenced by assessing the parameters of glaucomatous progression in VF testing. In the fast progression group, the prevalence of high-tension glaucoma was greater and the number of antiglaucoma medications administered was increased versus the slow progression group. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
Elemental Abundances in the Intracluster Gas and the Hot Galactic Coronae in Cluster A194
NASA Technical Reports Server (NTRS)
Forman, William R.
1997-01-01
We have completed the analysis of observations of the Coma cluster and are continuing analysis of A1367 both of which are shown to be merging clusters. Also, we are analyzing observations of the Centaurus cluster which we see as a merger based in both its temperature and surface brightness distributions. Attachment: Another collision for the coma cluster.
A Cluster of Legionella-Associated Pneumonia Cases in a Population of Military Recruits
2007-06-01
this cluster may suggest a previously unrecognized suscep- FIG. 1. Phylogenic analysis of the training center strain (represented by the MCRD consensus...military recruits during population- based surveillance for pneumonia pathogens. Results were confirmed by sequence analysis . Cases cluster tightly...17 April 2007 A Legionella cluster was identified through retrospective PCR analysis of 240 throat swab samples from X-ray-confirmed pneumonia cases
Cluster Headache: Epidemiology, Pathophysiology, Clinical Features, and Diagnosis
Wei, Diana Yi-Ting; Yuan Ong, Jonathan Jia; Goadsby, Peter James
2018-01-01
Cluster headache is a primary headache disorder affecting up to 0.1% of the population. Patients suffer from cluster headache attacks lasting from 15 to 180 min up to 8 times a day. The attacks are characterized by the severe unilateral pain mainly in the first division of the trigeminal nerve, with associated prominent unilateral cranial autonomic symptoms and a sense of agitation and restlessness during the attacks. The male-to-female ratio is approximately 2.5:1. Experimental, clinical, and neuroimaging studies have advanced our understanding of the pathogenesis of cluster headache. The pathophysiology involves activation of the trigeminovascular complex and the trigeminal-autonomic reflex and accounts for the unilateral severe headache, the prominent ipsilateral cranial autonomic symptoms. In addition, the circadian and circannual rhythmicity unique to this condition is postulated to involve the hypothalamus and suprachiasmatic nucleus. Although the clinical features are distinct, it may be misdiagnosed, with patients often presenting to the otolaryngologist or dentist with symptoms. The prognosis of cluster headache remains difficult to predict. Patients with episodic cluster headache can shift to chronic cluster headache and vice versa. Longitudinally, cluster headache tends to remit with age with less frequent bouts and more prolonged periods of remission in between bouts. PMID:29720812
Cluster Headache: Epidemiology, Pathophysiology, Clinical Features, and Diagnosis.
Wei, Diana Yi-Ting; Yuan Ong, Jonathan Jia; Goadsby, Peter James
2018-04-01
Cluster headache is a primary headache disorder affecting up to 0.1% of the population. Patients suffer from cluster headache attacks lasting from 15 to 180 min up to 8 times a day. The attacks are characterized by the severe unilateral pain mainly in the first division of the trigeminal nerve, with associated prominent unilateral cranial autonomic symptoms and a sense of agitation and restlessness during the attacks. The male-to-female ratio is approximately 2.5:1. Experimental, clinical, and neuroimaging studies have advanced our understanding of the pathogenesis of cluster headache. The pathophysiology involves activation of the trigeminovascular complex and the trigeminal-autonomic reflex and accounts for the unilateral severe headache, the prominent ipsilateral cranial autonomic symptoms. In addition, the circadian and circannual rhythmicity unique to this condition is postulated to involve the hypothalamus and suprachiasmatic nucleus. Although the clinical features are distinct, it may be misdiagnosed, with patients often presenting to the otolaryngologist or dentist with symptoms. The prognosis of cluster headache remains difficult to predict. Patients with episodic cluster headache can shift to chronic cluster headache and vice versa. Longitudinally, cluster headache tends to remit with age with less frequent bouts and more prolonged periods of remission in between bouts.
A scoping review of spatial cluster analysis techniques for point-event data.
Fritz, Charles E; Schuurman, Nadine; Robertson, Colin; Lear, Scott
2013-05-01
Spatial cluster analysis is a uniquely interdisciplinary endeavour, and so it is important to communicate and disseminate ideas, innovations, best practices and challenges across practitioners, applied epidemiology researchers and spatial statisticians. In this research we conducted a scoping review to systematically search peer-reviewed journal databases for research that has employed spatial cluster analysis methods on individual-level, address location, or x and y coordinate derived data. To illustrate the thematic issues raised by our results, methods were tested using a dataset where known clusters existed. Point pattern methods, spatial clustering and cluster detection tests, and a locally weighted spatial regression model were most commonly used for individual-level, address location data (n = 29). The spatial scan statistic was the most popular method for address location data (n = 19). Six themes were identified relating to the application of spatial cluster analysis methods and subsequent analyses, which we recommend researchers to consider; exploratory analysis, visualization, spatial resolution, aetiology, scale and spatial weights. It is our intention that researchers seeking direction for using spatial cluster analysis methods, consider the caveats and strengths of each approach, but also explore the numerous other methods available for this type of analysis. Applied spatial epidemiology researchers and practitioners should give special consideration to applying multiple tests to a dataset. Future research should focus on developing frameworks for selecting appropriate methods and the corresponding spatial weighting schemes.
ULTRA-COMPACT DWARFS IN THE COMA CLUSTER
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chiboucas, Kristin; Tully, R. Brent; Marzke, R. O.
2011-08-20
We have undertaken a spectroscopic search for ultra-compact dwarf galaxies (UCDs) in the dense core of the dynamically evolved, massive Coma cluster as part of the Hubble Space Telescope/Advanced Camera for Surveys (HST/ACS) Coma Cluster Treasury Survey. UCD candidates were initially chosen based on color, magnitude, degree of resolution within the ACS images, and the known properties of Fornax and Virgo UCDs. Follow-up spectroscopy with Keck/Low-Resolution Imaging Spectrometer confirmed 27 candidates as members of the Coma cluster, a success rate >60% for targeted objects brighter than M{sub R} = -12. Another 14 candidates may also prove to be Coma members,more » but low signal-to-noise spectra prevent definitive conclusions. An investigation of the properties and distribution of the Coma UCDs finds these objects to be very similar to UCDs discovered in other environments. The Coma UCDs tend to be clustered around giant galaxies in the cluster core and have colors/metallicity that correlate with the host galaxy. With properties and a distribution similar to that of the Coma cluster globular cluster population, we find strong support for a star cluster origin for the majority of the Coma UCDs. However, a few UCDs appear to have stellar population or structural properties which differentiate them from the old star cluster populations found in the Coma cluster, perhaps indicating that UCDs may form through multiple formation channels.« less
NASA Astrophysics Data System (ADS)
Rood, R. T.; Renzini, A.
1997-01-01
The present volume on stellar evolution discusses fundamentals of stellar evolution and star clusters, variable stars, AGB stars and planetary nebulae, white dwarfs, binary star evolution, and stars in galaxies. Attention is given to the stellar population in the Galactic bulge, a photometric study of NGC 458, and HST observations of high-density globular clusters. Other topics addressed include the Cepheid instability strip in external galaxies, Hyades cluster white dwarfs and the initial-final mass relation, element diffusion in novae, mass function of the stars in the solar neighborhood, synthetic spectral indices for elliptical galaxies, and stars at the Galactic center.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mitchell, John; Castillo, Andrew
2016-09-21
This software contains a set of python modules – input, search, cluster, analysis; these modules read input files containing spatial coordinates and associated attributes which can be used to perform nearest neighbor search (spatial indexing via kdtree), cluster analysis/identification, and calculation of spatial statistics for analysis.
Dhayal, Rajendra S; van Zyl, Werner E; Liu, C W
2016-01-19
Metal hydride clusters have historically been studied to unravel their aesthetically pleasing molecular structures and interesting properties, especially toward hydrogen related applications. Central to this work is the hydride ligand, H¯, the smallest closed-shell spherical anion known. Two new developments in polyhydrido nanocluster chemistry include the determination of heretofore unknown hydride coordination modes and novel structural constructs, and conversion from the molecular entities to rhombus-shaped copper nanoparticles (CuNPs). These advances, together with hydrogen evolution and catalysis, have provided both experimentalists and theorists with a rich scientific directive to further explore. The isolation of hexameric [{(Ph3P)CuH}6] (Stryker reagent) could be regarded as the springboard for the recent emergence of polyhydrido copper cluster chemistry due to its utilization in a variety of organic chemical transformations. The stability of clusters of various nuclearity was improved through phosphine, pyridine, and carbene type ligands. Our focus lies with the isolation of novel copper (poly)hydride clusters using mostly the phosphor-1,1-dithiolato type ligands. We found such chalcogen-stabilized clusters to be exceptionally air and moisture stable over a wide range of nuclearities (Cu7 to Cu32). In this Account, we (i) report on state-of-the-art copper hydride cluster chemistry, especially with regards to the diverse and novel structural types generally, and newly discovered hydride coordination modes in particular, (ii) demonstrate the indispensable power of neutron diffraction for the unambiguous assignment and location of hydride ligand(s) within a cluster, and (iii) prove unique transformations that can occur not only between well characterized high nuclearity clusters, but also how such clusters can transform to uniquely shaped nanoparticles of several nanometers in diameter through copper hydride reduction. The increase in the number of low- to high-nuclearity hydride clusters allows for different means by which they can be classified. We chose a classification based on the coordination mode of hydride ligand within the cluster. This includes copper clusters associated with bridging (μ2-H) and capping (μ3-H) hydride modes, followed by an interstitial (μ4-H) hydride mode that was introduced for the first time into octa- and hepta-nuclear copper clusters stabilized by dichalcogen-type ligands. This breakthrough provided a means to explore higher nuclearity polyhydrido nanoclusters, which contain both capping (μ3-H) and interstitial (μ(4-6)-H) hydrides. The presence of bidentate ligands having mixed S/P dative sites led to air- and moisture-stable copper hydride nanoclusters. The formation of rhombus-shaped nanoparticles (CuNPs) from copper polyhydrides in the presence of excess borohydrides suggests the presence of metal hydrides as intermediates during the formation of nanoparticles.
Multivariate statistical analysis of wildfires in Portugal
NASA Astrophysics Data System (ADS)
Costa, Ricardo; Caramelo, Liliana; Pereira, Mário
2013-04-01
Several studies demonstrate that wildfires in Portugal present high temporal and spatial variability as well as cluster behavior (Pereira et al., 2005, 2011). This study aims to contribute to the characterization of the fire regime in Portugal with the multivariate statistical analysis of the time series of number of fires and area burned in Portugal during the 1980 - 2009 period. The data used in the analysis is an extended version of the Rural Fire Portuguese Database (PRFD) (Pereira et al, 2011), provided by the National Forest Authority (Autoridade Florestal Nacional, AFN), the Portuguese Forest Service, which includes information for more than 500,000 fire records. There are many multiple advanced techniques for examining the relationships among multiple time series at the same time (e.g., canonical correlation analysis, principal components analysis, factor analysis, path analysis, multiple analyses of variance, clustering systems). This study compares and discusses the results obtained with these different techniques. Pereira, M.G., Trigo, R.M., DaCamara, C.C., Pereira, J.M.C., Leite, S.M., 2005: "Synoptic patterns associated with large summer forest fires in Portugal". Agricultural and Forest Meteorology. 129, 11-25. Pereira, M. G., Malamud, B. D., Trigo, R. M., and Alves, P. I.: The history and characteristics of the 1980-2005 Portuguese rural fire database, Nat. Hazards Earth Syst. Sci., 11, 3343-3358, doi:10.5194/nhess-11-3343-2011, 2011 This work is supported by European Union Funds (FEDER/COMPETE - Operational Competitiveness Programme) and by national funds (FCT - Portuguese Foundation for Science and Technology) under the project FCOMP-01-0124-FEDER-022692, the project FLAIR (PTDC/AAC-AMB/104702/2008) and the EU 7th Framework Program through FUME (contract number 243888).
Ning, P; Guo, Y F; Sun, T Y; Zhang, H S; Chai, D; Li, X M
2016-09-01
To study the distinct clinical phenotype of chronic airway diseases by hierarchical cluster analysis and two-step cluster analysis. A population sample of adult patients in Donghuamen community, Dongcheng district and Qinghe community, Haidian district, Beijing from April 2012 to January 2015, who had wheeze within the last 12 months, underwent detailed investigation, including a clinical questionnaire, pulmonary function tests, total serum IgE levels, blood eosinophil level and a peak flow diary. Nine variables were chosen as evaluating parameters, including pre-salbutamol forced expired volume in one second(FEV1)/forced vital capacity(FVC) ratio, pre-salbutamol FEV1, percentage of post-salbutamol change in FEV1, residual capacity, diffusing capacity of the lung for carbon monoxide/alveolar volume adjusted for haemoglobin level, peak expiratory flow(PEF) variability, serum IgE level, cumulative tobacco cigarette consumption (pack-years) and respiratory symptoms (cough and expectoration). Subjects' different clinical phenotype by hierarchical cluster analysis and two-step cluster analysis was identified. (1) Four clusters were identified by hierarchical cluster analysis. Cluster 1 was chronic bronchitis in smokers with normal pulmonary function. Cluster 2 was chronic bronchitis or mild chronic obstructive pulmonary disease (COPD) patients with mild airflow limitation. Cluster 3 included COPD patients with heavy smoking, poor quality of life and severe airflow limitation. Cluster 4 recognized atopic patients with mild airflow limitation, elevated serum IgE and clinical features of asthma. Significant differences were revealed regarding pre-salbutamol FEV1/FVC%, pre-salbutamol FEV1% pred, post-salbutamol change in FEV1%, maximal mid-expiratory flow curve(MMEF)% pred, carbon monoxide diffusing capacity per liter of alveolar(DLCO)/(VA)% pred, residual volume(RV)% pred, total serum IgE level, smoking history (pack-years), St.George's respiratory questionnaire(SGRQ) score, acute exacerbation in the past one year, PEF variability and allergic dermatitis (P<0.05). (2) Four clusters were also identified by two-step cluster analysis as followings, cluster 1, COPD patients with moderate to severe airflow limitation; cluster 2, asthma and COPD patients with heavy smoking, airflow limitation and increased airways reversibility; cluster 3, patients having less smoking and normal pulmonary function with wheezing but no chronic cough; cluster 4, chronic bronchitis patients with normal pulmonary function and chronic cough. Significant differences were revealed regarding gender distribution, respiratory symptoms, pre-salbutamol FEV1/FVC%, pre-salbutamol FEV1% pred, post-salbutamol change in FEV1%, MMEF% pred, DLCO/VA% pred, RV% pred, PEF variability, total serum IgE level, cumulative tobacco cigarette consumption (pack-years), and SGRQ score (P<0.05). By different cluster analyses, distinct clinical phenotypes of chronic airway diseases are identified. Thus, individualized treatments may guide doctors to provide based on different phenotypes.
Brägelmann, Johannes; Klümper, Niklas; Offermann, Anne; von Mässenhausen, Anne; Böhm, Diana; Deng, Mario; Queisser, Angela; Sanders, Christine; Syring, Isabella; Merseburger, Axel S; Vogel, Wenzel; Sievers, Elisabeth; Vlasic, Ignacija; Carlsson, Jessica; Andrén, Ove; Brossart, Peter; Duensing, Stefan; Svensson, Maria A; Shaikhibrahim, Zaki; Kirfel, Jutta; Perner, Sven
2017-04-01
Purpose: The Mediator complex is a multiprotein assembly, which serves as a hub for diverse signaling pathways to regulate gene expression. Because gene expression is frequently altered in cancer, a systematic understanding of the Mediator complex in malignancies could foster the development of novel targeted therapeutic approaches. Experimental Design: We performed a systematic deconvolution of the Mediator subunit expression profiles across 23 cancer entities ( n = 8,568) using data from The Cancer Genome Atlas (TCGA). Prostate cancer-specific findings were validated in two publicly available gene expression cohorts and a large cohort of primary and advanced prostate cancer ( n = 622) stained by immunohistochemistry. The role of CDK19 and CDK8 was evaluated by siRNA-mediated gene knockdown and inhibitor treatment in prostate cancer cell lines with functional assays and gene expression analysis by RNAseq. Results: Cluster analysis of TCGA expression data segregated tumor entities, indicating tumor-type-specific Mediator complex compositions. Only prostate cancer was marked by high expression of CDK19 In primary prostate cancer, CDK19 was associated with increased aggressiveness and shorter disease-free survival. During cancer progression, highest levels of CDK19 and of its paralog CDK8 were present in metastases. In vitro , inhibition of CDK19 and CDK8 by knockdown or treatment with a selective CDK8/CDK19 inhibitor significantly decreased migration and invasion. Conclusions: Our analysis revealed distinct transcriptional expression profiles of the Mediator complex across cancer entities indicating differential modes of transcriptional regulation. Moreover, it identified CDK19 and CDK8 to be specifically overexpressed during prostate cancer progression, highlighting their potential as novel therapeutic targets in advanced prostate cancer. Clin Cancer Res; 23(7); 1829-40. ©2016 AACR . ©2016 American Association for Cancer Research.
Brazil, Kevin; Carter, Gillian; Cardwell, Chris; Clarke, Mike; Hudson, Peter; Froggatt, Katherine; McLaughlin, Dorry; Passmore, Peter; Kernohan, W George
2018-03-01
In dementia care, a large number of treatment decisions are made by family carers on behalf of their family member who lacks decisional capacity; advance care planning can support such carers in the decision-making of care goals. However, given the relative importance of advance care planning in dementia care, the prevalence of advance care planning in dementia care is poor. To evaluate the effectiveness of advance care planning with family carers in dementia care homes. Paired cluster randomized controlled trial. The intervention comprised a trained facilitator, family education, family meetings, documentation of advance care planning decisions and intervention orientation for general practitioners and nursing home staff. A total of 24 nursing homes with a dementia nursing category located in Northern Ireland, United Kingdom. Family carers of nursing home residents classified as having dementia and judged as not having decisional capacity to participate in advance care planning discussions. The primary outcome was family carer uncertainty in decision-making about the care of the resident (Decisional Conflict Scale). There was evidence of a reduction in total Decisional Conflict Scale score in the intervention group compared with the usual care group (-10.5, 95% confidence interval: -16.4 to -4.7; p < 0.001). Advance care planning was effective in reducing family carer uncertainty in decision-making concerning the care of their family member and improving perceptions of quality of care in nursing homes. Given the global significance of dementia, the implications for clinicians and policy makers include them recognizing the importance of family carer education and improving communication between family carers and formal care providers.
NASA Astrophysics Data System (ADS)
Oguri, Masamune; Schrabback, Tim; Jullo, Eric; Ota, Naomi; Kochanek, Christopher S.; Dai, Xinyu; Ofek, Eran O.; Richards, Gordon T.; Blandford, Roger D.; Falco, Emilio E.; Fohlmeister, Janine
2013-02-01
We present Hubble Space Telescope (HST) Advanced Camera for Surveys (ACS) and Wide Field Camera 3 (WFC3) observations of SDSS J1029+2623, a three-image quasar lens system produced by a foreground cluster at z = 0.584. Our strong lensing analysis reveals six additional multiply imaged galaxies in addition to the multiply imaged quasar. We confirm the complex nature of the mass distribution of the lensing cluster, with a bimodal dark matter distribution which deviates from the Chandra X-ray surface brightness distribution. The Einstein radius of the lensing cluster is estimated to be θE = 15.2 ± 0.5 arcsec for the quasar redshift of z = 2.197. We derive a radial mass distribution from the combination of strong lensing, HST/ACS weak lensing and Subaru/Suprime-cam weak lensing analysis results, finding a best-fitting virial mass of Mvir = 1.55+ 0.40- 0.35 × 1014 h- 1 M⊙ and a concentration parameter of cvir = 25.7+ 14.1- 7.5. The lensing mass estimate at the outer radius is smaller than the X-ray mass estimate by a factor of ˜2. We ascribe this large mass discrepancy to shock heating of the intracluster gas during a merger, which is also suggested by the complex mass and gas distributions and the high value of the concentration parameter. In the HST image, we also identify a probable galaxy, GX, in the vicinity of the faintest quasar image C. In strong lens models, the inclusion of GX explains the anomalous flux ratios between the quasar images. The morphology of the highly elongated quasar host galaxy is also well reproduced. The best-fitting model suggests large total magnifications of 30 for the quasar and 35 for the quasar host galaxy, and has an AB time delay consistent with the measured value.
Liu, Jun; Wang, Zhuo-Ren; Li, Chuang; Bian, Yin-Bing; Xiao, Yang
2015-06-01
Genetic diversity among 89 Chinese Lentinula edodes cultivars was analyzed by inter-simple sequence repeat (ISSR) and sequence-related amplified polymorphism (SRAP) markers. A 123 out of 126 ISSR loci (97.62%) and 108 out of 129 SRAP loci (83.73%) were polymorphic between two or more strains. A dendrogram constructed by cluster analysis based on the ISSR and SRAP markers separated the L. edodes strains into two major groups, of which group B was further divided into five subgroups. Clustering results also showed a positive correlation with the main agronomic traits of the strains, and that strains with similar traits clustered together into the same groups or subgroups in most cases. The average coefficient of pairwise genetic similarity was 0.820 (range: 0.576-0.988). Compared to the wild strains, Chinese L. edodes cultivars indicated a lower level of genetic diversity. Two preliminary core collections of L. edodes, Core1 and Core2, were established based on the ISSR and SRAP data, respectively. Core1 was constructed by the advanced M (maximization) strategy using the PowerCore version 1.0 software and contained 21 strains, whereas Core2 was created by the allele preferred sampling strategy using the cluster method and contained 18 strains. Both core collections were highly representative of the genetic diversity of the original germplasm, as confirmed by the values of Na (observed number of alleles), Ne (effective number of alleles), H (Nei's gene diversity) and I (Shannon's information index), as well as results of principal coordinate analysis. The loci retention ratio of Core1 (99.61%) was higher than that of Core2 (97.65%). Moreover, Core1 contained strains with more types of agronomic traits than those in Core2. This study builds the basis for further effective protection, management and use of L. edodes germplasm resource. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The Impact of Non-Thermal Processes in the Intracluster Medium on Cosmological Cluster Observables
NASA Astrophysics Data System (ADS)
Battaglia, Nicholas Ambrose
In this thesis we describe the generation and analysis of hydrodynamical simulations of galaxy clusters and their intracluster medium (ICM), using large cosmological boxes to generate large samples, in conjunction with individual cluster computations. The main focus is the exploration of the non-thermal processes in the ICM and the effect they have on the interpretation of observations used for cosmological constraints. We provide an introduction to the cosmological structure formation framework for our computations and an overview of the numerical simulations and observations of galaxy clusters. We explore the cluster magnetic field observables through radio relics, extended entities in the ICM characterized by their of diffuse radio emission. We show that statistical quantities such as radio relic luminosity functions and rotation measure power spectra are sensitive to magnetic field models. The spectral index of the radio relic emission provides information on structure formation shocks, e.g., on their Mach number. We develop a coarse grained stochastic model of active galaxy nucleus (AGN) feed-back in clusters and show the impact of such inhomogeneous feedback on the thermal pressure profile. We explore variations in the pressure profile as a function of cluster mass, redshift, and radius and provide a constrained fitting function for this profile. We measure the degree of the non-thermal pressure in the gas from internal cluster bulk motions and show it has an impact on the slope and scatter of the Sunyaev-Zel'dovich (SZ) scaling relation. We also find that the gross shape of the ICM, as characterized by scaled moment of inertia tensors, affects the SZ scaling relation. We demonstrate that the shape and the amplitude of the SZ angular power spectrum is sensitive to AGN feedback, and this affects the cosmological parameters determined from high resolution ACT and SPT cosmic microwave background data. We compare analytic, semi-analytic, and simulation-based methods for calculating the SZ power spectrum, and characterize their differences. All the methods must rely, one way or another, on high resolution large-scale hydrodynamical simulations with varying assumptions for modelling the gas of the sort presented here. We show how our results can be used to interpret the latest ACT and SPT power spectrum results. We provide an outlook for the future, describing follow-up work we are undertaking to further advance the theory of cluster science.
The Formation and Early Evolution of Embedded Massive Star Clusters
NASA Astrophysics Data System (ADS)
Barnes, Peter
We propose to combine Spitzer, WISE, Herschel, and other archival spacecraft data with an existing ground- and space-based mm-wave to near-IR survey of molecular clouds over a large portion of the Milky Way, in order to systematically study the formation and early evolution of massive stars and star clusters, and provide new observational calibrations for a theoretical paradigm of this key astrophysical problem. Central Objectives: The Galactic Census of High- and Medium-mass Protostars (CHaMP) is a large, unbiased, uniform, and panchromatic survey of massive star and cluster formation and early evolution, covering 20°x6° of the Galactic Plane. Its uniqueness lies in the comprehensive molecular spectroscopy of 303 massive dense clumps, which have also been included in several archival spacecraft surveys. Our objective is a systematic demographic analysis of massive star and cluster formation, one which has not been possible without knowledge of our CHaMP cloud sample, including all clouds with embedded clusters as well as those that have not yet formed massive stars. For proto-clusters deeply embedded within dense molecular clouds, analysis of these space-based data will: 1. Yield a complete census of Young Stellar Objects in each cluster. 2. Allow systematic measurements of embedded cluster properties: spectral energy distributions, luminosity functions, protostellar and disk fractions, and how these vary with cluster mass, age, and density. Combined with other, similarly complete and unbiased infrared and mm data, CHaMP's goals include: 3. A detailed comparison of the embedded stellar populations with their natal dense gas to derive extinction maps, star formation efficiencies and feedback effects, and the kinematics, physics, and chemistry of the gas in and around the clusters. 4. Tying the demographics, age spreads, and timescales of the clusters, based on pre-Main Sequence evolution, to that of the dense gas clumps and Giant Molecular Clouds. 5. A measurement of the local star formation rate per gas mass surface density in the Milky Way, as well as examining arm versus interarm dependencies. Methods and Techniques: We will primarily use archival cryogenic-Spitzer, WISE, and Herschel data, and support this with existing data from ground- and space-based facilities, to conduct a comprehensive assay of critical metrics (as above) and provide observational calibration of theoretical models over the entire massive star formation process. The mm-wave molecular maps of 303 dense gas clumps in multiple species, comprising all the gas above a column density limit of 100 Msun/pc^2, are already inhand. We have also surveyed the embedded stellar content of these clumps, down to subsolar masses, in the near-infrared J, H, and K bands and with deep Warm Spitzer data. Relevance to NASA programs: Analysis to date of the space- and ground-based data has yielded several new insights into evolutionary timescales and the chemical & energy evolution of clumps during the cluster formation process. Investigations as described in this proposal will yield new demographic insights on how the properties and evolution of molecular clouds relate to the properties of massive stars and clusters that form within them, and significantly enhance the science return from these spacecraft missions. The large number of resulting data products are already being made publicly available to the astronomical community, providing crucial information for future NASA science targets. This research will be performed within the framework of a broad international collaboration spanning four continents. This ambitious but practical program will therefore maximise the science payoff from these archival data sets, provide enhanced legacy data for more advanced studies with the next generation of ground- and space-based instruments such as JWST, and open up several new windows into the discovery space of Galactic star formation & interstellar medium studies.
CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks.
Li, Min; Li, Dongyan; Tang, Yu; Wu, Fangxiang; Wang, Jianxin
2017-08-31
Nowadays, cluster analysis of biological networks has become one of the most important approaches to identifying functional modules as well as predicting protein complexes and network biomarkers. Furthermore, the visualization of clustering results is crucial to display the structure of biological networks. Here we present CytoCluster, a cytoscape plugin integrating six clustering algorithms, HC-PIN (Hierarchical Clustering algorithm in Protein Interaction Networks), OH-PIN (identifying Overlapping and Hierarchical modules in Protein Interaction Networks), IPCA (Identifying Protein Complex Algorithm), ClusterONE (Clustering with Overlapping Neighborhood Expansion), DCU (Detecting Complexes based on Uncertain graph model), IPC-MCE (Identifying Protein Complexes based on Maximal Complex Extension), and BinGO (the Biological networks Gene Ontology) function. Users can select different clustering algorithms according to their requirements. The main function of these six clustering algorithms is to detect protein complexes or functional modules. In addition, BinGO is used to determine which Gene Ontology (GO) categories are statistically overrepresented in a set of genes or a subgraph of a biological network. CytoCluster can be easily expanded, so that more clustering algorithms and functions can be added to this plugin. Since it was created in July 2013, CytoCluster has been downloaded more than 9700 times in the Cytoscape App store and has already been applied to the analysis of different biological networks. CytoCluster is available from http://apps.cytoscape.org/apps/cytocluster.
CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks
Li, Min; Li, Dongyan; Tang, Yu; Wang, Jianxin
2017-01-01
Nowadays, cluster analysis of biological networks has become one of the most important approaches to identifying functional modules as well as predicting protein complexes and network biomarkers. Furthermore, the visualization of clustering results is crucial to display the structure of biological networks. Here we present CytoCluster, a cytoscape plugin integrating six clustering algorithms, HC-PIN (Hierarchical Clustering algorithm in Protein Interaction Networks), OH-PIN (identifying Overlapping and Hierarchical modules in Protein Interaction Networks), IPCA (Identifying Protein Complex Algorithm), ClusterONE (Clustering with Overlapping Neighborhood Expansion), DCU (Detecting Complexes based on Uncertain graph model), IPC-MCE (Identifying Protein Complexes based on Maximal Complex Extension), and BinGO (the Biological networks Gene Ontology) function. Users can select different clustering algorithms according to their requirements. The main function of these six clustering algorithms is to detect protein complexes or functional modules. In addition, BinGO is used to determine which Gene Ontology (GO) categories are statistically overrepresented in a set of genes or a subgraph of a biological network. CytoCluster can be easily expanded, so that more clustering algorithms and functions can be added to this plugin. Since it was created in July 2013, CytoCluster has been downloaded more than 9700 times in the Cytoscape App store and has already been applied to the analysis of different biological networks. CytoCluster is available from http://apps.cytoscape.org/apps/cytocluster. PMID:28858211
Acioli, Paulo H.; Jellinek, Julius
2017-07-14
A theoretical/computational description and analysis of the spectra of electron binding energies of Al 12 -, Al 13 - and Al 12Ni- clusters, which differ in size and/or composition by a single atom yet possess strikingly different measured photoelectron spectra, is presented. It is shown that the measured spectra can not only be reproduced computationally with quantitative fidelity – this is achieved through a combination of state-of-the-art density functional theory with a highly accurate scheme for conversion of the Kohn-Sham eigenenergies into electron binding energies – but also explained in terms of the effects of size, structure/symmetry and composition. Furthermore,more » a new methodology is developed and applied that provides for disentanglement and differential assignment of the separate roles played by size, structure/symmetry and composition in defining the observed differences in the measured spectra. The methodology is general and applicable to any finite system, homogeneous or heterogeneous. Finally, we project that in combination with advances in synthesis techniques this methodology will become an indispensable computation-based aid in the design of controlled synthesis protocols for manufacture of nanosystems and nanodevices with precisely desired electronic and other characteristics.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Acioli, Paulo H.; Jellinek, Julius
A theoretical/computational description and analysis of the spectra of electron binding energies of Al 12 -, Al 13 - and Al 12Ni- clusters, which differ in size and/or composition by a single atom yet possess strikingly different measured photoelectron spectra, is presented. It is shown that the measured spectra can not only be reproduced computationally with quantitative fidelity – this is achieved through a combination of state-of-the-art density functional theory with a highly accurate scheme for conversion of the Kohn-Sham eigenenergies into electron binding energies – but also explained in terms of the effects of size, structure/symmetry and composition. Furthermore,more » a new methodology is developed and applied that provides for disentanglement and differential assignment of the separate roles played by size, structure/symmetry and composition in defining the observed differences in the measured spectra. The methodology is general and applicable to any finite system, homogeneous or heterogeneous. Finally, we project that in combination with advances in synthesis techniques this methodology will become an indispensable computation-based aid in the design of controlled synthesis protocols for manufacture of nanosystems and nanodevices with precisely desired electronic and other characteristics.« less
Arnold Anteraper, Sheeba; Guell, Xavier; D'Mello, Anila; Joshi, Neha; Whitfield-Gabrieli, Susan; Joshi, Gagan
2018-06-13
To examine the resting-state functional-connectivity (RsFc) in young adults with high-functioning autism spectrum disorder (HF-ASD) using state-of-the-art fMRI data acquisition and analysis techniques. Simultaneous multi-slice, high temporal resolution fMRI acquisition; unbiased whole-brain connectome-wide multivariate pattern analysis (MVPA) techniques for assessing RsFc; and post-hoc whole-brain seed-to-voxel analyses using MVPA results as seeds. MVPA revealed two clusters of abnormal connectivity in the cerebellum. Whole-brain seed-based functional connectivity analyses informed by MVPA-derived clusters showed significant under connectivity between the cerebellum and social, emotional, and language brain regions in the HF-ASD group compared to healthy controls. The results we report are coherent with existing structural, functional, and RsFc literature in autism, extend previous literature reporting cerebellar abnormalities in the neuropathology of autism, and highlight the cerebellum as a potential target for therapeutic, diagnostic, predictive, and prognostic developments in ASD. The description of functional connectivity abnormalities using whole-brain, data-driven analyses as reported in the present study may crucially advance the development of ASD biomarkers, targets for therapeutic interventions, and neural predictors for measuring treatment response.
Periorbital melasma: Hierarchical cluster analysis of clinical features in Asian patients.
Jung, Y S; Bae, J M; Kim, B J; Kang, J-S; Cho, S B
2017-11-01
Studies have shown melasma lesions to be distributed across the face in centrofacial, malar, and mandibular patterns. Meanwhile, however, melasma lesions of the periorbital area have yet to be thoroughly described. We analyzed normal and ultraviolet light-exposed photographs of patients with melasma. The periorbital melasma lesions were measured according to anatomical reference points and a hierarchical cluster analysis was performed. The periorbital melasma lesions showed clinical features of fine and homogenous melasma pigmentation, involving both the upper and lower eyelids that extended to other anatomical sites with a darker and coarser appearance. The hierarchical cluster analysis indicated that patients with periorbital melasma can be categorized into two clusters according to the surface anatomy of the face. Significant differences between cluster 1 and cluster 2 were found in lateral distance and inferolateral distance, but not in medial distance and superior distance. Comparing the two clusters, patients in cluster 2 were found to be significantly older and more commonly accompanied by melasma lesions of the temple and medial cheek. Our hierarchical cluster analysis of periorbital melasma lesions demonstrated that Asian patients with periorbital melasma can be categorized into two clusters according to the surface anatomy of the face. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Abramyan, Tigran M; Snyder, James A; Thyparambil, Aby A; Stuart, Steven J; Latour, Robert A
2016-08-05
Clustering methods have been widely used to group together similar conformational states from molecular simulations of biomolecules in solution. For applications such as the interaction of a protein with a surface, the orientation of the protein relative to the surface is also an important clustering parameter because of its potential effect on adsorbed-state bioactivity. This study presents cluster analysis methods that are specifically designed for systems where both molecular orientation and conformation are important, and the methods are demonstrated using test cases of adsorbed proteins for validation. Additionally, because cluster analysis can be a very subjective process, an objective procedure for identifying both the optimal number of clusters and the best clustering algorithm to be applied to analyze a given dataset is presented. The method is demonstrated for several agglomerative hierarchical clustering algorithms used in conjunction with three cluster validation techniques. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
2013-01-01
Background More effective methods are needed to implement evidence-based findings into practice. The Advancing Recovery Framework offers a multi-level approach to evidence-based practice implementation by aligning purchasing and regulatory policies at the payer level with organizational change strategies at the organizational level. Methods The Advancing Recovery Buprenorphine Implementation Study is a cluster-randomized controlled trial designed to increase use of the evidence-based practice buprenorphine medication to treat opiate addiction. Ohio Alcohol, Drug Addiction, and Mental Health Services Boards (ADAMHS), who are payers, and their addiction treatment organizations were recruited for a trial to assess the effects of payer and treatment organization changes (using the Advancing Recovery Framework) versus treatment organization changes alone on the use of buprenorphine. A matched-pair randomization, based on county characteristics, was applied, resulting in seven county ADAMHS boards and twenty-five treatment organizations in each arm. Opioid dependent patients are nested within cluster (treatment organization), and treatment organization clusters are nested within ADAMHS county board. The primary outcome is the percentage of individuals with an opioid dependence diagnosis who use buprenorphine during the 24-month intervention period and the 12-month sustainability period. The trial is currently in the baseline data collection stage. Discussion Although addiction treatment providers are under increasing pressure to implement evidence-based practices that have been proven to improve patient outcomes, adoption of these practices lags, compared to other areas of healthcare. Reasons frequently cited for the slow adoption of EBPs in addiction treatment include, regulatory issues, staff, or client resistance and lack of resources. Yet the way addiction treatment is funded, the payer’s role—has not received a lot of attention in research on EBP adoption. This research is unique because it investigates the role of payers in evidence-based practice implementation using a randomized controlled design instead of case examples. The testing of the Advancing Recovery Framework is designed to broaden the understanding of the impact payers have on evidence-based practice (EBP) adoption. Trial registration http://NCT01702142 (ClinicalTrials.gov registry, USA) PMID:23663749
The Use of Cluster Analysis in Typological Research on Community College Students
ERIC Educational Resources Information Center
Bahr, Peter Riley; Bielby, Rob; House, Emily
2011-01-01
One useful and increasingly popular method of classifying students is known commonly as cluster analysis. The variety of techniques that comprise the cluster analytic family are intended to sort observations (for example, students) within a data set into subsets (clusters) that share similar characteristics and differ in meaningful ways from other…
Integrating Data Clustering and Visualization for the Analysis of 3D Gene Expression Data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Data Analysis and Visualization; nternational Research Training Group ``Visualization of Large and Unstructured Data Sets,'' University of Kaiserslautern, Germany; Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
2008-05-12
The recent development of methods for extracting precise measurements of spatial gene expression patterns from three-dimensional (3D) image data opens the way for new analyses of the complex gene regulatory networks controlling animal development. We present an integrated visualization and analysis framework that supports user-guided data clustering to aid exploration of these new complex datasets. The interplay of data visualization and clustering-based data classification leads to improved visualization and enables a more detailed analysis than previously possible. We discuss (i) integration of data clustering and visualization into one framework; (ii) application of data clustering to 3D gene expression data; (iii)more » evaluation of the number of clusters k in the context of 3D gene expression clustering; and (iv) improvement of overall analysis quality via dedicated post-processing of clustering results based on visualization. We discuss the use of this framework to objectively define spatial pattern boundaries and temporal profiles of genes and to analyze how mRNA patterns are controlled by their regulatory transcription factors.« less
The 2013 Clusters, Nanocrystals & Nanostructures Gordon Research Conference/Gordon Research Seminar
DOE Office of Scientific and Technical Information (OSTI.GOV)
Krauss, Todd D.
The fundamental properties of small particles and their potential for groundbreaking applications are among the most exciting areas of study in modern physics, chemistry, and materials science. The Clusters, Nanocrystals & Nanostructures Gordon ResearchConference and Gordon Research Seminar synthesize contributions from these inter-related fields that reflect the pivotal role of nano-particles at the interface between these disciplines. Size-dependent optical, electronic, magnetic and catalytic properties offer prospects for applications in many fields, and possible solutions for many of the grand challenges facing energy generation, consumption, delivery, and storage in the 21st century. The goal of the 2013 Clusters, Nanocrystals & Nanostructuresmore » Gordon Research Conference and Gordon Research Seminar is to continue the historical interdisciplinary tradition of this series and discuss the most recent advances, basic scientific questions, and emerging applications of clusters, nanocrystals, and nanostructures. The Clusters, Nanocrystals & Nanostructures GRC/GRS traditionally brings together the leading scientific groups that have made significant recent advances in one or more fundamental nanoscience or nanotechnology areas. Broad interests of the DOE BES and Solar Photochemistry Program addressed by this meeting include the areas of solar energy to fuels conversion, new photovoltaic systems, fundamental characterization of nanomaterials, magnetism, catalysis, and quantum physics. The vast majority of speakers and attendees will address either directly the topic of nanotechnology for photoinduced charge transfer, charge transport, and catalysis, or will have made significant contributions to related areas that will impact these fields indirectly. These topics have direct relevance to the mission of the DOE BES since it is this cutting-edge basic science that underpins our energy future.« less
The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44
Massa, Alicia N.; Childs, Kevin L.; Lin, Haining; Bryan, Glenn J.; Giuliano, Giovanni; Buell, C. Robin
2011-01-01
Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family. PMID:22046362
Advances in molecular quantum chemistry contained in the Q-Chem 4 program package
NASA Astrophysics Data System (ADS)
Shao, Yihan; Gan, Zhengting; Epifanovsky, Evgeny; Gilbert, Andrew T. B.; Wormit, Michael; Kussmann, Joerg; Lange, Adrian W.; Behn, Andrew; Deng, Jia; Feng, Xintian; Ghosh, Debashree; Goldey, Matthew; Horn, Paul R.; Jacobson, Leif D.; Kaliman, Ilya; Khaliullin, Rustam Z.; Kuś, Tomasz; Landau, Arie; Liu, Jie; Proynov, Emil I.; Rhee, Young Min; Richard, Ryan M.; Rohrdanz, Mary A.; Steele, Ryan P.; Sundstrom, Eric J.; Woodcock, H. Lee, III; Zimmerman, Paul M.; Zuev, Dmitry; Albrecht, Ben; Alguire, Ethan; Austin, Brian; Beran, Gregory J. O.; Bernard, Yves A.; Berquist, Eric; Brandhorst, Kai; Bravaya, Ksenia B.; Brown, Shawn T.; Casanova, David; Chang, Chun-Min; Chen, Yunqing; Chien, Siu Hung; Closser, Kristina D.; Crittenden, Deborah L.; Diedenhofen, Michael; DiStasio, Robert A., Jr.; Do, Hainam; Dutoi, Anthony D.; Edgar, Richard G.; Fatehi, Shervin; Fusti-Molnar, Laszlo; Ghysels, An; Golubeva-Zadorozhnaya, Anna; Gomes, Joseph; Hanson-Heine, Magnus W. D.; Harbach, Philipp H. P.; Hauser, Andreas W.; Hohenstein, Edward G.; Holden, Zachary C.; Jagau, Thomas-C.; Ji, Hyunjun; Kaduk, Benjamin; Khistyaev, Kirill; Kim, Jaehoon; Kim, Jihan; King, Rollin A.; Klunzinger, Phil; Kosenkov, Dmytro; Kowalczyk, Tim; Krauter, Caroline M.; Lao, Ka Un; Laurent, Adèle D.; Lawler, Keith V.; Levchenko, Sergey V.; Lin, Ching Yeh; Liu, Fenglai; Livshits, Ester; Lochan, Rohini C.; Luenser, Arne; Manohar, Prashant; Manzer, Samuel F.; Mao, Shan-Ping; Mardirossian, Narbe; Marenich, Aleksandr V.; Maurer, Simon A.; Mayhall, Nicholas J.; Neuscamman, Eric; Oana, C. Melania; Olivares-Amaya, Roberto; O'Neill, Darragh P.; Parkhill, John A.; Perrine, Trilisa M.; Peverati, Roberto; Prociuk, Alexander; Rehn, Dirk R.; Rosta, Edina; Russ, Nicholas J.; Sharada, Shaama M.; Sharma, Sandeep; Small, David W.; Sodt, Alexander; Stein, Tamar; Stück, David; Su, Yu-Chuan; Thom, Alex J. W.; Tsuchimochi, Takashi; Vanovschi, Vitalii; Vogt, Leslie; Vydrov, Oleg; Wang, Tao; Watson, Mark A.; Wenzel, Jan; White, Alec; Williams, Christopher F.; Yang, Jun; Yeganeh, Sina; Yost, Shane R.; You, Zhi-Qiang; Zhang, Igor Ying; Zhang, Xing; Zhao, Yan; Brooks, Bernard R.; Chan, Garnet K. L.; Chipman, Daniel M.; Cramer, Christopher J.; Goddard, William A., III; Gordon, Mark S.; Hehre, Warren J.; Klamt, Andreas; Schaefer, Henry F., III; Schmidt, Michael W.; Sherrill, C. David; Truhlar, Donald G.; Warshel, Arieh; Xu, Xin; Aspuru-Guzik, Alán; Baer, Roi; Bell, Alexis T.; Besley, Nicholas A.; Chai, Jeng-Da; Dreuw, Andreas; Dunietz, Barry D.; Furlani, Thomas R.; Gwaltney, Steven R.; Hsu, Chao-Ping; Jung, Yousung; Kong, Jing; Lambrecht, Daniel S.; Liang, WanZhen; Ochsenfeld, Christian; Rassolov, Vitaly A.; Slipchenko, Lyudmila V.; Subotnik, Joseph E.; Van Voorhis, Troy; Herbert, John M.; Krylov, Anna I.; Gill, Peter M. W.; Head-Gordon, Martin
2015-01-01
A summary of the technical advances that are incorporated in the fourth major release of the Q-Chem quantum chemistry program is provided, covering approximately the last seven years. These include developments in density functional theory methods and algorithms, nuclear magnetic resonance (NMR) property evaluation, coupled cluster and perturbation theories, methods for electronically excited and open-shell species, tools for treating extended environments, algorithms for walking on potential surfaces, analysis tools, energy and electron transfer modelling, parallel computing capabilities, and graphical user interfaces. In addition, a selection of example case studies that illustrate these capabilities is given. These include extensive benchmarks of the comparative accuracy of modern density functionals for bonded and non-bonded interactions, tests of attenuated second order Møller-Plesset (MP2) methods for intermolecular interactions, a variety of parallel performance benchmarks, and tests of the accuracy of implicit solvation models. Some specific chemical examples include calculations on the strongly correlated Cr2 dimer, exploring zeolite-catalysed ethane dehydrogenation, energy decomposition analysis of a charged ter-molecular complex arising from glycerol photoionisation, and natural transition orbitals for a Frenkel exciton state in a nine-unit model of a self-assembling nanotube.
SMMRNA: a database of small molecule modulators of RNA
Mehta, Ankita; Sonam, Surabhi; Gouri, Isha; Loharch, Saurabh; Sharma, Deepak K.; Parkesh, Raman
2014-01-01
We have developed SMMRNA, an interactive database, available at http://www.smmrna.org, with special focus on small molecule ligands targeting RNA. Currently, SMMRNA consists of ∼770 unique ligands along with structural images of RNA molecules. Each ligand in the SMMRNA contains information such as Kd, Ki, IC50, ΔTm, molecular weight (MW), hydrogen donor and acceptor count, XlogP, number of rotatable bonds, number of aromatic rings and 2D and 3D structures. These parameters can be explored using text search, advanced search, substructure and similarity-based analysis tools that are embedded in SMMRNA. A structure editor is provided for 3D visualization of ligands. Advance analysis can be performed using substructure and OpenBabel-based chemical similarity fingerprints. Upload facility for both RNA and ligands is also provided. The physicochemical properties of the ligands were further examined using OpenBabel descriptors, hierarchical clustering, binning partition and multidimensional scaling. We have also generated a 3D conformation database of ligands to support the structure and ligand-based screening. SMMRNA provides comprehensive resource for further design, development and refinement of small molecule modulators for selective targeting of RNA molecules. PMID:24163098
Simultaneous Two-Way Clustering of Multiple Correspondence Analysis
ERIC Educational Resources Information Center
Hwang, Heungsun; Dillon, William R.
2010-01-01
A 2-way clustering approach to multiple correspondence analysis is proposed to account for cluster-level heterogeneity of both respondents and variable categories in multivariate categorical data. Specifically, in the proposed method, multiple correspondence analysis is combined with k-means in a unified framework in which "k"-means is…
Cluster Analysis of Minnesota School Districts. A Research Report.
ERIC Educational Resources Information Center
Cleary, James
The term "cluster analysis" refers to a set of statistical methods that classify entities with similar profiles of scores on a number of measured dimensions, in order to create empirically based typologies. A 1980 Minnesota House Research Report employed cluster analysis to categorize school districts according to their relative mixtures…
Mao, J.; Fang, X.; Lan, Y.; Schimmelmann, A.; Mastalerz, Maria; Xu, L.; Schmidt-Rohr, K.
2010-01-01
We have used advanced and quantitative solid-state nuclear magnetic resonance (NMR) techniques to investigate structural changes in a series of type II kerogen samples from the New Albany Shale across a range of maturity (vitrinite reflectance R0 from 0.29% to 1.27%). Specific functional groups such as CH3, CH2, alkyl CH, aromatic CH, aromatic C-O, and other nonprotonated aromatics, as well as "oil prone" and "gas prone" carbons, have been quantified by 13C NMR; atomic H/C and O/C ratios calculated from the NMR data agree with elemental analysis. Relationships between NMR structural parameters and vitrinite reflectance, a proxy for thermal maturity, were evaluated. The aromatic cluster size is probed in terms of the fraction of aromatic carbons that are protonated (???30%) and the average distance of aromatic C from the nearest protons in long-range H-C dephasing, both of which do not increase much with maturation, in spite of a great increase in aromaticity. The aromatic clusters in the most mature sample consist of ???30 carbons, and of ???20 carbons in the least mature samples. Proof of many links between alkyl chains and aromatic rings is provided by short-range and long-range 1H-13C correlation NMR. The alkyl segments provide most H in the samples; even at a carbon aromaticity of 83%, the fraction of aromatic H is only 38%. While aromaticity increases with thermal maturity, most other NMR structural parameters, including the aromatic C-O fractions, decrease. Aromaticity is confirmed as an excellent NMR structural parameter for assessing thermal maturity. In this series of samples, thermal maturation mostly increases aromaticity by reducing the length of the alkyl chains attached to the aromatic cores, not by pronounced growth of the size of the fused aromatic ring clusters. ?? 2010 Elsevier Ltd. All rights reserved.
NASA Technical Reports Server (NTRS)
Hasler, Nicole; Bulbul, Esra; Bonamente, Massimiliano; Carlstrom, John E.; Culverhouse, Thomas L.; Gralla, Megan; Greer, Christopher; Lamb, James W.; Hawkins, David; Hennessy, Ryan;
2012-01-01
We perform a joint analysis of X-ray and Sunyaev-Zel'dovich effect data using an analytic model that describes the gas properties of galaxy clusters. The joint analysis allows the measurement of the cluster gas mass fraction profile and Hubble constant independent of cosmological parameters. Weak cosmological priors are used to calculate the overdensity radius within which the gas mass fractions are reported. Such an analysis can provide direct constraints on the evolution of the cluster gas mass fraction with redshift. We validate the model and the joint analysis on high signal-to-noise data from the Chandra X-ray Observatory and the Sunyaev-Zel'dovich Array for two clusters, A2631 and A2204.
Description and typology of intensive Chios dairy sheep farms in Greece.
Gelasakis, A I; Valergakis, G E; Arsenos, G; Banos, G
2012-06-01
The aim was to assess the intensified dairy sheep farming systems of the Chios breed in Greece, establishing a typology that may properly describe and characterize them. The study included the total of the 66 farms of the Chios sheep breeders' cooperative Macedonia. Data were collected using a structured direct questionnaire for in-depth interviews, including questions properly selected to obtain a general description of farm characteristics and overall management practices. A multivariate statistical analysis was used on the data to obtain the most appropriate typology. Initially, principal component analysis was used to produce uncorrelated variables (principal components), which would be used for the consecutive cluster analysis. The number of clusters was decided using hierarchical cluster analysis, whereas, the farms were allocated in 4 clusters using k-means cluster analysis. The identified clusters were described and afterward compared using one-way ANOVA or a chi-squared test. The main differences were evident on land availability and use, facility and equipment availability and type, expansion rates, and application of preventive flock health programs. In general, cluster 1 included newly established, intensive, well-equipped, specialized farms and cluster 2 included well-established farms with balanced sheep and feed/crop production. In cluster 3 were assigned small flock farms focusing more on arable crops than on sheep farming with a tendency to evolve toward cluster 2, whereas cluster 4 included farms representing a rather conservative form of Chios sheep breeding with low/intermediate inputs and choosing not to focus on feed/crop production. In the studied set of farms, 4 different farmer attitudes were evident: 1) farming disrupts sheep breeding; feed should be purchased and economies of scale will decrease costs (mainly cluster 1), 2) only exercise/pasture land is necessary; at least part of the feed (pasture) must be home-grown to decrease costs (clusters 1 and 4), 3) providing pasture to sheep is essential; on-farm feed production decreases costs (mainly cluster 3), and 4) large-scale farming (feed production and cash crops) does not disrupt sheep breeding; all feed must be produced on-farm to decrease costs (mainly cluster 3). Conducting a profitability analysis among different clusters, exploring and discovering the most beneficial levels of intensified management and capital investment should now be considered. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Vigre, Håkan; Domingues, Ana Rita Coutinho Calado; Pedersen, Ulrik Bo; Hald, Tine
2016-03-01
The aim of the project as the cluster analysis was to in part to develop a generic structured quantitative microbiological risk assessment (QMRA) model of human salmonellosis due to pork consumption in EU member states (MSs), and the objective of the cluster analysis was to group the EU MSs according to the relative contribution of different pathways of Salmonella in the farm-to-consumption chain of pork products. In the development of the model, by selecting a case study MS from each cluster the model was developed to represent different aspects of pig production, pork production, and consumption of pork products across EU states. The objective of the cluster analysis was to aggregate MSs into groups of countries with similar importance of different pathways of Salmonella in the farm-to-consumption chain using available, and where possible, universal register data related to the pork production and consumption in each country. Based on MS-specific information about distribution of (i) small and large farms, (ii) small and large slaughterhouses, (iii) amount of pork meat consumed, and (iv) amount of sausages consumed we used nonhierarchical and hierarchical cluster analysis to group the MSs. The cluster solutions were validated internally using statistic measures and externally by comparing the clustered MSs with an estimated human incidence of salmonellosis due to pork products in the MSs. Finally, each cluster was characterized qualitatively using the centroids of the clusters. © 2016 Society for Risk Analysis.
The Scale Sizes of Globular Clusters: Tidal Limits, Evolution, and the Outer Halo
NASA Astrophysics Data System (ADS)
Harris, William
2011-10-01
The physical factors that determine the linear sizes of massive star clusters are not well understood. Their scale sizes were long thought to be governed by the tidal field of the parent galaxy, but major questions are now emerging. Globular clusters, for example, have mean sizes nearly independent of location in the halo. Paradoxically, the recently discovered "anomalous extended clusters" in M31 and elsewhere have scale sizes that fit much better with tidal theory, but they are puzzlingly rare. Lastly, the persistent size difference between metal-poor and metal-rich clusters still lacks a quantitative explanation. Many aspects of these observations call for better modelling of dynamical evolution in the outskirts of clusters, and also their conditions of formation including the early rapid mass loss phase of protoclusters. A new set of accurate measurements of scale sizes and structural parameters, for a large and homogeneous set of globular clusters, would represent a major advance in this subject. We propose to carry out a {WFC3+ACS} imaging survey of the globular clusters in the supergiant Virgo elliptical M87 to cover the complete run of the halo. M87 is an optimum target system because of its huge numbers of clusters and HST's ability to resolve the cluster profiles accurately. We will derive cluster effective radii, central concentrations, luminosities, and colors for more than 4000 clusters using PSF-convolved King-model profile fitting. In parallel, we are developing theoretical tools to model the expected distribution of cluster sizes versus galactocentric distance as functions of cluster mass, concentration, and orbital anisotropy.
Statistical Significance for Hierarchical Clustering
Kimes, Patrick K.; Liu, Yufeng; Hayes, D. Neil; Marron, J. S.
2017-01-01
Summary Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clustering structure. A critical and challenging question in cluster analysis is whether the identified clusters represent important underlying structure or are artifacts of natural sampling variation. Few approaches have been proposed for addressing this problem in the context of hierarchical clustering, for which the problem is further complicated by the natural tree structure of the partition, and the multiplicity of tests required to parse the layers of nested clusters. In this paper, we propose a Monte Carlo based approach for testing statistical significance in hierarchical clustering which addresses these issues. The approach is implemented as a sequential testing procedure guaranteeing control of the family-wise error rate. Theoretical justification is provided for our approach, and its power to detect true clustering structure is illustrated through several simulation studies and applications to two cancer gene expression datasets. PMID:28099990
The detection methods of dynamic objects
NASA Astrophysics Data System (ADS)
Knyazev, N. L.; Denisova, L. A.
2018-01-01
The article deals with the application of cluster analysis methods for solving the task of aircraft detection on the basis of distribution of navigation parameters selection into groups (clusters). The modified method of cluster analysis for search and detection of objects and then iterative combining in clusters with the subsequent count of their quantity for increase in accuracy of the aircraft detection have been suggested. The course of the method operation and the features of implementation have been considered. In the conclusion the noted efficiency of the offered method for exact cluster analysis for finding targets has been shown.
Latent class analysis of diagnostic science assessment data using Bayesian networks
NASA Astrophysics Data System (ADS)
Steedle, Jeffrey Thomas
2008-10-01
Diagnostic science assessments seek to draw inferences about student understanding by eliciting evidence about the mental models that underlie students' reasoning about physical systems. Measurement techniques for analyzing data from such assessments embody one of two contrasting assessment programs: learning progressions and facet-based assessments. Learning progressions assume that students have coherent theories that they apply systematically across different problem contexts. In contrast, the facet approach makes no such assumption, so students should not be expected to reason systematically across different problem contexts. A systematic comparison of these two approaches is of great practical value to assessment programs such as the National Assessment of Educational Progress as they seek to incorporate small clusters of related items in their tests for the purpose of measuring depth of understanding. This dissertation describes an investigation comparing learning progression and facet models. Data comprised student responses to small clusters of multiple-choice diagnostic science items focusing on narrow aspects of understanding of Newtonian mechanics. Latent class analysis was employed using Bayesian networks in order to model the relationship between students' science understanding and item responses. Separate models reflecting the assumptions of the learning progression and facet approaches were fit to the data. The technical qualities of inferences about student understanding resulting from the two models were compared in order to determine if either modeling approach was more appropriate. Specifically, models were compared on model-data fit, diagnostic reliability, diagnostic certainty, and predictive accuracy. In addition, the effects of test length were evaluated for both models in order to inform the number of items required to obtain adequately reliable latent class diagnoses. Lastly, changes in student understanding over time were studied with a longitudinal model in order to provide educators and curriculum developers with a sense of how students advance in understanding over the course of instruction. Results indicated that expected student response patterns rarely reflected the assumptions of the learning progression approach. That is, students tended not to systematically apply a coherent set of ideas across different problem contexts. Even those students expected to express scientifically-accurate understanding had substantial probabilities of reporting certain problematic ideas. The learning progression models failed to make as many substantively-meaningful distinctions among students as the facet models. In statistical comparisons, model-data fit was better for the facet model, but the models were quite comparable on all other statistical criteria. Studying the effects of test length revealed that approximately 8 items are needed to obtain adequate diagnostic certainty, but more items are needed to obtain adequate diagnostic reliability. The longitudinal analysis demonstrated that students either advance in their understanding (i.e., switch to the more advanced latent class) over a short period of instruction or stay at the same level. There was no significant relationship between the probability of changing latent classes and time between testing occasions. In all, this study is valuable because it provides evidence informing decisions about modeling and reporting on student understanding, it assesses the quality of measurement available from short clusters of diagnostic multiple-choice items, and it provides educators with knowledge of the paths that student may take as they advance from novice to expert understanding over the course of instruction.
Cluster analysis of particulate matter (PM10) and black carbon (BC) concentrations
NASA Astrophysics Data System (ADS)
Žibert, Janez; Pražnikar, Jure
2012-09-01
The monitoring of air-pollution constituents like particulate matter (PM10) and black carbon (BC) can provide information about air quality and the dynamics of emissions. Air quality depends on natural and anthropogenic sources of emissions as well as the weather conditions. For a one-year period the diurnal concentrations of PM10 and BC in the Port of Koper were analysed by clustering days into similar groups according to the similarity of the BC and PM10 hourly derived day-profiles without any prior assumptions about working and non-working days, weather conditions or hot and cold seasons. The analysis was performed by using k-means clustering with the squared Euclidean distance as the similarity measure. The analysis showed that 10 clusters in the BC case produced 3 clusters with just one member day and 7 clusters that encompasses more than one day with similar BC profiles. Similar results were found in the PM10 case, where one cluster has a single-member day, while 7 clusters contain several member days. The clustering analysis revealed that the clusters with less pronounced bimodal patterns and low hourly and average daily concentrations for both types of measurements include the most days in the one-year analysis. A typical day profile of the BC measurements includes a bimodal pattern with morning and evening peaks, while the PM10 measurements reveal a less pronounced bimodality. There are also clusters with single-peak day-profiles. The BC data in such cases exhibit morning peaks, while the PM10 data consist of noon or afternoon single peaks. Single pronounced peaks can be explained by appropriate cluster wind speed profiles. The analysis also revealed some special day-profiles. The BC cluster with a high midnight peak at 30/04/2010 and the PM10 cluster with the highest observed concentration of PM10 at 01/05/2010 (208.0 μg m-3) coincide with 1 May, which is a national holiday in Slovenia and has very strong tradition of bonfire parties. The clustering of the diurnal concentration showed that various different day-profiles are presented in a cold period, while this is not the case for the hot season. Additional analysis of ship traffic and rain fall data showed that there is no statistically significant difference between the ship gross (bruto) registered tonnage (BRT) values in the case of BC and PM10 clusters, but that there is statistically significant differences between the rain fall in the BC and PM10 clusters. The wind-rose for clusters which included most days in the sampling period indicating that emitted PM10 and BC from Port of Koper were manly transported in the west direction over the sea and in the east direction, where there is in no populated area. Presented analysis showed that both BC and PM10 concentrations were driven by rain intensity and wind speed.
Using Cluster Analysis and ICP-MS to Identify Groups of Ecstasy Tablets in Sao Paulo State, Brazil.
Maione, Camila; de Oliveira Souza, Vanessa Cristina; Togni, Loraine Rezende; da Costa, José Luiz; Campiglia, Andres Dobal; Barbosa, Fernando; Barbosa, Rommel Melgaço
2017-11-01
The variations found in the elemental composition in ecstasy samples result in spectral profiles with useful information for data analysis, and cluster analysis of these profiles can help uncover different categories of the drug. We provide a cluster analysis of ecstasy tablets based on their elemental composition. Twenty-five elements were determined by ICP-MS in tablets apprehended by Sao Paulo's State Police, Brazil. We employ the K-means clustering algorithm along with C4.5 decision tree to help us interpret the clustering results. We found a better number of two clusters within the data, which can refer to the approximated number of sources of the drug which supply the cities of seizures. The C4.5 model was capable of differentiating the ecstasy samples from the two clusters with high prediction accuracy using the leave-one-out cross-validation. The model used only Nd, Ni, and Pb concentration values in the classification of the samples. © 2017 American Academy of Forensic Sciences.
Prime movers: Advanced practice professionals in the role of stroke coordinator.
Rattray, Nicholas A; Damush, Teresa M; Luckhurst, Cherie; Bauer-Martinez, Catherine J; Homoya, Barbara J; Miech, Edward J
2017-07-01
Following a stroke quality improvement clustered randomized trial and a national acute ischemic stroke (AIS) directive in the Veterans Health Administration in 2011, this comparative case study examined the role of advanced practice professionals (APPs) in quality improvement activities among stroke teams. Semistructured interviews were conducted at 11 Veterans Affairs medical centers annually over a 3-year period. A multidisciplinary team analyzed interviews from clinical providers through a mixed-methods, data matrix approach linking APPs (nurse practitioners and physician assistants) with Consolidated Framework for Implementation Research constructs and a group organization measure. Five of 11 facilities independently chose to staff stroke coordinator positions with APPs. Analysis indicated that APPs emerged as boundary spanners across services and disciplines who played an important role in coordinating evidence-based, facility-level approaches to AIS care. The presence of APPs was related to engaging in group-based evaluation of performance data, implementing stroke protocols, monitoring care through data audit, convening interprofessional meetings involving planning activities, and providing direct care. The presence of APPs appears to be an influential feature of local context crucial in developing an advanced, facility-wide approach to stroke care because of their boundary spanning capabilities. ©2017 American Association of Nurse Practitioners.
NASA Technical Reports Server (NTRS)
Zhu, Dongming; Nesbitt, James A.; McCue, Terry R.; Barrett, Charles A.; Miller, Robert A.
2002-01-01
Ceramic thermal barrier coatings will play an increasingly important role in advanced gas turbine engines because of their ability to enable further increases in engine temperatures. However, the coating performance and durability become a major concern under the increasingly harsh thermal cycling conditions. Advanced zirconia- and hafnia-based cluster oxide thermal barrier coatings with lower thermal conductivity and improved thermal stability are being developed using a high-heat-flux laser-rig based test approach. Although the new composition coatings were not yet optimized for cyclic durability, an initial durability screening of numerous candidate coating materials was carried out using conventional furnace cyclic tests. In this paper, furnace thermal cyclic behavior of the advanced plasma-sprayed zirconia-yttria-based thermal barrier coatings that were co-doped with multi-component rare earth oxides was investigated at 1163 C using 45 min hot cycles. The ceramic coating failure mechanisms were studied by using scanning electron microscopy combined with X-ray diffraction phase analysis after the furnace tests. The coating cyclic lifetime will be discussed in relation to coating phase structures, total dopant concentrations, and other properties.
Advances in Machine Learning and Data Mining for Astronomy
NASA Astrophysics Data System (ADS)
Way, Michael J.; Scargle, Jeffrey D.; Ali, Kamal M.; Srivastava, Ashok N.
2012-03-01
Advances in Machine Learning and Data Mining for Astronomy documents numerous successful collaborations among computer scientists, statisticians, and astronomers who illustrate the application of state-of-the-art machine learning and data mining techniques in astronomy. Due to the massive amount and complexity of data in most scientific disciplines, the material discussed in this text transcends traditional boundaries between various areas in the sciences and computer science. The book's introductory part provides context to issues in the astronomical sciences that are also important to health, social, and physical sciences, particularly probabilistic and statistical aspects of classification and cluster analysis. The next part describes a number of astrophysics case studies that leverage a range of machine learning and data mining technologies. In the last part, developers of algorithms and practitioners of machine learning and data mining show how these tools and techniques are used in astronomical applications. With contributions from leading astronomers and computer scientists, this book is a practical guide to many of the most important developments in machine learning, data mining, and statistics. It explores how these advances can solve current and future problems in astronomy and looks at how they could lead to the creation of entirely new algorithms within the data mining community.
Spearson Goulet, Jo-Annie; Tardif, Monique
2018-06-05
Very few studies have taken a specific interest in the various sexual dimensions, beyond delinquent sexual behavior, of adolescents who have engaged in sexual abuse (AESA). Those that went beyond delinquent sexual behavior have report mixed results, suggesting they are a heterogeneous group. The current study used cluster analysis to examine the sexuality profiles of AESA, which included information on several sexual dimensions (atypical and normative fantasies and experiences, drive, body image, pornography, first masturbation, onset of sexual interest and first exposure to sex). Participants (N = 136) are adolescents who have engaged in sexual abuse involving physical contact, for which at least one parent also participated in the study. They were recruited from six specialized treatment centers and three youth centers in Quebec (Canada). Cluster analyses were performed to identify specific sexual profiles. Results suggest three clusters of AESA: 1- Discordant sexuality pertaining to adolescents who show mostly normative sexual interests, 2- Constrictive sexuality, characterizing adolescents who seem to be less invested/interested in their sexuality and 3- Overinvested sexuality for adolescents showing an exacerbated sexuality, including atypical sexual interest. Additional analyses (ANOVAs and Chi-square tests) reveal that five delinquency and offense characteristics were significantly more likely to be present in the Overinvested than the Constrictive cluster: non-sexual offenses, three or more victims, peer victims and alcohol and drug consumption. Advancing our knowledge on this topic can provide relevant data for clinicians to better target interventions. Copyright © 2018. Published by Elsevier Ltd.
Vibration control of a cluster of buildings through the Vibrating Barrier
NASA Astrophysics Data System (ADS)
Tombari, A.; Garcia Espinosa, M.; Alexander, N. A.; Cacciola, P.
2018-02-01
A novel device, called Vibrating Barrier (ViBa), that aims to reduce the vibrations of adjacent structures subjected to ground motion waves has been recently proposed. The ViBa is a structure buried in the soil and detached from surrounding buildings that is able to absorb a significant portion of the dynamic energy arising from the ground motion. The working principle exploits the dynamic interaction among vibrating structures due to the propagation of waves through the soil, namely the structure-soil-structure interaction. In this paper the efficiency of the ViBa is investigated to control the vibrations of a cluster of buildings. To this aim, a discrete model of structures-site interaction involving multiple buildings and the ViBa is developed where the effects of the soil on the structures, i.e. the soil-structure interaction (SSI), the structure-soil-structure interaction (SSSI) as well as the ViBa-soil-structures interaction are taken into account by means of linear elastic springs. Closed-form solutions are derived to design the ViBa in the case of harmonic excitation from the analysis of the discrete model. Advanced finite element numerical simulations are performed in order to assess the efficiency of the ViBa for protecting more than a single building. Parametric studies are also conducted to identify beneficial/adverse effects in the use of the proposed vibration control strategy to protect cluster of buildings. Finally, experimental shake table tests are performed to a prototype of a cluster of two buildings protected by the ViBa device for validating the proposed numerical models.
Batch Computed Tomography Analysis of Projectiles
2016-05-01
error calculation. Projectiles are then grouped together according to the similarity of their components. Also discussed is graphical- cluster analysis...ballistic, armor, grouping, clustering 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF ABSTRACT UU 18. NUMBER OF...Fig. 10 Graphical structure of 15 clusters of the jacket/core radii profiles with plots of the profiles contained within each cluster . The size of
ERIC Educational Resources Information Center
Raker, Jeffrey R.; Holme, Thomas A.
2014-01-01
A cluster analysis was conducted with a set of survey data on chemistry faculty familiarity with 13 assessment terms. Cluster groupings suggest a high, middle, and low overall familiarity with the terminology and an independent high and low familiarity with terms related to fundamental statistics. The six resultant clusters were found to be…
NASA Technical Reports Server (NTRS)
Ormsby, J. P.
1982-01-01
An examination of the possibilities of using Landsat data to simulate NOAA-6 Advanced Very High Resolution Radiometer (AVHRR) data on two channels, as well as using actual NOAA-6 imagery, for large-scale hydrological studies is presented. A running average was obtained of 18 consecutive pixels of 1 km resolution taken by the Landsat scanners were scaled up to 8-bit data and investigated for different gray levels. AVHRR data comprising five channels of 10-bit, band-interleaved information covering 10 deg latitude were analyzed and a suitable pixel grid was chosen for comparison with the Landsat data in a supervised classification format, an unsupervised mode, and with ground truth. Landcover delineation was explored by removing snow, water, and cloud features from the cluster analysis, and resulted in less than 10% difference. Low resolution large-scale data was determined useful for characterizing some landcover features if weekly and/or monthly updates are maintained.
NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways.
Brohée, Sylvain; Faust, Karoline; Lima-Mendez, Gipsi; Sand, Olivier; Janky, Rekin's; Vanderstocken, Gilles; Deville, Yves; van Helden, Jacques
2008-07-01
The network analysis tools (NeAT) (http://rsat.ulb.ac.be/neat/) provide a user-friendly web access to a collection of modular tools for the analysis of networks (graphs) and clusters (e.g. microarray clusters, functional classes, etc.). A first set of tools supports basic operations on graphs (comparison between two graphs, neighborhood of a set of input nodes, path finding and graph randomization). Another set of programs makes the connection between networks and clusters (graph-based clustering, cliques discovery and mapping of clusters onto a network). The toolbox also includes programs for detecting significant intersections between clusters/classes (e.g. clusters of co-expression versus functional classes of genes). NeAT are designed to cope with large datasets and provide a flexible toolbox for analyzing biological networks stored in various databases (protein interactions, regulation and metabolism) or obtained from high-throughput experiments (two-hybrid, mass-spectrometry and microarrays). The web interface interconnects the programs in predefined analysis flows, enabling to address a series of questions about networks of interest. Each tool can also be used separately by entering custom data for a specific analysis. NeAT can also be used as web services (SOAP/WSDL interface), in order to design programmatic workflows and integrate them with other available resources.
Identification and characterization of near-fatal asthma phenotypes by cluster analysis.
Serrano-Pariente, J; Rodrigo, G; Fiz, J A; Crespo, A; Plaza, V
2015-09-01
Near-fatal asthma (NFA) is a heterogeneous clinical entity and several profiles of patients have been described according to different clinical, pathophysiological and histological features. However, there are no previous studies that identify in a unbiased way--using statistical methods such as clusters analysis--different phenotypes of NFA. Therefore, the aim of the present study was to identify and to characterize phenotypes of near fatal asthma using a cluster analysis. Over a period of 2 years, 33 Spanish hospitals enrolled 179 asthmatics admitted for an episode of NFA. A cluster analysis using two-steps algorithm was performed from data of 84 of these cases. The analysis defined three clusters of patients with NFA: cluster 1, the largest, including older patients with clinical and therapeutic criteria of severe asthma; cluster 2, with an high proportion of respiratory arrest (68%), impaired consciousness level (82%) and mechanical ventilation (93%); and cluster 3, which included younger patients, characterized by an insufficient anti-inflammatory treatment and frequent sensitization to Alternaria alternata and soybean. These results identify specific asthma phenotypes involved in NFA, confirming in part previous findings observed in studies with a clinical approach. The identification of patients with a specific NFA phenotype could suggest interventions to prevent future severe asthma exacerbations. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Chen, Shan; Li, Xiao-ning; Liang, Yi-zeng; Zhang, Zhi-min; Liu, Zhao-xia; Zhang, Qi-ming; Ding, Li-xia; Ye, Fei
2010-08-01
During Raman spectroscopy analysis, the organic molecules and contaminations will obscure or swamp Raman signals. The present study starts from Raman spectra of prednisone acetate tablets and glibenclamide tables, which are acquired from the BWTek i-Raman spectrometer. The background is corrected by R package baselineWavelet. Then principle component analysis and random forests are used to perform clustering analysis. Through analyzing the Raman spectra of two medicines, the accurate and validity of this background-correction algorithm is checked and the influences of fluorescence background on Raman spectra clustering analysis is discussed. Thus, it is concluded that it is important to correct fluorescence background for further analysis, and an effective background correction solution is provided for clustering or other analysis.
A Survey of Popular R Packages for Cluster Analysis
ERIC Educational Resources Information Center
Flynt, Abby; Dean, Nema
2016-01-01
Cluster analysis is a set of statistical methods for discovering new group/class structure when exploring data sets. This article reviews the following popular libraries/commands in the R software language for applying different types of cluster analysis: from the stats library, the kmeans, and hclust functions; the mclust library; the poLCA…
Using Cluster Analysis for Data Mining in Educational Technology Research
ERIC Educational Resources Information Center
Antonenko, Pavlo D.; Toy, Serkan; Niederhauser, Dale S.
2012-01-01
Cluster analysis is a group of statistical methods that has great potential for analyzing the vast amounts of web server-log data to understand student learning from hyperlinked information resources. In this methodological paper we provide an introduction to cluster analysis for educational technology researchers and illustrate its use through…
Cluster Analysis of the Luria-Nebraska Neuropsychological Battery with Learning Disabled Adults.
ERIC Educational Resources Information Center
McCue, Michael; And Others
The study reports a cluster analysis of Luria-Nebraska Neuropsychological Battery sources of 25 learning disabled adults. The cluster analysis suggested the presence of three subgroups within this sample, one having high elevations on the Rhythm, Writing, Reading, and Arithmetic Rhythm scales, the second having an extremely high evelation on the…
A nonparametric clustering technique which estimates the number of clusters
NASA Technical Reports Server (NTRS)
Ramey, D. B.
1983-01-01
In applications of cluster analysis, one usually needs to determine the number of clusters, K, and the assignment of observations to each cluster. A clustering technique based on recursive application of a multivariate test of bimodality which automatically estimates both K and the cluster assignments is presented.
Subgroups of physically abusive parents based on cluster analysis of parenting behavior and affect.
Haskett, Mary E; Smith Scott, Susan; Sabourin Ward, Caryn
2004-10-01
Cluster analysis of observed parenting and self-reported discipline was used to categorize 83 abusive parents into subgroups. A 2-cluster solution received support for validity. Cluster 1 parents were relatively warm, positive, sensitive, and engaged during interactions with their children, whereas Cluster 2 parents were relatively negative, disengaged or intrusive, and insensitive. Further, clusters differed in emotional health, parenting stress, perceptions of children, and problem solving. Children of parents in the 2 clusters differed on several indexes of social adjustment. Cluster 1 parents were similar to nonabusive parents (n = 66) on parenting and related constructs, but Cluster 2 parents differed from nonabusive parents on all clustering variables and many validation variables. Results highlight clinically relevant diversity in parenting practices and functioning among abusive parents. ((c) 2004 APA, all rights reserved).
Sputum neutrophil counts are associated with more severe asthma phenotypes using cluster analysis.
Moore, Wendy C; Hastie, Annette T; Li, Xingnan; Li, Huashi; Busse, William W; Jarjour, Nizar N; Wenzel, Sally E; Peters, Stephen P; Meyers, Deborah A; Bleecker, Eugene R
2014-06-01
Clinical cluster analysis from the Severe Asthma Research Program (SARP) identified 5 asthma subphenotypes that represent the severity spectrum of early-onset allergic asthma, late-onset severe asthma, and severe asthma with chronic obstructive pulmonary disease characteristics. Analysis of induced sputum from a subset of SARP subjects showed 4 sputum inflammatory cellular patterns. Subjects with concurrent increases in eosinophil (≥2%) and neutrophil (≥40%) percentages had characteristics of very severe asthma. To better understand interactions between inflammation and clinical subphenotypes, we integrated inflammatory cellular measures and clinical variables in a new cluster analysis. Participants in SARP who underwent sputum induction at 3 clinical sites were included in this analysis (n = 423). Fifteen variables, including clinical characteristics and blood and sputum inflammatory cell assessments, were selected using factor analysis for unsupervised cluster analysis. Four phenotypic clusters were identified. Cluster A (n = 132) and B (n = 127) subjects had mild-to-moderate early-onset allergic asthma with paucigranulocytic or eosinophilic sputum inflammatory cell patterns. In contrast, these inflammatory patterns were present in only 7% of cluster C (n = 117) and D (n = 47) subjects who had moderate-to-severe asthma with frequent health care use despite treatment with high doses of inhaled or oral corticosteroids and, in cluster D, reduced lung function. The majority of these subjects (>83%) had sputum neutrophilia either alone or with concurrent sputum eosinophilia. Baseline lung function and sputum neutrophil percentages were the most important variables determining cluster assignment. This multivariate approach identified 4 asthma subphenotypes representing the severity spectrum from mild-to-moderate allergic asthma with minimal or eosinophil-predominant sputum inflammation to moderate-to-severe asthma with neutrophil-predominant or mixed granulocytic inflammation. Published by Mosby, Inc.
Sputum neutrophils are associated with more severe asthma phenotypes using cluster analysis
Moore, Wendy C.; Hastie, Annette T.; Li, Xingnan; Li, Huashi; Busse, William W.; Jarjour, Nizar N.; Wenzel, Sally E.; Peters, Stephen P.; Meyers, Deborah A.; Bleecker, Eugene R.
2013-01-01
Background Clinical cluster analysis from the Severe Asthma Research Program (SARP) identified five asthma subphenotypes that represent the severity spectrum of early onset allergic asthma, late onset severe asthma and severe asthma with COPD characteristics. Analysis of induced sputum from a subset of SARP subjects showed four sputum inflammatory cellular patterns. Subjects with concurrent increases in eosinophils (≥2%) and neutrophils (≥40%) had characteristics of very severe asthma. Objective To better understand interactions between inflammation and clinical subphenotypes we integrated inflammatory cellular measures and clinical variables in a new cluster analysis. Methods Participants in SARP at three clinical sites who underwent sputum induction were included in this analysis (n=423). Fifteen variables including clinical characteristics and blood and sputum inflammatory cell assessments were selected by factor analysis for unsupervised cluster analysis. Results Four phenotypic clusters were identified. Cluster A (n=132) and B (n=127) subjects had mild-moderate early onset allergic asthma with paucigranulocytic or eosinophilic sputum inflammatory cell patterns. In contrast, these inflammatory patterns were present in only 7% of Cluster C (n=117) and D (n=47) subjects who had moderate-severe asthma with frequent health care utilization despite treatment with high doses of inhaled or oral corticosteroids, and in Cluster D, reduced lung function. The majority these subjects (>83%) had sputum neutrophilia either alone or with concurrent sputum eosinophilia. Baseline lung function and sputum neutrophils were the most important variables determining cluster assignment. Conclusion This multivariate approach identified four asthma subphenotypes representing the severity spectrum from mild-moderate allergic asthma with minimal or eosinophilic predominant sputum inflammation to moderate-severe asthma with neutrophilic predominant or mixed granulocytic inflammation. PMID:24332216
Telescope Scientist on the Advanced X-ray Astrophysics Observatory
NASA Technical Reports Server (NTRS)
Smith, Carl M. (Technical Monitor); VanSpeybroeck, Leon; Tananbaum, Harvey D.
2004-01-01
In this period, the Chandra X-ray Observatory continued to perform exceptionally well, with many scientific observations and spectacular results. The HRMA performance continues to be essentially identical to that predicted from ground calibration data. The Telescope Scientist Team has improved the mirror model to provide a more accurate description to the Chandra observers, enabling them to reduce the systematic errors and uncertainties in their data reduction. There also has been good progress in the scientific program. Using the Telescope Scientist GTO time, we carried out an extensive Chandra program to observe distant clusters of galaxies. The goals of this program were to use clusters to derive cosmological constraints and to investigate the physics and evolution of clusters. A total of 71 clusters were observed with ACIS-I; the last observations were completed in December 2003.
Xavier, Paulrajpillai Lourdu; Chaudhari, Kamalesh; Baksi, Ananya; Pradeep, Thalappil
2012-01-01
Noble metal quantum clusters (NMQCs) are the missing link between isolated noble metal atoms and nanoparticles. NMQCs are sub-nanometer core sized clusters composed of a group of atoms, most often luminescent in the visible region, and possess intriguing photo-physical and chemical properties. A trend is observed in the use of ligands, ranging from phosphines to functional proteins, for the synthesis of NMQCs in the liquid phase. In this review, we briefly overview recent advancements in the synthesis of protein protected NMQCs with special emphasis on their structural and photo-physical properties. In view of the protein protection, coupled with direct synthesis and easy functionalization, this hybrid QC-protein system is expected to have numerous optical and bioimaging applications in the future, pointers in this direction are visible in the literature. PMID:22312454
Identification and validation of asthma phenotypes in Chinese population using cluster analysis.
Wang, Lei; Liang, Rui; Zhou, Ting; Zheng, Jing; Liang, Bing Miao; Zhang, Hong Ping; Luo, Feng Ming; Gibson, Peter G; Wang, Gang
2017-10-01
Asthma is a heterogeneous airway disease, so it is crucial to clearly identify clinical phenotypes to achieve better asthma management. To identify and prospectively validate asthma clusters in a Chinese population. Two hundred eighty-four patients were consecutively recruited and 18 sociodemographic and clinical variables were collected. Hierarchical cluster analysis was performed by the Ward method followed by k-means cluster analysis. Then, a prospective 12-month cohort study was used to validate the identified clusters. Five clusters were successfully identified. Clusters 1 (n = 71) and 3 (n = 81) were mild asthma phenotypes with slight airway obstruction and low exacerbation risk, but with a sex differential. Cluster 2 (n = 65) described an "allergic" phenotype, cluster 4 (n = 33) featured a "fixed airflow limitation" phenotype with smoking, and cluster 5 (n = 34) was a "low socioeconomic status" phenotype. Patients in clusters 2, 4, and 5 had distinctly lower socioeconomic status and more psychological symptoms. Cluster 2 had a significantly increased risk of exacerbations (risk ratio [RR] 1.13, 95% confidence interval [CI] 1.03-1.25), unplanned visits for asthma (RR 1.98, 95% CI 1.07-3.66), and emergency visits for asthma (RR 7.17, 95% CI 1.26-40.80). Cluster 4 had an increased risk of unplanned visits (RR 2.22, 95% CI 1.02-4.81), and cluster 5 had increased emergency visits (RR 12.72, 95% CI 1.95-69.78). Kaplan-Meier analysis confirmed that cluster grouping was predictive of time to the first asthma exacerbation, unplanned visit, emergency visit, and hospital admission (P < .0001 for all comparisons). We identified 3 clinical clusters as "allergic asthma," "fixed airflow limitation," and "low socioeconomic status" phenotypes that are at high risk of severe asthma exacerbations and that have management implications for clinical practice in developing countries. Copyright © 2017 American College of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Network Analysis Tools: from biological networks to clusters and pathways.
Brohée, Sylvain; Faust, Karoline; Lima-Mendez, Gipsi; Vanderstocken, Gilles; van Helden, Jacques
2008-01-01
Network Analysis Tools (NeAT) is a suite of computer tools that integrate various algorithms for the analysis of biological networks: comparison between graphs, between clusters, or between graphs and clusters; network randomization; analysis of degree distribution; network-based clustering and path finding. The tools are interconnected to enable a stepwise analysis of the network through a complete analytical workflow. In this protocol, we present a typical case of utilization, where the tasks above are combined to decipher a protein-protein interaction network retrieved from the STRING database. The results returned by NeAT are typically subnetworks, networks enriched with additional information (i.e., clusters or paths) or tables displaying statistics. Typical networks comprising several thousands of nodes and arcs can be analyzed within a few minutes. The complete protocol can be read and executed in approximately 1 h.
Cochrane, Rachel V K; Sanichar, Randy; Lambkin, Gareth R; Reiz, Béla; Xu, Wei; Tang, Yi; Vederas, John C
2016-01-11
The antimalarial agent cladosporin is a nanomolar inhibitor of the Plasmodium falciparum lysyl-tRNA synthetase, and exhibits activity against both blood- and liver-stage infection. Cladosporin can be isolated from the fungus Cladosporium cladosporioides, where it is biosynthesized by a highly reducing (HR) and a non-reducing (NR) iterative type I polyketide synthase (PKS) pair. Genome sequencing of the host organism and subsequent heterologous expression of these enzymes in Saccharomyces cerevisiae produced cladosporin, confirming the identity of the putative gene cluster. Incorporation of a pentaketide intermediate analogue indicated a 5+3 assembly by the HR PKS Cla2 and the NR PKS Cla3 during cladosporin biosynthesis. Advanced-intermediate analogues were synthesized and incorporated by Cla3 to furnish new cladosporin analogues. A putative lysyl-tRNA synthetase resistance gene was identified in the cladosporin gene cluster. Analysis of the active site emphasizes key structural features thought to be important in resistance to cladosporin. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Lu, Jing; Chen, Lei; Yin, Jun; Huang, Tao; Bi, Yi; Kong, Xiangyin; Zheng, Mingyue; Cai, Yu-Dong
2016-01-01
Lung cancer, characterized by uncontrolled cell growth in the lung tissue, is the leading cause of global cancer deaths. Until now, effective treatment of this disease is limited. Many synthetic compounds have emerged with the advancement of combinatorial chemistry. Identification of effective lung cancer candidate drug compounds among them is a great challenge. Thus, it is necessary to build effective computational methods that can assist us in selecting for potential lung cancer drug compounds. In this study, a computational method was proposed to tackle this problem. The chemical-chemical interactions and chemical-protein interactions were utilized to select candidate drug compounds that have close associations with approved lung cancer drugs and lung cancer-related genes. A permutation test and K-means clustering algorithm were employed to exclude candidate drugs with low possibilities to treat lung cancer. The final analysis suggests that the remaining drug compounds have potential anti-lung cancer activities and most of them have structural dissimilarity with approved drugs for lung cancer.
Evidence for a Significant Intermediate-Age Population in the M31 Halo from Main Sequence Photometry
NASA Technical Reports Server (NTRS)
Brown, Thomas M.; Ferguson, Henry C.; Smith, Ed; Kimble, Randy A.; Sweigart, Allen V.; Renzini, Alvio; Rich, R. Michael; Vandenberg, Don A.
2003-01-01
We present a color-magnitude diagram (CMD) for a minor-axis field in the halo of the Andromeda galaxy (M3l), 51 arcmin (11 kpc) from the nucleus. These observations, taken with the Advanced Camera for Surveys (ACS) on the Hubble Space Telescope, are the deepest optical images yet obtained, attaining 50% completeness at m(sub v) = 30.7 mag. The CMD, constructed from approx. 3 x 10(exp 5) stars, reaches more than 1.5 mag fainter than the old main-sequence turnoff. Our analysis is based on direct comparisons to ACS observations of four globular clusters through the same filters, as well as chi square fitting to a finely-spaced grid of calibrated stellar-population models. We find that the M31 halo contains a major (approx. 30% by mass) intermediate-age (6-8 Gyr) metal-rich ([Fe/H] greater than -0.5) population, as well as a significant globular-cluster age (11-13.5 Gyr) metal-poor population. These findings support the idea that galaxy mergers played an important role in the formation of the M31 halo.
ERIC Educational Resources Information Center
Viriyangkura, Yuwadee
2014-01-01
Through a secondary analysis of statewide data from Colorado, people with intellectual and related developmental disabilities (ID/DD) were classified into five clusters based on their support needs characteristics using cluster analysis techniques. Prior latent factor models of support needs in the field of ID/DD were examined to investigate the…
Lazzeri, Giacomo; Panatto, Donatella; Domnich, Alexander; Arata, Lucia; Pammolli, Andrea; Simi, Rita; Giacchi, Mariano Vincenzo; Amicizia, Daniela; Gasparini, Roberto
2018-01-01
Abstract Background A huge amount of literature suggests that adolescents’ health-related behaviors tend to occur in clusters, and the understanding of such behavioral clustering may have direct implications for the effective tailoring of health-promotion interventions. Despite the usefulness of analyzing clustering, Italian data on this topic are scant. This study aimed to evaluate the clustering patterns of health-related behaviors. Methods The present study is based on data from the Health Behaviors in School-aged Children (HBSC) study conducted in Tuscany in 2010, which involved 3291 11-, 13- and 15-year olds. To aggregate students’ data on 22 health-related behaviors, factor analysis and subsequent cluster analysis were performed. Results Factor analysis revealed eight factors, which were dubbed in accordance with their main traits: ‘Alcohol drinking’, ‘Smoking’, ‘Physical activity’, ‘Screen time’, ‘Signs & symptoms’, ‘Healthy eating’, ‘Violence’ and ‘Sweet tooth’. These factors explained 67% of variance and underwent cluster analysis. A six-cluster κ-means solution was established with a 93.8% level of classification validity. The between-cluster differences in both mean age and gender distribution were highly statistically significant. Conclusions Health-compromising behaviors are common among Tuscan teens and occur in distinct clusters. These results may be used by schools, health-promotion authorities and other stakeholders to design and implement tailored preventive interventions in Tuscany. PMID:27908972
Lazzeri, Giacomo; Panatto, Donatella; Domnich, Alexander; Arata, Lucia; Pammolli, Andrea; Simi, Rita; Giacchi, Mariano Vincenzo; Amicizia, Daniela; Gasparini, Roberto
2018-03-01
A huge amount of literature suggests that adolescents' health-related behaviors tend to occur in clusters, and the understanding of such behavioral clustering may have direct implications for the effective tailoring of health-promotion interventions. Despite the usefulness of analyzing clustering, Italian data on this topic are scant. This study aimed to evaluate the clustering patterns of health-related behaviors. The present study is based on data from the Health Behaviors in School-aged Children (HBSC) study conducted in Tuscany in 2010, which involved 3291 11-, 13- and 15-year olds. To aggregate students' data on 22 health-related behaviors, factor analysis and subsequent cluster analysis were performed. Factor analysis revealed eight factors, which were dubbed in accordance with their main traits: 'Alcohol drinking', 'Smoking', 'Physical activity', 'Screen time', 'Signs & symptoms', 'Healthy eating', 'Violence' and 'Sweet tooth'. These factors explained 67% of variance and underwent cluster analysis. A six-cluster κ-means solution was established with a 93.8% level of classification validity. The between-cluster differences in both mean age and gender distribution were highly statistically significant. Health-compromising behaviors are common among Tuscan teens and occur in distinct clusters. These results may be used by schools, health-promotion authorities and other stakeholders to design and implement tailored preventive interventions in Tuscany.
Henry, David; Dymnicki, Allison B.; Mohatt, Nathaniel; Allen, James; Kelly, James G.
2016-01-01
Qualitative methods potentially add depth to prevention research, but can produce large amounts of complex data even with small samples. Studies conducted with culturally distinct samples often produce voluminous qualitative data, but may lack sufficient sample sizes for sophisticated quantitative analysis. Currently lacking in mixed methods research are methods allowing for more fully integrating qualitative and quantitative analysis techniques. Cluster analysis can be applied to coded qualitative data to clarify the findings of prevention studies by aiding efforts to reveal such things as the motives of participants for their actions and the reasons behind counterintuitive findings. By clustering groups of participants with similar profiles of codes in a quantitative analysis, cluster analysis can serve as a key component in mixed methods research. This article reports two studies. In the first study, we conduct simulations to test the accuracy of cluster assignment using three different clustering methods with binary data as produced when coding qualitative interviews. Results indicated that hierarchical clustering, K-Means clustering, and latent class analysis produced similar levels of accuracy with binary data, and that the accuracy of these methods did not decrease with samples as small as 50. Whereas the first study explores the feasibility of using common clustering methods with binary data, the second study provides a “real-world” example using data from a qualitative study of community leadership connected with a drug abuse prevention project. We discuss the implications of this approach for conducting prevention research, especially with small samples and culturally distinct communities. PMID:25946969
Gonzalez, Robert; Suppes, Trisha; Zeitzer, Jamie; McClung, Colleen; Tamminga, Carol; Tohen, Mauricio; Forero, Angelica; Dwivedi, Alok; Alvarado, Andres
2018-02-19
Multiple types of chronobiological disturbances have been reported in bipolar disorder, including characteristics associated with general activity levels, sleep, and rhythmicity. Previous studies have focused on examining the individual relationships between affective state and chronobiological characteristics. The aim of this study was to conduct a variable cluster analysis in order to ascertain how mood states are associated with chronobiological traits in bipolar I disorder (BDI). We hypothesized that manic symptomatology would be associated with disturbances of rhythm. Variable cluster analysis identified five chronobiological clusters in 105 BDI subjects. Cluster 1, comprising subjective sleep quality was associated with both mania and depression. Cluster 2, which comprised variables describing the degree of rhythmicity, was associated with mania. Significant associations between mood state and cluster analysis-identified chronobiological variables were noted. Disturbances of mood were associated with subjectively assessed sleep disturbances as opposed to objectively determined, actigraphy-based sleep variables. No associations with general activity variables were noted. Relationships between gender and medication classes in use and cluster analysis-identified chronobiological characteristics were noted. Exploratory analyses noted that medication class had a larger impact on these relationships than the number of psychiatric medications in use. In a BDI sample, variable cluster analysis was able to group related chronobiological variables. The results support our primary hypothesis that mood state, particularly mania, is associated with chronobiological disturbances. Further research is required in order to define these relationships and to determine the directionality of the associations between mood state and chronobiological characteristics.
Henry, David; Dymnicki, Allison B; Mohatt, Nathaniel; Allen, James; Kelly, James G
2015-10-01
Qualitative methods potentially add depth to prevention research but can produce large amounts of complex data even with small samples. Studies conducted with culturally distinct samples often produce voluminous qualitative data but may lack sufficient sample sizes for sophisticated quantitative analysis. Currently lacking in mixed-methods research are methods allowing for more fully integrating qualitative and quantitative analysis techniques. Cluster analysis can be applied to coded qualitative data to clarify the findings of prevention studies by aiding efforts to reveal such things as the motives of participants for their actions and the reasons behind counterintuitive findings. By clustering groups of participants with similar profiles of codes in a quantitative analysis, cluster analysis can serve as a key component in mixed-methods research. This article reports two studies. In the first study, we conduct simulations to test the accuracy of cluster assignment using three different clustering methods with binary data as produced when coding qualitative interviews. Results indicated that hierarchical clustering, K-means clustering, and latent class analysis produced similar levels of accuracy with binary data and that the accuracy of these methods did not decrease with samples as small as 50. Whereas the first study explores the feasibility of using common clustering methods with binary data, the second study provides a "real-world" example using data from a qualitative study of community leadership connected with a drug abuse prevention project. We discuss the implications of this approach for conducting prevention research, especially with small samples and culturally distinct communities.
Temperature Map of the Perseus Cluster of Galaxies Observed with ASCA
NASA Technical Reports Server (NTRS)
Furusho, T.; Yamasaki, N. Y.; Ohashi, T.; Shibata, R.; Ezawa, H.; White, Nicholas E. (Technical Monitor)
2000-01-01
We present two-dimensional temperature map of the Perseus cluster based on multi-pointing observations with the Advanced Spacecraft for Cosmology Astrophysics (ASCA) Gas Imaging Spectrometer (GIS), covering a region with a diameter of approximately 2 deg. By correcting for the effect of the X-ray telescope response, the temperatures were estimated from hardness ratios and the complete temperature structure of the cluster with a spatial resolution of about 100 kpc was obtained for the first time. There is an extended cool region with a diameter of approximately 20 arcmin and kT approx. 5 keV at about 20 arcmin east from the cluster center. This region also shows higher surface brightness and is surrounded by a large ring-like hot region with kT approx. > 7 keV, and likely to be a remnant of a merger with a poor cluster. Another extended cool region is extending outward from the IC 310 subcluster. These features and the presence of several other hot and cool blobs suggest that this rich cluster has been formed as a result of a repetition of many subcluster mergers.
NASA Astrophysics Data System (ADS)
Vančová, Viera; Čambál, Miloš; Cagáňová, Dagmar
2012-12-01
Nowadays, the opportunity for companies to be involved in cluster initiatives and international business associations is a major factor that contributes to the increase of their innovative potential. Companies organized in technological clusters have greater access to mutual business contacts, faster information transfer and deployment of advanced technologies. These companies cooperate more frequently with universities and research - development institutions on innovative projects. An important benefit of cluster associations is that they create a suitable environment for innovation and the transfer of knowledge by means of international cooperation and networking. This supportive environment is not easy to access for different small and mediumsized companies, who are not members of any clusters or networks. Supplier-customer business channels expand by means of transnational networks and exchanges of experience. Knowledge potential is broadened and joint innovative projects are developed. Reflecting the growing importance of clusters as driving forces of economic and regional development, a number of cluster policies and initiatives have emerged in the last few decades, oriented to encourage the establishment of new clusters, to support existing clusters, or to assist the development of transnational cooperation. To achieve the goals of the Europe 2020 Strategy, European countries should have an interest in building strong clusters and developing cluster cooperation by sharing specialized research infrastructures and testing facilities and facilitating knowledge transfer for crossborder cooperation. This requires developing a long term joint strategy in order to facilitate the development of open global clusters and innovative small and medium entrepreneurs.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stello, Dennis; Huber, Daniel; Bedding, Timothy R.
Studying star clusters offers significant advances in stellar astrophysics due to the combined power of having many stars with essentially the same distance, age, and initial composition. This makes clusters excellent test benches for verification of stellar evolution theory. To fully exploit this potential, it is vital that the star sample is uncontaminated by stars that are not members of the cluster. Techniques for determining cluster membership therefore play a key role in the investigation of clusters. We present results on three clusters in the Kepler field of view based on a newly established technique that uses asteroseismology to identifymore » fore- or background stars in the field, which demonstrates advantages over classical methods such as kinematic and photometry measurements. Four previously identified seismic non-members in NGC 6819 are confirmed in this study, and three additional non-members are found-two in NGC 6819 and one in NGC 6791. We further highlight which stars are, or might be, affected by blending, which needs to be taken into account when analyzing these Kepler data.« less
Obstructive Sleep Apnea: A Cluster Analysis at Time of Diagnosis
Grillet, Yves; Richard, Philippe; Stach, Bruno; Vivodtzev, Isabelle; Timsit, Jean-Francois; Lévy, Patrick; Tamisier, Renaud; Pépin, Jean-Louis
2016-01-01
Background The classification of obstructive sleep apnea is on the basis of sleep study criteria that may not adequately capture disease heterogeneity. Improved phenotyping may improve prognosis prediction and help select therapeutic strategies. Objectives: This study used cluster analysis to investigate the clinical clusters of obstructive sleep apnea. Methods An ascending hierarchical cluster analysis was performed on baseline symptoms, physical examination, risk factor exposure and co-morbidities from 18,263 participants in the OSFP (French national registry of sleep apnea). The probability for criteria to be associated with a given cluster was assessed using odds ratios, determined by univariate logistic regression. Results: Six clusters were identified, in which patients varied considerably in age, sex, symptoms, obesity, co-morbidities and environmental risk factors. The main significant differences between clusters were minimally symptomatic versus sleepy obstructive sleep apnea patients, lean versus obese, and among obese patients different combinations of co-morbidities and environmental risk factors. Conclusions Our cluster analysis identified six distinct clusters of obstructive sleep apnea. Our findings underscore the high degree of heterogeneity that exists within obstructive sleep apnea patients regarding clinical presentation, risk factors and consequences. This may help in both research and clinical practice for validating new prevention programs, in diagnosis and in decisions regarding therapeutic strategies. PMID:27314230
Calibrating the Planck cluster mass scale with CLASH
NASA Astrophysics Data System (ADS)
Penna-Lima, M.; Bartlett, J. G.; Rozo, E.; Melin, J.-B.; Merten, J.; Evrard, A. E.; Postman, M.; Rykoff, E.
2017-08-01
We determine the mass scale of Planck galaxy clusters using gravitational lensing mass measurements from the Cluster Lensing And Supernova survey with Hubble (CLASH). We have compared the lensing masses to the Planck Sunyaev-Zeldovich (SZ) mass proxy for 21 clusters in common, employing a Bayesian analysis to simultaneously fit an idealized CLASH selection function and the distribution between the measured observables and true cluster mass. We used a tiered analysis strategy to explicitly demonstrate the importance of priors on weak lensing mass accuracy. In the case of an assumed constant bias, bSZ, between true cluster mass, M500, and the Planck mass proxy, MPL, our analysis constrains 1-bSZ = 0.73 ± 0.10 when moderate priors on weak lensing accuracy are used, including a zero-mean Gaussian with standard deviation of 8% to account for possible bias in lensing mass estimations. Our analysis explicitly accounts for possible selection bias effects in this calibration sourced by the CLASH selection function. Our constraint on the cluster mass scale is consistent with recent results from the Weighing the Giants program and the Canadian Cluster Comparison Project. It is also consistent, at 1.34σ, with the value needed to reconcile the Planck SZ cluster counts with Planck's base ΛCDM model fit to the primary cosmic microwave background anisotropies.
NASA Technical Reports Server (NTRS)
Milner, G. Martin; Black, Mike; Hovenga, Mike; Mcclure, Paul; Miller, Patrice
1988-01-01
The application of vibration monitoring to the rotating machinery typical of ECLSS components in advanced NASA spacecraft was studied. It is found that the weighted summation of the accelerometer power spectrum is the most successful detection scheme for a majority of problem types. Other detection schemes studied included high-frequency demodulation, cepstrum, clustering, and amplitude processing.
Another collision for the Coma cluster
NASA Technical Reports Server (NTRS)
Vikhlinin, A.; Forman, W.; Jones, C.
1996-01-01
The wavelet transform analysis of the Rosat position sensitive proportional counter (PSPC) images of the Coma cluster are presented. The analysis shows, on small scales, a substructure dominated by two extended sources surrounding the two bright clusters NGC 4874 and NGC 4889. On scales of about 2 arcmin to 3 arcmin, the analysis reveals a tail of X-ray emission originating near the cluster center, curving to the south and east for approximately 25 arcmin and ending near the galaxy NGC 4911. The results are interpreted in terms of a merger of a group, having a core mass of approximately 10(exp 13) solar mass, with the main body of the Coma cluster.
Leung, S C; Fung, W K; Wong, K H
1999-01-01
The relative bit density variation graphs of 207 specimen credit cards processed by 12 encoding machines were examined first visually, and then classified by means of hierarchical cluster analysis. Twenty-nine credit cards being treated as 'questioned' samples were tested by way of cluster analysis against 'controls' derived from known encoders. It was found that hierarchical cluster analysis provided a high accuracy of identification with all 29 'questioned' samples classified correctly. On the other hand, although visual comparison of jitter graphs was less discriminating, it was nevertheless capable of giving a reasonably accurate result.