Braschel, Melissa C; Svec, Ivana; Darlington, Gerarda A; Donner, Allan
2016-04-01
Many investigators rely on previously published point estimates of the intraclass correlation coefficient rather than on their associated confidence intervals to determine the required size of a newly planned cluster randomized trial. Although confidence interval methods for the intraclass correlation coefficient that can be applied to community-based trials have been developed for a continuous outcome variable, fewer methods exist for a binary outcome variable. The aim of this study is to evaluate confidence interval methods for the intraclass correlation coefficient applied to binary outcomes in community intervention trials enrolling a small number of large clusters. Existing methods for confidence interval construction are examined and compared to a new ad hoc approach based on dividing clusters into a large number of smaller sub-clusters and subsequently applying existing methods to the resulting data. Monte Carlo simulation is used to assess the width and coverage of confidence intervals for the intraclass correlation coefficient based on Smith's large sample approximation of the standard error of the one-way analysis of variance estimator, an inverted modified Wald test for the Fleiss-Cuzick estimator, and intervals constructed using a bootstrap-t applied to a variance-stabilizing transformation of the intraclass correlation coefficient estimate. In addition, a new approach is applied in which clusters are randomly divided into a large number of smaller sub-clusters with the same methods applied to these data (with the exception of the bootstrap-t interval, which assumes large cluster sizes). These methods are also applied to a cluster randomized trial on adolescent tobacco use for illustration. When applied to a binary outcome variable in a small number of large clusters, existing confidence interval methods for the intraclass correlation coefficient provide poor coverage. However, confidence intervals constructed using the new approach combined with Smith's method provide nominal or close to nominal coverage when the intraclass correlation coefficient is small (<0.05), as is the case in most community intervention trials. This study concludes that when a binary outcome variable is measured in a small number of large clusters, confidence intervals for the intraclass correlation coefficient may be constructed by dividing existing clusters into sub-clusters (e.g. groups of 5) and using Smith's method. The resulting confidence intervals provide nominal or close to nominal coverage across a wide range of parameters when the intraclass correlation coefficient is small (<0.05). Application of this method should provide investigators with a better understanding of the uncertainty associated with a point estimator of the intraclass correlation coefficient used for determining the sample size needed for a newly designed community-based trial. © The Author(s) 2015.
Knox, Stephanie A; Chondros, Patty
2004-01-01
Background Cluster sample study designs are cost effective, however cluster samples violate the simple random sample assumption of independence of observations. Failure to account for the intra-cluster correlation of observations when sampling through clusters may lead to an under-powered study. Researchers therefore need estimates of intra-cluster correlation for a range of outcomes to calculate sample size. We report intra-cluster correlation coefficients observed within a large-scale cross-sectional study of general practice in Australia, where the general practitioner (GP) was the primary sampling unit and the patient encounter was the unit of inference. Methods Each year the Bettering the Evaluation and Care of Health (BEACH) study recruits a random sample of approximately 1,000 GPs across Australia. Each GP completes details of 100 consecutive patient encounters. Intra-cluster correlation coefficients were estimated for patient demographics, morbidity managed and treatments received. Intra-cluster correlation coefficients were estimated for descriptive outcomes and for associations between outcomes and predictors and were compared across two independent samples of GPs drawn three years apart. Results Between April 1999 and March 2000, a random sample of 1,047 Australian general practitioners recorded details of 104,700 patient encounters. Intra-cluster correlation coefficients for patient demographics ranged from 0.055 for patient sex to 0.451 for language spoken at home. Intra-cluster correlations for morbidity variables ranged from 0.005 for the management of eye problems to 0.059 for management of psychological problems. Intra-cluster correlation for the association between two variables was smaller than the descriptive intra-cluster correlation of each variable. When compared with the April 2002 to March 2003 sample (1,008 GPs) the estimated intra-cluster correlation coefficients were found to be consistent across samples. Conclusions The demonstrated precision and reliability of the estimated intra-cluster correlations indicate that these coefficients will be useful for calculating sample sizes in future general practice surveys that use the GP as the primary sampling unit. PMID:15613248
Wang, Z; Wang, W H; Wang, S L; Jin, J; Song, Y W; Liu, Y P; Ren, H; Fang, H; Tang, Y; Chen, B; Qi, S N; Lu, N N; Li, N; Tang, Y; Liu, X F; Yu, Z H; Li, Y X
2016-06-23
To find phenotypic subgroups of patients with pT1-2N0 invasive breast cancer by means of cluster analysis and estimate the prognosis and clinicopathological features of these subgroups. From 1999 to 2013, 4979 patients with pT1-2N0 invasive breast cancer were recruited for hierarchical clustering analysis. Age (≤40, 41-70, 70+ years), size of primary tumor, pathological type, grade of differentiation, microvascular invasion, estrogen receptor (ER), progesterone receptor (PR) and human epidermal growth factor receptor 2 (HER-2) were chosen as distance metric between patients. Hierarchical cluster analysis was performed using Ward's method. Cophenetic correlation coefficient (CPCC) and Spearman correlation coefficient were used to validate clustering structures. The CPCC was 0.603. The Spearman correlation coefficient was 0.617 (P<0.001), which indicated a good fit of hierarchy to the data. A twelve-cluster model seemed to best illustrate our patient cohort. Patients in cluster 5, 9 and 12 had best prognosis and were characterized by age >40 years, smaller primary tumor, lower histologic grade, positive ER and PR status, and mainly negative HER-2. Patients in the cluster 1 and 11 had the worst prognosis, The cluster 1 was characterized by a larger tumor, higher grade and negative ER and PR status, while the cluster 11 was characterized by positive microvascular invasion. Patients in other 7 clusters had a moderate prognosis, and patients in each cluster had distinctive clinicopathological features and recurrent patterns. This study identified distinctive clinicopathologic phenotypes in a large cohort of patients with pT1-2N0 breast cancer through hierarchical clustering and revealed different prognosis. This integrative model may help physicians to make more personalized decisions regarding adjuvant therapy.
Suppression of vacancy cluster growth in concentrated solid solution alloys
Zhao, Shijun; Velisa, Gihan; Xue, Haizhou; ...
2016-12-13
Large vacancy clusters, such as stacking-fault tetrahedra, are detrimental vacancy-type defects in ion-irradiated structural alloys. Suppression of vacancy cluster formation and growth is highly desirable to improve the irradiation tolerance of these materials. In this paper, we demonstrate that vacancy cluster growth can be inhibited in concentrated solid solution alloys by modifying cluster migration pathways and diffusion kinetics. The alloying effects of Fe and Cr on the migration of vacancy clusters in Ni concentrated alloys are investigated by molecular dynamics simulations and ion irradiation experiment. While the diffusion coefficients of small vacancy clusters in Ni-based binary and ternary solid solutionmore » alloys are higher than in pure Ni, they become lower for large clusters. This observation suggests that large clusters can easily migrate and grow to very large sizes in pure Ni. In contrast, cluster growth is suppressed in solid solution alloys owing to the limited mobility of large vacancy clusters. Finally, the differences in cluster sizes and mobilities in Ni and in solid solution alloys are consistent with the results from ion irradiation experiments.« less
Atomistic modeling of dropwise condensation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sikarwar, B. S., E-mail: bssikarwar@amity.edu; Singh, P. L.; Muralidhar, K.
The basic aim of the atomistic modeling of condensation of water is to determine the size of the stable cluster and connect phenomena occurring at atomic scale to the macroscale. In this paper, a population balance model is described in terms of the rate equations to obtain the number density distribution of the resulting clusters. The residence time is taken to be large enough so that sufficient time is available for all the adatoms existing in vapor-phase to loose their latent heat and get condensed. The simulation assumes clusters of a given size to be formed from clusters of smallermore » sizes, but not by the disintegration of the larger clusters. The largest stable cluster size in the number density distribution is taken to be representative of the minimum drop radius formed in a dropwise condensation process. A numerical confirmation of this result against predictions based on a thermodynamic model has been obtained. Results show that the number density distribution is sensitive to the surface diffusion coefficient and the rate of vapor flux impinging on the substrate. The minimum drop radius increases with the diffusion coefficient and the impinging vapor flux; however, the dependence is weak. The minimum drop radius predicted from thermodynamic considerations matches the prediction of the cluster model, though the former does not take into account the effect of the surface properties on the nucleation phenomena. For a chemically passive surface, the diffusion coefficient and the residence time are dependent on the surface texture via the coefficient of friction. Thus, physical texturing provides a means of changing, within limits, the minimum drop radius. The study reveals that surface texturing at the scale of the minimum drop radius does not provide controllability of the macro-scale dropwise condensation at large timescales when a dynamic steady-state is reached.« less
Cluster structure in the correlation coefficient matrix can be characterized by abnormal eigenvalues
NASA Astrophysics Data System (ADS)
Nie, Chun-Xiao
2018-02-01
In a large number of previous studies, the researchers found that some of the eigenvalues of the financial correlation matrix were greater than the predicted values of the random matrix theory (RMT). Here, we call these eigenvalues as abnormal eigenvalues. In order to reveal the hidden meaning of these abnormal eigenvalues, we study the toy model with cluster structure and find that these eigenvalues are related to the cluster structure of the correlation coefficient matrix. In this paper, model-based experiments show that in most cases, the number of abnormal eigenvalues of the correlation matrix is equal to the number of clusters. In addition, empirical studies show that the sum of the abnormal eigenvalues is related to the clarity of the cluster structure and is negatively correlated with the correlation dimension.
Higher-order clustering in networks
NASA Astrophysics Data System (ADS)
Yin, Hao; Benson, Austin R.; Leskovec, Jure
2018-05-01
A fundamental property of complex networks is the tendency for edges to cluster. The extent of the clustering is typically quantified by the clustering coefficient, which is the probability that a length-2 path is closed, i.e., induces a triangle in the network. However, higher-order cliques beyond triangles are crucial to understanding complex networks, and the clustering behavior with respect to such higher-order network structures is not well understood. Here we introduce higher-order clustering coefficients that measure the closure probability of higher-order network cliques and provide a more comprehensive view of how the edges of complex networks cluster. Our higher-order clustering coefficients are a natural generalization of the traditional clustering coefficient. We derive several properties about higher-order clustering coefficients and analyze them under common random graph models. Finally, we use higher-order clustering coefficients to gain new insights into the structure of real-world networks from several domains.
Thompson, Jennifer A; Fielding, Katherine; Hargreaves, James; Copas, Andrew
2017-12-01
Background/Aims We sought to optimise the design of stepped wedge trials with an equal allocation of clusters to sequences and explored sample size comparisons with alternative trial designs. Methods We developed a new expression for the design effect for a stepped wedge trial, assuming that observations are equally correlated within clusters and an equal number of observations in each period between sequences switching to the intervention. We minimised the design effect with respect to (1) the fraction of observations before the first and after the final sequence switches (the periods with all clusters in the control or intervention condition, respectively) and (2) the number of sequences. We compared the design effect of this optimised stepped wedge trial to the design effects of a parallel cluster-randomised trial, a cluster-randomised trial with baseline observations, and a hybrid trial design (a mixture of cluster-randomised trial and stepped wedge trial) with the same total cluster size for all designs. Results We found that a stepped wedge trial with an equal allocation to sequences is optimised by obtaining all observations after the first sequence switches and before the final sequence switches to the intervention; this means that the first sequence remains in the control condition and the last sequence remains in the intervention condition for the duration of the trial. With this design, the optimal number of sequences is [Formula: see text], where [Formula: see text] is the cluster-mean correlation, [Formula: see text] is the intracluster correlation coefficient, and m is the total cluster size. The optimal number of sequences is small when the intracluster correlation coefficient and cluster size are small and large when the intracluster correlation coefficient or cluster size is large. A cluster-randomised trial remains more efficient than the optimised stepped wedge trial when the intracluster correlation coefficient or cluster size is small. A cluster-randomised trial with baseline observations always requires a larger sample size than the optimised stepped wedge trial. The hybrid design can always give an equally or more efficient design, but will be at most 5% more efficient. We provide a strategy for selecting a design if the optimal number of sequences is unfeasible. For a non-optimal number of sequences, the sample size may be reduced by allowing a proportion of observations before the first or after the final sequence has switched. Conclusion The standard stepped wedge trial is inefficient. To reduce sample sizes when a hybrid design is unfeasible, stepped wedge trial designs should have no observations before the first sequence switches or after the final sequence switches.
Theory of scattering of electromagnetic waves of the microwave range in a turbid medium
NASA Astrophysics Data System (ADS)
Konstantinov, O. V.; Matveentsev, A. V.
2013-02-01
The coefficient of extinction of electromagnetic waves of the microwave range due to their scattering from clusters suspended in an amorphous medium and responsible for turbidity is calculated. Turbidity resembles the case when butter clusters transform water into milk. In the case under investigation, the clusters are conductors (metallic or semiconducting). The extinction coefficient is connected in a familiar way with the cross section of light scattering from an individual cluster. A new formula is derived for the light scattering cross section in the case when damping of oscillations of an electron is due only to spontaneous emission of light quanta. In this case, the resonant scattering cross section for light can be very large. It is shown that this can be observed only in a whisker nanocluster. In addition, the phonon energy on a whisker segment must be higher than the photon energy, which is close to the spacing between the electron energy levels in the cluster.
Berenguer, Roberto; Pastor-Juan, María Del Rosario; Canales-Vázquez, Jesús; Castro-García, Miguel; Villas, María Victoria; Legorburo, Francisco Mansilla; Sabater, Sebastià
2018-04-24
Purpose To identify the reproducible and nonredundant radiomics features (RFs) for computed tomography (CT). Materials and Methods Two phantoms were used to test RF reproducibility by using test-retest analysis, by changing the CT acquisition parameters (hereafter, intra-CT analysis), and by comparing five different scanners with the same CT parameters (hereafter, inter-CT analysis). Reproducible RFs were selected by using the concordance correlation coefficient (as a measure of the agreement between variables) and the coefficient of variation (defined as the ratio of the standard deviation to the mean). Redundant features were grouped by using hierarchical cluster analysis. Results A total of 177 RFs including intensity, shape, and texture features were evaluated. The test-retest analysis showed that 91% (161 of 177) of the RFs were reproducible according to concordance correlation coefficient. Reproducibility of intra-CT RFs, based on coefficient of variation, ranged from 89.3% (151 of 177) to 43.1% (76 of 177) where the pitch factor and the reconstruction kernel were modified, respectively. Reproducibility of inter-CT RFs, based on coefficient of variation, also showed large material differences, from 85.3% (151 of 177; wood) to only 15.8% (28 of 177; polyurethane). Ten clusters were identified after the hierarchical cluster analysis and one RF per cluster was chosen as representative. Conclusion Many RFs were redundant and nonreproducible. If all the CT parameters are fixed except field of view, tube voltage, and milliamperage, then the information provided by the analyzed RFs can be summarized in only 10 RFs (each representing a cluster) because of redundancy. © RSNA, 2018 Online supplemental material is available for this article.
Clustering Coefficients for Correlation Networks.
Masuda, Naoki; Sakaki, Michiko; Ezaki, Takahiro; Watanabe, Takamitsu
2018-01-01
Graph theory is a useful tool for deciphering structural and functional networks of the brain on various spatial and temporal scales. The clustering coefficient quantifies the abundance of connected triangles in a network and is a major descriptive statistics of networks. For example, it finds an application in the assessment of small-worldness of brain networks, which is affected by attentional and cognitive conditions, age, psychiatric disorders and so forth. However, it remains unclear how the clustering coefficient should be measured in a correlation-based network, which is among major representations of brain networks. In the present article, we propose clustering coefficients tailored to correlation matrices. The key idea is to use three-way partial correlation or partial mutual information to measure the strength of the association between the two neighboring nodes of a focal node relative to the amount of pseudo-correlation expected from indirect paths between the nodes. Our method avoids the difficulties of previous applications of clustering coefficient (and other) measures in defining correlational networks, i.e., thresholding on the correlation value, discarding of negative correlation values, the pseudo-correlation problem and full partial correlation matrices whose estimation is computationally difficult. For proof of concept, we apply the proposed clustering coefficient measures to functional magnetic resonance imaging data obtained from healthy participants of various ages and compare them with conventional clustering coefficients. We show that the clustering coefficients decline with the age. The proposed clustering coefficients are more strongly correlated with age than the conventional ones are. We also show that the local variants of the proposed clustering coefficients (i.e., abundance of triangles around a focal node) are useful in characterizing individual nodes. In contrast, the conventional local clustering coefficients were strongly correlated with and therefore may be confounded by the node's connectivity. The proposed methods are expected to help us to understand clustering and lack thereof in correlational brain networks, such as those derived from functional time series and across-participant correlation in neuroanatomical properties.
Clustering Coefficients for Correlation Networks
Masuda, Naoki; Sakaki, Michiko; Ezaki, Takahiro; Watanabe, Takamitsu
2018-01-01
Graph theory is a useful tool for deciphering structural and functional networks of the brain on various spatial and temporal scales. The clustering coefficient quantifies the abundance of connected triangles in a network and is a major descriptive statistics of networks. For example, it finds an application in the assessment of small-worldness of brain networks, which is affected by attentional and cognitive conditions, age, psychiatric disorders and so forth. However, it remains unclear how the clustering coefficient should be measured in a correlation-based network, which is among major representations of brain networks. In the present article, we propose clustering coefficients tailored to correlation matrices. The key idea is to use three-way partial correlation or partial mutual information to measure the strength of the association between the two neighboring nodes of a focal node relative to the amount of pseudo-correlation expected from indirect paths between the nodes. Our method avoids the difficulties of previous applications of clustering coefficient (and other) measures in defining correlational networks, i.e., thresholding on the correlation value, discarding of negative correlation values, the pseudo-correlation problem and full partial correlation matrices whose estimation is computationally difficult. For proof of concept, we apply the proposed clustering coefficient measures to functional magnetic resonance imaging data obtained from healthy participants of various ages and compare them with conventional clustering coefficients. We show that the clustering coefficients decline with the age. The proposed clustering coefficients are more strongly correlated with age than the conventional ones are. We also show that the local variants of the proposed clustering coefficients (i.e., abundance of triangles around a focal node) are useful in characterizing individual nodes. In contrast, the conventional local clustering coefficients were strongly correlated with and therefore may be confounded by the node's connectivity. The proposed methods are expected to help us to understand clustering and lack thereof in correlational brain networks, such as those derived from functional time series and across-participant correlation in neuroanatomical properties. PMID:29599714
Mobility of large clusters on a semiconductor surface: Kinetic Monte Carlo simulation results
NASA Astrophysics Data System (ADS)
M, Esen; A, T. Tüzemen; M, Ozdemir
2016-01-01
The mobility of clusters on a semiconductor surface for various values of cluster size is studied as a function of temperature by kinetic Monte Carlo method. The cluster resides on the surface of a square grid. Kinetic processes such as the diffusion of single particles on the surface, their attachment and detachment to/from clusters, diffusion of particles along cluster edges are considered. The clusters considered in this study consist of 150-6000 atoms per cluster on average. A statistical probability of motion to each direction is assigned to each particle where a particle with four nearest neighbors is assumed to be immobile. The mobility of a cluster is found from the root mean square displacement of the center of mass of the cluster as a function of time. It is found that the diffusion coefficient of clusters goes as D = A(T)Nα where N is the average number of particles in the cluster, A(T) is a temperature-dependent constant and α is a parameter with a value of about -0.64 < α < -0.75. The value of α is found to be independent of cluster sizes and temperature values (170-220 K) considered in this study. As the diffusion along the perimeter of the cluster becomes prohibitive, the exponent approaches a value of -0.5. The diffusion coefficient is found to change by one order of magnitude as a function of cluster size.
Moerbeek, Mirjam; van Schie, Sander
2016-07-11
The number of clusters in a cluster randomized trial is often low. It is therefore likely random assignment of clusters to treatment conditions results in covariate imbalance. There are no studies that quantify the consequences of covariate imbalance in cluster randomized trials on parameter and standard error bias and on power to detect treatment effects. The consequences of covariance imbalance in unadjusted and adjusted linear mixed models are investigated by means of a simulation study. The factors in this study are the degree of imbalance, the covariate effect size, the cluster size and the intraclass correlation coefficient. The covariate is binary and measured at the cluster level; the outcome is continuous and measured at the individual level. The results show covariate imbalance results in negligible parameter bias and small standard error bias in adjusted linear mixed models. Ignoring the possibility of covariate imbalance while calculating the sample size at the cluster level may result in a loss in power of at most 25 % in the adjusted linear mixed model. The results are more severe for the unadjusted linear mixed model: parameter biases up to 100 % and standard error biases up to 200 % may be observed. Power levels based on the unadjusted linear mixed model are often too low. The consequences are most severe for large clusters and/or small intraclass correlation coefficients since then the required number of clusters to achieve a desired power level is smallest. The possibility of covariate imbalance should be taken into account while calculating the sample size of a cluster randomized trial. Otherwise more sophisticated methods to randomize clusters to treatments should be used, such as stratification or balance algorithms. All relevant covariates should be carefully identified, be actually measured and included in the statistical model to avoid severe levels of parameter and standard error bias and insufficient power levels.
Kim, Jiyu; Jung, Inkyung
2017-01-01
Spatial scan statistics with circular or elliptic scanning windows are commonly used for cluster detection in various applications, such as the identification of geographical disease clusters from epidemiological data. It has been pointed out that the method may have difficulty in correctly identifying non-compact, arbitrarily shaped clusters. In this paper, we evaluated the Gini coefficient for detecting irregularly shaped clusters through a simulation study. The Gini coefficient, the use of which in spatial scan statistics was recently proposed, is a criterion measure for optimizing the maximum reported cluster size. Our simulation study results showed that using the Gini coefficient works better than the original spatial scan statistic for identifying irregularly shaped clusters, by reporting an optimized and refined collection of clusters rather than a single larger cluster. We have provided a real data example that seems to support the simulation results. We think that using the Gini coefficient in spatial scan statistics can be helpful for the detection of irregularly shaped clusters. PMID:28129368
Genome-scale cluster analysis of replicated microarrays using shrinkage correlation coefficient.
Yao, Jianchao; Chang, Chunqi; Salmi, Mari L; Hung, Yeung Sam; Loraine, Ann; Roux, Stanley J
2008-06-18
Currently, clustering with some form of correlation coefficient as the gene similarity metric has become a popular method for profiling genomic data. The Pearson correlation coefficient and the standard deviation (SD)-weighted correlation coefficient are the two most widely-used correlations as the similarity metrics in clustering microarray data. However, these two correlations are not optimal for analyzing replicated microarray data generated by most laboratories. An effective correlation coefficient is needed to provide statistically sufficient analysis of replicated microarray data. In this study, we describe a novel correlation coefficient, shrinkage correlation coefficient (SCC), that fully exploits the similarity between the replicated microarray experimental samples. The methodology considers both the number of replicates and the variance within each experimental group in clustering expression data, and provides a robust statistical estimation of the error of replicated microarray data. The value of SCC is revealed by its comparison with two other correlation coefficients that are currently the most widely-used (Pearson correlation coefficient and SD-weighted correlation coefficient) using statistical measures on both synthetic expression data as well as real gene expression data from Saccharomyces cerevisiae. Two leading clustering methods, hierarchical and k-means clustering were applied for the comparison. The comparison indicated that using SCC achieves better clustering performance. Applying SCC-based hierarchical clustering to the replicated microarray data obtained from germinating spores of the fern Ceratopteris richardii, we discovered two clusters of genes with shared expression patterns during spore germination. Functional analysis suggested that some of the genetic mechanisms that control germination in such diverse plant lineages as mosses and angiosperms are also conserved among ferns. This study shows that SCC is an alternative to the Pearson correlation coefficient and the SD-weighted correlation coefficient, and is particularly useful for clustering replicated microarray data. This computational approach should be generally useful for proteomic data or other high-throughput analysis methodology.
Slow-Down in Diffusion in Crowded Protein Solutions Correlates with Transient Cluster Formation.
Nawrocki, Grzegorz; Wang, Po-Hung; Yu, Isseki; Sugita, Yuji; Feig, Michael
2017-12-14
For a long time, the effect of a crowded cellular environment on protein dynamics has been largely ignored. Recent experiments indicate that proteins diffuse more slowly in a living cell than in a diluted solution, and further studies suggest that the diffusion depends on the local surroundings. Here, detailed insight into how diffusion depends on protein-protein contacts is presented based on extensive all-atom molecular dynamics simulations of concentrated villin headpiece solutions. After force field adjustments in the form of increased protein-water interactions to reproduce experimental data, translational and rotational diffusion was analyzed in detail. Although internal protein dynamics remained largely unaltered, rotational diffusion was found to slow down more significantly than translational diffusion as the protein concentration increased. The decrease in diffusion is interpreted in terms of a transient formation of protein clusters. These clusters persist on sub-microsecond time scales and follow distributions that increasingly shift toward larger cluster size with increasing protein concentrations. Weighting diffusion coefficients estimated for different clusters extracted from the simulations with the distribution of clusters largely reproduces the overall observed diffusion rates, suggesting that transient cluster formation is a primary cause for a slow-down in diffusion upon crowding with other proteins.
Stochastic theory of log-periodic patterns
NASA Astrophysics Data System (ADS)
Canessa, Enrique
2000-12-01
We introduce an analytical model based on birth-death clustering processes to help in understanding the empirical log-periodic corrections to power law scaling and the finite-time singularity as reported in several domains including rupture, earthquakes, world population and financial systems. In our stochastic theory log-periodicities are a consequence of transient clusters induced by an entropy-like term that may reflect the amount of co-operative information carried by the state of a large system of different species. The clustering completion rates for the system are assumed to be given by a simple linear death process. The singularity at t0 is derived in terms of birth-death clustering coefficients.
Optimizing the maximum reported cluster size in the spatial scan statistic for ordinal data.
Kim, Sehwi; Jung, Inkyung
2017-01-01
The spatial scan statistic is an important tool for spatial cluster detection. There have been numerous studies on scanning window shapes. However, little research has been done on the maximum scanning window size or maximum reported cluster size. Recently, Han et al. proposed to use the Gini coefficient to optimize the maximum reported cluster size. However, the method has been developed and evaluated only for the Poisson model. We adopt the Gini coefficient to be applicable to the spatial scan statistic for ordinal data to determine the optimal maximum reported cluster size. Through a simulation study and application to a real data example, we evaluate the performance of the proposed approach. With some sophisticated modification, the Gini coefficient can be effectively employed for the ordinal model. The Gini coefficient most often picked the optimal maximum reported cluster sizes that were the same as or smaller than the true cluster sizes with very high accuracy. It seems that we can obtain a more refined collection of clusters by using the Gini coefficient. The Gini coefficient developed specifically for the ordinal model can be useful for optimizing the maximum reported cluster size for ordinal data and helpful for properly and informatively discovering cluster patterns.
Optimizing the maximum reported cluster size in the spatial scan statistic for ordinal data
Kim, Sehwi
2017-01-01
The spatial scan statistic is an important tool for spatial cluster detection. There have been numerous studies on scanning window shapes. However, little research has been done on the maximum scanning window size or maximum reported cluster size. Recently, Han et al. proposed to use the Gini coefficient to optimize the maximum reported cluster size. However, the method has been developed and evaluated only for the Poisson model. We adopt the Gini coefficient to be applicable to the spatial scan statistic for ordinal data to determine the optimal maximum reported cluster size. Through a simulation study and application to a real data example, we evaluate the performance of the proposed approach. With some sophisticated modification, the Gini coefficient can be effectively employed for the ordinal model. The Gini coefficient most often picked the optimal maximum reported cluster sizes that were the same as or smaller than the true cluster sizes with very high accuracy. It seems that we can obtain a more refined collection of clusters by using the Gini coefficient. The Gini coefficient developed specifically for the ordinal model can be useful for optimizing the maximum reported cluster size for ordinal data and helpful for properly and informatively discovering cluster patterns. PMID:28753674
Genetic diversity of popcorn genotypes using molecular analysis.
Resh, F S; Scapim, C A; Mangolin, C A; Machado, M F P S; do Amaral, A T; Ramos, H C C; Vivas, M
2015-08-19
In this study, we analyzed dominant molecular markers to estimate the genetic divergence of 26 popcorn genotypes and evaluate whether using various dissimilarity coefficients with these dominant markers influences the results of cluster analysis. Fifteen random amplification of polymorphic DNA primers produced 157 amplified fragments, of which 65 were monomorphic and 92 were polymorphic. To calculate the genetic distances among the 26 genotypes, the complements of the Jaccard, Dice, and Rogers and Tanimoto similarity coefficients were used. A matrix of Dij values (dissimilarity matrix) was constructed, from which the genetic distances among genotypes were represented in a more simplified manner as a dendrogram generated using the unweighted pair-group method with arithmetic average. Clusters determined by molecular analysis generally did not group material from the same parental origin together. The largest genetic distance was between varieties 17 (UNB-2) and 18 (PA-091). In the identification of genotypes with the smallest genetic distance, the 3 coefficients showed no agreement. The 3 dissimilarity coefficients showed no major differences among their grouping patterns because agreement in determining the genotypes with large, medium, and small genetic distances was high. The largest genetic distances were observed for the Rogers and Tanimoto dissimilarity coefficient (0.74), followed by the Jaccard coefficient (0.65) and the Dice coefficient (0.48). The 3 coefficients showed similar estimations for the cophenetic correlation coefficient. Correlations among the matrices generated using the 3 coefficients were positive and had high magnitudes, reflecting strong agreement among the results obtained using the 3 evaluated dissimilarity coefficients.
Yang, Huan; Goudeli, Eirini; Hogan, Christopher J.
2018-04-24
In gas phase synthesis systems, clusters form and grow via condensation, in which a monomer binds to an existing cluster. While a hard sphere equation is frequently used to predict the condensation rate coefficient, this equation neglects the influences of potential interactions and cluster internal energy on the condensation process. Here, we present a collision rate theory-Molecular Dynamics simulation approach to calculate condensation probabilities and condensation rate coefficients; we use this approach to examine atomic condensation onto 6-56 atom Au and Mg clusters. The probability of condensation depends upon the initial relative velocity ( v) between atom and cluster andmore » the initial impact parameter ( b). In all cases there is a well-defined region of b-v space where condensation is highly probable, and outside of which the condensation probability drops to zero. For Au clusters with more than 10 atoms, we find that at gas temperatures in the 300-1200 K range, the condensation rate coefficient exceeds the hard sphere rate coefficient by a factor of 1.5-2.0. Conversely, for Au clusters with 10 or fewer atoms, and for 14 atom and 28 atom Mg clusters, as cluster equilibration temperature increases the condensation rate coefficient drops to values below the hard sphere rate coefficient. Calculations also yield the self-dissociation rate coefficient, which is found to vary considerably with gas temperature. Finally, calculations results reveal that grazing (high b) atom-cluster collisions at elevated velocity (> 1000 m s -1) can result in the colliding atom rebounding (bounce) from the cluster surface or binding while another atom dissociates (replacement). In conclusion, the presented method can be applied in developing rate equations to predict material formation and growth rates in vapor phase systems.« less
Yang, Huan; Goudeli, Eirini; Hogan, Christopher J
2018-04-28
In gas phase synthesis systems, clusters form and grow via condensation, in which a monomer binds to an existing cluster. While a hard-sphere equation is frequently used to predict the condensation rate coefficient, this equation neglects the influences of potential interactions and cluster internal energy on the condensation process. Here, we present a collision rate theory-molecular dynamics simulation approach to calculate condensation probabilities and condensation rate coefficients. We use this approach to examine atomic condensation onto 6-56-atom Au and Mg clusters. The probability of condensation depends upon the initial relative velocity (v) between atom and cluster and the initial impact parameter (b). In all cases, there is a well-defined region of b-v space where condensation is highly probable, and outside of which the condensation probability drops to zero. For Au clusters with more than 10 atoms, we find that at gas temperatures in the 300-1200 K range, the condensation rate coefficient exceeds the hard-sphere rate coefficient by a factor of 1.5-2.0. Conversely, for Au clusters with 10 or fewer atoms and for 14- and 28-atom Mg clusters, as cluster equilibration temperature increases, the condensation rate coefficient drops to values below the hard-sphere rate coefficient. Calculations also yield the self-dissociation rate coefficient, which is found to vary considerably with gas temperature. Finally, calculations results reveal that grazing (high b) atom-cluster collisions at elevated velocity (>1000 m s -1 ) can result in the colliding atom rebounding (bounce) from the cluster surface or binding while another atom dissociates (replacement). The presented method can be applied in developing rate equations to predict material formation and growth rates in vapor phase systems.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Huan; Goudeli, Eirini; Hogan, Christopher J.
In gas phase synthesis systems, clusters form and grow via condensation, in which a monomer binds to an existing cluster. While a hard sphere equation is frequently used to predict the condensation rate coefficient, this equation neglects the influences of potential interactions and cluster internal energy on the condensation process. Here, we present a collision rate theory-Molecular Dynamics simulation approach to calculate condensation probabilities and condensation rate coefficients; we use this approach to examine atomic condensation onto 6-56 atom Au and Mg clusters. The probability of condensation depends upon the initial relative velocity ( v) between atom and cluster andmore » the initial impact parameter ( b). In all cases there is a well-defined region of b-v space where condensation is highly probable, and outside of which the condensation probability drops to zero. For Au clusters with more than 10 atoms, we find that at gas temperatures in the 300-1200 K range, the condensation rate coefficient exceeds the hard sphere rate coefficient by a factor of 1.5-2.0. Conversely, for Au clusters with 10 or fewer atoms, and for 14 atom and 28 atom Mg clusters, as cluster equilibration temperature increases the condensation rate coefficient drops to values below the hard sphere rate coefficient. Calculations also yield the self-dissociation rate coefficient, which is found to vary considerably with gas temperature. Finally, calculations results reveal that grazing (high b) atom-cluster collisions at elevated velocity (> 1000 m s -1) can result in the colliding atom rebounding (bounce) from the cluster surface or binding while another atom dissociates (replacement). In conclusion, the presented method can be applied in developing rate equations to predict material formation and growth rates in vapor phase systems.« less
NASA Astrophysics Data System (ADS)
Yang, Huan; Goudeli, Eirini; Hogan, Christopher J.
2018-04-01
In gas phase synthesis systems, clusters form and grow via condensation, in which a monomer binds to an existing cluster. While a hard-sphere equation is frequently used to predict the condensation rate coefficient, this equation neglects the influences of potential interactions and cluster internal energy on the condensation process. Here, we present a collision rate theory-molecular dynamics simulation approach to calculate condensation probabilities and condensation rate coefficients. We use this approach to examine atomic condensation onto 6-56-atom Au and Mg clusters. The probability of condensation depends upon the initial relative velocity (v) between atom and cluster and the initial impact parameter (b). In all cases, there is a well-defined region of b-v space where condensation is highly probable, and outside of which the condensation probability drops to zero. For Au clusters with more than 10 atoms, we find that at gas temperatures in the 300-1200 K range, the condensation rate coefficient exceeds the hard-sphere rate coefficient by a factor of 1.5-2.0. Conversely, for Au clusters with 10 or fewer atoms and for 14- and 28-atom Mg clusters, as cluster equilibration temperature increases, the condensation rate coefficient drops to values below the hard-sphere rate coefficient. Calculations also yield the self-dissociation rate coefficient, which is found to vary considerably with gas temperature. Finally, calculations results reveal that grazing (high b) atom-cluster collisions at elevated velocity (>1000 m s-1) can result in the colliding atom rebounding (bounce) from the cluster surface or binding while another atom dissociates (replacement). The presented method can be applied in developing rate equations to predict material formation and growth rates in vapor phase systems.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Jianbao; Ma, Zhongjun, E-mail: mzj1234402@163.com; Chen, Guanrong
All edges in the classical Watts and Strogatz's small-world network model are unweighted and cooperative (positive). By introducing competitive (negative) inter-cluster edges and assigning edge weights to mimic more realistic networks, this paper develops a modified model which possesses co-competitive weighted couplings and cluster structures while maintaining the common small-world network properties of small average shortest path lengths and large clustering coefficients. Based on theoretical analysis, it is proved that the new model with inter-cluster co-competition balance has an important dynamical property of robust cluster synchronous pattern formation. More precisely, clusters will neither merge nor split regardless of adding ormore » deleting nodes and edges, under the condition of inter-cluster co-competition balance. Numerical simulations demonstrate the robustness of the model against the increase of the coupling strength and several topological variations.« less
NASA Astrophysics Data System (ADS)
Zhang, Jianbao; Ma, Zhongjun; Chen, Guanrong
2014-06-01
All edges in the classical Watts and Strogatz's small-world network model are unweighted and cooperative (positive). By introducing competitive (negative) inter-cluster edges and assigning edge weights to mimic more realistic networks, this paper develops a modified model which possesses co-competitive weighted couplings and cluster structures while maintaining the common small-world network properties of small average shortest path lengths and large clustering coefficients. Based on theoretical analysis, it is proved that the new model with inter-cluster co-competition balance has an important dynamical property of robust cluster synchronous pattern formation. More precisely, clusters will neither merge nor split regardless of adding or deleting nodes and edges, under the condition of inter-cluster co-competition balance. Numerical simulations demonstrate the robustness of the model against the increase of the coupling strength and several topological variations.
Sparse subspace clustering for data with missing entries and high-rank matrix completion.
Fan, Jicong; Chow, Tommy W S
2017-09-01
Many methods have recently been proposed for subspace clustering, but they are often unable to handle incomplete data because of missing entries. Using matrix completion methods to recover missing entries is a common way to solve the problem. Conventional matrix completion methods require that the matrix should be of low-rank intrinsically, but most matrices are of high-rank or even full-rank in practice, especially when the number of subspaces is large. In this paper, a new method called Sparse Representation with Missing Entries and Matrix Completion is proposed to solve the problems of incomplete-data subspace clustering and high-rank matrix completion. The proposed algorithm alternately computes the matrix of sparse representation coefficients and recovers the missing entries of a data matrix. The proposed algorithm recovers missing entries through minimizing the representation coefficients, representation errors, and matrix rank. Thorough experimental study and comparative analysis based on synthetic data and natural images were conducted. The presented results demonstrate that the proposed algorithm is more effective in subspace clustering and matrix completion compared with other existing methods. Copyright © 2017 Elsevier Ltd. All rights reserved.
The degree-related clustering coefficient and its application to link prediction
NASA Astrophysics Data System (ADS)
Liu, Yangyang; Zhao, Chengli; Wang, Xiaojie; Huang, Qiangjuan; Zhang, Xue; Yi, Dongyun
2016-07-01
Link prediction plays a significant role in explaining the evolution of networks. However it is still a challenging problem that has been addressed only with topological information in recent years. Based on the belief that network nodes with a great number of common neighbors are more likely to be connected, many similarity indices have achieved considerable accuracy and efficiency. Motivated by the natural assumption that the effect of missing links on the estimation of a node's clustering ability could be related to node degree, in this paper, we propose a degree-related clustering coefficient index to quantify the clustering ability of nodes. Unlike the classical clustering coefficient, our new coefficient is highly robust when the observed bias of links is considered. Furthermore, we propose a degree-related clustering ability path (DCP) index, which applies the proposed coefficient to the link prediction problem. Experiments on 12 real-world networks show that our proposed method is highly accurate and robust compared with four common-neighbor-based similarity indices (Common Neighbors(CN), Adamic-Adar(AA), Resource Allocation(RA), and Preferential Attachment(PA)), and the recently introduced clustering ability (CA) index.
Effects of global financial crisis on network structure in a local stock market
NASA Astrophysics Data System (ADS)
Nobi, Ashadun; Maeng, Seong Eun; Ha, Gyeong Gyun; Lee, Jae Woo
2014-08-01
This study considers the effects of the 2008 global financial crisis on threshold networks of a local Korean financial market around the time of the crisis. Prices of individual stocks belonging to KOSPI 200 (Korea Composite Stock Price Index 200) are considered for three time periods, namely before, during, and after the crisis. Threshold networks are constructed from fully connected cross-correlation networks, and thresholds of cross-correlation coefficients are assigned to obtain threshold networks. At the high threshold, only one large cluster consisting of firms in the financial sector, heavy industry, and construction is observed during the crisis. However, before and after the crisis, there are several fragmented clusters belonging to various sectors. The power law of the degree distribution in threshold networks is observed within the limited range of thresholds. Threshold networks are fatter during the crisis than before or after the crisis. The clustering coefficient of the threshold network follows the power law in the scaling range.
Network community structure and loop coefficient method
NASA Astrophysics Data System (ADS)
Vragović, I.; Louis, E.
2006-07-01
A modular structure, in which groups of tightly connected nodes could be resolved as separate entities, is a property that can be found in many complex networks. In this paper, we propose a algorithm for identifying communities in networks. It is based on a local measure, so-called loop coefficient that is a generalization of the clustering coefficient. Nodes with a large loop coefficient tend to be core inner community nodes, while other vertices are usually peripheral sites at the borders of communities. Our method gives satisfactory results for both artificial and real-world graphs, if they have a relatively pronounced modular structure. This type of algorithm could open a way of interpreting the role of nodes in communities in terms of the local loop coefficient, and could be used as a complement to other methods.
Social Network Analysis in Frontier Capital Markets
2012-06-01
in a network [Bor03]. 3.6 Clustering Coefficient The clustering coefficient developed by Watts and Strogatz measures the extent to which clusters or...Distance 2.6255 2.6754 2.4074 Fragmentation 0.2812 0.2263 0.0442 Clustering Coefficient Watts- Strogatz 0.8039 0.8222 0.7227 Total Degree Centralization... Strogatz 0.5281 0.6607 0.6360 Total Degree Centralization 0.0153 0.0360 0.0171 Betweenness Centralization 0.1133 0.1574 0.1849 Closeness Centralization
Automated flow cytometric analysis across large numbers of samples and cell types.
Chen, Xiaoyi; Hasan, Milena; Libri, Valentina; Urrutia, Alejandra; Beitz, Benoît; Rouilly, Vincent; Duffy, Darragh; Patin, Étienne; Chalmond, Bernard; Rogge, Lars; Quintana-Murci, Lluis; Albert, Matthew L; Schwikowski, Benno
2015-04-01
Multi-parametric flow cytometry is a key technology for characterization of immune cell phenotypes. However, robust high-dimensional post-analytic strategies for automated data analysis in large numbers of donors are still lacking. Here, we report a computational pipeline, called FlowGM, which minimizes operator input, is insensitive to compensation settings, and can be adapted to different analytic panels. A Gaussian Mixture Model (GMM)-based approach was utilized for initial clustering, with the number of clusters determined using Bayesian Information Criterion. Meta-clustering in a reference donor permitted automated identification of 24 cell types across four panels. Cluster labels were integrated into FCS files, thus permitting comparisons to manual gating. Cell numbers and coefficient of variation (CV) were similar between FlowGM and conventional gating for lymphocyte populations, but notably FlowGM provided improved discrimination of "hard-to-gate" monocyte and dendritic cell (DC) subsets. FlowGM thus provides rapid high-dimensional analysis of cell phenotypes and is amenable to cohort studies. Copyright © 2015. Published by Elsevier Inc.
Modeling of the Modulation by Buffers of Ca2+ Release through Clusters of IP3 Receptors
Zeller, S.; Rüdiger, S.; Engel, H.; Sneyd, J.; Warnecke, G.; Parker, I.; Falcke, M.
2009-01-01
Abstract Intracellular Ca2+ release is a versatile second messenger system. It is modeled here by reaction-diffusion equations for the free Ca2+ and Ca2+ buffers, with spatially discrete clusters of stochastic IP3 receptor channels (IP3Rs) controlling the release of Ca2+ from the endoplasmic reticulum. IP3Rs are activated by a small rise of the cytosolic Ca2+ concentration and inhibited by large concentrations. Buffering of cytosolic Ca2+ shapes global Ca2+ transients. Here we use a model to investigate the effect of buffers with slow and fast reaction rates on single release spikes. We find that, depending on their diffusion coefficient, fast buffers can either decouple clusters or delay inhibition. Slow buffers have little effect on Ca2+ release, but affect the time course of the signals from the fluorescent Ca2+ indicator mainly by competing for Ca2+. At low [IP3], fast buffers suppress fluorescence signals, slow buffers increase the contrast between bulk signals and signals at open clusters, and large concentrations of buffers, either fast or slow, decouple clusters. PMID:19686646
Wedge sampling for computing clustering coefficients and triangle counts on large graphs
Seshadhri, C.; Pinar, Ali; Kolda, Tamara G.
2014-05-08
Graphs are used to model interactions in a variety of contexts, and there is a growing need to quickly assess the structure of such graphs. Some of the most useful graph metrics are based on triangles, such as those measuring social cohesion. Despite the importance of these triadic measures, algorithms to compute them can be extremely expensive. We discuss the method of wedge sampling. This versatile technique allows for the fast and accurate approximation of various types of clustering coefficients and triangle counts. Furthermore, these techniques are extensible to counting directed triangles in digraphs. Our methods come with provable andmore » practical time-approximation tradeoffs for all computations. We provide extensive results that show our methods are orders of magnitude faster than the state of the art, while providing nearly the accuracy of full enumeration.« less
Segregation of large granules from close-packed cluster of small granules due to buoyancy.
Yang, Xian-qing; Zhou, Kun; Qiu, Kang; Zhao, Yue-min
2006-03-01
Segregation of large granules in a vibrofluidized granular bed with inhomogeneous granular number density distribution is studied by an event-driven algorithm. Simulation results show that the mean vertical position of large granules decreases with the increase of the density ration of the large granules to the small ones. This conclusion is consistent with the explanation that the net pressure due to the small surrounding particle impacts balances the large granular weight, and indict that the upward movement of the large granules is driven by the buoyancy. The values of temperature, density, and pressure of the systems are also computed by changing the conditions such as heating temperature on the bottom and restitution coefficient of particles. These results indicate that the segregation of large granules also happen in the systems with density inversion or even close-packed cluster of particles floating on a low-density fluid, due to the buoyancy. An equation of state is proposed to explain the buoyancy.
Srinivasan, A.; Galbán, C.J.; Johnson, T.D.; Chenevert, T.L.; Ross, B.D.; Mukherji, S.K.
2014-01-01
Purpose The objective of our study was to analyze the differences between apparent diffusion coefficient (ADC) partitions (created using the K-Means algorithm) between benign and malignant neck lesions and evaluate its benefit in distinguishing these entities. Material and methods MRI studies of 10 benign and 10 malignant proven neck pathologies were post-processed on a PC using in-house software developed in MATLAB (The MathWorks, Inc., Natick, MA). Lesions were manually contoured by two neuroradiologists with the ADC values within each lesion clustered into two (low ADC-ADCL, high ADC-ADCH) and three partitions (ADCL, intermediate ADC-ADCI, ADCH) using the K-Means clustering algorithm. An unpaired two-tailed Student’s t-test was performed for all metrics to determine statistical differences in the means between the benign and malignant pathologies. Results Statistically significant difference between the mean ADCL clusters in benign and malignant pathologies was seen in the 3 cluster models of both readers (p=0.03, 0.022 respectively) and the 2 cluster model of reader 2 (p=0.04) with the other metrics (ADCH, ADCI, whole lesion mean ADC) not revealing any significant differences. Receiver operating characteristics curves demonstrated the quantitative difference in mean ADCH and ADCL in both the 2 and 3 cluster models to be predictive of malignancy (2 clusters: p=0.008, area under curve=0.850, 3 clusters: p=0.01, area under curve=0.825). Conclusion The K-Means clustering algorithm that generates partitions of large datasets may provide a better characterization of neck pathologies and may be of additional benefit in distinguishing benign and malignant neck pathologies compared to whole lesion mean ADC alone. PMID:20007723
Designing Large-Scale Multisite and Cluster-Randomized Studies of Professional Development
ERIC Educational Resources Information Center
Kelcey, Ben; Spybrook, Jessaca; Phelps, Geoffrey; Jones, Nathan; Zhang, Jiaqi
2017-01-01
We develop a theoretical and empirical basis for the design of teacher professional development studies. We build on previous work by (a) developing estimates of intraclass correlation coefficients for teacher outcomes using two- and three-level data structures, (b) developing estimates of the variance explained by covariates, and (c) modifying…
NASA Astrophysics Data System (ADS)
Antoni, R.; Passard, C.; Perot, B.; Guillaumin, F.; Mazy, C.; Batifol, M.; Grassi, G.
2018-07-01
AREVA NC is preparing to process, characterize and compact old used fuel metallic waste stored at La Hague reprocessing plant in view of their future storage ("Haute Activité Oxyde" HAO project). For a large part of these historical wastes, the packaging is planned in CSD-C canisters ("Colis Standard de Déchets Compacté s") in the ACC hulls and nozzles compaction facility ("Atelier de Compactage des Coques et embouts"). . This paper presents a new method to take into account the possible presence of fissile material clusters, which may have a significant impact in the active neutron interrogation (Differential Die-away Technique) measurement of the CSD-C canisters, in the industrial neutron measurement station "P2-2". A matrix effect correction has already been investigated to predict the prompt fission neutron calibration coefficient (which provides the fissile mass) from an internal "drum flux monitor" signal provided during the active measurement by a boron-coated proportional counter located in the measurement cavity, and from a "drum transmission signal" recorded in passive mode by the detection blocks, in presence of an AmBe point source in the measurement cell. Up to now, the relationship between the calibration coefficient and these signals was obtained from a factorial design that did not consider the potential for occurrence of fissile material clusters. The interrogative neutron self-shielding in these clusters was treated separately and resulted in a penalty coefficient larger than 20% to prevent an underestimation of the fissile mass within the drum. In this work, we have shown that the incorporation of a new parameter in the factorial design, representing the fissile mass fraction in these clusters, provides an alternative to the penalty coefficient. This new approach finally does not degrade the uncertainty of the original prediction, which was calculated without taking into consideration the possible presence of clusters. Consequently, the accuracy of the fissile mass assessment is improved by this new method, and this last should be extended to similar DDT measurement stations of larger drums, also using an internal monitor for matrix effect correction.
Hao, Dapeng; Ren, Cong; Li, Chuanxing
2012-05-01
A central idea in biology is the hierarchical organization of cellular processes. A commonly used method to identify the hierarchical modular organization of network relies on detecting a global signature known as variation of clustering coefficient (so-called modularity scaling). Although several studies have suggested other possible origins of this signature, it is still widely used nowadays to identify hierarchical modularity, especially in the analysis of biological networks. Therefore, a further and systematical investigation of this signature for different types of biological networks is necessary. We analyzed a variety of biological networks and found that the commonly used signature of hierarchical modularity is actually the reflection of spoke-like topology, suggesting a different view of network architecture. We proved that the existence of super-hubs is the origin that the clustering coefficient of a node follows a particular scaling law with degree k in metabolic networks. To study the modularity of biological networks, we systematically investigated the relationship between repulsion of hubs and variation of clustering coefficient. We provided direct evidences for repulsion between hubs being the underlying origin of the variation of clustering coefficient, and found that for biological networks having no anti-correlation between hubs, such as gene co-expression network, the clustering coefficient doesn't show dependence of degree. Here we have shown that the variation of clustering coefficient is neither sufficient nor exclusive for a network to be hierarchical. Our results suggest the existence of spoke-like modules as opposed to "deterministic model" of hierarchical modularity, and suggest the need to reconsider the organizational principle of biological hierarchy.
Cascading failure in scale-free networks with tunable clustering
NASA Astrophysics Data System (ADS)
Zhang, Xue-Jun; Gu, Bo; Guan, Xiang-Min; Zhu, Yan-Bo; Lv, Ren-Li
2016-02-01
Cascading failure is ubiquitous in many networked infrastructure systems, such as power grids, Internet and air transportation systems. In this paper, we extend the cascading failure model to a scale-free network with tunable clustering and focus on the effect of clustering coefficient on system robustness. It is found that the network robustness undergoes a nonmonotonic transition with the increment of clustering coefficient: both highly and lowly clustered networks are fragile under the intentional attack, and the network with moderate clustering coefficient can better resist the spread of cascading. We then provide an extensive explanation for this constructive phenomenon via the microscopic point of view and quantitative analysis. Our work can be useful to the design and optimization of infrastructure systems.
Melnikov, Sergey M; Stein, Matthias
2018-03-15
CO 2 sequestration from anthropogenic resources is a challenge to the design of environmental processes at a large scale. Reversible chemical absorption by amine-based solvents is one of the most efficient methods of CO 2 removal. Molecular simulation techniques are very useful tools to investigate CO 2 binding by aqueous alkanolamine molecules for further technological application. In the present work, we have performed detailed atomistic molecular dynamics simulations of aqueous solutions of three prototype amines: monoethanolamine (MEA) as a standard, 3-aminopropanol (MPA), 2-methylaminoethanol (MMEA), and 4-diethylamino-2-butanol (DEAB) as potential novel CO 2 absorptive solvents. Solvent densities, radial distribution functions, cluster size distributions, hydrogen-bonding statistics, and diffusion coefficients for a full range of mixture compositions have been obtained. The solvent densities and diffusion coefficients from simulations are in good agreement with those in the experiment. In aqueous solution, MEA, MPA, and MMEA molecules prefer to be fully solvated by water molecules, whereas DEAB molecules tend to self-aggregate. In a range from 30/70-50/50 (w/w) alkanolamine/water mixtures, they form a bicontinuous phase (both alkanolamine and water are organized in two mutually percolating clusters). Among the studied aqueous alkanolamine solutions, the diffusion coefficients decrease in the following order MEA > MPA = MMEA > DEAB. With an increase of water content, the diffusion coefficients increase for all studied alkanolamines. The presented results are a first step for process-scale simulation and provide important qualitative and quantitative information for the design and engineering of efficient new CO 2 removal processes.
On Learning Cluster Coefficient of Private Networks
Wang, Yue; Wu, Xintao; Zhu, Jun; Xiang, Yang
2013-01-01
Enabling accurate analysis of social network data while preserving differential privacy has been challenging since graph features such as clustering coefficient or modularity often have high sensitivity, which is different from traditional aggregate functions (e.g., count and sum) on tabular data. In this paper, we treat a graph statistics as a function f and develop a divide and conquer approach to enforce differential privacy. The basic procedure of this approach is to first decompose the target computation f into several less complex unit computations f1, …, fm connected by basic mathematical operations (e.g., addition, subtraction, multiplication, division), then perturb the output of each fi with Laplace noise derived from its own sensitivity value and the distributed privacy threshold εi, and finally combine those perturbed fi as the perturbed output of computation f. We examine how various operations affect the accuracy of complex computations. When unit computations have large global sensitivity values, we enforce the differential privacy by calibrating noise based on the smooth sensitivity, rather than the global sensitivity. By doing this, we achieve the strict differential privacy guarantee with smaller magnitude noise. We illustrate our approach by using clustering coefficient, which is a popular statistics used in social network analysis. Empirical evaluations on five real social networks and various synthetic graphs generated from three random graph models show the developed divide and conquer approach outperforms the direct approach. PMID:24429843
Intra-class correlation estimates for assessment of vitamin A intake in children.
Agarwal, Girdhar G; Awasthi, Shally; Walter, Stephen D
2005-03-01
In many community-based surveys, multi-level sampling is inherent in the design. In the design of these studies, especially to calculate the appropriate sample size, investigators need good estimates of intra-class correlation coefficient (ICC), along with the cluster size, to adjust for variation inflation due to clustering at each level. The present study used data on the assessment of clinical vitamin A deficiency and intake of vitamin A-rich food in children in a district in India. For the survey, 16 households were sampled from 200 villages nested within eight randomly-selected blocks of the district. ICCs and components of variances were estimated from a three-level hierarchical random effects analysis of variance model. Estimates of ICCs and variance components were obtained at village and block levels. Between-cluster variation was evident at each level of clustering. In these estimates, ICCs were inversely related to cluster size, but the design effect could be substantial for large clusters. At the block level, most ICC estimates were below 0.07. At the village level, many ICC estimates ranged from 0.014 to 0.45. These estimates may provide useful information for the design of epidemiological studies in which the sampled (or allocated) units range in size from households to large administrative zones.
Information Filtering via Clustering Coefficients of User-Object Bipartite Networks
NASA Astrophysics Data System (ADS)
Guo, Qiang; Leng, Rui; Shi, Kerui; Liu, Jian-Guo
The clustering coefficient of user-object bipartite networks is presented to evaluate the overlap percentage of neighbors rating lists, which could be used to measure interest correlations among neighbor sets. The collaborative filtering (CF) information filtering algorithm evaluates a given user's interests in terms of his/her friends' opinions, which has become one of the most successful technologies for recommender systems. In this paper, different from the object clustering coefficient, users' clustering coefficients of user-object bipartite networks are introduced to improve the user similarity measurement. Numerical results for MovieLens and Netflix data sets show that users' clustering effects could enhance the algorithm performance. For MovieLens data set, the algorithmic accuracy, measured by the average ranking score, can be improved by 12.0% and the diversity could be improved by 18.2% and reach 0.649 when the recommendation list equals to 50. For Netflix data set, the accuracy could be improved by 14.5% at the optimal case and the popularity could be reduced by 13.4% comparing with the standard CF algorithm. Finally, we investigate the sparsity effect on the performance. This work indicates the user clustering coefficients is an effective factor to measure the user similarity, meanwhile statistical properties of user-object bipartite networks should be investigated to estimate users' tastes.
Clustering of change patterns using Fourier coefficients.
Kim, Jaehee; Kim, Haseong
2008-01-15
To understand the behavior of genes, it is important to explore how the patterns of gene expression change over a time period because biologically related gene groups can share the same change patterns. Many clustering algorithms have been proposed to group observation data. However, because of the complexity of the underlying functions there have not been many studies on grouping data based on change patterns. In this study, the problem of finding similar change patterns is induced to clustering with the derivative Fourier coefficients. The sample Fourier coefficients not only provide information about the underlying functions, but also reduce the dimension. In addition, as their limiting distribution is a multivariate normal, a model-based clustering method incorporating statistical properties would be appropriate. This work is aimed at discovering gene groups with similar change patterns that share similar biological properties. We developed a statistical model using derivative Fourier coefficients to identify similar change patterns of gene expression. We used a model-based method to cluster the Fourier series estimation of derivatives. The model-based method is advantageous over other methods in our proposed model because the sample Fourier coefficients asymptotically follow the multivariate normal distribution. Change patterns are automatically estimated with the Fourier representation in our model. Our model was tested in simulations and on real gene data sets. The simulation results showed that the model-based clustering method with the sample Fourier coefficients has a lower clustering error rate than K-means clustering. Even when the number of repeated time points was small, the same results were obtained. We also applied our model to cluster change patterns of yeast cell cycle microarray expression data with alpha-factor synchronization. It showed that, as the method clusters with the probability-neighboring data, the model-based clustering with our proposed model yielded biologically interpretable results. We expect that our proposed Fourier analysis with suitably chosen smoothing parameters could serve as a useful tool in classifying genes and interpreting possible biological change patterns. The R program is available upon the request.
Distance correlation methods for discovering associations in large astrophysical databases
DOE Office of Scientific and Technical Information (OSTI.GOV)
Martínez-Gómez, Elizabeth; Richards, Mercedes T.; Richards, Donald St. P., E-mail: elizabeth.martinez@itam.mx, E-mail: mrichards@astro.psu.edu, E-mail: richards@stat.psu.edu
2014-01-20
High-dimensional, large-sample astrophysical databases of galaxy clusters, such as the Chandra Deep Field South COMBO-17 database, provide measurements on many variables for thousands of galaxies and a range of redshifts. Current understanding of galaxy formation and evolution rests sensitively on relationships between different astrophysical variables; hence an ability to detect and verify associations or correlations between variables is important in astrophysical research. In this paper, we apply a recently defined statistical measure called the distance correlation coefficient, which can be used to identify new associations and correlations between astrophysical variables. The distance correlation coefficient applies to variables of any dimension,more » can be used to determine smaller sets of variables that provide equivalent astrophysical information, is zero only when variables are independent, and is capable of detecting nonlinear associations that are undetectable by the classical Pearson correlation coefficient. Hence, the distance correlation coefficient provides more information than the Pearson coefficient. We analyze numerous pairs of variables in the COMBO-17 database with the distance correlation method and with the maximal information coefficient. We show that the Pearson coefficient can be estimated with higher accuracy from the corresponding distance correlation coefficient than from the maximal information coefficient. For given values of the Pearson coefficient, the distance correlation method has a greater ability than the maximal information coefficient to resolve astrophysical data into highly concentrated horseshoe- or V-shapes, which enhances classification and pattern identification. These results are observed over a range of redshifts beyond the local universe and for galaxies from elliptical to spiral.« less
Report on simulation of fission gas and fission product diffusion in UO 2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andersson, Anders David; Perriot, Romain Thibault; Pastore, Giovanni
2016-07-22
In UO 2 nuclear fuel, the retention and release of fission gas atoms such as xenon (Xe) are important for nuclear fuel performance by, for example, reducing the fuel thermal conductivity, causing fuel swelling that leads to mechanical interaction with the clad, increasing the plenum pressure and reducing the fuel–clad gap thermal conductivity. We use multi-scale simulations to determine fission gas diffusion mechanisms as well as the corresponding rates in UO 2 under both intrinsic and irradiation conditions. In addition to Xe and Kr, the fission products Zr, Ru, Ce, Y, La, Sr and Ba have been investigated. Density functionalmore » theory (DFT) calculations are used to study formation, binding and migration energies of small clusters of Xe atoms and vacancies. Empirical potential calculations enable us to determine the corresponding entropies and attempt frequencies for migration as well as investigate the properties of large clusters or small fission gas bubbles. A continuum reaction-diffusion model is developed for Xe and point defects based on the mechanisms and rates obtained from atomistic simulations. Effective fission gas diffusivities are then obtained by solving this set of equations for different chemical and irradiation conditions using the MARMOT phase field code. The predictions are compared to available experimental data. The importance of the large Xe U3O cluster (a Xe atom in a uranium + oxygen vacancy trap site with two bound uranium vacancies) is emphasized, which is a consequence of its high mobility and high binding energy. We find that the Xe U3O cluster gives Xe diffusion coefficients that are higher for intrinsic conditions than under irradiation over a wide range of temperatures. Under irradiation the fast-moving Xe U3O cluster recombines quickly with irradiation-induced interstitial U ions, while this mechanism is less important for intrinsic conditions. The net result is higher concentration of the Xe U3O cluster for intrinsic conditions than under irradiation. We speculate that differences in the irradiation conditions and their impact on the Xe U3O cluster can explain the wide range of diffusivities reported in experimental studies. However, all vacancy-mediated mechanisms underestimate the Xe diffusivity compared to the empirical radiation-enhanced rate used in most fission gas release models. We investigate the possibility that diffusion of small fission gas bubbles or extended Xe-vacancy clusters may give rise to the observed radiation-enhanced diffusion coefficient. These studies highlight the importance of U divacancies and an octahedron coordination of uranium vacancies encompassing a Xe fission gas atom. The latter cluster can migrate via a multistep mechanism with a rather low effective barrier, which together with irradiation-induced clusters of uranium vacancies, gives rise to the irradiation-enhanced diffusion coefficient observed in experiments.« less
Rumor Diffusion in an Interests-Based Dynamic Social Network
Mao, Xinjun; Guessoum, Zahia; Zhou, Huiping
2013-01-01
To research rumor diffusion in social friend network, based on interests, a dynamic friend network is proposed, which has the characteristics of clustering and community, and a diffusion model is also proposed. With this friend network and rumor diffusion model, based on the zombie-city model, some simulation experiments to analyze the characteristics of rumor diffusion in social friend networks have been conducted. The results show some interesting observations: (1) positive information may evolve to become a rumor through the diffusion process that people may modify the information by word of mouth; (2) with the same average degree, a random social network has a smaller clustering coefficient and is more beneficial for rumor diffusion than the dynamic friend network; (3) a rumor is spread more widely in a social network with a smaller global clustering coefficient than in a social network with a larger global clustering coefficient; and (4) a network with a smaller clustering coefficient has a larger efficiency. PMID:24453911
Rumor diffusion in an interests-based dynamic social network.
Tang, Mingsheng; Mao, Xinjun; Guessoum, Zahia; Zhou, Huiping
2013-01-01
To research rumor diffusion in social friend network, based on interests, a dynamic friend network is proposed, which has the characteristics of clustering and community, and a diffusion model is also proposed. With this friend network and rumor diffusion model, based on the zombie-city model, some simulation experiments to analyze the characteristics of rumor diffusion in social friend networks have been conducted. The results show some interesting observations: (1) positive information may evolve to become a rumor through the diffusion process that people may modify the information by word of mouth; (2) with the same average degree, a random social network has a smaller clustering coefficient and is more beneficial for rumor diffusion than the dynamic friend network; (3) a rumor is spread more widely in a social network with a smaller global clustering coefficient than in a social network with a larger global clustering coefficient; and (4) a network with a smaller clustering coefficient has a larger efficiency.
Carlson, Matthew T.; Sonderegger, Morgan; Bane, Max
2014-01-01
We explored how phonological network structure influences the age of words’ first appearance in children’s (14–50 months) speech, using a large, longitudinal corpus of spontaneous child-caregiver interactions. We represent the caregiver lexicon as a network in which each word is connected to all of its phonological neighbors, and consider both words’ local neighborhood density (degree), and also their embeddedness among interconnected neighborhoods (clustering coefficient and coreness). The larger-scale structure reflected in the latter two measures is implicated in current theories of lexical development and processing, but its role in lexical development has not yet been explored. Multilevel discrete-time survival analysis revealed that children are more likely to produce new words whose network properties support lexical access for production: high degree, but low clustering coefficient and coreness. These effects appear to be strongest at earlier ages and largely absent from 30 months on. These results suggest that both a word’s local connectivity in the lexicon and its position in the lexicon as a whole influences when it is learned, and they underscore how general lexical processing mechanisms contribute to productive vocabulary development. PMID:25089073
Scale-free Graphs for General Aviation Flight Schedules
NASA Technical Reports Server (NTRS)
Alexandov, Natalia M. (Technical Monitor); Kincaid, Rex K.
2003-01-01
In the late 1990s a number of researchers noticed that networks in biology, sociology, and telecommunications exhibited similar characteristics unlike standard random networks. In particular, they found that the cummulative degree distributions of these graphs followed a power law rather than a binomial distribution and that their clustering coefficients tended to a nonzero constant as the number of nodes, n, became large rather than O(1/n). Moreover, these networks shared an important property with traditional random graphs as n becomes large the average shortest path length scales with log n. This latter property has been coined the small-world property. When taken together these three properties small-world, power law, and constant clustering coefficient describe what are now most commonly referred to as scale-free networks. Since 1997 at least six books and over 400 articles have been written about scale-free networks. In this manuscript an overview of the salient characteristics of scale-free networks. Computational experience will be provided for two mechanisms that grow (dynamic) scale-free graphs. Additional computational experience will be given for constructing (static) scale-free graphs via a tabu search optimization approach. Finally, a discussion of potential applications to general aviation networks is given.
Directed clustering coefficient as a measure of systemic risk in complex banking networks
NASA Astrophysics Data System (ADS)
Tabak, Benjamin M.; Takami, Marcelo; Rocha, Jadson M. C.; Cajueiro, Daniel O.; Souza, Sergio R. S.
2014-01-01
Recent literature has focused on the study of systemic risk in complex networks. It is clear now, after the crisis of 2008, that the aggregate behavior of the interaction among agents is not straightforward and it is very difficult to predict. Contributing to this debate, this paper shows that the directed clustering coefficient may be used as a measure of systemic risk in complex networks. Furthermore, using data from the Brazilian interbank network, we show that the directed clustering coefficient is negatively correlated with domestic interest rates.
Modeling online social signed networks
NASA Astrophysics Data System (ADS)
Li, Le; Gu, Ke; Zeng, An; Fan, Ying; Di, Zengru
2018-04-01
People's online rating behavior can be modeled by user-object bipartite networks directly. However, few works have been devoted to reveal the hidden relations between users, especially from the perspective of signed networks. We analyze the signed monopartite networks projected by the signed user-object bipartite networks, finding that the networks are highly clustered with obvious community structure. Interestingly, the positive clustering coefficient is remarkably higher than the negative clustering coefficient. Then, a Signed Growing Network model (SGN) based on local preferential attachment is proposed to generate a user's signed network that has community structure and high positive clustering coefficient. Other structural properties of the modeled networks are also found to be similar to the empirical networks.
Ke, Tracy; Fan, Jianqing; Wu, Yichao
2014-01-01
This paper explores the homogeneity of coefficients in high-dimensional regression, which extends the sparsity concept and is more general and suitable for many applications. Homogeneity arises when regression coefficients corresponding to neighboring geographical regions or a similar cluster of covariates are expected to be approximately the same. Sparsity corresponds to a special case of homogeneity with a large cluster of known atom zero. In this article, we propose a new method called clustering algorithm in regression via data-driven segmentation (CARDS) to explore homogeneity. New mathematics are provided on the gain that can be achieved by exploring homogeneity. Statistical properties of two versions of CARDS are analyzed. In particular, the asymptotic normality of our proposed CARDS estimator is established, which reveals better estimation accuracy for homogeneous parameters than that without homogeneity exploration. When our methods are combined with sparsity exploration, further efficiency can be achieved beyond the exploration of sparsity alone. This provides additional insights into the power of exploring low-dimensional structures in high-dimensional regression: homogeneity and sparsity. Our results also shed lights on the properties of the fussed Lasso. The newly developed method is further illustrated by simulation studies and applications to real data. Supplementary materials for this article are available online. PMID:26085701
Determinates of clustering across America's national parks: An application of the Gini coefficients
R. Geoffrey Lacher; Matthew T.J. Brownlee
2012-01-01
The changes in the clustering of visitation across National Park Service (NPS) sites have not been well documented or widely studied. This paper investigates the changes in the dispersion of visitation across NPS sites with the Gini coefficient, a popular measure of inequality used primarily in the field of economics. To calculate the degree of clustering nationally,...
Hsu, David
2015-09-27
Clustering methods are often used to model energy consumption for two reasons. First, clustering is often used to process data and to improve the predictive accuracy of subsequent energy models. Second, stable clusters that are reproducible with respect to non-essential changes can be used to group, target, and interpret observed subjects. However, it is well known that clustering methods are highly sensitive to the choice of algorithms and variables. This can lead to misleading assessments of predictive accuracy and mis-interpretation of clusters in policymaking. This paper therefore introduces two methods to the modeling of energy consumption in buildings: clusterwise regression,more » also known as latent class regression, which integrates clustering and regression simultaneously; and cluster validation methods to measure stability. Using a large dataset of multifamily buildings in New York City, clusterwise regression is compared to common two-stage algorithms that use K-means and model-based clustering with linear regression. Predictive accuracy is evaluated using 20-fold cross validation, and the stability of the perturbed clusters is measured using the Jaccard coefficient. These results show that there seems to be an inherent tradeoff between prediction accuracy and cluster stability. This paper concludes by discussing which clustering methods may be appropriate for different analytical purposes.« less
NASA Astrophysics Data System (ADS)
Wu, Zhihao; Lin, Youfang; Zhao, Yiji; Yan, Hongyan
2018-02-01
Networks can represent a wide range of complex systems, such as social, biological and technological systems. Link prediction is one of the most important problems in network analysis, and has attracted much research interest recently. Many link prediction methods have been proposed to solve this problem with various techniques. We can note that clustering information plays an important role in solving the link prediction problem. In previous literatures, we find node clustering coefficient appears frequently in many link prediction methods. However, node clustering coefficient is limited to describe the role of a common-neighbor in different local networks, because it cannot distinguish different clustering abilities of a node to different node pairs. In this paper, we shift our focus from nodes to links, and propose the concept of asymmetric link clustering (ALC) coefficient. Further, we improve three node clustering based link prediction methods via the concept of ALC. The experimental results demonstrate that ALC-based methods outperform node clustering based methods, especially achieving remarkable improvements on food web, hamster friendship and Internet networks. Besides, comparing with other methods, the performance of ALC-based methods are very stable in both globalized and personalized top-L link prediction tasks.
NASA Technical Reports Server (NTRS)
Kalton, G.
1983-01-01
A number of surveys were conducted to study the relationship between the level of aircraft or traffic noise exposure experienced by people living in a particular area and their annoyance with it. These surveys generally employ a clustered sample design which affects the precision of the survey estimates. Regression analysis of annoyance on noise measures and other variables is often an important component of the survey analysis. Formulae are presented for estimating the standard errors of regression coefficients and ratio of regression coefficients that are applicable with a two- or three-stage clustered sample design. Using a simple cost function, they also determine the optimum allocation of the sample across the stages of the sample design for the estimation of a regression coefficient.
Nickel-aluminum alloy clusters -- structural and dynamical properties
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jellinek, J.; Krissinel, E.B.
1997-08-01
Structural and dynamical properties of mixed Ni{sub n}Al{sub m} alloy clusters mimicked by a many-body potential are studied computationally for all the possible compositions n and m such that n + m = 13. It is shown that the manifold of the usually very large number of the different possible structural forms can be systematized by introducing classes of structures corresponding to the same concentration of the components, geometry and type of the central atom. General definitions of mixing energy and mixing coefficient are introduced, and it is shown that the energy ordering of the structural forms within each classmore » is governed by the mixing coefficient. The peculiarities of the solid-to-liquid-like transition are described as a function of the concentration of the two types of atoms. These peculiarities are correlated with and explained in terms of the energy spectra of the structural forms. Class-dependent features of the dynamics are described and analyzed.« less
NASA Astrophysics Data System (ADS)
Mahanta, Upakul; Goswami, Aruna; Duorah, Hiralal; Duorah, Kalpana
2017-08-01
Elemental abundance patterns of globular cluster stars can provide important clues for understanding cluster formation and early chemical evolution. The origin of the abundance patterns, however, still remains poorly understood. We have studied the impact of p-capture reaction cycles on the abundances of oxygen, sodium and aluminium considering nuclear reaction cycles of carbon-nitrogen-oxygen-fluorine, neon-sodium and magnesium-aluminium in massive stars in stellar conditions of temperature range 2×107 to 10×107 K and typical density of 102 gm cc-1. We have estimated abundances of oxygen, sodium and aluminium with respect to Fe, which are then assumed to be ejected from those stars because of rotation reaching a critical limit. These ejected abundances of elements are then compared with their counterparts that have been observed in some metal-poor evolved stars, mainly giants and red giants, of globular clusters M3, M4, M13 and NGC 6752. We observe an excellent agreement with [O/Fe] between the estimated and observed abundance values for globular clusters M3 and M4 with a correlation coefficient above 0.9 and a strong linear correlation for the remaining two clusters with a correlation coefficient above 0.7. The estimated [Na/Fe] is found to have a correlation coefficient above 0.7, thus implying a strong correlation for all four globular clusters. As far as [Al/Fe] is concerned, it also shows a strong correlation between the estimated abundance and the observed abundance for globular clusters M13 and NGC 6752, since here also the correlation coefficient is above 0.7 whereas for globular cluster M4 there is a moderate correlation found with a correlation coefficient above 0.6. Possible sources of these discrepancies are discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wiesner, Matthew P.; Lin, Huan; Soares-Santos, Marcelle
We present galaxy cluster mass–richness relations found in the Sloan Digital Sky Survey Stripe 82 co-add using clusters found using a Voronoi tessellation cluster finder. These relations were found using stacked weak lensing shear observed in a large sample of galaxy clusters. These mass–richness relations are presented for four redshift bins, 0.1 < z ≤ 0.4, 0.4 < z ≤ 0.7, 0.7 < z ≤ 1.0 and 0.1 < z ≤ 1.0. We describe the sample of galaxy clusters and explain how these clusters were found using a Voronoi tessellation cluster finder. We fit a Navarro-Frenk-White profile to the stackedmore » weak lensing shear signal in redshift and richness bins in order to measure virial mass (M 200). We describe several effects that can bias weak lensing measurements, including photometric redshift bias, the effect of the central BCG, halo miscentering, photometric redshift uncertainty and foreground galaxy contamination. We present mass–richness relations using richness measure N VT with each of these effects considered separately as well as considered altogether. We also examine redshift evolution of the mass–richness relation. As a result, we present measurements of the mass coefficient (M 200|20) and the power-law slope (α) for power-law fits to the mass and richness values in each of the redshift bins. We find values of the mass coefficient of 8.49 ± 0.526, 14.1 ± 1.78, 30.2 ± 8.74 and 9.23 ± 0.525 × 10 13 h –1 M ⊙ for each of the four redshift bins, respectively. As a result, we find values of the power-law slope of 0.905 ± 0.0585, 0.948 ± 0.100, 1.33 ± 0.260 and 0.883 ± 0.0500, respectively.« less
Wiesner, Matthew P.; Lin, Huan; Soares-Santos, Marcelle
2015-07-08
We present galaxy cluster mass–richness relations found in the Sloan Digital Sky Survey Stripe 82 co-add using clusters found using a Voronoi tessellation cluster finder. These relations were found using stacked weak lensing shear observed in a large sample of galaxy clusters. These mass–richness relations are presented for four redshift bins, 0.1 < z ≤ 0.4, 0.4 < z ≤ 0.7, 0.7 < z ≤ 1.0 and 0.1 < z ≤ 1.0. We describe the sample of galaxy clusters and explain how these clusters were found using a Voronoi tessellation cluster finder. We fit a Navarro-Frenk-White profile to the stackedmore » weak lensing shear signal in redshift and richness bins in order to measure virial mass (M 200). We describe several effects that can bias weak lensing measurements, including photometric redshift bias, the effect of the central BCG, halo miscentering, photometric redshift uncertainty and foreground galaxy contamination. We present mass–richness relations using richness measure N VT with each of these effects considered separately as well as considered altogether. We also examine redshift evolution of the mass–richness relation. As a result, we present measurements of the mass coefficient (M 200|20) and the power-law slope (α) for power-law fits to the mass and richness values in each of the redshift bins. We find values of the mass coefficient of 8.49 ± 0.526, 14.1 ± 1.78, 30.2 ± 8.74 and 9.23 ± 0.525 × 10 13 h –1 M ⊙ for each of the four redshift bins, respectively. As a result, we find values of the power-law slope of 0.905 ± 0.0585, 0.948 ± 0.100, 1.33 ± 0.260 and 0.883 ± 0.0500, respectively.« less
Zhang, Juping; Yang, Chan; Jin, Zhen; Li, Jia
2018-07-14
In this paper, the correlation coefficients between nodes in states are used as dynamic variables, and we construct SIR epidemic dynamic models with correlation coefficients by using the pair approximation method in static networks and dynamic networks, respectively. Considering the clustering coefficient of the network, we analytically investigate the existence and the local asymptotic stability of each equilibrium of these models and derive threshold values for the prevalence of diseases. Additionally, we obtain two equivalent epidemic thresholds in dynamic networks, which are compared with the results of the mean field equations. Copyright © 2018 Elsevier Ltd. All rights reserved.
Gujt, Jure; Podlipnik, Črtomir; Bešter-Rogač, Marija; Spohr, Eckhard
2014-09-28
The relative position of the hydroxylic and the carboxylic group in the isomeric hydroxybenzoate (HB) anions is known to have a large impact on transport properties of this species. It also influences crucially the self-organisation of cationic surfactants. In this article a systematic investigation of aqueous solutions of the ortho, meta, and para isomers of the HB anion is presented. Molecular dynamics simulations of all three HB isomers were conducted for two different concentrations at 298.15 K and using two separate water models. From the resulting trajectories we calculated the self-diffusion coefficient of each isomer. According to the calculated self-diffusion coefficients, isomers were ranked in the order o-HB > m-HB > p-HB at both concentrations for both the used SPC and SPC/E water models, which agrees very well with the experiment. The structural analysis revealed that at lower concentration, where the tendency for dimerisation or cluster formation is low, hydrogen bonding with water determines the mobility of the HB anion. o-HB forms the least hydrogen bonds and is therefore the most mobile, and p-HB, which forms the most hydrogen bonds with water, is the least mobile isomer. At higher concentration the formation of clusters also needs to be considered. The ortho isomer predominantly forms dimers with 2 hydrogen bonds per dimer between one OH and one carboxylate group of each anion. m-HB mostly forms clusters of sizes around 5 and p-HB forms clusters of sizes even larger than 10, which can be either rings or chains.
Identification of structural damage using wavelet-based data classification
NASA Astrophysics Data System (ADS)
Koh, Bong-Hwan; Jeong, Min-Joong; Jung, Uk
2008-03-01
Predicted time-history responses from a finite-element (FE) model provide a baseline map where damage locations are clustered and classified by extracted damage-sensitive wavelet coefficients such as vertical energy threshold (VET) positions having large silhouette statistics. Likewise, the measured data from damaged structure are also decomposed and rearranged according to the most dominant positions of wavelet coefficients. Having projected the coefficients to the baseline map, the true localization of damage can be identified by investigating the level of closeness between the measurement and predictions. The statistical confidence of baseline map improves as the number of prediction cases increases. The simulation results of damage detection in a truss structure show that the approach proposed in this study can be successfully applied for locating structural damage even in the presence of a considerable amount of process and measurement noise.
Coevolutionary dynamics of opinion propagation and social balance: The key role of small-worldness
NASA Astrophysics Data System (ADS)
Chen, Yan; Chen, Lixue; Sun, Xian; Zhang, Kai; Zhang, Jie; Li, Ping
2014-03-01
The propagation of various opinions in social networks, which influences human inter-relationships and even social structure, and hence is a most important part of social life. We have incorporated social balance into opinion propagation in social networks are influenced by social balance. The edges in networks can represent both friendly or hostile relations, and change with the opinions of individual nodes. We introduce a model to characterize the coevolutionary dynamics of these two dynamical processes on Watts-Strogatz (WS) small-world network. We employ two distinct evolution rules (i) opinion renewal; and (ii) relation adjustment. By changing the rewiring probability, and thus the small-worldness of the WS network, we found that the time for the system to reach balanced states depends critically on both the average path length and clustering coefficient of the network, which is different than other networked process like epidemic spreading. In particular, the system equilibrates most quickly when the underlying network demonstrates strong small-worldness, i.e., small average path lengths and large clustering coefficient. We also find that opinion clusters emerge in the process of the network approaching the global equilibrium, and a measure of global contrariety is proposed to quantify the balanced state of a social network.
Zhang, Jiang; Li, Yuyao; Chen, Huafu; Ding, Jurong; Yuan, Zhen
2016-11-04
In this study, small-world network analysis was performed to identify the similarities and differences between functional brain networks for right- and left-hand motor imageries (MIs). First, Pearson correlation coefficients among the nodes within the functional brain networks from healthy subjects were calculated. Then, small-world network indicators, including the clustering coefficient, the average path length, the global efficiency, the local efficiency, the average node degree, and the small-world index, were generated for the functional brain networks during both right- and left-hand MIs. We identified large differences in the small-world network indicators between the functional networks during MI and in the random networks. More importantly, the functional brain networks underlying the right- and left-hand MIs exhibited similar small-world properties in terms of the clustering coefficient, the average path length, the global efficiency, and the local efficiency. By contrast, the right- and left-hand MI brain networks showed differences in small-world characteristics, including indicators such as the average node degree and the small-world index. Interestingly, our findings also suggested that the differences in the activity intensity and range, the average node degree, and the small-world index of brain networks between the right- and left-hand MIs were associated with the asymmetry of brain functions.
Kinetics of proton migration in liquid water.
Chen, Hanning; Voth, Gregory A; Agmon, Noam
2010-01-14
We have utilized multistate empirical valence bond (MS-EVB3) simulations of protonated liquid water to calculate the relative mean-square displacement (MSD) and the history-independent time correlation function, c(t), of the hydrated proton center of excess charge (CEC) with respect to the water molecule on which it has initially resided. The MSD is nonlinear for the first 15 ps, suggesting that the relative diffusion coefficient increases from a small value, D(0), at short separations to its larger bulk value, D(infinity), at large separations. With the ensuing distance-dependent diffusion coefficient, D(r), the time dependence of both the MSD and c(t) agrees quantitatively with the solution of a diffusion equation for reversible geminate recombination. This suggests that the relative motion of the CEC is not independent from the nearby water molecules, in agreement with theoretical and experimental observations that large water clusters participate in the mechanism of proton mobility.
Balouchestani, Mohammadreza; Krishnan, Sridhar
2014-01-01
Long-term recording of Electrocardiogram (ECG) signals plays an important role in health care systems for diagnostic and treatment purposes of heart diseases. Clustering and classification of collecting data are essential parts for detecting concealed information of P-QRS-T waves in the long-term ECG recording. Currently used algorithms do have their share of drawbacks: 1) clustering and classification cannot be done in real time; 2) they suffer from huge energy consumption and load of sampling. These drawbacks motivated us in developing novel optimized clustering algorithm which could easily scan large ECG datasets for establishing low power long-term ECG recording. In this paper, we present an advanced K-means clustering algorithm based on Compressed Sensing (CS) theory as a random sampling procedure. Then, two dimensionality reduction methods: Principal Component Analysis (PCA) and Linear Correlation Coefficient (LCC) followed by sorting the data using the K-Nearest Neighbours (K-NN) and Probabilistic Neural Network (PNN) classifiers are applied to the proposed algorithm. We show our algorithm based on PCA features in combination with K-NN classifier shows better performance than other methods. The proposed algorithm outperforms existing algorithms by increasing 11% classification accuracy. In addition, the proposed algorithm illustrates classification accuracy for K-NN and PNN classifiers, and a Receiver Operating Characteristics (ROC) area of 99.98%, 99.83%, and 99.75% respectively.
Social Network Analysis in Frontier Capital Markets
2012-06-01
developed by Watts and Strogatz measures the extent to which clusters or cliques exist in a network [WS98]. The clustering coefficent of each individual...Coefficient Watts- Strogatz 0.8039 0.8222 0.7227 Total Degree Centralization 0.0618 0.0940 0.0612 Betweenness Centralization 0.0909 0.1256 0.0646 Closeness...Fragmentation 0.6099 0.5304 0.5308 Clustering Coefficient Watts- Strogatz 0.5281 0.6607 0.6360 Total Degree Centralization 0.0153 0.0360 0.0171
Recombination of electrons with NH4/+/-/NH3/n-series ions
NASA Technical Reports Server (NTRS)
Huang, C.-M.; Biondi, M. A.; Johnsen, R.
1976-01-01
The paper examines the recombination of electrons with ammonium-series cluster ions, NH4(+)-(NH3)n, for two reasons: (1) NH4(+) may be a significant ion in the lower atmospheres of the earth and the outer planets, and (2) to investigate the weak temperature dependence of the cluster ion's recombination coefficient. A microwave afterglow mass spectrometer was used to determine the recombination coefficients for the first five members of the ammonium series, (18+) through (86+), at temperatures between 200 and 410 K. The electron temperature dependence of the recombination coefficient was determined for (35+) and (52+), the n = 1 and 2 cluster ions, over the temperature range 300-3000 K.
Heat conduction in diatomic chains with correlated disorder
NASA Astrophysics Data System (ADS)
Savin, Alexander V.; Zolotarevskiy, Vadim; Gendelman, Oleg V.
2017-01-01
The paper considers heat transport in diatomic one-dimensional lattices, containing equal amounts of particles with different masses. Ordering of the particles in the chain is governed by single correlation parameter - the probability for two neighboring particles to have the same mass. As this parameter grows from zero to unity, the structure of the chain varies from regular staggering chain to completely random configuration, and then - to very long clusters of particles with equal masses. Therefore, this correlation parameter allows a control of typical cluster size in the chain. In order to explore different regimes of the heat transport, two interatomic potentials are considered. The first one is an infinite potential wall, corresponding to instantaneous elastic collisions between the neighboring particles. In homogeneous chains such interaction leads to an anomalous heat transport. The other one is classical Lennard-Jones interatomic potential, which leads to a normal heat transport. The simulations demonstrate that the correlated disorder of the particle arrangement does not change the convergence properties of the heat conduction coefficient, but essentially modifies its value. For the collision potential, one observes essential growth of the coefficient for fixed chain length as the limit of large homogeneous clusters is approached. The thermal transport in these models remains superdiffusive. In the Lennard-Jones chain the effect of correlation appears to be not monotonous in the limit of low temperatures. This behavior stems from the competition between formation of long clusters mentioned above, and Anderson localization close to the staggering ordered state.
Surface properties for α-cluster nuclear matter
NASA Astrophysics Data System (ADS)
Castro, J. J.; Soto, J. R.; Yépez, E.
2013-03-01
We introduce a new microscopic model for α-cluster matter, which simulates the properties of ordinary nuclear matter and α-clustering in a curved surface of a large but finite nucleus. The model is based on a nested icosahedral fullerene-like multiple-shell structure, where each vertex is occupied by a microscopic α-particle. The novel aspect of this model is that it allows a consistent description of nuclear surface properties from microscopic parameters to be made without using the leptodermous expansion. In particular, we show that the calculated surface energy is in excellent agreement with the corresponding coefficient of the Bethe-Weizäcker semi-empirical mass formula. We discuss the properties of the surface α-cluster state, which resembles an ultra cold bosonic quantum gas trapped in an optical lattice. By comparing the surface and interior states we are able to estimate the α preformation probability. Possible extensions of this model to study nuclear dynamics through surface vibrations and departures from approximate sphericity are mentioned.
Sample size calculations for stepped wedge and cluster randomised trials: a unified approach
Hemming, Karla; Taljaard, Monica
2016-01-01
Objectives To clarify and illustrate sample size calculations for the cross-sectional stepped wedge cluster randomized trial (SW-CRT) and to present a simple approach for comparing the efficiencies of competing designs within a unified framework. Study Design and Setting We summarize design effects for the SW-CRT, the parallel cluster randomized trial (CRT), and the parallel cluster randomized trial with before and after observations (CRT-BA), assuming cross-sectional samples are selected over time. We present new formulas that enable trialists to determine the required cluster size for a given number of clusters. We illustrate by example how to implement the presented design effects and give practical guidance on the design of stepped wedge studies. Results For a fixed total cluster size, the choice of study design that provides the greatest power depends on the intracluster correlation coefficient (ICC) and the cluster size. When the ICC is small, the CRT tends to be more efficient; when the ICC is large, the SW-CRT tends to be more efficient and can serve as an alternative design when the CRT is an infeasible design. Conclusion Our unified approach allows trialists to easily compare the efficiencies of three competing designs to inform the decision about the most efficient design in a given scenario. PMID:26344808
The Influence of the Phonological Neighborhood Clustering Coefficient on Spoken Word Recognition
ERIC Educational Resources Information Center
Chan, Kit Ying; Vitevitch, Michael S.
2009-01-01
Clustering coefficient--a measure derived from the new science of networks--refers to the proportion of phonological neighbors of a target word that are also neighbors of each other. Consider the words "bat", "hat", and "can", all of which are neighbors of the word "cat"; the words "bat" and…
Effects of the bipartite structure of a network on performance of recommenders
NASA Astrophysics Data System (ADS)
Wang, Qing-Xian; Li, Jian; Luo, Xin; Xu, Jian-Jun; Shang, Ming-Sheng
2018-02-01
Recommender systems aim to predict people's preferences for online items by analyzing their historical behaviors. A recommender can be modeled as a high-dimensional and sparse bipartite network, where the key issue is to understand the relation between the network structure and a recommender's performance. To address this issue, we choose three network characteristics, clustering coefficient, network density and user-item ratio, as the analyzing targets. For the cluster coefficient, we adopt the Degree-preserving rewiring algorithm to obtain a series of bipartite network with varying cluster coefficient, while the degree of user and item keep unchanged. Furthermore, five state-of-the-art recommenders are applied on two real datasets. The performances of recommenders are measured by both numerical and physical metrics. These results show that a recommender's performance is positively related to the clustering coefficient of a bipartite network. Meanwhile, higher density of a bipartite network can provide more accurate but less diverse or novel recommendations. Furthermore, the user-item ratio is positively correlated with the accuracy metrics but negatively correlated with the diverse and novel metrics.
A Real-Time PCR with Melting Curve Analysis for Molecular Typing of Vibrio parahaemolyticus.
He, Peiyan; Wang, Henghui; Luo, Jianyong; Yan, Yong; Chen, Zhongwen
2018-05-23
Foodborne disease caused by Vibrio parahaemolyticus is a serious public health problem in many countries. Molecular typing has a great scientific significance and application value for epidemiological research of V. parahaemolyticus. In this study, a real-time PCR with melting curve analysis was established for molecular typing of V. parahaemolyticus. Eighteen large variably presented gene clusters (LVPCs) of V. parahaemolyticus which have different distributions in the genome of different strains were selected as targets. Primer pairs of 18 LVPCs were distributed into three tubes. To validate this newly developed assay, we tested 53 Vibrio parahaemolyticus strains, which were classified in 13 different types. Furthermore, cluster analysis using NTSYS PC 2.02 software could divide 53 V. parahaemolyticus strains into six clusters at a relative similarity coefficient of 0.85. This method is fast, simple, and conveniently for molecular typing of V. parahaemolyticus.
Generic Feature Selection with Short Fat Data
Clarke, B.; Chu, J.-H.
2014-01-01
SUMMARY Consider a regression problem in which there are many more explanatory variables than data points, i.e., p ≫ n. Essentially, without reducing the number of variables inference is impossible. So, we group the p explanatory variables into blocks by clustering, evaluate statistics on the blocks and then regress the response on these statistics under a penalized error criterion to obtain estimates of the regression coefficients. We examine the performance of this approach for a variety of choices of n, p, classes of statistics, clustering algorithms, penalty terms, and data types. When n is not large, the discrimination over number of statistics is weak, but computations suggest regressing on approximately [n/K] statistics where K is the number of blocks formed by a clustering algorithm. Small deviations from this are observed when the blocks of variables are of very different sizes. Larger deviations are observed when the penalty term is an Lq norm with high enough q. PMID:25346546
Functional connectivity and graph theory in preclinical Alzheimer's disease.
Brier, Matthew R; Thomas, Jewell B; Fagan, Anne M; Hassenstab, Jason; Holtzman, David M; Benzinger, Tammie L; Morris, John C; Ances, Beau M
2014-04-01
Alzheimer's disease (AD) has a long preclinical phase in which amyloid and tau cerebral pathology accumulate without producing cognitive symptoms. Resting state functional connectivity magnetic resonance imaging has demonstrated that brain networks degrade during symptomatic AD. It is unclear to what extent these degradations exist before symptomatic onset. In this study, we investigated graph theory metrics of functional integration (path length), functional segregation (clustering coefficient), and functional distinctness (modularity) as a function of disease severity. Further, we assessed whether these graph metrics were affected in cognitively normal participants with cerebrospinal fluid evidence of preclinical AD. Clustering coefficient and modularity, but not path length, were reduced in AD. Cognitively normal participants who harbored AD biomarker pathology also showed reduced values in these graph measures, demonstrating brain changes similar to, but smaller than, symptomatic AD. Only modularity was significantly affected by age. We also demonstrate that AD has a particular effect on hub-like regions in the brain. We conclude that AD causes large-scale disconnection that is present before onset of symptoms. Copyright © 2014 Elsevier Inc. All rights reserved.
Boriollo, Marcelo Fabiano Gomes; Rosa, Edvaldo Antonio Ribeiro; Gonçalves, Reginaldo Bruno; Höfling, José Francisco
2006-03-01
The typing of C. albicans by MLEE (multilocus enzyme electrophoresis) is dependent on the interpretation of enzyme electrophoretic patterns, and the study of the epidemiological relationships of these yeasts can be conducted by cluster analysis. Therefore, the aims of the present study were to first determine the discriminatory power of genetic interpretation (deduction of the allelic composition of diploid organisms) and numerical interpretation (mere determination of the presence and absence of bands) of MLEE patterns, and then to determine the concordance (Pearson product-moment correlation coefficient) and similarity (Jaccard similarity coefficient) of the groups of strains generated by three cluster analysis models, and the discriminatory power of such models as well [model A: genetic interpretation, genetic distance matrix of Nei (d(ij)) and UPGMA dendrogram; model B: genetic interpretation, Dice similarity matrix (S(D1)) and UPGMA dendrogram; model C: numerical interpretation, Dice similarity matrix (S(D2)) and UPGMA dendrogram]. MLEE was found to be a powerful and reliable tool for the typing of C. albicans due to its high discriminatory power (>0.9). Discriminatory power indicated that numerical interpretation is a method capable of discriminating a greater number of strains (47 versus 43 subtypes), but also pointed to model B as a method capable of providing a greater number of groups, suggesting its use for the typing of C. albicans by MLEE and cluster analysis. Very good agreement was only observed between the elements of the matrices S(D1) and S(D2), but a large majority of the groups generated in the three UPGMA dendrograms showed similarity S(J) between 4.8% and 75%, suggesting disparities in the conclusions obtained by the cluster assays.
Grieve, Richard; Nixon, Richard; Thompson, Simon G
2010-01-01
Cost-effectiveness analyses (CEA) may be undertaken alongside cluster randomized trials (CRTs) where randomization is at the level of the cluster (for example, the hospital or primary care provider) rather than the individual. Costs (and outcomes) within clusters may be correlated so that the assumption made by standard bivariate regression models, that observations are independent, is incorrect. This study develops a flexible modeling framework to acknowledge the clustering in CEA that use CRTs. The authors extend previous Bayesian bivariate models for CEA of multicenter trials to recognize the specific form of clustering in CRTs. They develop new Bayesian hierarchical models (BHMs) that allow mean costs and outcomes, and also variances, to differ across clusters. They illustrate how each model can be applied using data from a large (1732 cases, 70 primary care providers) CRT evaluating alternative interventions for reducing postnatal depression. The analyses compare cost-effectiveness estimates from BHMs with standard bivariate regression models that ignore the data hierarchy. The BHMs show high levels of cost heterogeneity across clusters (intracluster correlation coefficient, 0.17). Compared with standard regression models, the BHMs yield substantially increased uncertainty surrounding the cost-effectiveness estimates, and altered point estimates. The authors conclude that ignoring clustering can lead to incorrect inferences. The BHMs that they present offer a flexible modeling framework that can be applied more generally to CEA that use CRTs.
Nuclear States with Abnormally Large Radii (size Isomers)
NASA Astrophysics Data System (ADS)
Ogloblin, A. A.; Demyanova, A. S.; Danilov, A. N.; Belyaeva, T. L.; Goncharov, S. A.
2015-06-01
Application of the methods of measuring the radii of the short-lived excited states (Modified diffraction model MDM, Inelastic nuclear rainbow scattering method INRS, Asymptotic normalization coefficients method ANC) to the analysis of some nuclear reactions provide evidence of existing in 9Be, 11B, 12C, 13C the excited states whose radii exceed those of the corresponding ground states by ~ 30%. Two types of structure of these "size isomers" were identified: neutron halo an α-clusters.
Estrada, Lisbell D; Duran, Elizabeth; Cisterna, Matias; Echeverria, Cesar; Zheng, Zhiping; Borgna, Vincenzo; Arancibia-Miranda, Nicolas; Ramírez-Tagle, Rodrigo
2018-03-24
Tumorigenic cell lines are more susceptible to [Re 6 Se 8 I 6 ] 3- cluster-induced death than normal cells, becoming a novel candidate for cancer treatment. Still, the feasibility of using this type of molecules in human patients remains unclear and further pharmacokinetics analysis is needed. Using coupled plasma optical emission spectroscopy, we determined the Re-cluster tissue content in injected mice, as a biodistribution measurement. Our results show that the Re-cluster successfully reaches different tissues, accumulating mainly in heart and liver. In order to dissect the mechanism underlying cluster biodistribution, we used three different experimental approaches. First, we evaluate the degree of lipophilicity by determining the octanol/water partition coefficient. The cluster mostly remained in the octanol fraction, with a coefficient of 1.86 ± 0.02, which indicates it could potentially cross cell membranes. Then, we measured the biological membrane penetration through a parallel artificial membrane permeability assays (PAMPA) assay. The Re-cluster crosses the artificial membrane, with a coefficient of 122 nm/s that is considered highly permeable. To evaluate a potential application of the Re-cluster in central nervous system (CNS) tumors, we analyzed the cluster's brain penetration by exposing cultured blood-brain-barrier (BBB) cells to increasing concentrations of the cluster. The Re-cluster effectively penetrates the BBB, reaching nearly 30% of the brain side after 24 h. Thus, our results indicate that the Re-cluster penetrates biological membranes reaching different target organs-most probably due to its lipophilic properties-becoming a promising anti-cancer drug with high potential for CNS cancer's diagnosis and treatment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Popescu, Bogdan; Hanson, M. M.
2010-04-10
We present Monte Carlo models of open stellar clusters with the purpose of mapping out the behavior of integrated colors with mass and age. Our cluster simulation package allows for stochastic variations in the stellar mass function to evaluate variations in integrated cluster properties. We find that UBVK colors from our simulations are consistent with simple stellar population (SSP) models, provided the cluster mass is large, M {sub cluster} {>=} 10{sup 6} M {sub sun}. Below this mass, our simulations show two significant effects. First, the mean value of the distribution of integrated colors moves away from the SSP predictionsmore » and is less red, in the first 10{sup 7} to 10{sup 8} years in UBV colors, and for all ages in (V - K). Second, the 1{sigma} dispersion of observed colors increases significantly with lower cluster mass. We attribute the former to the reduced number of red luminous stars in most of the lower mass clusters and the latter to the increased stochastic effect of a few of these stars on lower mass clusters. This latter point was always assumed to occur, but we now provide the first public code able to quantify this effect. We are completing a more extensive database of magnitudes and colors as a function of stellar cluster age and mass that will allow the determination of the correlation coefficients among different bands, and improve estimates of cluster age and mass from integrated photometry.« less
Raman, Abhinav S; Li, Huiyong; Chiew, Y C
2018-01-07
Supercritical oxygen, a cryogenic fluid, is widely used as an oxidizer in jet propulsion systems and is therefore of paramount importance in gaining physical insights into processes such as transcritical and supercritical vaporization. It is well established in the scientific literature that the supercritical state is not homogeneous but, in fact, can be demarcated into regions with liquid-like and vapor-like properties, separated by the "Widom line." In this study, we identified the Widom line for oxygen, constituted by the loci of the extrema of thermodynamic response functions (heat capacity, volumetric thermal expansion coefficient, and isothermal compressibility) in the supercritical region, via atomistic molecular dynamics simulations. We found that the Widom lines derived from these response functions all coincide near the critical point until about 25 bars and 15-20 K, beyond which the isothermal compressibility line begins to deviate. We also obtained the crossover from liquid-like to vapor-like behavior of the translational diffusion coefficient, shear viscosity, and rotational relaxation time of supercritical oxygen. While the crossover of the translational diffusion coefficient and shear viscosity coincided with the Widom lines, the rotational relaxation time showed a crossover that was largely independent of the Widom line. Further, we characterized the clustering behavior and percolation transition of supercritical oxygen molecules, identified the percolation threshold based on the fractal dimension of the largest cluster and the probability of finding a cluster that spans the system in all three dimensions, and found that the locus of the percolation threshold also coincided with the isothermal compressibility Widom line. It is therefore clear that supercritical oxygen is far more complex than originally perceived and that the Widom line, dynamical crossovers, and percolation transitions serve as useful routes to better our understanding of the supercritical state.
NASA Astrophysics Data System (ADS)
Raman, Abhinav S.; Li, Huiyong; Chiew, Y. C.
2018-01-01
Supercritical oxygen, a cryogenic fluid, is widely used as an oxidizer in jet propulsion systems and is therefore of paramount importance in gaining physical insights into processes such as transcritical and supercritical vaporization. It is well established in the scientific literature that the supercritical state is not homogeneous but, in fact, can be demarcated into regions with liquid-like and vapor-like properties, separated by the "Widom line." In this study, we identified the Widom line for oxygen, constituted by the loci of the extrema of thermodynamic response functions (heat capacity, volumetric thermal expansion coefficient, and isothermal compressibility) in the supercritical region, via atomistic molecular dynamics simulations. We found that the Widom lines derived from these response functions all coincide near the critical point until about 25 bars and 15-20 K, beyond which the isothermal compressibility line begins to deviate. We also obtained the crossover from liquid-like to vapor-like behavior of the translational diffusion coefficient, shear viscosity, and rotational relaxation time of supercritical oxygen. While the crossover of the translational diffusion coefficient and shear viscosity coincided with the Widom lines, the rotational relaxation time showed a crossover that was largely independent of the Widom line. Further, we characterized the clustering behavior and percolation transition of supercritical oxygen molecules, identified the percolation threshold based on the fractal dimension of the largest cluster and the probability of finding a cluster that spans the system in all three dimensions, and found that the locus of the percolation threshold also coincided with the isothermal compressibility Widom line. It is therefore clear that supercritical oxygen is far more complex than originally perceived and that the Widom line, dynamical crossovers, and percolation transitions serve as useful routes to better our understanding of the supercritical state.
Efficient sampling of complex network with modified random walk strategies
NASA Astrophysics Data System (ADS)
Xie, Yunya; Chang, Shuhua; Zhang, Zhipeng; Zhang, Mi; Yang, Lei
2018-02-01
We present two novel random walk strategies, choosing seed node (CSN) random walk and no-retracing (NR) random walk. Different from the classical random walk sampling, the CSN and NR strategies focus on the influences of the seed node choice and path overlap, respectively. Three random walk samplings are applied in the Erdös-Rényi (ER), Barabási-Albert (BA), Watts-Strogatz (WS), and the weighted USAir networks, respectively. Then, the major properties of sampled subnets, such as sampling efficiency, degree distributions, average degree and average clustering coefficient, are studied. The similar conclusions can be reached with these three random walk strategies. Firstly, the networks with small scales and simple structures are conducive to the sampling. Secondly, the average degree and the average clustering coefficient of the sampled subnet tend to the corresponding values of original networks with limited steps. And thirdly, all the degree distributions of the subnets are slightly biased to the high degree side. However, the NR strategy performs better for the average clustering coefficient of the subnet. In the real weighted USAir networks, some obvious characters like the larger clustering coefficient and the fluctuation of degree distribution are reproduced well by these random walk strategies.
Liao, Fuyuan; Jan, Yih-Kuen
2012-06-01
This paper presents a recurrence network approach for the analysis of skin blood flow dynamics in response to loading pressure. Recurrence is a fundamental property of many dynamical systems, which can be explored in phase spaces constructed from observational time series. A visualization tool of recurrence analysis called recurrence plot (RP) has been proved to be highly effective to detect transitions in the dynamics of the system. However, it was found that delay embedding can produce spurious structures in RPs. Network-based concepts have been applied for the analysis of nonlinear time series recently. We demonstrate that time series with different types of dynamics exhibit distinct global clustering coefficients and distributions of local clustering coefficients and that the global clustering coefficient is robust to the embedding parameters. We applied the approach to study skin blood flow oscillations (BFO) response to loading pressure. The results showed that global clustering coefficients of BFO significantly decreased in response to loading pressure (p<0.01). Moreover, surrogate tests indicated that such a decrease was associated with a loss of nonlinearity of BFO. Our results suggest that the recurrence network approach can practically quantify the nonlinear dynamics of BFO.
Cross-entropy clustering framework for catchment classification
NASA Astrophysics Data System (ADS)
Tongal, Hakan; Sivakumar, Bellie
2017-09-01
There is an increasing interest in catchment classification and regionalization in hydrology, as they are useful for identification of appropriate model complexity and transfer of information from gauged catchments to ungauged ones, among others. This study introduces a nonlinear cross-entropy clustering (CEC) method for classification of catchments. The method specifically considers embedding dimension (m), sample entropy (SampEn), and coefficient of variation (CV) to represent dimensionality, complexity, and variability of the time series, respectively. The method is applied to daily streamflow time series from 217 gauging stations across Australia. The results suggest that a combination of linear and nonlinear parameters (i.e. m, SampEn, and CV), representing different aspects of the underlying dynamics of streamflows, could be useful for determining distinct patterns of flow generation mechanisms within a nonlinear clustering framework. For the 217 streamflow time series, nine hydrologically homogeneous clusters that have distinct patterns of flow regime characteristics and specific dominant hydrological attributes with different climatic features are obtained. Comparison of the results with those obtained using the widely employed k-means clustering method (which results in five clusters, with the loss of some information about the features of the clusters) suggests the superiority of the cross-entropy clustering method. The outcomes from this study provide a useful guideline for employing the nonlinear dynamic approaches based on hydrologic signatures and for gaining an improved understanding of streamflow variability at a large scale.
Synchrony in broadband fluctuation and the 2008 financial crisis.
Lin, Der Chyan
2013-01-01
We propose phase-like characteristics in scale-free broadband processes and consider fluctuation synchrony based on the temporal signature of significant amplitude fluctuation. Using wavelet transform, successful captures of similar fluctuation pattern between such broadband processes are demonstrated. The application to the financial data leading to the 2008 financial crisis reveals the transition towards a qualitatively different dynamical regime with many equity price in fluctuation synchrony. Further analysis suggests an underlying scale free "price fluctuation network" with large clustering coefficient.
Double Cluster Heads Model for Secure and Accurate Data Fusion in Wireless Sensor Networks
Fu, Jun-Song; Liu, Yun
2015-01-01
Secure and accurate data fusion is an important issue in wireless sensor networks (WSNs) and has been extensively researched in the literature. In this paper, by combining clustering techniques, reputation and trust systems, and data fusion algorithms, we propose a novel cluster-based data fusion model called Double Cluster Heads Model (DCHM) for secure and accurate data fusion in WSNs. Different from traditional clustering models in WSNs, two cluster heads are selected after clustering for each cluster based on the reputation and trust system and they perform data fusion independently of each other. Then, the results are sent to the base station where the dissimilarity coefficient is computed. If the dissimilarity coefficient of the two data fusion results exceeds the threshold preset by the users, the cluster heads will be added to blacklist, and the cluster heads must be reelected by the sensor nodes in a cluster. Meanwhile, feedback is sent from the base station to the reputation and trust system, which can help us to identify and delete the compromised sensor nodes in time. Through a series of extensive simulations, we found that the DCHM performed very well in data fusion security and accuracy. PMID:25608211
Connick, Mark J; Beckman, Emma; Vanlandewijck, Yves; Malone, Laurie A; Blomqvist, Sven; Tweedy, Sean M
2017-11-25
The Para athletics wheelchair-racing classification system employs best practice to ensure that classes comprise athletes whose impairments cause a comparable degree of activity limitation. However, decision-making is largely subjective and scientific evidence which reduces this subjectivity is required. To evaluate whether isometric strength tests were valid for the purposes of classifying wheelchair racers and whether cluster analysis of the strength measures produced a valid classification structure. Thirty-two international level, male wheelchair racers from classes T51-54 completed six isometric strength tests evaluating elbow extensors, shoulder flexors, trunk flexors and forearm pronators and two wheelchair performance tests-Top-Speed (0-15 m) and Top-Speed (absolute). Strength tests significantly correlated with wheelchair performance were included in a cluster analysis and the validity of the resulting clusters was assessed. All six strength tests correlated with performance (r=0.54-0.88). Cluster analysis yielded four clusters with reasonable overall structure (mean silhouette coefficient=0.58) and large intercluster strength differences. Six athletes (19%) were allocated to clusters that did not align with their current class. While the mean wheelchair racing performance of the resulting clusters was unequivocally hierarchical, the mean performance of current classes was not, with no difference between current classes T53 and T54. Cluster analysis of isometric strength tests produced classes comprising athletes who experienced a similar degree of activity limitation. The strength tests reported can provide the basis for a new, more transparent, less subjective wheelchair racing classification system, pending replication of these findings in a larger, representative sample. This paper also provides guidance for development of evidence-based systems in other Para sports. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
NASA Astrophysics Data System (ADS)
Chai, Bing-Bing; Vass, Jozsef; Zhuang, Xinhua
1997-04-01
Recent success in wavelet coding is mainly attributed to the recognition of importance of data organization. There has been several very competitive wavelet codecs developed, namely, Shapiro's Embedded Zerotree Wavelets (EZW), Servetto et. al.'s Morphological Representation of Wavelet Data (MRWD), and Said and Pearlman's Set Partitioning in Hierarchical Trees (SPIHT). In this paper, we propose a new image compression algorithm called Significant-Linked Connected Component Analysis (SLCCA) of wavelet coefficients. SLCCA exploits both within-subband clustering of significant coefficients and cross-subband dependency in significant fields. A so-called significant link between connected components is designed to reduce the positional overhead of MRWD. In addition, the significant coefficients' magnitude are encoded in bit plane order to match the probability model of the adaptive arithmetic coder. Experiments show that SLCCA outperforms both EZW and MRWD, and is tied with SPIHT. Furthermore, it is observed that SLCCA generally has the best performance on images with large portion of texture. When applied to fingerprint image compression, it outperforms FBI's wavelet scalar quantization by about 1 dB.
NASA Astrophysics Data System (ADS)
Susilo; Setyaningsih, M.
2018-01-01
Solanum melongena (eggplant) is one of the diversity of the Solanum family which is grown and widely spread in Indonesia and widely used by the community. This research explored the genetic diversity of four local Indonesian eggplant species namely leuca, tekokak, gelatik and kopek by using RAPD (Random Amplified Polymorphic DNA). The samples were obtained from Agricultural Technology Assessment Institute (BPTP) Bogor, Indonesia. The result of data observation was in the form of Solanum melongena plant’s DNA profile analyzed descriptively and quantitatively. 30 DNA bands (28 polymorphic and 2 monomorphic) were successfully scored by using four primers (OPF-01, OPF-02, OPF-03, and OPF-04). The Primers were used able to amplify all of the four eggplant samples. The result of PCR-RAPD visualization produces bands of 300-1500 bp. The result of cluster analysis showed the existence of three clusters (A, B, and C). Cluster A (coefficient of equal to 49%) consisted of a gelatik, cluster B (coefficient of 65% equilibrium) consisted of TPU (Kopek) and TK (Tekokak), and cluster C (55% equilibrium coefficient) consisted of LC (Leunca). These results indicated that the closest proximity is found in samples of TK (Tekokak) and TPU (Kopek).
Smiga, Szymon; Fabiano, Eduardo
2017-11-15
We have developed a simplified coupled cluster (SCC) methodology, using the basic idea of scaled MP2 methods. The scheme has been applied to the coupled cluster double equations and implemented in three different non-iterative variants. This new method (especially the SCCD[3] variant, which utilizes a spin-resolved formalism) has been found to be very efficient and to yield an accurate approximation of the reference CCD results for both total and interaction energies of different atoms and molecules. Furthermore, we demonstrate that the equations determining the scaling coefficients for the SCCD[3] approach can generate non-empirical SCS-MP2 scaling coefficients which are in good agreement with previous theoretical investigations.
Local-world and cluster-growing weighted networks with controllable clustering
NASA Astrophysics Data System (ADS)
Yang, Chun-Xia; Tang, Min-Xuan; Tang, Hai-Qiang; Deng, Qiang-Qiang
2014-12-01
We constructed an improved weighted network model by introducing local-world selection mechanism and triangle coupling mechanism based on the traditional BBV model. The model gives power-law distributions of degree, strength and edge weight and presents the linear relationship both between the degree and strength and between the degree and the clustering coefficient. Particularly, the model is equipped with an ability to accelerate the speed increase of strength exceeding that of degree. Besides, the model is more sound and efficient in tuning clustering coefficient than the original BBV model. Finally, based on our improved model, we analyze the virus spread process and find that reducing the size of local-world has a great inhibited effect on virus spread.
H. Li; X. Deng; Andy Dolloff; E. P. Smith
2015-01-01
A novel clustering method for bivariate functional data is proposed to group streams based on their waterâair temperature relationship. A distance measure is developed for bivariate curves by using a time-varying coefficient model and a weighting scheme. This distance is also adjusted by spatial correlation of streams via the variogram. Therefore, the proposed...
Catchment classification by runoff behaviour with self-organizing maps (SOM)
NASA Astrophysics Data System (ADS)
Ley, R.; Casper, M. C.; Hellebrand, H.; Merz, R.
2011-09-01
Catchments show a wide range of response behaviour, even if they are adjacent. For many purposes it is necessary to characterise and classify them, e.g. for regionalisation, prediction in ungauged catchments, model parameterisation. In this study, we investigate hydrological similarity of catchments with respect to their response behaviour. We analyse more than 8200 event runoff coefficients (ERCs) and flow duration curves of 53 gauged catchments in Rhineland-Palatinate, Germany, for the period from 1993 to 2008, covering a huge variability of weather and runoff conditions. The spatio-temporal variability of event-runoff coefficients and flow duration curves are assumed to represent how different catchments "transform" rainfall into runoff. From the runoff coefficients and flow duration curves we derive 12 signature indices describing various aspects of catchment response behaviour to characterise each catchment. Hydrological similarity of catchments is defined by high similarities of their indices. We identify, analyse and describe hydrologically similar catchments by cluster analysis using Self-Organizing Maps (SOM). As a result of the cluster analysis we get five clusters of similarly behaving catchments where each cluster represents one differentiated class of catchments. As catchment response behaviour is supposed to be dependent on its physiographic and climatic characteristics, we compare groups of catchments clustered by response behaviour with clusters of catchments based on catchment properties. Results show an overlap of 67% between these two pools of clustered catchments which can be improved using the topologic correctness of SOMs.
Catchment classification by runoff behaviour with self-organizing maps (SOM)
NASA Astrophysics Data System (ADS)
Ley, R.; Casper, M. C.; Hellebrand, H.; Merz, R.
2011-03-01
Catchments show a wide range of response behaviour, even if they are adjacent. For many purposes it is necessary to characterise and classify them, e.g. for regionalisation, prediction in ungauged catchments, model parameterisation. In this study, we investigate hydrological similarity of catchments with respect to their response behaviour. We analyse more than 8200 event runoff coefficients (ERCs) and flow duration curves of 53 gauged catchments in Rhineland-Palatinate, Germany, for the period from 1993 to 2008, covering a huge variability of weather and runoff conditions. The spatio-temporal variability of event-runoff coefficients and flow duration curves are assumed to represent how different catchments "transform" rainfall into runoff. From the runoff coefficients and flow duration curves we derive 12 signature indices describing various aspects of catchment response behaviour to characterise each catchment. Hydrological similarity of catchments is defined by high similarities of their indices. We identify, analyse and describe hydrologically similar catchments by cluster analysis using Self-Organizing Maps (SOM). As a result of the cluster analysis we get five clusters of similarly behaving catchments where each cluster represents one differentiated class of catchments. As catchment response behaviour is supposed to be dependent on its physiographic and climatic characteristics, we compare groups of catchments clustered by response behaviour with clusters of catchments based on catchment properties. Results show an overlap of 67% between these two pools of clustered catchments which can be improved using the topologic correctness of SOMs.
Clustering stocks using partial correlation coefficients
NASA Astrophysics Data System (ADS)
Jung, Sean S.; Chang, Woojin
2016-11-01
A partial correlation analysis is performed on the Korean stock market (KOSPI). The difference between Pearson correlation and the partial correlation is analyzed and it is found that when conditioned on the market return, Pearson correlation coefficients are generally greater than those of the partial correlation, which implies that the market return tends to drive up the correlation between stock returns. A clustering analysis is then performed to study the market structure given by the partial correlation analysis and the members of the clusters are compared with the Global Industry Classification Standard (GICS). The initial hypothesis is that the firms in the same GICS sector are clustered together since they are in a similar business and environment. However, the result is inconsistent with the hypothesis and most clusters are a mix of multiple sectors suggesting that the traditional approach of using sectors to determine the proximity between stocks may not be sufficient enough to diversify a portfolio.
Analysis of ligand-protein exchange by Clustering of Ligand Diffusion Coefficient Pairs (CoLD-CoP)
NASA Astrophysics Data System (ADS)
Snyder, David A.; Chantova, Mihaela; Chaudhry, Saadia
2015-06-01
NMR spectroscopy is a powerful tool in describing protein structures and protein activity for pharmaceutical and biochemical development. This study describes a method to determine weak binding ligands in biological systems by using hierarchic diffusion coefficient clustering of multidimensional data obtained with a 400 MHz Bruker NMR. Comparison of DOSY spectrums of ligands of the chemical library in the presence and absence of target proteins show translational diffusion rates for small molecules upon interaction with macromolecules. For weak binders such as compounds found in fragment libraries, changes in diffusion rates upon macromolecular binding are on the order of the precision of DOSY diffusion measurements, and identifying such subtle shifts in diffusion requires careful statistical analysis. The "CoLD-CoP" (Clustering of Ligand Diffusion Coefficient Pairs) method presented here uses SAHN clustering to identify protein-binders in a chemical library or even a not fully characterized metabolite mixture. We will show how DOSY NMR and the "CoLD-CoP" method complement each other in identifying the most suitable candidates for lysozyme and wheat germ acid phosphatase.
Deterministic Joint Remote Preparation of an Arbitrary Sevenqubit Cluster-type State
NASA Astrophysics Data System (ADS)
Ding, MengXiao; Jiang, Min
2017-06-01
In this paper, we propose a scheme for joint remotely preparing an arbitrary seven-qubit cluster-type state by using several GHZ entangled states as the quantum channel. The coefficients of the prepared states can be not only real, but also complex. Firstly, Alice performs a three-qubit projective measurement according to the amplitude coefficients of the target state, and then Bob carries out another three-qubit projective measurement based on its phase coefficients. Next, one three-qubit state containing all information of the target state is prepared with suitable operation. Finally, the target seven-qubit cluster-type state can be prepared by introducing four auxiliary qubits and performing appropriate local unitary operations based on the prepared three-qubit state in a deterministic way. The receiver's all recovery operations are summarized into a concise formula. Furthermore, it's worth noting that our scheme is more novel and feasible with the present technologies than most other previous schemes.
Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G; Gu, C Charles
2014-11-01
Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of custom correlation coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (six genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. © 2014 WILEY PERIODICALS, INC.
Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G.; Gu, C. Charles
2014-01-01
Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of Custom Correlation Coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (6 genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. PMID:25168954
Clustering change patterns using Fourier transformation with time-course gene expression data.
Kim, Jaehee
2011-01-01
To understand the behavior of genes, it is important to explore how the patterns of gene expression change over a period of time because biologically related gene groups can share the same change patterns. In this study, the problem of finding similar change patterns is induced to clustering with the derivative Fourier coefficients. This work is aimed at discovering gene groups with similar change patterns which share similar biological properties. We developed a statistical model using derivative Fourier coefficients to identify similar change patterns of gene expression. We used a model-based method to cluster the Fourier series estimation of derivatives. We applied our model to cluster change patterns of yeast cell cycle microarray expression data with alpha-factor synchronization. It showed that, as the method clusters with the probability-neighboring data, the model-based clustering with our proposed model yielded biologically interpretable results. We expect that our proposed Fourier analysis with suitably chosen smoothing parameters could serve as a useful tool in classifying genes and interpreting possible biological change patterns.
Transmission Loss Calculation using A and B Loss Coefficients in Dynamic Economic Dispatch Problem
NASA Astrophysics Data System (ADS)
Jethmalani, C. H. Ram; Dumpa, Poornima; Simon, Sishaj P.; Sundareswaran, K.
2016-04-01
This paper analyzes the performance of A-loss coefficients while evaluating transmission losses in a Dynamic Economic Dispatch (DED) Problem. The performance analysis is carried out by comparing the losses computed using nominal A loss coefficients and nominal B loss coefficients in reference with load flow solution obtained by standard Newton-Raphson (NR) method. Density based clustering method based on connected regions with sufficiently high density (DBSCAN) is employed in identifying the best regions of A and B loss coefficients. Based on the results obtained through cluster analysis, a novel approach in improving the accuracy of network loss calculation is proposed. Here, based on the change in per unit load values between the load intervals, loss coefficients are updated for calculating the transmission losses. The proposed algorithm is tested and validated on IEEE 6 bus system, IEEE 14 bus, system IEEE 30 bus system and IEEE 118 bus system. All simulations are carried out using SCILAB 5.4 (www.scilab.org) which is an open source software.
A Symmetric Time-Varying Cluster Rate of Descent Model
NASA Technical Reports Server (NTRS)
Ray, Eric S.
2015-01-01
A model of the time-varying rate of descent of the Orion vehicle was developed based on the observed correlation between canopy projected area and drag coefficient. This initial version of the model assumes cluster symmetry and only varies the vertical component of velocity. The cluster fly-out angle is modeled as a series of sine waves based on flight test data. The projected area of each canopy is synchronized with the primary fly-out angle mode. The sudden loss of projected area during canopy collisions is modeled at minimum fly-out angles, leading to brief increases in rate of descent. The cluster geometry is converted to drag coefficient using empirically derived constants. A more complete model is under development, which computes the aerodynamic response of each canopy to its local incidence angle.
Subspace Clustering via Learning an Adaptive Low-Rank Graph.
Yin, Ming; Xie, Shengli; Wu, Zongze; Zhang, Yun; Gao, Junbin
2018-08-01
By using a sparse representation or low-rank representation of data, the graph-based subspace clustering has recently attracted considerable attention in computer vision, given its capability and efficiency in clustering data. However, the graph weights built using the representation coefficients are not the exact ones as the traditional definition is in a deterministic way. The two steps of representation and clustering are conducted in an independent manner, thus an overall optimal result cannot be guaranteed. Furthermore, it is unclear how the clustering performance will be affected by using this graph. For example, the graph parameters, i.e., the weights on edges, have to be artificially pre-specified while it is very difficult to choose the optimum. To this end, in this paper, a novel subspace clustering via learning an adaptive low-rank graph affinity matrix is proposed, where the affinity matrix and the representation coefficients are learned in a unified framework. As such, the pre-computed graph regularizer is effectively obviated and better performance can be achieved. Experimental results on several famous databases demonstrate that the proposed method performs better against the state-of-the-art approaches, in clustering.
NASA Astrophysics Data System (ADS)
Meng, Xiaocheng; Che, Renfei; Gao, Shi; He, Juntao
2018-04-01
With the advent of large data age, power system research has entered a new stage. At present, the main application of large data in the power system is the early warning analysis of the power equipment, that is, by collecting the relevant historical fault data information, the system security is improved by predicting the early warning and failure rate of different kinds of equipment under certain relational factors. In this paper, a method of line failure rate warning is proposed. Firstly, fuzzy dynamic clustering is carried out based on the collected historical information. Considering the imbalance between the attributes, the coefficient of variation is given to the corresponding weights. And then use the weighted fuzzy clustering to deal with the data more effectively. Then, by analyzing the basic idea and basic properties of the relational analysis model theory, the gray relational model is improved by combining the slope and the Deng model. And the incremental composition and composition of the two sequences are also considered to the gray relational model to obtain the gray relational degree between the various samples. The failure rate is predicted according to the principle of weighting. Finally, the concrete process is expounded by an example, and the validity and superiority of the proposed method are verified.
Peyrard, N; Dieckmann, U; Franc, A
2008-05-01
Models of infectious diseases are characterized by a phase transition between extinction and persistence. A challenge in contemporary epidemiology is to understand how the geometry of a host's interaction network influences disease dynamics close to the critical point of such a transition. Here we address this challenge with the help of moment closures. Traditional moment closures, however, do not provide satisfactory predictions close to such critical points. We therefore introduce a new method for incorporating longer-range correlations into existing closures. Our method is technically simple, remains computationally tractable and significantly improves the approximation's performance. Our extended closures thus provide an innovative tool for quantifying the influence of interaction networks on spatially or socially structured disease dynamics. In particular, we examine the effects of a network's clustering coefficient, as well as of new geometrical measures, such as a network's square clustering coefficients. We compare the relative performance of different closures from the literature, with or without our long-range extension. In this way, we demonstrate that the normalized version of the Bethe approximation-extended to incorporate long-range correlations according to our method-is an especially good candidate for studying influences of network structure. Our numerical results highlight the importance of the clustering coefficient and the square clustering coefficient for predicting disease dynamics at low and intermediate values of transmission rate, and demonstrate the significance of path redundancy for disease persistence.
Towards universal potentials for (H2)2 and isotopic variants: post-Born-Oppenheimer contributions.
Diniz, Leonardo G; Mohallem, José R
2008-06-07
Adiabatic corrections are evaluated for the interaction of two hydrogen molecules (H(2))(2) and isotopic variants. Their contribution to the cluster formation amount up to 10% of the interaction energy. Added to the best ab initio Born-Oppenheimer isotropic potential, they correct especially its short range repulsive part. Calculations of second virial coefficients are improved in general, with an impressive agreement with experiments for gaseous D(2) in a large range of temperatures. The potentials are available in both analytical and numerical forms.
The topology of a causal network for the Chinese financial system
NASA Astrophysics Data System (ADS)
Gao, Bo; Ren, Ruo-en
2013-07-01
The paper builds a causal network for the Chinese financial system based on the Granger causality of company risks, studies its different topologies in crisis and bull period, and applies the centrality to explain individual risk and prevent systemic risk. The results show that this causal network possesses both small-world phenomenon and scale-free property, and has a little different average distance, clustering coefficient, and degree distribution in different periods, and financial institutions with high centrality not only have large individual risk, but also are important for systemic risk immunization.
Nucleation and growth in cluster dynamics: A quantitative test of the classical kinetic approach
NASA Astrophysics Data System (ADS)
Gránásy, László; James, Peter F.
2000-12-01
Nucleation and size dependent growth of nanometer sized crystalline particles in glassy media have been studied by numerically solving the Turnbull-Fisher master equations that describe the time evolution of cluster population. Time dependencies of the formation rate and number density are determined for large clusters (built of up to 2×105 formula units, containing 1.8×106 atoms). We demonstrate that the formation rate and number density of such clusters are well approximated by Shneidman's asymptotically exact analytical solution. A quantitative test of the kinetic Turnbull-Fisher model has been performed: Evaluating the kinetic coefficients and interfacial parameters from the transient time and steady-state nucleation rates measured on six stoichiometric oxide glass compositions (lithium-disilicate, barium-disilicate, lithium-diborate, wollastonite, 1:2:3 and 2:1:3 soda-lime-silica glass compositions), we calculated the macroscopic growth rates and compared with experiments. For wollastonite, lithium-diborate and the 1:2:3 soda-lime-silica glass, differences of 2 to 4 orders of magnitude have been observed between theory and experiment. This inadequacy of the microscopic kinetic parameters in describing macroscopic growth cannot be explained by either the curvature effect on the interfacial free energy or the self-consistency correction for the cluster free energy. The origin of the discrepancy is discussed.
Aguirre von Wobeser, Eneas; Ibelings, Bas W.; Bok, Jasper; Krasikov, Vladimir; Huisman, Jef; Matthijs, Hans C.P.
2011-01-01
Physiological adaptation and genome-wide expression profiles of the cyanobacterium Synechocystis sp. strain PCC 6803 in response to gradual transitions between nitrogen-limited and light-limited growth conditions were measured in continuous cultures. Transitions induced changes in pigment composition, light absorption coefficient, photosynthetic electron transport, and specific growth rate. Physiological changes were accompanied by reproducible changes in the expression of several hundred open reading frames, genes with functions in photosynthesis and respiration, carbon and nitrogen assimilation, protein synthesis, phosphorus metabolism, and overall regulation of cell function and proliferation. Cluster analysis of the nearly 1,600 regulated open reading frames identified eight clusters, each showing a different temporal response during the transitions. Two large clusters mirrored each other. One cluster included genes involved in photosynthesis, which were up-regulated during light-limited growth but down-regulated during nitrogen-limited growth. Conversely, genes in the other cluster were down-regulated during light-limited growth but up-regulated during nitrogen-limited growth; this cluster included several genes involved in nitrogen uptake and assimilation. These results demonstrate complementary regulation of gene expression for two major metabolic activities of cyanobacteria. Comparison with batch-culture experiments revealed interesting differences in gene expression between batch and continuous culture and illustrates that continuous-culture experiments can pick up subtle changes in cell physiology and gene expression. PMID:21205618
Long-term surface EMG monitoring using K-means clustering and compressive sensing
NASA Astrophysics Data System (ADS)
Balouchestani, Mohammadreza; Krishnan, Sridhar
2015-05-01
In this work, we present an advanced K-means clustering algorithm based on Compressed Sensing theory (CS) in combination with the K-Singular Value Decomposition (K-SVD) method for Clustering of long-term recording of surface Electromyography (sEMG) signals. The long-term monitoring of sEMG signals aims at recording of the electrical activity produced by muscles which are very useful procedure for treatment and diagnostic purposes as well as for detection of various pathologies. The proposed algorithm is examined for three scenarios of sEMG signals including healthy person (sEMG-Healthy), a patient with myopathy (sEMG-Myopathy), and a patient with neuropathy (sEMG-Neuropathr), respectively. The proposed algorithm can easily scan large sEMG datasets of long-term sEMG recording. We test the proposed algorithm with Principal Component Analysis (PCA) and Linear Correlation Coefficient (LCC) dimensionality reduction methods. Then, the output of the proposed algorithm is fed to K-Nearest Neighbours (K-NN) and Probabilistic Neural Network (PNN) classifiers in order to calclute the clustering performance. The proposed algorithm achieves a classification accuracy of 99.22%. This ability allows reducing 17% of Average Classification Error (ACE), 9% of Training Error (TE), and 18% of Root Mean Square Error (RMSE). The proposed algorithm also reduces 14% clustering energy consumption compared to the existing K-Means clustering algorithm.
Richards, Todd L; Abbott, Robert D; Yagle, Kevin; Peterson, Dan; Raskind, Wendy; Berninger, Virginia W
2017-01-01
To understand mental self-government of the developing reading and writing brain, correlations of clustering coefficients on fMRI reading or writing tasks with BASC 2 Adaptivity ratings (time 1 only) or working memory components (time 1 before and time 2 after instruction previously shown to improve achievement and change magnitude of fMRI connectivity) were investigated in 39 students in grades 4 to 9 who varied along a continuum of reading and writing skills. A Philips 3T scanner measured connectivity during six leveled fMRI reading tasks (subword-letters and sounds, word-word-specific spellings or affixed words, syntax comprehension-with and without homonym foils or with and without affix foils, and text comprehension) and three fMRI writing tasks-writing next letter in alphabet, adding missing letter in word spelling, and planning for composing. The Brain Connectivity Toolbox generated clustering coefficients based on the cingulo-opercular (CO) network; after controlling for multiple comparisons and movement, significant fMRI connectivity clustering coefficients for CO were identified in 8 brain regions bilaterally (cingulate gyrus, superior frontal gyrus, middle frontal gyrus, inferior frontal gyrus, superior temporal gyrus, insula, cingulum-cingulate gyrus, and cingulum-hippocampus). BASC2 Parent Ratings for Adaptivity were correlated with CO clustering coefficients on three reading tasks (letter-sound, word affix judgments and sentence comprehension) and one writing task (writing next letter in alphabet). Before instruction, each behavioral working memory measure (phonology, orthography, morphology, and syntax coding, phonological and orthographic loops for integrating internal language and output codes, and supervisory focused and switching attention) correlated significantly with at least one CO clustering coefficient. After instruction, the patterning of correlations changed with new correlations emerging. Results show that the reading and writing brain's mental government, supported by both CO Adaptive Control and multiple working memory components, had changed in response to instruction during middle childhood/early adolescence.
Bradshaw, Peter L.; Colville, Jonathan F.; Linder, H. Peter
2015-01-01
We used a very large dataset (>40% of all species) from the endemic-rich Cape Floristic Region (CFR) to explore the impact of different weighting techniques, coefficients to calculate similarity among the cells, and clustering approaches on biogeographical regionalisation. The results were used to revise the biogeographical subdivision of the CFR. We show that weighted data (down-weighting widespread species), similarity calculated using Kulczinsky’s second measure, and clustering using UPGMA resulted in the optimal classification. This maximized the number of endemic species, the number of centres recognized, and operational geographic units assigned to centres of endemism (CoEs). We developed a dendrogram branch order cut-off (BOC) method to locate the optimal cut-off points on the dendrogram to define candidate clusters. Kulczinsky’s second measure dendrograms were combined using consensus, identifying areas of conflict which could be due to biotic element overlap or transitional areas. Post-clustering GIS manipulation substantially enhanced the endemic composition and geographic size of candidate CoEs. Although there was broad spatial congruence with previous phytogeographic studies, our techniques allowed for the recovery of additional phytogeographic detail not previously described for the CFR. PMID:26147438
NASA Astrophysics Data System (ADS)
Omenzetter, Piotr; de Lautour, Oliver R.
2010-04-01
Developed for studying long, periodic records of various measured quantities, time series analysis methods are inherently suited and offer interesting possibilities for Structural Health Monitoring (SHM) applications. However, their use in SHM can still be regarded as an emerging application and deserves more studies. In this research, Autoregressive (AR) models were used to fit experimental acceleration time histories from two experimental structural systems, a 3- storey bookshelf-type laboratory structure and the ASCE Phase II SHM Benchmark Structure, in healthy and several damaged states. The coefficients of the AR models were chosen as damage sensitive features. Preliminary visual inspection of the large, multidimensional sets of AR coefficients to check the presence of clusters corresponding to different damage severities was achieved using Sammon mapping - an efficient nonlinear data compression technique. Systematic classification of damage into states based on the analysis of the AR coefficients was achieved using two supervised classification techniques: Nearest Neighbor Classification (NNC) and Learning Vector Quantization (LVQ), and one unsupervised technique: Self-organizing Maps (SOM). This paper discusses the performance of AR coefficients as damage sensitive features and compares the efficiency of the three classification techniques using experimental data.
NASA Astrophysics Data System (ADS)
Sneath, P. H. A.
A BASIC program is presented for significance tests to determine whether a dendrogram is derived from clustering of points that belong to a single multivariate normal distribution. The significance tests are based on statistics of the Kolmogorov—Smirnov type, obtained by comparing the observed cumulative graph of branch levels with a graph for the hypothesis of multivariate normality. The program also permits testing whether the dendrogram could be from a cluster of lower dimensionality due to character correlations. The program makes provision for three similarity coefficients, (1) Euclidean distances, (2) squared Euclidean distances, and (3) Simple Matching Coefficients, and for five cluster methods (1) WPGMA, (2) UPGMA, (3) Single Linkage (or Minimum Spanning Trees), (4) Complete Linkage, and (5) Ward's Increase in Sums of Squares. The program is entitled DENBRAN.
NASA Astrophysics Data System (ADS)
Kirchner-Bossi, Nicolas; Befort, Daniel J.; Wild, Simon B.; Ulbrich, Uwe; Leckebusch, Gregor C.
2016-04-01
Time-clustered winter storms are responsible for a majority of the wind-induced losses in Europe. Over last years, different atmospheric and oceanic large-scale mechanisms as the North Atlantic Oscillation (NAO) or the Meridional Overturning Circulation (MOC) have been proven to drive some significant portion of the windstorm variability over Europe. In this work we systematically investigate the influence of different large-scale natural variability modes: more than 20 indices related to those mechanisms with proven or potential influence on the windstorm frequency variability over Europe - mostly SST- or pressure-based - are derived by means of ECMWF ERA-20C reanalysis during the last century (1902-2009), and compared to the windstorm variability for the European winter (DJF). Windstorms are defined and tracked as in Leckebusch et al. (2008). The derived indices are then employed to develop a statistical procedure including a stepwise Multiple Linear Regression (MLR) and an Artificial Neural Network (ANN), aiming to hindcast the inter-annual (DJF) regional windstorm frequency variability in a case study for the British Isles. This case study reveals 13 indices with a statistically significant coupling with seasonal windstorm counts. The Scandinavian Pattern (SCA) showed the strongest correlation (0.61), followed by the NAO (0.48) and the Polar/Eurasia Pattern (0.46). The obtained indices (standard-normalised) are selected as predictors for a windstorm variability hindcast model applied for the British Isles. First, a stepwise linear regression is performed, to identify which mechanisms can explain windstorm variability best. Finally, the indices retained by the stepwise regression are used to develop a multlayer perceptron-based ANN that hindcasted seasonal windstorm frequency and clustering. Eight indices (SCA, NAO, EA, PDO, W.NAtl.SST, AMO (unsmoothed), EA/WR and Trop.N.Atl SST) are retained by the stepwise regression. Among them, SCA showed the highest linear coefficient, followed by SST in western Atlantic, AMO and NAO. The explanatory regression model (considering all time steps) provided a Coefficient of Determination (R^2) of 0.75. A predictive version of the linear model applying a leave-one-out cross-validation (LOOCV) shows an R2 of 0.56 and a relative RMSE of 4.67 counts/season. An ANN-based nonlinear hindcast model for the seasonal windstorm frequency is developed with the aim to improve the stepwise hindcast ability and thus better predict a time-clustered season over the case study. A 7 node-hidden layer perceptron is set, and the LOOCV procedure reveals a R2 of 0.71. In comparison to the stepwise MLR the RMSE is reduced a 20%. This work shows that for the British Isles case study, most of the interannual variability can be explained by certain large-scale mechanisms, considering also nonlinear effects (ANN). This allows to discern a time-clustered season from a non-clustered one - a key issue for applications e.g., in the (re)insurance industry.
Property relationships of the physical infrastructure and the traffic flow networks
NASA Astrophysics Data System (ADS)
Zhou, Ta; Zou, Sheng-Rong; He, Da-Ren
2010-03-01
We studied both empirically and analytically the correlation between the degrees or the clustering coefficients, respectively, of the networks in the physical infrastructure and the traffic flow layers in three Chinese transportation systems. The systems are bus transportation systems in Beijing and Hangzhou, and the railway system in the mainland. It is found that the correlation between the degrees obey a linear function; while the correlation between the clustering coefficients obey a power law. A possible dynamic explanation on the rules is presented.
Greedy bases in rank 2 quantum cluster algebras
Lee, Kyungyong; Li, Li; Rupel, Dylan; Zelevinsky, Andrei
2014-01-01
We identify a quantum lift of the greedy basis for rank 2 coefficient-free cluster algebras. Our main result is that our construction does not depend on the choice of initial cluster, that it builds all cluster monomials, and that it produces bar-invariant elements. We also present several conjectures related to this quantum greedy basis and the triangular basis of Berenstein and Zelevinsky. PMID:24982182
Patterns in the English language: phonological networks, percolation and assembly models
NASA Astrophysics Data System (ADS)
Stella, Massimo; Brede, Markus
2015-05-01
In this paper we provide a quantitative framework for the study of phonological networks (PNs) for the English language by carrying out principled comparisons to null models, either based on site percolation, randomization techniques, or network growth models. In contrast to previous work, we mainly focus on null models that reproduce lower order characteristics of the empirical data. We find that artificial networks matching connectivity properties of the English PN are exceedingly rare: this leads to the hypothesis that the word repertoire might have been assembled over time by preferentially introducing new words which are small modifications of old words. Our null models are able to explain the ‘power-law-like’ part of the degree distributions and generally retrieve qualitative features of the PN such as high clustering, high assortativity coefficient and small-world characteristics. However, the detailed comparison to expectations from null models also points out significant differences, suggesting the presence of additional constraints in word assembly. Key constraints we identify are the avoidance of large degrees, the avoidance of triadic closure and the avoidance of large non-percolating clusters.
Srinivasan, A; Galbán, C J; Johnson, T D; Chenevert, T L; Ross, B D; Mukherji, S K
2010-04-01
Does the K-means algorithm do a better job of differentiating benign and malignant neck pathologies compared to only mean ADC? The objective of our study was to analyze the differences between ADC partitions to evaluate whether the K-means technique can be of additional benefit to whole-lesion mean ADC alone in distinguishing benign and malignant neck pathologies. MR imaging studies of 10 benign and 10 malignant proved neck pathologies were postprocessed on a PC by using in-house software developed in Matlab. Two neuroradiologists manually contoured the lesions, with the ADC values within each lesion clustered into 2 (low, ADC-ADC(L); high, ADC-ADC(H)) and 3 partitions (ADC(L); intermediate, ADC-ADC(I); ADC(H)) by using the K-means clustering algorithm. An unpaired 2-tailed Student t test was performed for all metrics to determine statistical differences in the means of the benign and malignant pathologies. A statistically significant difference between the mean ADC(L) clusters in benign and malignant pathologies was seen in the 3-cluster models of both readers (P = .03 and .022, respectively) and the 2-cluster model of reader 2 (P = .04), with the other metrics (ADC(H), ADC(I); whole-lesion mean ADC) not revealing any significant differences. ROC curves demonstrated the quantitative differences in mean ADC(H) and ADC(L) in both the 2- and 3-cluster models to be predictive of malignancy (2 clusters: P = .008, area under curve = 0.850; 3 clusters: P = .01, area under curve = 0.825). The K-means clustering algorithm that generates partitions of large datasets may provide a better characterization of neck pathologies and may be of additional benefit in distinguishing benign and malignant neck pathologies compared with whole-lesion mean ADC alone.
Competing Contact Processes on Homogeneous Networks with Tunable Clusterization
NASA Astrophysics Data System (ADS)
Rybak, Marcin; Kułakowski, Krzysztof
2013-03-01
We investigate two homogeneous networks: the Watts-Strogatz network with mean degree ⟨k⟩ = 4 and the Erdös-Rényi network with ⟨k⟩ = 10. In both kinds of networks, the clustering coefficient C is a tunable control parameter. The network is an area of two competing contact processes, where nodes can be in two states, S or D. A node S becomes D with probability 1 if at least two its mutually linked neighbors are D. A node D becomes S with a given probability p if at least one of its neighbors is S. The competition between the processes is described by a phase diagram, where the critical probability pc depends on the clustering coefficient C. For p > pc the rate of state S increases in time, seemingly to dominate in the whole system. Below pc, the majority of nodes is in the D-state. The numerical results indicate that for the Watts-Strogatz network the D-process is activated at the finite value of the clustering coefficient C, close to 0.3. On the contrary, for the Erdös-Rényi network the transition is observed at the whole investigated range of C.
Analysis of ligand-protein exchange by Clustering of Ligand Diffusion Coefficient Pairs (CoLD-CoP).
Snyder, David A; Chantova, Mihaela; Chaudhry, Saadia
2015-06-01
NMR spectroscopy is a powerful tool in describing protein structures and protein activity for pharmaceutical and biochemical development. This study describes a method to determine weak binding ligands in biological systems by using hierarchic diffusion coefficient clustering of multidimensional data obtained with a 400 MHz Bruker NMR. Comparison of DOSY spectrums of ligands of the chemical library in the presence and absence of target proteins show translational diffusion rates for small molecules upon interaction with macromolecules. For weak binders such as compounds found in fragment libraries, changes in diffusion rates upon macromolecular binding are on the order of the precision of DOSY diffusion measurements, and identifying such subtle shifts in diffusion requires careful statistical analysis. The "CoLD-CoP" (Clustering of Ligand Diffusion Coefficient Pairs) method presented here uses SAHN clustering to identify protein-binders in a chemical library or even a not fully characterized metabolite mixture. We will show how DOSY NMR and the "CoLD-CoP" method complement each other in identifying the most suitable candidates for lysozyme and wheat germ acid phosphatase. Copyright © 2015 Elsevier Inc. All rights reserved.
Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa
2008-01-01
This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.
ERIC Educational Resources Information Center
Hale, Robert L.; Dougherty, Donna
1988-01-01
Compared the efficacy of two methods of cluster analysis, the unweighted pair-groups method using arithmetic averages (UPGMA) and Ward's method, for students grouped on intelligence, achievement, and social adjustment by both clustering methods. Found UPGMA more efficacious based on output, on cophenetic correlation coefficients generated by each…
Clustering coefficients of protein-protein interaction networks
NASA Astrophysics Data System (ADS)
Miller, Gerald A.; Shi, Yi Y.; Qian, Hong; Bomsztyk, Karol
2007-05-01
The properties of certain networks are determined by hidden variables that are not explicitly measured. The conditional probability (propagator) that a vertex with a given value of the hidden variable is connected to k other vertices determines all measurable properties. We study hidden variable models and find an averaging approximation that enables us to obtain a general analytical result for the propagator. Analytic results showing the validity of the approximation are obtained. We apply hidden variable models to protein-protein interaction networks (PINs) in which the hidden variable is the association free energy, determined by distributions that depend on biochemistry and evolution. We compute degree distributions as well as clustering coefficients of several PINs of different species; good agreement with measured data is obtained. For the human interactome two different parameter sets give the same degree distributions, but the computed clustering coefficients differ by a factor of about 2. This shows that degree distributions are not sufficient to determine the properties of PINs.
Generalization of Clustering Coefficients to Signed Correlation Networks
Costantini, Giulio; Perugini, Marco
2014-01-01
The recent interest in network analysis applications in personality psychology and psychopathology has put forward new methodological challenges. Personality and psychopathology networks are typically based on correlation matrices and therefore include both positive and negative edge signs. However, some applications of network analysis disregard negative edges, such as computing clustering coefficients. In this contribution, we illustrate the importance of the distinction between positive and negative edges in networks based on correlation matrices. The clustering coefficient is generalized to signed correlation networks: three new indices are introduced that take edge signs into account, each derived from an existing and widely used formula. The performances of the new indices are illustrated and compared with the performances of the unsigned indices, both on a signed simulated network and on a signed network based on actual personality psychology data. The results show that the new indices are more resistant to sample variations in correlation networks and therefore have higher convergence compared with the unsigned indices both in simulated networks and with real data. PMID:24586367
Sample size calculations for the design of cluster randomized trials: A summary of methodology.
Gao, Fei; Earnest, Arul; Matchar, David B; Campbell, Michael J; Machin, David
2015-05-01
Cluster randomized trial designs are growing in popularity in, for example, cardiovascular medicine research and other clinical areas and parallel statistical developments concerned with the design and analysis of these trials have been stimulated. Nevertheless, reviews suggest that design issues associated with cluster randomized trials are often poorly appreciated and there remain inadequacies in, for example, describing how the trial size is determined and the associated results are presented. In this paper, our aim is to provide pragmatic guidance for researchers on the methods of calculating sample sizes. We focus attention on designs with the primary purpose of comparing two interventions with respect to continuous, binary, ordered categorical, incidence rate and time-to-event outcome variables. Issues of aggregate and non-aggregate cluster trials, adjustment for variation in cluster size and the effect size are detailed. The problem of establishing the anticipated magnitude of between- and within-cluster variation to enable planning values of the intra-cluster correlation coefficient and the coefficient of variation are also described. Illustrative examples of calculations of trial sizes for each endpoint type are included. Copyright © 2015 Elsevier Inc. All rights reserved.
Pavlik, Valory N; Chan, Wenyaw; Hyman, David J; Feldman, Penny; Ogedegbe, Gbenga; Schwartz, Joseph E; McDonald, Margaret; Einhorn, Paula; Tobin, Jonathan N
2015-01-01
African-Americans (AAs) have a high prevalence of hypertension and their blood pressure (BP) control on treatment still lags behind other groups. In 2004, NHLBI funded five projects that aimed to evaluate clinically feasible interventions to effect changes in medical care delivery leading to an increased proportion of AA patients with controlled BP. Three of the groups performed a pooled analysis of trial results to determine: 1) the magnitude of the combined intervention effect; and 2) how the pooled results could inform the methodology for future health-system level BP interventions. Using a cluster randomized design, the trials enrolled AAs with uncontrolled hypertension to test interventions targeting a combination of patient and clinician behaviors. The 12-month Systolic BP (SBP) and Diastolic BP (DBP) effects of intervention or control cluster assignment were assessed using mixed effects longitudinal regression modeling. 2,015 patients representing 352 clusters participated across the three trials. Pooled BP slopes followed a quadratic pattern, with an initial decline, followed by a rise toward baseline, and did not differ significantly between intervention and control clusters: SBP linear coefficient = -2.60±0.21 mmHg per month, p<0.001; quadratic coefficient = 0.167± 0.02 mmHg/month, p<0.001; group by time interaction group by time group x linear time coefficient=0.145 ± 0.293, p=0.622; group x quadratic time coefficient= -0.017 ± 0.026, p=0.525). RESULTS were similar for DBP. The individual sites did not have significant intervention effects when analyzed separately. Investigators planning behavioral trials to improve BP control in health systems serving AAs should plan for small effect sizes and employ a "run-in" period in which BP can be expected to improve in both experimental and control clusters.
Bruno, Andrew E.; Ruby, Amanda M.; Luft, Joseph R.; Grant, Thomas D.; Seetharaman, Jayaraman; Montelione, Gaetano T.; Hunt, John F.; Snell, Edward H.
2014-01-01
Many bioscience fields employ high-throughput methods to screen multiple biochemical conditions. The analysis of these becomes tedious without a degree of automation. Crystallization, a rate limiting step in biological X-ray crystallography, is one of these fields. Screening of multiple potential crystallization conditions (cocktails) is the most effective method of probing a proteins phase diagram and guiding crystallization but the interpretation of results can be time-consuming. To aid this empirical approach a cocktail distance coefficient was developed to quantitatively compare macromolecule crystallization conditions and outcome. These coefficients were evaluated against an existing similarity metric developed for crystallization, the C6 metric, using both virtual crystallization screens and by comparison of two related 1,536-cocktail high-throughput crystallization screens. Hierarchical clustering was employed to visualize one of these screens and the crystallization results from an exopolyphosphatase-related protein from Bacteroides fragilis, (BfR192) overlaid on this clustering. This demonstrated a strong correlation between certain chemically related clusters and crystal lead conditions. While this analysis was not used to guide the initial crystallization optimization, it led to the re-evaluation of unexplained peaks in the electron density map of the protein and to the insertion and correct placement of sodium, potassium and phosphate atoms in the structure. With these in place, the resulting structure of the putative active site demonstrated features consistent with active sites of other phosphatases which are involved in binding the phosphoryl moieties of nucleotide triphosphates. The new distance coefficient, CDcoeff, appears to be robust in this application, and coupled with hierarchical clustering and the overlay of crystallization outcome, reveals information of biological relevance. While tested with a single example the potential applications related to crystallography appear promising and the distance coefficient, clustering, and hierarchal visualization of results undoubtedly have applications in wider fields. PMID:24971458
The study of RMB exchange rate complex networks based on fluctuation mode
NASA Astrophysics Data System (ADS)
Yao, Can-Zhong; Lin, Ji-Nan; Zheng, Xu-Zhou; Liu, Xiao-Feng
2015-10-01
In the paper, we research on the characteristics of RMB exchange rate time series fluctuation with methods of symbolization and coarse gaining. First, based on fluctuation features of RMB exchange rate, we define the first type of fluctuation mode as one specific foreign currency against RMB in four days' fluctuating situations, and the second type as four different foreign currencies against RMB in one day's fluctuating situation. With the transforming method, we construct the unique-currency and multi-currency complex networks. Further, through analyzing the topological features including out-degree, betweenness centrality and clustering coefficient of fluctuation-mode complex networks, we find that the out-degree distribution of both types of fluctuation mode basically follows power-law distributions with exponents between 1 and 2. The further analysis reveals that the out-degree and the clustering coefficient generally obey the approximated negative correlation. With this result, we confirm previous observations showing that the RMB exchange rate exhibits a characteristic of long-range memory. Finally, we analyze the most probable transmission route of fluctuation modes, and provide probability prediction matrix. The transmission route for RMB exchange rate fluctuation modes exhibits the characteristics of partially closed loop, repeat and reversibility, which lays a solid foundation for predicting RMB exchange rate fluctuation patterns with large volume of data.
Graph theoretical analysis of EEG functional connectivity during music perception.
Wu, Junjie; Zhang, Junsong; Liu, Chu; Liu, Dongwei; Ding, Xiaojun; Zhou, Changle
2012-11-05
The present study evaluated the effect of music on large-scale structure of functional brain networks using graph theoretical concepts. While most studies on music perception used Western music as an acoustic stimulus, Guqin music, representative of Eastern music, was selected for this experiment to increase our knowledge of music perception. Electroencephalography (EEG) was recorded from non-musician volunteers in three conditions: Guqin music, noise and silence backgrounds. Phase coherence was calculated in the alpha band and between all pairs of EEG channels to construct correlation matrices. Each resulting matrix was converted into a weighted graph using a threshold, and two network measures: the clustering coefficient and characteristic path length were calculated. Music perception was found to display a higher level mean phase coherence. Over the whole range of thresholds, the clustering coefficient was larger while listening to music, whereas the path length was smaller. Networks in music background still had a shorter characteristic path length even after the correction for differences in mean synchronization level among background conditions. This topological change indicated a more optimal structure under music perception. Thus, prominent small-world properties are confirmed in functional brain networks. Furthermore, music perception shows an increase of functional connectivity and an enhancement of small-world network organizations. Copyright © 2012 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Alfarizy, A. D.; Indahwati; Sartono, B.
2017-03-01
Indonesia is the largest Hollywood movie industry target market in Southeast Asia in 2015. Hollywood movies distributed in Indonesia targeted people in all range of ages including children. Low awareness of guiding children while watching movies make them could watch any rated films even the unsuitable ones for their ages. Even after being translated into Bahasa and passed the censorship phase, words that uncomfortable for children to watch still exist. The purpose of this research is to cluster box office Hollywood movies based on Indonesian subtitle, revenue, IMDb user rating and genres as one of the reference for adults to choose right movies for their children to watch. Text mining is used to extract words from the subtitles and count the frequency for three group of words (bad words, sexual words and terror words), while Partition Around Medoids (PAM) Algorithm with Gower similarity coefficient as proximity matrix is used as clustering method. We clustered 624 movies from 2006 until first half of 2016 from IMDb. Cluster with highest silhouette coefficient value (0.36) is the one with 5 clusters. Animation, Adventure and Comedy movies with high revenue like in cluster 5 is recommended for children to watch, while Comedy movies with high revenue like in cluster 4 should be avoided to watch.
An Empirical Taxonomy of Youths' Fears: Cluster Analysis of the American Fear Survey Schedule
ERIC Educational Resources Information Center
Burnham, Joy J.; Schaefer, Barbara A.; Giesen, Judy
2006-01-01
Fears profiles among children and adolescents were explored using the Fear Survey Schedule for Children-American version (FSSC-AM; J.J. Burnham, 1995, 2005). Eight cluster profiles were identified via multistage Euclidean grouping and supported by homogeneity coefficients and replication. Four clusters reflected overall level of fears (i.e., very…
ERIC Educational Resources Information Center
Yan, Jun; Aseltine, Robert H., Jr.; Harel, Ofer
2013-01-01
Comparing regression coefficients between models when one model is nested within another is of great practical interest when two explanations of a given phenomenon are specified as linear models. The statistical problem is whether the coefficients associated with a given set of covariates change significantly when other covariates are added into…
Transport coefficients of Quark-Gluon plasma with full QCD potential
NASA Astrophysics Data System (ADS)
J. P., Prasanth; Bannur, Vishnu M.
2018-05-01
The shear viscosity η, bulk viscosity ζ and their ratio with the entropy density, η / s, ζ / s have been studied in a quark-gluon plasma (QGP) within the cluster expansion method. The cluster expansion method allows us to include the interaction between the partons in the deconfined phase and to calculate the equation of state of quark-gluon plasma. It has been argued that the interactions present in the equation of state, the modified Cornell potential significantly contributes to the viscosity. The results obtained within our approaches agree with lattice quantum chromodynamics (LQCD) equation of state. We obtained η / s ≈ 0 . 128 within the temperature range T /Tc ∈ [ 0 . 9 , 1 . 5 ] which is very close to the theoretical lower bound η / s ≥ 1 /(4 π) in Yang-Mills theory. We also demonstrate that the effects of ζ / s at freezeout are possibly large.
Hydration of nonelectrolytes in binary aqueous solutions
NASA Astrophysics Data System (ADS)
Rudakov, A. M.; Sergievskii, V. V.
2010-10-01
Literature data on the thermodynamic properties of binary aqueous solutions of nonelectrolytes that show negative deviations from Raoult's law due largely to the contribution of the hydration of the solute are briefly surveyed. Attention is focused on simulating the thermodynamic properties of solutions using equations of the cluster model. It is shown that the model is based on the assumption that there exists a distribution of stoichiometric hydrates over hydration numbers. In terms of the theory of ideal associated solutions, the equations for activity coefficients, osmotic coefficients, vapor pressure, and excess thermodynamic functions (volume, Gibbs energy, enthalpy, entropy) are obtained in analytical form. Basic parameters in the equations are the hydration numbers of the nonelectrolyte (the mathematical expectation of the distribution of hydrates) and the dispersions of the distribution. It is concluded that the model equations adequately describe the thermodynamic properties of a wide range of nonelectrolytes partly or completely soluble in water.
Theoretical modelling on thermal expansion of Al, Ag and Cu nanomaterials
NASA Astrophysics Data System (ADS)
Manu, Mehul; Dubey, Vikash
2018-05-01
A simple theoretical model is developed for the calculating the coefficient of volume thermal expansion (CTE) and volume thermal expansion (VTE) of Al, Ag and Cu nanomaterials by considering the cubo-octahedral structure with the change of temperature and the cluster size. At the room temperature, the coefficient of volume thermal expansion decreases sharply below 20-25 nm and the decrement of the coefficient of volume thermal expansion becomes slower above 20-25 nm. We also saw a variation in the volume thermal expansion with the variation of temperature and cluster size. At a fixed cluster size, the volume thermal expansion increases with an increase of temperature at below the melting temperature and show a linear relation of volume thermal expansion with the temperature. At a constant temperature, the volume thermal expansion decreases rapidly with an increase in cluster size below 20-25 nm and after 20-25 nm the decrement of volume thermal expansion becomes slower with the increase of the size of the cluster. Thermal expansion is due to the anharmonicity of the atom interaction. As the temperature rises the amplitude of crystal lattice vibration increases, but the equilibrium distance shifts as the atom spend more time at distance greater than the original spacing due as the repulsion at short distance greater than the corresponding attraction at farther distance. In considering the cubo- octahedral structure with the cluster order, the model prediction on the CTE and the VTE are in good agreement with the available experimental data which demonstrate the validity of our work.
Saxena, Raghvendra; Chandra, Amaresh
2011-11-01
Transferability of sequence-tagged-sites (STS) markers was assessed for genetic relationships study among accessions of marvel grass (Dichanthium annulatum Forsk.). In total, 17 STS primers of Stylosanthes origin were tested for their reactivity with thirty accessions of Dichanthium annulatum. Of these, 14 (82.4%) reacted and a total 106 (84 polymorphic) bands were scored. The number of bands generated by individual primer pairs ranged from 4 to 11 with an average of 7.57 bands, whereas polymorphic bands ranged from 4 to 9 with an average of 6.0 bands accounts to an average polymorphism of 80.1%. Polymorphic information content (PIC) ranged from 0.222 to 0.499 and marker index (MI) from 1.33 to 4.49. Utilizing Dice coefficient of genetic similarity dendrogram was generated through un-weighted pairgroup method with arithmetic mean (UPGMA) algorithm. Further, clustering through sequential agglomerative hierarchical and nested (SAHN) method resulted three main clusters constituted all accessions except IGBANG-D-2. Though there was intermixing of few accessions of one agro-climatic region to another, largely groupings of accessions were with their regions of collections. Bootstrap analysis at 1000 scale also showed large number of nodes (11 to 17) having strong clustering (> 50). Thus, results demonstrate the utility of STS markers of Stylosanthes in studying the genetic relationships among accessions of Dichanthium.
NASA Technical Reports Server (NTRS)
Scott, Carl D.; Smalley, Richard E.
2003-01-01
The high-pressure carbon monoxide (HiPco) process for producing single-wall carbon nanotubes (SWNTs) uses iron pentacarbonyl as the source of iron for catalyzing the Boudouard reaction. Attempts using nickel tetracarbonyl led to no production of SWNTs. This paper discusses simulations at a constant condition of 1300 K and 30 atm in which the chemical rate equations are solved for different reaction schemes. A lumped cluster model is developed to limit the number of species in the models, yet it includes fairly large clusters. Reaction rate coefficients in these schemes are based on bond energies of iron and nickel species and on estimates of chemical rates for formation of SWNTs. SWNT growth is measured by the conformation of CO2. It is shown that the production of CO2 is significantly greater for FeCO because of its lower bond energy as compared with that of NiCO. It is also shown that the dissociation and evaporation rates of atoms from small metal clusters have a significant effect on CO2 production. A high rate of evaporation leads to a smaller number of metal clusters available to catalyze the Boudouard reaction. This suggests that if CO reacts with metal clusters and removes atoms from them by forming MeCO, this has the effect of enhancing the evaporation rate and reducing SWNT production. The study also investigates some other reactions in the model that have a less dramatic influence.
NASA Technical Reports Server (NTRS)
Scott, Carl D.; Smalley, Richard E.
2002-01-01
The high-pressure carbon monoxide (HiPco) process for producing single-wall carbon nanotubes (SWNT) uses iron pentacarbonyl as the source of iron for catalyzing the Boudouard reaction. Attempts using nickel tetracarbonyl led to no production of SWNTs. This paper discusses simulations at a constant condition of 1300 K and 30 atm in which the chemical rate equations are solved for different reaction schemes. A lumped cluster model is developed to limit the number of species in the models, yet it includes fairly large clusters. Reaction rate coefficients in these schemes are based on bond energies of iron and nickel species and on estimates of chemical rates for formation of SWNTs. SWNT growth is measured by the co-formation of CO2. It is shown that the production of CO2 is significantly greater for FeCO due to its lower bond energy as compared with that ofNiCO. It is also shown that the dissociation and evaporation rates of atoms from small metal clusters have a significant effect on CO2 production. A high rate of evaporation leads to a smaller number of metal clusters available to catalyze the Boudouard reaction. This suggests that if CO reacts with metal clusters and removes atoms from them by forming MeCO, this has the effect of enhancing the evaporation rate and reducing SWNT production. The study also investigates some other reactions in the model that have a less dramatic influence.
Modular and hierarchical structure of social contact networks
NASA Astrophysics Data System (ADS)
Ge, Yuanzheng; Song, Zhichao; Qiu, Xiaogang; Song, Hongbin; Wang, Yong
2013-10-01
Social contact networks exhibit overlapping qualities of communities, hierarchical structure and spatial-correlated nature. We propose a mixing pattern of modular and growing hierarchical structures to reconstruct social contact networks by using an individual’s geospatial distribution information in the real world. The hierarchical structure of social contact networks is defined based on the spatial distance between individuals, and edges among individuals are added in turn from the modular layer to the highest layer. It is a gradual process to construct the hierarchical structure: from the basic modular model up to the global network. The proposed model not only shows hierarchically increasing degree distribution and large clustering coefficients in communities, but also exhibits spatial clustering features of individual distributions. As an evaluation of the method, we reconstruct a hierarchical contact network based on the investigation data of a university. Transmission experiments of influenza H1N1 are carried out on the generated social contact networks, and results show that the constructed network is efficient to reproduce the dynamic process of an outbreak and evaluate interventions. The reproduced spread process exhibits that the spatial clustering of infection is accordant with the clustering of network topology. Moreover, the effect of individual topological character on the spread of influenza is analyzed, and the experiment results indicate that the spread is limited by individual daily contact patterns and local clustering topology rather than individual degree.
Shyamalamma, S; Chandra, S B C; Hegde, M; Naryanswamy, P
2008-07-22
Artocarpus heterophyllus Lam., commonly called jackfruit, is a medium-sized evergreen tree that bears high yields of the largest known edible fruit. Yet, it has been little explored commercially due to wide variation in fruit quality. The genetic diversity and genetic relatedness of 50 jackfruit accessions were studied using amplified fragment length polymorphism markers. Of 16 primer pairs evaluated, eight were selected for screening of genotypes based on the number and quality of polymorphic fragments produced. These primer combinations produced 5976 bands, 1267 (22%) of which were polymorphic. Among the jackfruit accessions, the similarity coefficient ranged from 0.137 to 0.978; the accessions also shared a large number of monomorphic fragments (78%). Cluster analysis and principal component analysis grouped all jackfruit genotypes into three major clusters. Cluster I included the genotypes grown in a jackfruit region of Karnataka, called Tamaka, with very dry conditions; cluster II contained the genotypes collected from locations having medium to heavy rainfall in Karnataka; cluster III grouped the genotypes in distant locations with different environmental conditions. Strong coincidence of these amplified fragment length polymorphism-based groupings with geographical localities as well as morphological characters was observed. We found moderate genetic diversity in these jackfruit accessions. This information should be useful for tree breeding programs, as part of our effort to popularize jackfruit as a commercial crop.
Differential dynamic microscopy of weakly scattering and polydisperse protein-rich clusters
NASA Astrophysics Data System (ADS)
Safari, Mohammad S.; Vorontsova, Maria A.; Poling-Skutvik, Ryan; Vekilov, Peter G.; Conrad, Jacinta C.
2015-10-01
Nanoparticle dynamics impact a wide range of biological transport processes and applications in nanomedicine and natural resource engineering. Differential dynamic microscopy (DDM) was recently developed to quantify the dynamics of submicron particles in solutions from fluctuations of intensity in optical micrographs. Differential dynamic microscopy is well established for monodisperse particle populations, but has not been applied to solutions containing weakly scattering polydisperse biological nanoparticles. Here we use bright-field DDM (BDDM) to measure the dynamics of protein-rich liquid clusters, whose size ranges from tens to hundreds of nanometers and whose total volume fraction is less than 10-5. With solutions of two proteins, hemoglobin A and lysozyme, we evaluate the cluster diffusion coefficients from the dependence of the diffusive relaxation time on the scattering wave vector. We establish that for weakly scattering populations, an optimal thickness of the sample chamber exists at which the BDDM signal is maximized at the smallest sample volume. The average cluster diffusion coefficient measured using BDDM is consistently lower than that obtained from dynamic light scattering at a scattering angle of 90∘. This apparent discrepancy is due to Mie scattering from the polydisperse cluster population, in which larger clusters preferentially scatter more light in the forward direction.
Mawditt, Claire; Sacker, Amanda; Britton, Annie; Kelly, Yvonne; Cable, Noriko
2018-05-01
Building upon evidence linking socio-economic position (SEP) in childhood and adulthood with health-related behaviours (HRB) in adulthood, we examined how pre-adolescent SEP predicted membership of three HRB clusters: "Risky", "Moderate Smokers" and "Mainstream" (the latter pattern consisting of more beneficial HRBs), that were detected in our previous work. Data were taken from two British cohorts (born in 1958 and 1970) in pre-adolescence (age 11 and 10, respectively) and adulthood (age 33 and 34). SEP constructs in pre-adolescence and adulthood were derived through Confirmatory Factor Analysis. Conceptualised paths from pre-adolescent SEP to HRB cluster membership via adult SEP in our path models were tested for statistical significance separately by gender and cohort. Adult SEP mediated the path between pre-adolescent SEP and adult HRB clusters. More disadvantaged SEP in pre-adolescence predicted more disadvantaged SEP in adulthood which was associated with membership of the "Risky" and "Moderate Smokers" clusters compared to the "Mainstream" cluster. For example, large positive indirect effects between pre-adolescent SEP and adult HRB via adult SEP were present (coefficient 1958 Women = 0.39; 1970 Women = 0.36, 1958 Men = 0.51; 1970 Men = 0.39; p < 0.01) when comparing "Risky" and "Mainstream" cluster membership. Amongst men we found a small significant direct association (p < 0.001) between pre-adolescent SEP and HRB cluster membership. Our findings suggest that associations between adult SEP and HRBs are not likely to be pre-determined by earlier social circumstances, providing optimism for interventions relevant to reducing social gradients in HRBs. Observing consistent findings across the cohorts implies the social patterning of adult lifestyles may persist across time. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Direct construction of mesoscopic models from microscopic simulations
NASA Astrophysics Data System (ADS)
Lei, Huan; Caswell, Bruce; Karniadakis, George Em
2010-02-01
Starting from microscopic molecular-dynamics (MD) simulations of constrained Lennard-Jones (LJ) clusters (with constant radius of gyration Rg ), we construct two mesoscopic models [Langevin dynamics and dissipative particle dynamics (DPD)] by coarse graining the LJ clusters into single particles. Both static and dynamic properties of the coarse-grained models are investigated and compared with the MD results. The effective mean force field is computed as a function of the intercluster distance, and the corresponding potential scales linearly with the number of particles per cluster and the temperature. We verify that the mean force field can reproduce the equation of state of the atomistic systems within a wide density range but the radial distribution function only within the dilute and the semidilute regime. The friction force coefficients for both models are computed directly from the time-correlation function of the random force field of the microscopic system. For high density or a large cluster size the friction force is overestimated and the diffusivity underestimated due to the omission of many-body effects as a result of the assumed pairwise form of the coarse-grained force field. When the many-body effect is not as pronounced (e.g., smaller Rg or semidilute system), the DPD model can reproduce the dynamic properties of the MD system.
NASA Astrophysics Data System (ADS)
Thanos, Konstantinos-Georgios; Thomopoulos, Stelios C. A.
2014-06-01
The study in this paper belongs to a more general research of discovering facial sub-clusters in different ethnicity face databases. These new sub-clusters along with other metadata (such as race, sex, etc.) lead to a vector for each face in the database where each vector component represents the likelihood of participation of a given face to each cluster. This vector is then used as a feature vector in a human identification and tracking system based on face and other biometrics. The first stage in this system involves a clustering method which evaluates and compares the clustering results of five different clustering algorithms (average, complete, single hierarchical algorithm, k-means and DIGNET), and selects the best strategy for each data collection. In this paper we present the comparative performance of clustering results of DIGNET and four clustering algorithms (average, complete, single hierarchical and k-means) on fabricated 2D and 3D samples, and on actual face images from various databases, using four different standard metrics. These metrics are the silhouette figure, the mean silhouette coefficient, the Hubert test Γ coefficient, and the classification accuracy for each clustering result. The results showed that, in general, DIGNET gives more trustworthy results than the other algorithms when the metrics values are above a specific acceptance threshold. However when the evaluation results metrics have values lower than the acceptance threshold but not too low (too low corresponds to ambiguous results or false results), then it is necessary for the clustering results to be verified by the other algorithms.
Gulliford, Martin C; van Staa, Tjeerd P; McDermott, Lisa; McCann, Gerard; Charlton, Judith; Dregan, Alex
2014-06-11
There is growing interest in conducting clinical and cluster randomized trials through electronic health records. This paper reports on the methodological issues identified during the implementation of two cluster randomized trials using the electronic health records of the Clinical Practice Research Datalink (CPRD). Two trials were completed in primary care: one aimed to reduce inappropriate antibiotic prescribing for acute respiratory infection; the other aimed to increase physician adherence with secondary prevention interventions after first stroke. The paper draws on documentary records and trial datasets to report on the methodological experience with respect to research ethics and research governance approval, general practice recruitment and allocation, sample size calculation and power, intervention implementation, and trial analysis. We obtained research governance approvals from more than 150 primary care organizations in England, Wales, and Scotland. There were 104 CPRD general practices recruited to the antibiotic trial and 106 to the stroke trial, with the target number of practices being recruited within six months. Interventions were installed into practice information systems remotely over the internet. The mean number of participants per practice was 5,588 in the antibiotic trial and 110 in the stroke trial, with the coefficient of variation of practice sizes being 0.53 and 0.56 respectively. Outcome measures showed substantial correlations between the 12 months before, and after intervention, with coefficients ranging from 0.42 for diastolic blood pressure to 0.91 for proportion of consultations with antibiotics prescribed, defining practice and participant eligibility for analysis requires careful consideration. Cluster randomized trials may be performed efficiently in large samples from UK general practices using the electronic health records of a primary care database. The geographical dispersal of trial sites presents a difficulty for research governance approval and intervention implementation. Pretrial data analyses should inform trial design and analysis plans. Current Controlled Trials ISRCTN 47558792 and ISRCTN 35701810 (both registered on 17 March 2010).
2014-01-01
Background There is growing interest in conducting clinical and cluster randomized trials through electronic health records. This paper reports on the methodological issues identified during the implementation of two cluster randomized trials using the electronic health records of the Clinical Practice Research Datalink (CPRD). Methods Two trials were completed in primary care: one aimed to reduce inappropriate antibiotic prescribing for acute respiratory infection; the other aimed to increase physician adherence with secondary prevention interventions after first stroke. The paper draws on documentary records and trial datasets to report on the methodological experience with respect to research ethics and research governance approval, general practice recruitment and allocation, sample size calculation and power, intervention implementation, and trial analysis. Results We obtained research governance approvals from more than 150 primary care organizations in England, Wales, and Scotland. There were 104 CPRD general practices recruited to the antibiotic trial and 106 to the stroke trial, with the target number of practices being recruited within six months. Interventions were installed into practice information systems remotely over the internet. The mean number of participants per practice was 5,588 in the antibiotic trial and 110 in the stroke trial, with the coefficient of variation of practice sizes being 0.53 and 0.56 respectively. Outcome measures showed substantial correlations between the 12 months before, and after intervention, with coefficients ranging from 0.42 for diastolic blood pressure to 0.91 for proportion of consultations with antibiotics prescribed, defining practice and participant eligibility for analysis requires careful consideration. Conclusions Cluster randomized trials may be performed efficiently in large samples from UK general practices using the electronic health records of a primary care database. The geographical dispersal of trial sites presents a difficulty for research governance approval and intervention implementation. Pretrial data analyses should inform trial design and analysis plans. Trial registration Current Controlled Trials ISRCTN 47558792 and ISRCTN 35701810 (both registered on 17 March 2010). PMID:24919485
Link prediction with node clustering coefficient
NASA Astrophysics Data System (ADS)
Wu, Zhihao; Lin, Youfang; Wang, Jing; Gregory, Steve
2016-06-01
Predicting missing links in incomplete complex networks efficiently and accurately is still a challenging problem. The recently proposed Cannistrai-Alanis-Ravai (CAR) index shows the power of local link/triangle information in improving link-prediction accuracy. Inspired by the idea of employing local link/triangle information, we propose a new similarity index with more local structure information. In our method, local link/triangle structure information can be conveyed by clustering coefficient of common-neighbors directly. The reason why clustering coefficient has good effectiveness in estimating the contribution of a common-neighbor is that it employs links existing between neighbors of a common-neighbor and these links have the same structural position with the candidate link to this common-neighbor. In our experiments, three estimators: precision, AUP and AUC are used to evaluate the accuracy of link prediction algorithms. Experimental results on ten tested networks drawn from various fields show that our new index is more effective in predicting missing links than CAR index, especially for networks with low correlation between number of common-neighbors and number of links between common-neighbors.
Low Temperature Kinetics of the First Steps of Water Cluster Formation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bourgalais, J.; Roussel, V.; Capron, M.
2016-03-01
We present a combined experimental and theoretical low temperature kinetic study of water cluster formation. Water cluster growth takes place in low temperature (23-69 K) supersonic flows. The observed kinetics of formation of water clusters are reproduced with a kinetic model based on theoretical predictions for the first steps of clusterization. The temperature-and pressure-dependent association and dissociation rate coefficients are predicted with an ab initio transition state theory based master equation approach over a wide range of temperatures (20-100 K) and pressures (10(-6) - 10 bar).
Large-scale brain networks are distinctly affected in right and left mesial temporal lobe epilepsy.
de Campos, Brunno Machado; Coan, Ana Carolina; Lin Yasuda, Clarissa; Casseb, Raphael Fernandes; Cendes, Fernando
2016-09-01
Mesial temporal lobe epilepsy (MTLE) with hippocampus sclerosis (HS) is associated with functional and structural alterations extending beyond the temporal regions and abnormal pattern of brain resting state networks (RSNs) connectivity. We hypothesized that the interaction of large-scale RSNs is differently affected in patients with right- and left-MTLE with HS compared to controls. We aimed to determine and characterize these alterations through the analysis of 12 RSNs, functionally parceled in 70 regions of interest (ROIs), from resting-state functional-MRIs of 99 subjects (52 controls, 26 right- and 21 left-MTLE patients with HS). Image preprocessing and statistical analysis were performed using UF(2) C-toolbox, which provided ROI-wise results for intranetwork and internetwork connectivity. Intranetwork abnormalities were observed in the dorsal default mode network (DMN) in both groups of patients and in the posterior salience network in right-MTLE. Both groups showed abnormal correlation between the dorsal-DMN and the posterior salience, as well as between the dorsal-DMN and the executive-control network. Patients with left-MTLE also showed reduced correlation between the dorsal-DMN and visuospatial network and increased correlation between bilateral thalamus and the posterior salience network. The ipsilateral hippocampus stood out as a central area of abnormalities. Alterations on left-MTLE expressed a low cluster coefficient, whereas the altered connections on right-MTLE showed low cluster coefficient in the DMN but high in the posterior salience regions. Both right- and left-MTLE patients with HS have widespread abnormal interactions of large-scale brain networks; however, all parameters evaluated indicate that left-MTLE has a more intricate bihemispheric dysfunction compared to right-MTLE. Hum Brain Mapp 37:3137-3152, 2016. © 2016 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc. © 2016 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Mandelbaum, Rachel; Slosar, Anže; Baldauf, Tobias; Seljak, Uroš; Hirata, Christopher M.; Nakajima, Reiko; Reyes, Reinabelle; Smith, Robert E.
2013-06-01
Recent studies have shown that the cross-correlation coefficient between galaxies and dark matter is very close to unity on scales outside a few virial radii of galaxy haloes, independent of the details of how galaxies populate dark matter haloes. This finding makes it possible to determine the dark matter clustering from measurements of galaxy-galaxy weak lensing and galaxy clustering. We present new cosmological parameter constraints based on large-scale measurements of spectroscopic galaxy samples from the Sloan Digital Sky Survey (SDSS) data release 7. We generalize the approach of Baldauf et al. to remove small-scale information (below 2 and 4 h-1 Mpc for lensing and clustering measurements, respectively), where the cross-correlation coefficient differs from unity. We derive constraints for three galaxy samples covering 7131 deg2, containing 69 150, 62 150 and 35 088 galaxies with mean redshifts of 0.11, 0.28 and 0.40. We clearly detect scale-dependent galaxy bias for the more luminous galaxy samples, at a level consistent with theoretical expectations. When we vary both σ8 and Ωm (and marginalize over non-linear galaxy bias) in a flat Λ cold dark matter model, the best-constrained quantity is σ8(Ωm/0.25)0.57 = 0.80 ± 0.05 (1σ, stat. + sys.), where statistical and systematic errors (photometric redshift and shear calibration) have comparable contributions, and we have fixed ns = 0.96 and h = 0.7. These strong constraints on the matter clustering suggest that this method is competitive with cosmic shear in current data, while having very complementary and in some ways less serious systematics. We therefore expect that this method will play a prominent role in future weak lensing surveys. When we combine these data with Wilkinson Microwave Anisotropy Probe 7-year (WMAP7) cosmic microwave background (CMB) data, constraints on σ8, Ωm, H0, wde and ∑mν become 30-80 per cent tighter than with CMB data alone, since our data break several parameter degeneracies.
A Hybrid Approach for CpG Island Detection in the Human Genome.
Yang, Cheng-Hong; Lin, Yu-Da; Chiang, Yi-Cheng; Chuang, Li-Yeh
2016-01-01
CpG islands have been demonstrated to influence local chromatin structures and simplify the regulation of gene activity. However, the accurate and rapid determination of CpG islands for whole DNA sequences remains experimentally and computationally challenging. A novel procedure is proposed to detect CpG islands by combining clustering technology with the sliding-window method (PSO-based). Clustering technology is used to detect the locations of all possible CpG islands and process the data, thus effectively obviating the need for the extensive and unnecessary processing of DNA fragments, and thus improving the efficiency of sliding-window based particle swarm optimization (PSO) search. This proposed approach, named ClusterPSO, provides versatile and highly-sensitive detection of CpG islands in the human genome. In addition, the detection efficiency of ClusterPSO is compared with eight CpG island detection methods in the human genome. Comparison of the detection efficiency for the CpG islands in human genome, including sensitivity, specificity, accuracy, performance coefficient (PC), and correlation coefficient (CC), ClusterPSO revealed superior detection ability among all of the test methods. Moreover, the combination of clustering technology and PSO method can successfully overcome their respective drawbacks while maintaining their advantages. Thus, clustering technology could be hybridized with the optimization algorithm method to optimize CpG island detection. The prediction accuracy of ClusterPSO was quite high, indicating the combination of CpGcluster and PSO has several advantages over CpGcluster and PSO alone. In addition, ClusterPSO significantly reduced implementation time.
Ritenberga, Olga; Sofiev, Mikhail; Siljamo, Pilvi; Saarto, Annika; Dahl, Aslog; Ekebom, Agneta; Sauliene, Ingrida; Shalaboda, Valentina; Severova, Elena; Hoebeke, Lucie; Ramfjord, Hallvard
2018-02-15
The paper suggests a methodology for predicting next-year seasonal pollen index (SPI, a sum of daily-mean pollen concentrations) over large regions and demonstrates its performance for birch in Northern and North-Eastern Europe. A statistical model is constructed using meteorological, geophysical and biological characteristics of the previous year). A cluster analysis of multi-annual data of European Aeroallergen Network (EAN) revealed several large regions in Europe, where the observed SPI exhibits similar patterns of the multi-annual variability. We built the model for the northern cluster of stations, which covers Finland, Sweden, Baltic States, part of Belarus, and, probably, Russia and Norway, where the lack of data did not allow for conclusive analysis. The constructed model was capable of predicting the SPI with correlation coefficient reaching up to 0.9 for some stations, odds ratio is infinitely high for 50% of sites inside the region and the fraction of prediction falling within factor of 2 from observations, stays within 40-70%. In particular, model successfully reproduced both the bi-annual cycle of the SPI and years when this cycle breaks down. Copyright © 2017 Elsevier B.V. All rights reserved.
Search for Directed Networks by Different Random Walk Strategies
NASA Astrophysics Data System (ADS)
Zhu, Zi-Qi; Jin, Xiao-Ling; Huang, Zhi-Long
2012-03-01
A comparative study is carried out on the efficiency of five different random walk strategies searching on directed networks constructed based on several typical complex networks. Due to the difference in search efficiency of the strategies rooted in network clustering, the clustering coefficient in a random walker's eye on directed networks is defined and computed to be half of the corresponding undirected networks. The search processes are performed on the directed networks based on Erdös—Rényi model, Watts—Strogatz model, Barabási—Albert model and clustered scale-free network model. It is found that self-avoiding random walk strategy is the best search strategy for such directed networks. Compared to unrestricted random walk strategy, path-iteration-avoiding random walks can also make the search process much more efficient. However, no-triangle-loop and no-quadrangle-loop random walks do not improve the search efficiency as expected, which is different from those on undirected networks since the clustering coefficient of directed networks are smaller than that of undirected networks.
Optimized data fusion for K-means Laplacian clustering
Yu, Shi; Liu, Xinhai; Tranchevent, Léon-Charles; Glänzel, Wolfgang; Suykens, Johan A. K.; De Moor, Bart; Moreau, Yves
2011-01-01
Motivation: We propose a novel algorithm to combine multiple kernels and Laplacians for clustering analysis. The new algorithm is formulated on a Rayleigh quotient objective function and is solved as a bi-level alternating minimization procedure. Using the proposed algorithm, the coefficients of kernels and Laplacians can be optimized automatically. Results: Three variants of the algorithm are proposed. The performance is systematically validated on two real-life data fusion applications. The proposed Optimized Kernel Laplacian Clustering (OKLC) algorithms perform significantly better than other methods. Moreover, the coefficients of kernels and Laplacians optimized by OKLC show some correlation with the rank of performance of individual data source. Though in our evaluation the K values are predefined, in practical studies, the optimal cluster number can be consistently estimated from the eigenspectrum of the combined kernel Laplacian matrix. Availability: The MATLAB code of algorithms implemented in this paper is downloadable from http://homes.esat.kuleuven.be/~sistawww/bioi/syu/oklc.html. Contact: shiyu@uchicago.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20980271
Oxygen diffusion in alpha-Al2O3. Ph.D. Thesis
NASA Technical Reports Server (NTRS)
Cawley, J. D.; Halloran, J. W.; Cooper, A. R.
1984-01-01
Oxygen self diffusion coefficients were determined in single crystal alpha-Al2O3 using the gas exchange technique. The samples were semi-infinite slabs cut from five different boules with varying background impurities. The diffusion direction was parallel to the c-axis. The tracer profiles were determined by two techniques, single spectrum proton activation and secondary ion mass spectrometry. The SIMS proved to be a more useful tool. The determined diffusion coefficients, which were insensitive to impurity levels and oxygen partial pressure, could be described by D = .00151 exp (-572kJ/RT) sq m/s. The insensitivities are discussed in terms of point defect clustering. Two independent models are consistent with the findings, the first considers the clusters as immobile point defect traps which buffer changes in the defect chemistry. The second considers clusters to be mobile and oxygen diffusion to be intrinsic behavior, the mechanism for oxygen transport involving neutral clusters of Schottky quintuplets.
Amatore, Christian; Oleinick, Alexander I; Klymenko, Oleksiy V; Svir, Irina
2009-07-13
Breaking of symmetry is often required in biology in order to produce a specific function. In this work we address the problem of protein diffusion over a spherical vesicle surface towards one pole of the vesicle in order to produce ultimately an active protein cluster performing a specific biological function. Such a process is, for example, prerequisite for the assembling of proteins which then cooperatively catalyze the polymerization of actin monomers to sustain the growth of actin tails as occurs in natural vesicles such as those contained in Xenopus eggs. By this process such vesicles may propel themselves within the cell by the principle of action-reaction. In this work the physicochemical treatment of diffusion of large biomolecules within a cellular membrane is extended to encompass the case when proteins may be transiently poised by corral-like structures partitioning the membrane as has been recently documented in the literature. In such case the exchange of proteins between adjacent corrals occurs by energy-gated transitions instead of classical Brownian motion, yet the present analysis shows that long-range movements of the biomolecules may still be described by a classical diffusion law though the diffusion coefficient has then a different physical meaning. Such a model explains why otherwise classical diffusion of proteins may give rise to too small diffusion coefficients compared to predictions based on the protein dimension. This model is implemented to examine the rate of proteins clustering at one pole of a spherical vesicle and its outcome is discussed in relevance to the mechanism of actin comet tails growth.
Geng, Shujie; Liu, Xiangyu; Biswal, Bharat B; Niu, Haijing
2017-01-01
As an emerging brain imaging technique, functional near infrared spectroscopy (fNIRS) has attracted widespread attention for advancing resting-state functional connectivity (FC) and graph theoretical analyses of brain networks. However, it remains largely unknown how the duration of the fNIRS signal scanning is related to stable and reproducible functional brain network features. To answer this question, we collected resting-state fNIRS signals (10-min duration, two runs) from 18 participants and then truncated the hemodynamic time series into 30-s time bins that ranged from 1 to 10 min. Measures of nodal efficiency, nodal betweenness, network local efficiency, global efficiency, and clustering coefficient were computed for each subject at each fNIRS signal acquisition duration. Analyses of the stability and between-run reproducibility were performed to identify optimal time length for each measure. We found that the FC, nodal efficiency and nodal betweenness stabilized and were reproducible after 1 min of fNIRS signal acquisition, whereas network clustering coefficient, local and global efficiencies stabilized after 1 min and were reproducible after 5 min of fNIRS signal acquisition for only local and global efficiencies. These quantitative results provide direct evidence regarding the choice of the resting-state fNIRS scanning duration for functional brain connectivity and topological metric stability of brain network connectivity.
NASA Astrophysics Data System (ADS)
Teramae, Tatsuya; Kushida, Daisuke; Takemori, Fumiaki; Kitamura, Akira
Authors proposed the estimation method combining k-means algorithm and NN for evaluating massage. However, this estimation method has a problem that discrimination ratio is decreased to new user. There are two causes of this problem. One is that generalization of NN is bad. Another one is that clustering result by k-means algorithm has not high correlation coefficient in a class. Then, this research proposes k-means algorithm according to correlation coefficient and incremental learning for NN. The proposed k-means algorithm is method included evaluation function based on correlation coefficient. Incremental learning is method that NN is learned by new data and initialized weight based on the existing data. The effect of proposed methods are verified by estimation result using EEG data when testee is given massage.
Garcia, Danilo; MacDonald, Shane; Archer, Trevor
2015-01-01
Background. The notion of the affective system as being composed of two dimensions led Archer and colleagues to the development of the affective profiles model. The model consists of four different profiles based on combinations of individuals' experience of high/low positive and negative affect: self-fulfilling, low affective, high affective, and self-destructive. During the past 10 years, an increasing number of studies have used this person-centered model as the backdrop for the investigation of between and within individual differences in ill-being and well-being. The most common approach to this profiling is by dividing individuals' scores of self-reported affect using the median of the population as reference for high/low splits. However, scores just-above and just-below the median might become high and low by arbitrariness, not by reality. Thus, it is plausible to criticize the validity of this variable-oriented approach. Our aim was to compare the median splits approach with a person-oriented approach, namely, cluster analysis. Method. The participants (N = 2, 225) were recruited through Amazons' Mechanical Turk and asked to self-report affect using the Positive Affect Negative Affect Schedule. We compared the profiles' homogeneity and Silhouette coefficients to discern differences in homogeneity and heterogeneity between approaches. We also conducted exact cell-wise analyses matching the profiles from both approaches and matching profiles and gender to investigate profiling agreement with respect to affectivity levels and affectivity and gender. All analyses were conducted using the ROPstat software. Results. The cluster approach (weighted average of cluster homogeneity coefficients = 0.62, Silhouette coefficients = 0.68) generated profiles with greater homogeneity and more distinctive from each other compared to the median splits approach (weighted average of cluster homogeneity coefficients = 0.75, Silhouette coefficients = 0.59). Most of the participants (n = 1,736, 78.0%) were allocated to the same profile (Rand Index = .83), however, 489 (21.98%) were allocated to different profiles depending on the approach. Both approaches allocated females and males similarly in three of the four profiles. Only the cluster analysis approach classified men significantly more often than chance to a self-fulfilling profile (type) and females less often than chance to this very same profile (antitype). Conclusions. Although the question whether one approach is more appropriate than the other is still without answer, the cluster method allocated individuals to profiles that are more in accordance with the conceptual basis of the model and also to expected gender differences. More importantly, regardless of the approach, our findings suggest that the model mirrors a complex and dynamic adaptive system.
Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach
Kudisthalert, Wasu
2018-01-01
Machine learning techniques are becoming popular in virtual screening tasks. One of the powerful machine learning algorithms is Extreme Learning Machine (ELM) which has been applied to many applications and has recently been applied to virtual screening. We propose the Weighted Similarity ELM (WS-ELM) which is based on a single layer feed-forward neural network in a conjunction of 16 different similarity coefficients as activation function in the hidden layer. It is known that the performance of conventional ELM is not robust due to random weight selection in the hidden layer. Thus, we propose a Clustering-based WS-ELM (CWS-ELM) that deterministically assigns weights by utilising clustering algorithms i.e. k-means clustering and support vector clustering. The experiments were conducted on one of the most challenging datasets–Maximum Unbiased Validation Dataset–which contains 17 activity classes carefully selected from PubChem. The proposed algorithms were then compared with other machine learning techniques such as support vector machine, random forest, and similarity searching. The results show that CWS-ELM in conjunction with support vector clustering yields the best performance when utilised together with Sokal/Sneath(1) coefficient. Furthermore, ECFP_6 fingerprint presents the best results in our framework compared to the other types of fingerprints, namely ECFP_4, FCFP_4, and FCFP_6. PMID:29652912
Properties of highly clustered networks
NASA Astrophysics Data System (ADS)
Newman, M. E.
2003-08-01
We propose and solve exactly a model of a network that has both a tunable degree distribution and a tunable clustering coefficient. Among other things, our results indicate that increased clustering leads to a decrease in the size of the giant component of the network. We also study susceptible/infective/recovered type epidemic processes within the model and find that clustering decreases the size of epidemics, but also decreases the epidemic threshold, making it easier for diseases to spread. In addition, clustering causes epidemics to saturate sooner, meaning that they infect a near-maximal fraction of the network for quite low transmission rates.
NASA Astrophysics Data System (ADS)
Lai, King C.; Liu, Da-Jiang; Evans, James W.
2017-12-01
For diffusion of two-dimensional homoepitaxial clusters of N atoms on metal (100) surfaces mediated by edge atom hopping, macroscale continuum theory suggests that the diffusion coefficient scales like DN˜ N-β with β =3 /2 . However, we find quite different and diverse behavior in multiple size regimes. These include: (i) facile diffusion for small sizes N <9 ; (ii) slow nucleation-mediated diffusion with small β <1 for "perfect" sizes N = Np= L2 or L (L +1 ) , for L =3 ,4 , ... having unique ground-state shapes, for moderate sizes 9 ≤N ≤O (102) ; the same also applies for N =Np+3 , Np+ 4 , ... (iii) facile diffusion but with large β >2 for N =Np+1 and Np+2 also for moderate sizes 9 ≤N ≤O (102) ; (iv) merging of the above distinct branches and subsequent anomalous scaling with 1 ≲β <3 /2 , reflecting the quasifacetted structure of clusters, for larger N =O (102) to N =O (103) ; (v) classic scaling with β =3 /2 for very large N =O (103) and above. The specified size ranges apply for typical model parameters. We focus on the moderate size regime where we show that diffusivity cycles quasiperiodically from the slowest branch for Np+3 (not Np) to the fastest branch for Np+1 . Behavior is quantified by kinetic Monte Carlo simulation of an appropriate stochastic lattice-gas model. However, precise analysis must account for a strong enhancement of diffusivity for short time increments due to back correlation in the cluster motion. Further understanding of this enhancement, of anomalous size scaling behavior, and of the merging of various branches, is facilitated by combinatorial analysis of the number of the ground-state and low-lying excited state cluster configurations, and also of kink populations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lai, King C.; Liu, Da -Jiang; Evans, James W.
For diffusion of two-dimensional homoepitaxial clusters of N atoms on metal(100) surfaces mediated by edge atom hopping, macroscale continuum theory suggests that the diffusion coefficient scales like DN ~ N -β with β = 3/2. However, we find quite different and diverse behavior in multiple size regimes. These include: (i) facile diffusion for small sizes N < 9; (ii) slow nucleation-mediated diffusion with small β < 1 for “perfect” sizes N = N p = L 2 or L(L+1), for L = 3, 4,… having unique ground state shapes, for moderate sizes 9 ≤ N ≤ O(10 2); the samemore » also applies for N = N p +3, N p + 4,… (iii) facile diffusion but with large β > 2 for N = Np + 1 and N p + 2 also for moderate sizes 9 ≤ N ≤ O(10 2); (iv) merging of the above distinct branches and subsequent anomalous scaling with 1 ≲ β < 3/2, reflecting the quasi-facetted structure of clusters, for larger N = O(10 2) to N = O(10 3); and (v) classic scaling with β = 3/2 for very large N = O(103) and above. The specified size ranges apply for typical model parameters. We focus on the moderate size regime where show that diffusivity cycles quasi-periodically from the slowest branch for N p + 3 (not Np) to the fastest branch for Np + 1. Behavior is quantified by Kinetic Monte Carlo simulation of an appropriate stochastic lattice-gas model. However, precise analysis must account for a strong enhancement of diffusivity for short time increments due to back-correlation in the cluster motion. Further understanding of this enhancement, of anomalous size scaling behavior, and of the merging of various branches, is facilitated by combinatorial analysis of the number of the ground state and low-lying excited state cluster configurations, and also of kink populations.« less
Lai, King C.; Liu, Da -Jiang; Evans, James W.
2017-12-05
For diffusion of two-dimensional homoepitaxial clusters of N atoms on metal(100) surfaces mediated by edge atom hopping, macroscale continuum theory suggests that the diffusion coefficient scales like DN ~ N -β with β = 3/2. However, we find quite different and diverse behavior in multiple size regimes. These include: (i) facile diffusion for small sizes N < 9; (ii) slow nucleation-mediated diffusion with small β < 1 for “perfect” sizes N = N p = L 2 or L(L+1), for L = 3, 4,… having unique ground state shapes, for moderate sizes 9 ≤ N ≤ O(10 2); the samemore » also applies for N = N p +3, N p + 4,… (iii) facile diffusion but with large β > 2 for N = Np + 1 and N p + 2 also for moderate sizes 9 ≤ N ≤ O(10 2); (iv) merging of the above distinct branches and subsequent anomalous scaling with 1 ≲ β < 3/2, reflecting the quasi-facetted structure of clusters, for larger N = O(10 2) to N = O(10 3); and (v) classic scaling with β = 3/2 for very large N = O(103) and above. The specified size ranges apply for typical model parameters. We focus on the moderate size regime where show that diffusivity cycles quasi-periodically from the slowest branch for N p + 3 (not Np) to the fastest branch for Np + 1. Behavior is quantified by Kinetic Monte Carlo simulation of an appropriate stochastic lattice-gas model. However, precise analysis must account for a strong enhancement of diffusivity for short time increments due to back-correlation in the cluster motion. Further understanding of this enhancement, of anomalous size scaling behavior, and of the merging of various branches, is facilitated by combinatorial analysis of the number of the ground state and low-lying excited state cluster configurations, and also of kink populations.« less
Hudes, Mark L; McCann, Joyce C; Ames, Bruce N
2009-03-01
A simple statistical method is described to test whether data are consistent with minimum statistical variability expected in a biological experiment. The method is applied to data presented in data tables in a subset of 84 articles among more than 200 published by 3 investigators in a small medical biochemistry department at a major university in India and to 29 "control" articles selected by key word PubMed searches. Major conclusions include: 1) unusual clustering of coefficients of variation (CVs) was observed for data from the majority of articles analyzed that were published by the 3 investigators from 2000-2007; unusual clustering was not observed for data from any of their articles examined that were published between 1992 and 1999; and 2) among a group of 29 control articles retrieved by PubMed key word, title, or title/abstract searches, unusually clustered CVs were observed in 3 articles. Two of these articles were coauthored by 1 of the 3 investigators, and 1 was from the same university but a different department. We are unable to offer a statistical or biological explanation for the unusual clustering observed.
Li, Yan; Rui, Xue; Li, Shuyu; Pu, Fang
2014-11-01
Graph theoretical analysis has recently become a popular research tool in neuroscience, however, there have been very few studies on brain responses to music perception, especially when culturally different styles of music are involved. Electroencephalograms were recorded from ten subjects listening to Chinese traditional music, light music and western classical music. For event-related potentials, phase coherence was calculated in the alpha band and then constructed into correlation matrices. Clustering coefficients and characteristic path lengths were evaluated for global properties, while clustering coefficients and efficiency were assessed for local network properties. Perception of light music and western classical music manifested small-world network properties, especially with a relatively low proportion of weights of correlation matrices. For local analysis, efficiency was more discernible than clustering coefficient. Nevertheless, there was no significant discrimination between Chinese traditional and western classical music perception. Perception of different styles of music introduces different network properties, both globally and locally. Research into both global and local network properties has been carried out in other areas; however, this is a preliminary investigation aimed at suggesting a possible new approach to brain network properties in music perception. Copyright © 2014 Elsevier Ltd. All rights reserved.
Effect of sharp maximum in ion diffusivity for liquid xenon
NASA Astrophysics Data System (ADS)
Lankin, A. V.; Orekhov, M. A.
2016-11-01
Ion diffusion in a liquid usually could be treated as a movement of an ion cluster in a viscous media. For small ions this leads to a special feature: diffusion coefficient is either independent of the ion size or increases with it. We find a different behavior for small ions in liquid xenon. Calculation of the dependence of an ion diffusion coefficient in liquid xenon on the ion size is carried out. Classical molecular dynamics method is applied. Calculated dependence of the ion diffusion coefficient on its radius has sharp maximums at the ion radiuses 1.75 and 2 Å. Every maximum is placed between two regions with different stable ion cluster configurations. This leads to the instability of these configurations in a small region between them. Consequently ion with radius near 1.75 or 2 Å could jump from one configuration to another. This increases the speed of the diffusion. A simple qualitative model for this effect is suggested. The decomposition of the ion movement into continuous and jump diffusion shows that continuous part of the diffusion is the same as for the ion cluster in the stable region.
Long-term variability of global statistical properties of epileptic brain networks
NASA Astrophysics Data System (ADS)
Kuhnert, Marie-Therese; Elger, Christian E.; Lehnertz, Klaus
2010-12-01
We investigate the influence of various pathophysiologic and physiologic processes on global statistical properties of epileptic brain networks. We construct binary functional networks from long-term, multichannel electroencephalographic data recorded from 13 epilepsy patients, and the average shortest path length and the clustering coefficient serve as global statistical network characteristics. For time-resolved estimates of these characteristics we observe large fluctuations over time, however, with some periodic temporal structure. These fluctuations can—to a large extent—be attributed to daily rhythms while relevant aspects of the epileptic process contribute only marginally. Particularly, we could not observe clear cut changes in network states that can be regarded as predictive of an impending seizure. Our findings are of particular relevance for studies aiming at an improved understanding of the epileptic process with graph-theoretical approaches.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Renslow, Ryan S.; Majors, Paul D.; McLean, Jeffrey S.
2010-08-15
Diffusive mass transfer in biofilms is characterized by the effective diffusion coefficient. It is well-documented that the effective diffusion coefficient can vary by location in a biofilm. The current literature is dominated by effective diffusion coefficient measurements for distinct cell clusters and stratified biofilms showing this spatial variation. Regardless of whether distinct cell clusters or surface-averaging methods are used, position-dependent measurements of the effective diffusion coefficient are currently: 1) invasive to the biofilm, 2) performed under unnatural conditions, 3) lethal to cells, and/or 4) spatially restricted to only certain regions of the biofilm. Invasive measurements can lead to inaccurate resultsmore » and prohibit further (time dependent) measurements which are important for the mathematical modeling of biofilms. In this study our goals were to: 1) measure the effective diffusion coefficient for water in live biofilms, 2) monitor how the effective diffusion coefficient changes over time under growth conditions, and 3) correlate the effective diffusion coefficient with depth in the biofilm. We measured in situ two-dimensional effective diffusion coefficient maps within Shewanella oneidensis MR-1biofilms using pulsed-field gradient nuclear magnetic resonance methods, and used them to calculate surface-averaged relative effective diffusion coefficient (Drs) profiles. We found that 1) Drs decreased from the top of the biofilm to the bottom, 2) Drs profiles differed for biofilms of different ages, 3) Drs profiles changed over time and generally decreased with time, 4) all the biofilms showed very similar Drs profiles near the top of the biofilm, and 5) the Drs profile near the bottom of the biofilm was different for each biofilm. Practically, our results demonstrate that advanced biofilm models should use a variable effective diffusivity which changes with time and location in the biofilm.« less
Peterson, Leif E
2002-01-01
CLUSFAVOR (CLUSter and Factor Analysis with Varimax Orthogonal Rotation) 5.0 is a Windows-based computer program for hierarchical cluster and principal-component analysis of microarray-based transcriptional profiles. CLUSFAVOR 5.0 standardizes input data; sorts data according to gene-specific coefficient of variation, standard deviation, average and total expression, and Shannon entropy; performs hierarchical cluster analysis using nearest-neighbor, unweighted pair-group method using arithmetic averages (UPGMA), or furthest-neighbor joining methods, and Euclidean, correlation, or jack-knife distances; and performs principal-component analysis. PMID:12184816
The second virial coefficient of system ((nitrogen-water))
NASA Astrophysics Data System (ADS)
Podmurnaya, O. A.
2004-01-01
The virial coefficient data of various components of atmosphere are interesting because permit to evaluate a deviation from ideal gas model. These data may be useful while investigating the clusters generation and determination their contribution in absorption. The second cross virial coefficient Baw for system ((nitrogen water)) has been calculated form +9°C to +50°C using the last experimental data about water vapor mole fraction. The reliability of this coefficient has been tested by analysing of errors sources and by comparing the results with other available experimental data.
Coarsening behavior of γ' and γ″ phases in GH4169 superalloy by electric field treatment
NASA Astrophysics Data System (ADS)
Wang, Lei; Wang, Yao; Liu, Yang; Song, Xiu; Lü, Xu-dong; Zhang, Bei-jiang
2013-09-01
The coarsening behaviors of γ' and γ″ phases in GH4169 alloy aged at 1023 and 1073 K with electric field treatment (EFT) were investigated by transmission electron microscopy (TEM) and positron annihilation lifetime spectroscopy (PALS). It is demonstrated that precipitation coarsening occurs, and the growth activation energies of γ' and γ″ phases can be decreased to 115.6 and 198.1 kJ·mol-1, respectively, by applying the electric field. The formation of a large number of vacancies in the matrix is induced by EFT. Due to the occurrence of vacancy migration, the diffusion coefficients of Al and Nb atoms are increased to be 1.6-5.0 times larger than those without EFT at 1023 or 1073 K. Furthermore, the formation of vacancy clusters is promoted by EFT, and the increase in strain energy for the coarsening of γ' and γ″ phases can be counterbalanced by the formation of vacancy clusters.
NASA Astrophysics Data System (ADS)
Blaszczuk, Artur; Nowak, Wojciech
2016-10-01
In the present work, the heat transfer study focuses on assessment of the impact of bed temperature on the local heat transfer characteristic between a fluidized bed and vertical rifled tubes (38mm-O.D.) in a commercial circulating fluidized bed (CFB) boiler. Heat transfer behavior in a 1296t/h supercritical CFB furnace has been analyzed for Geldart B particle with Sauter mean diameter of 0.219 and 0.246mm. The heat transfer experiments were conducted for the active heat transfer surface in the form of membrane tube with a longitudinal fin at the tube crest under the normal operating conditions of CFB boiler. A heat transfer analysis of CFB boiler with detailed consideration of the bed-to-wall heat transfer coefficient and the contribution of heat transfer mechanisms inside furnace chamber were investigated using mechanistic heat transfer model based on cluster renewal approach. The predicted values of heat transfer coefficient are compared with empirical correlation for CFB units in large-scale.
Analysis of the correlation dimension for inertial particles
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gustavsson, Kristian; Department of Physics, Göteborg University, 41296 Gothenburg; Mehlig, Bernhard
2015-07-15
We obtain an implicit equation for the correlation dimension which describes clustering of inertial particles in a complex flow onto a fractal measure. Our general equation involves a propagator of a nonlinear stochastic process in which the velocity gradient of the fluid appears as additive noise. When the long-time limit of the propagator is considered our equation reduces to an existing large-deviation formalism from which it is difficult to extract concrete results. In the short-time limit, however, our equation reduces to a solvability condition on a partial differential equation. In the case where the inertial particles are much denser thanmore » the fluid, we show how this approach leads to a perturbative expansion of the correlation dimension, for which the coefficients can be obtained exactly and in principle to any order. We derive the perturbation series for the correlation dimension of inertial particles suspended in three-dimensional spatially smooth random flows with white-noise time correlations, obtaining the first 33 non-zero coefficients exactly.« less
On polynomial preconditioning for indefinite Hermitian matrices
NASA Technical Reports Server (NTRS)
Freund, Roland W.
1989-01-01
The minimal residual method is studied combined with polynomial preconditioning for solving large linear systems (Ax = b) with indefinite Hermitian coefficient matrices (A). The standard approach for choosing the polynomial preconditioners leads to preconditioned systems which are positive definite. Here, a different strategy is studied which leaves the preconditioned coefficient matrix indefinite. More precisely, the polynomial preconditioner is designed to cluster the positive, resp. negative eigenvalues of A around 1, resp. around some negative constant. In particular, it is shown that such indefinite polynomial preconditioners can be obtained as the optimal solutions of a certain two parameter family of Chebyshev approximation problems. Some basic results are established for these approximation problems and a Remez type algorithm is sketched for their numerical solution. The problem of selecting the parameters such that the resulting indefinite polynomial preconditioners speeds up the convergence of minimal residual method optimally is also addressed. An approach is proposed based on the concept of asymptotic convergence factors. Finally, some numerical examples of indefinite polynomial preconditioners are given.
Using earthquake clusters to identify fracture zones at Puna geothermal field, Hawaii
NASA Astrophysics Data System (ADS)
Lucas, A.; Shalev, E.; Malin, P.; Kenedi, C. L.
2010-12-01
The actively producing Puna geothermal system (PGS) is located on the Kilauea East Rift Zone (ERZ), which extends out from the active Kilauea volcano on Hawaii. In the Puna area the rift trend is identified as NE-SW from surface expressions of normal faulting with a corresponding strike; at PGS the surface expression offsets in a left step, but no rift perpendicular faulting is observed. An eight station borehole seismic network has been installed in the area of the geothermal system. Since June 2006, a total of 6162 earthquakes have been located close to or inside the geothermal system. The spread of earthquake locations follows the rift trend, but down rift to the NE of PGS almost no earthquakes are observed. Most earthquakes located within the PGS range between 2-3 km depth. Up rift to the SW of PGS the number of events decreases and the depth range increases to 3-4 km. All initial locations used Hypoinverse71 and showed no trends other than the dominant rift parallel. Double difference relocation of all earthquakes, using both catalog and cross-correlation, identified one large cluster but could not conclusively identify trends within the cluster. A large number of earthquake waveforms showed identifiable shear wave splitting. For five stations out of the six where shear wave splitting was observed, the dominant polarization direction was rift parallel. Two of the five stations also showed a smaller rift perpendicular signal. The sixth station (located close to the area of the rift offset) displayed a N-S polarization, approximately halfway between rift parallel and perpendicular. The shear wave splitting time delays indicate that fracture density is higher at the PGS compared to the surrounding ERZ. Correlation co-efficient clustering with independent P and S wave windows was used to identify clusters based on similar earthquake waveforms. In total, 40 localized clusters containing ten or more events were identified. The largest cluster was located in the production area for the power plant. Most of the clusters had linear features when their Hypoinverse locations were plotted. The concentration of individual linear features was higher in the PGS than the surrounding ERZ. The resolution of the features was resolved further by relocating each individual cluster through the catalog double difference method. Mapping of the linear features showed that a number of the larger features ran rift parallel. However a large number of rift perpendicular features were also identified. In the area where the anomalous (N-S) shear wave polarization was observed, a number of linear features with a similar orientation were identified. We assume that events occurring on the same fracture zone have similar source mechanisms and thus similar waveforms. It is concluded that the linear features identified by earthquake clustering are fracture zones. The orientation and concentration of the fracture zones is consistent with that of the shear wave splitting polarizations.
Sample Size Estimation in Cluster Randomized Educational Trials: An Empirical Bayes Approach
ERIC Educational Resources Information Center
Rotondi, Michael A.; Donner, Allan
2009-01-01
The educational field has now accumulated an extensive literature reporting on values of the intraclass correlation coefficient, a parameter essential to determining the required size of a planned cluster randomized trial. We propose here a simple simulation-based approach including all relevant information that can facilitate this task. An…
ERIC Educational Resources Information Center
Raykov, Tenko
2011-01-01
Interval estimation of intraclass correlation coefficients in hierarchical designs is discussed within a latent variable modeling framework. A method accomplishing this aim is outlined, which is applicable in two-level studies where participants (or generally lower-order units) are clustered within higher-order units. The procedure can also be…
Thermodynamic Behavior of Nano-sized Gold Clusters on the (001) Surface
NASA Technical Reports Server (NTRS)
Paik, Sun M.; Yoo, Sung M.; Namkung, Min; Wincheski, Russell A.; Bushnell, Dennis M. (Technical Monitor)
2001-01-01
We have studied thermal expansion of the surface layers of the hexagonally reconstructed Au (001) surface using a classical Molecular Dynamics (MD) simulation technique with an Embedded Atomic Method (EAM) type many-body potential. We find that the top-most hexagonal layer contracts as temperature increases, whereas the second layer expands or contracts depending on the system size. The magnitude of expansion coefficient of the top layer is much larger than that of the other layers. The calculated thermal expansion coefficients of the top-most layer are about -4.93 x 10(exp -5)Angstroms/Kelvin for the (262 x 227)Angstrom cluster and -3.05 x 10(exp -5)Angstroms/Kelvin for (101 x 87)Angstrom cluster. The Fast Fourier Transform (FFT) image of the atomic density shows that there exists a rotated domain of the top-most hexagonal cluster with rotation angle close to 1 degree at temperature T less than 1000Kelvin. As the temperature increases this domain undergoes a surface orientational phase transition. These predictions are in good agreement with previous phenomenological theories and experimental studies.
Luoto, Riitta; Kinnunen, Tarja I.; Aittasalo, Minna; Kolu, Päivi; Raitanen, Jani; Ojala, Katriina; Mansikkamäki, Kirsi; Lamberg, Satu; Vasankari, Tommi; Komulainen, Tanja; Tulokas, Sirkku
2011-01-01
Background Our objective was to examine whether gestational diabetes mellitus (GDM) or newborns' high birthweight can be prevented by lifestyle counseling in pregnant women at high risk of GDM. Method and Findings We conducted a cluster-randomized trial, the NELLI study, in 14 municipalities in Finland, where 2,271 women were screened by oral glucose tolerance test (OGTT) at 8–12 wk gestation. Euglycemic (n = 399) women with at least one GDM risk factor (body mass index [BMI] ≥25 kg/m2, glucose intolerance or newborn's macrosomia (≥4,500 g) in any earlier pregnancy, family history of diabetes, age ≥40 y) were included. The intervention included individual intensified counseling on physical activity and diet and weight gain at five antenatal visits. Primary outcomes were incidence of GDM as assessed by OGTT (maternal outcome) and newborns' birthweight adjusted for gestational age (neonatal outcome). Secondary outcomes were maternal weight gain and the need for insulin treatment during pregnancy. Adherence to the intervention was evaluated on the basis of changes in physical activity (weekly metabolic equivalent task (MET) minutes) and diet (intake of total fat, saturated and polyunsaturated fatty acids, saccharose, and fiber). Multilevel analyses took into account cluster, maternity clinic, and nurse level influences in addition to age, education, parity, and prepregnancy BMI. 15.8% (34/216) of women in the intervention group and 12.4% (22/179) in the usual care group developed GDM (absolute effect size 1.36, 95% confidence interval [CI] 0.71–2.62, p = 0.36). Neonatal birthweight was lower in the intervention than in the usual care group (absolute effect size −133 g, 95% CI −231 to −35, p = 0.008) as was proportion of large-for-gestational-age (LGA) newborns (26/216, 12.1% versus 34/179, 19.7%, p = 0.042). Women in the intervention group increased their intake of dietary fiber (adjusted coefficient 1.83, 95% CI 0.30–3.25, p = 0.023) and polyunsaturated fatty acids (adjusted coefficient 0.37, 95% CI 0.16–0.57, p<0.001), decreased their intake of saturated fatty acids (adjusted coefficient −0.63, 95% CI −1.12 to −0.15, p = 0.01) and intake of saccharose (adjusted coefficient −0.83, 95% CI −1.55 to −0.11, p = 0.023), and had a tendency to a smaller decrease in MET minutes/week for at least moderate intensity activity (adjusted coefficient 91, 95% CI −37 to 219, p = 0.17) than women in the usual care group. In subgroup analysis, adherent women in the intervention group (n = 55/229) had decreased risk of GDM (27.3% versus 33.0%, p = 0.43) and LGA newborns (7.3% versus 19.5%, p = 0.03) compared to women in the usual care group. Conclusions The intervention was effective in controlling birthweight of the newborns, but failed to have an effect on maternal GDM. Trial registration Current Controlled Trials ISRCTN33885819 Please see later in the article for the Editors' Summary PMID:21610860
NASA Astrophysics Data System (ADS)
Huby, Nolwenn; Bigeon, John; Lagneaux, Quentin; Amela-Cortes, Maria; Garreau, Alexandre; Molard, Yann; Fade, Julien; Desert, Anthony; Faulques, Eric; Bêche, Bruno; Duvail, Jean-Luc; Cordier, Stéphane
2016-02-01
Integration of stable emissive entities into organic waveguide with minimum scattering is essential to design efficient optically active devices. Here we present a new class of doped nanocomposite waveguides exploiting 1-nm diameter metallic cluster-based building blocks as red-NIR luminescent dyes embedded in a SU8 polymeric matrix, a reference photoresist for organic photonics. These building blocks are [Mo6Ii8(OOCC2F5)a6]2- cluster anionic units with unique chemical and physical features well suited for optical nanocomposites such as a ligand-promoted dispersibility, a large Stokes shift with a broad absorption window and an emission window in the range 600-900 nm. A whole investigation of the nanocomposite has been first performed. Optical characterizations of Cs2[Mo6Ii8(OOCCnF2n+1)a6]@SU8 nanocomposites thin film and waveguiding structures show their relevance as active layers in integrated structures with a significant increase of the refractive index of 3 × 10-2 when the cluster concentration increases up to 4 wt%, while keeping high values for the transmitted power, as shown for different waveguide dimensions and clusters concentrations. The efficiency of photoluminescence propagation is investigated as a function of clusters concentration in the excitation area for several waveguides dimensions. Attenuation coefficient ranges between 5 and 18 dB/cm, values of the same order of magnitude as those obtained in polymeric waveguide doped with QDs or organic dyes. This original, stable and efficient nanocomposite is promising for downscaling complex nanosources and active waveguides in the visible and NIR range.
Effects of amyloid and small vessel disease on white matter network disruption.
Kim, Hee Jin; Im, Kiho; Kwon, Hunki; Lee, Jong Min; Ye, Byoung Seok; Kim, Yeo Jin; Cho, Hanna; Choe, Yearn Seong; Lee, Kyung Han; Kim, Sung Tae; Kim, Jae Seung; Lee, Jae Hong; Na, Duk L; Seo, Sang Won
2015-01-01
There is growing evidence that the human brain is a large scale complex network. The structural network is reported to be disrupted in cognitively impaired patients. However, there have been few studies evaluating the effects of amyloid and small vessel disease (SVD) markers, the common causes of cognitive impairment, on structural networks. Thus, we evaluated the association between amyloid and SVD burdens and structural networks using diffusion tensor imaging (DTI). Furthermore, we determined if network parameters predict cognitive impairments. Graph theoretical analysis was applied to DTI data from 232 cognitively impaired patients with varying degrees of amyloid and SVD burdens. All patients underwent Pittsburgh compound-B (PiB) PET to detect amyloid burden, MRI to detect markers of SVD, including the volume of white matter hyperintensities and the number of lacunes, and detailed neuropsychological testing. The whole-brain network was assessed by network parameters of integration (shortest path length, global efficiency) and segregation (clustering coefficient, transitivity, modularity). PiB retention ratio was not associated with any white matter network parameters. Greater white matter hyperintensity volumes or lacunae numbers were significantly associated with decreased network integration (increased shortest path length, decreased global efficiency) and increased network segregation (increased clustering coefficient, increased transitivity, increased modularity). Decreased network integration or increased network segregation were associated with poor performances in attention, language, visuospatial, memory, and frontal-executive functions. Our results suggest that SVD alters white matter network integration and segregation, which further predicts cognitive dysfunction.
Relationships among and variation within rare breeds of swine.
Roberts, K S; Lamberson, W R
2015-08-01
Extinction of rare breeds of livestock threatens to reduce the total genetic variation available for selection in the face of the changing environment and new diseases. Swine breeds facing extinction typically share characteristics such as small size, slow growth rate, and high fat percentage, which limit them from contributing to commercial production. Compounding the risk of loss of variation is the lack of pedigree information for many rare breeds due to inadequate herd books, which increases the chance that producers are breeding closely related individuals. By making genetic data available, producers can make more educated breeding decisions to preserve genetic diversity in future generations, and conservation organizations can prioritize investments in breed preservation. The objective of this study was to characterize genetic variation within and among breeds of swine and prioritize heritage breeds for preservation. Genotypes from the Illumina PorcineSNP60 BeadChip (GeneSeek, Lincoln, NE) were obtained for Guinea, Ossabaw Island, Red Wattle, American Saddleback, Mulefoot, British Saddleback, Duroc, Landrace, Large White, Pietrain, and Tamworth pigs. A whole-genome analysis toolset was used to construct a genomic relationship matrix and to calculate inbreeding coefficients for the animals within each breed. Relatedness and average inbreeding coefficient differed among breeds, and pigs from rare breeds were generally more closely related and more inbred ( < 0.05). A multidimensional scaling diagram was constructed based on the SNP genotypes. Animals within breeds clustered tightly together except for 2 Guinea pigs. Tamworth, Duroc, and Mulefoot tended to not cluster with the other 7 breeds.
NASA Technical Reports Server (NTRS)
Whitaker, M.; Biondi, M. A.; Johnsen, R.
1981-01-01
The dependence on electron temperature of the coefficients for electron recombination with molecular cluster ions of the carbon monoxide series, CO(+).(CO)n, is determined. A microwave discharge lasting approximately 0.1 msec was applied in 5-20 Torr neon containing a few tenths percent CO in an afterglow mass spectrometer apparatus, and the time histories of the various afterglow ions were measured. Expressions for the dependence of the recombination coefficients of the dimer and trimer ions CO(+).CO and CO(+).(CO)2 are obtained which are found to be significantly different from those previously obtained for hydronium and ammonium series polar cluster ions, but similar to those of simple diatomic ions.
McNicol, L A; De, S P; Kaper, J B; West, P A; Colwell, R R
1983-01-01
A total of 165 strains of vibrios isolated from clinical and environmental sources in the United States, India, and Bangladesh, 11 reference cultures, and 4 duplicated cultures were compared in a numerical taxonomic study using 83 unit characters. Similarity between strains was computed by using the simple matching coefficient and the Jaccard coefficient. Strains were clustered by unweighted average linkage and single linkage algorithms. All methods gave similar cluster compositions. The estimated probability of error in the study was obtained from a comparison of the results of duplicated strains and was within acceptable limits. A total of 174 of the 180 organisms studied were divided into eight major clusters. Two clusters were identified as Vibrio cholerae, one as Vibrio mimicus, one as Vibrio parahaemolyticus, three as Vibrio species, and one as Aeromonas hydrophila. The V. mimicus cluster could be further divided into two subclusters, and the major V. cholerae group could be split into seven minor subclusters. Phenotypic traits routinely used to identify clinical isolates of V. cholerae can be used to identify environmental V. cholerae isolates. No distinction was found between strains of V. cholerae isolated from regions endemic for cholera and strains from nonendemic regions. PMID:6874901
NASA Astrophysics Data System (ADS)
Błaszczuk, Artur; Krzywański, Jarosław
2017-03-01
The interrelation between fuzzy logic and cluster renewal approaches for heat transfer modeling in a circulating fluidized bed (CFB) has been established based on a local furnace data. The furnace data have been measured in a 1296 t/h CFB boiler with low level of flue gas recirculation. In the present study, the bed temperature and suspension density were treated as experimental variables along the furnace height. The measured bed temperature and suspension density were varied in the range of 1131-1156 K and 1.93-6.32 kg/m3, respectively. Using the heat transfer coefficient for commercial CFB combustor, two empirical heat transfer correlation were developed in terms of important operating parameters including bed temperature and also suspension density. The fuzzy logic results were found to be in good agreement with the corresponding experimental heat transfer data obtained based on cluster renewal approach. The predicted bed-to-wall heat transfer coefficient covered a range of 109-241 W/(m2K) and 111-240 W/(m2K), for fuzzy logic and cluster renewal approach respectively. The divergence in calculated heat flux recovery along the furnace height between fuzzy logic and cluster renewal approach did not exceeded ±2%.
Constructing a Watts-Strogatz network from a small-world network with symmetric degree distribution.
Menezes, Mozart B C; Kim, Seokjin; Huang, Rongbing
2017-01-01
Though the small-world phenomenon is widespread in many real networks, it is still challenging to replicate a large network at the full scale for further study on its structure and dynamics when sufficient data are not readily available. We propose a method to construct a Watts-Strogatz network using a sample from a small-world network with symmetric degree distribution. Our method yields an estimated degree distribution which fits closely with that of a Watts-Strogatz network and leads into accurate estimates of network metrics such as clustering coefficient and degree of separation. We observe that the accuracy of our method increases as network size increases.
NASA Astrophysics Data System (ADS)
Cerbino, Roberto; Piotti, Davide; Buscaglia, Marco; Giavazzi, Fabio
2018-01-01
Micro- and nanoscale objects with anisotropic shape are key components of a variety of biological systems and inert complex materials, and represent fundamental building blocks of novel self-assembly strategies. The time scale of their thermal motion is set by their translational and rotational diffusion coefficients, whose measurement may become difficult for relatively large particles with small optical contrast. Here we show that dark field differential dynamic microscopy is the ideal tool for probing the roto-translational Brownian motion of anisotropic shaped particles. We demonstrate our approach by successful application to aqueous dispersions of non-motile bacteria and of colloidal aggregates of spherical particles.
Dealing with Dependence (Part I): Understanding the Effects of Clustered Data
ERIC Educational Resources Information Center
McCoach, D. Betsy; Adelson, Jill L.
2010-01-01
This article provides a conceptual introduction to the issues surrounding the analysis of clustered (nested) data. We define the intraclass correlation coefficient (ICC) and the design effect, and we explain their effect on the standard error. When the ICC is greater than 0, then the design effect is greater than 1. In such a scenario, the…
Teleportation of Three-Qubit State via Six-qubit Cluster State
NASA Astrophysics Data System (ADS)
Yu, Li-zhi; Sun, Shao-xin
2015-05-01
A scheme of probabilistic teleportation was proposed. In this scheme, we took a six-qubit nonmaximally cluster state as the quantum channel to teleport an unknown three-qubit entangled state. Based on Bob's three times Bell state measurement (BSM) results, the receiver Bob can by introducing an auxiliary particle and the appropriate transformation to reconstruct the initial state with a certain probability. We found that, the successful transmission probability depend on the absolute value of coefficients of two of six particle cluster state minimum.
Failure tolerance of spike phase synchronization in coupled neural networks
NASA Astrophysics Data System (ADS)
Jalili, Mahdi
2011-09-01
Neuronal synchronization plays an important role in the various functionality of nervous system such as binding, cognition, information processing, and computation. In this paper, we investigated how random and intentional failures in the nodes of a network influence its phase synchronization properties. We considered both artificially constructed networks using models such as preferential attachment, Watts-Strogatz, and Erdős-Rényi as well as a number of real neuronal networks. The failure strategy was either random or intentional based on properties of the nodes such as degree, clustering coefficient, betweenness centrality, and vulnerability. Hindmarsh-Rose model was considered as the mathematical model for the individual neurons, and the phase synchronization of the spike trains was monitored as a function of the percentage/number of removed nodes. The numerical simulations were supplemented by considering coupled non-identical Kuramoto oscillators. Failures based on the clustering coefficient, i.e., removing the nodes with high values of the clustering coefficient, had the least effect on the spike synchrony in all of the networks. This was followed by errors where the nodes were removed randomly. However, the behavior of the other three attack strategies was not uniform across the networks, and different strategies were the most influential in different network structure.
A Systematic Study of Kelvin-Helmholtz Instability in Galaxy Clusters
NASA Astrophysics Data System (ADS)
Su, Yuanyuan
2017-09-01
Kelvin-Helmholtz instabilities (KHI) were observed at cold fronts in a handful of clusters. KHI are predicted at all cold fronts in hydro simulation of intracluster medium (ICM). Their presence and absence provides a unique probe of transport processes in the hot plasma, which are essential to the dissipation and redistribution of the energy in the ICM. We propose the first systematic study of the prevalence of KHI in galaxy clusters by analyzing the archived Chandra observations of a sample of 50 nearby galaxy clusters. We will associate the occurrence and properties of KHI rolls with various cluster parameters such as their gas temperature and density, and put constraints on effective transport coefficients in the ICM
An Electroencephalography Network and Connectivity Analysis for Deception in Instructed Lying Tasks
Wang, Yue; Ng, Wu Chun; Ng, Khoon Siong; Yu, Ke; Wu, Tiecheng; Li, Xiaoping
2015-01-01
Deception is an impactful social event that has been the focus of an abundance of researches over recent decades. In this paper, an electroencephalography (EEG) study is presented regarding the cognitive processes of an instructed liar/truth-teller during the time window of stimulus (question) delivery period (SDP) prior to their deceptive/truthful responses towards questions related to authentic (WE: with prior experience) and fictional experience (NE: no prior experience). To investigate deception in non-experienced events, the subjects were given stimuli in a mock interview scenario that induced them to fabricate lies. To analyze the data, frequency domain network and connectivity analysis was performed in the source space in order to provide a more systematic level understanding of deception during SDP. This study reveals several groups of neuronal generators underlying both the instructed lying (IL) and the instructed truth-telling (IT) conditions for both tasks during the SDP. Despite the similarities existed in these group components, significant differences were found in the intra- and inter-group connectivity between the IL and IT conditions in either task. Additionally, the response time was found to be positively correlated with the clustering coefficient of the inferior frontal gyrus (44R) in the WE-IL condition and positively correlated with the clustering coefficient of the precuneus (7L) and the angular gyrus (39R) in the WE-IT condition. However, the response time was found to be marginally negatively correlated with the clustering coefficient of the secondary auditory cortex (42L) in the NE-IL condition and negatively correlated with the clustering coefficient of the somatosensory association cortex (5L, R) in the NE-IT condition. Therefore, these results provide complementary and intuitive evidence for the differences between the IL and IT conditions in SDP for two types of deception tasks, thus elucidating the electrophysiological mechanisms underlying SDP of deception from regional, inter-regional, network, and inter-network scale analyses. PMID:25679784
Auplish, Aashima; Clarke, Alison S; Van Zanten, Trent; Abel, Kate; Tham, Charmaine; Bhutia, Thinlay N; Wilks, Colin R; Stevenson, Mark A; Firestone, Simon M
2017-05-01
Educational initiatives targeting at-risk populations have long been recognized as a mainstay of ongoing rabies control efforts. Cluster-based studies are often utilized to assess levels of knowledge, attitudes and practices of a population in response to education campaigns. The design of cluster-based studies requires estimates of intra-cluster correlation coefficients obtained from previous studies. This study estimates the school-level intra-cluster correlation coefficient (ICC) for rabies knowledge change following an educational intervention program. A cross-sectional survey was conducted with 226 students from 7 schools in Sikkim, India, using cluster sampling. In order to assess knowledge uptake, rabies education sessions with pre- and post-session questionnaires were administered. Paired differences of proportions were estimated for questions answered correctly. A mixed effects logistic regression model was developed to estimate school-level and student-level ICCs and to test for associations between gender, age, school location and educational level. The school- and student-level ICCs for rabies knowledge and awareness were 0.04 (95% CI: 0.01, 0.19) and 0.05 (95% CI: 0.2, 0.09), respectively. These ICCs suggest design effect multipliers of 5.45 schools and 1.05 students per school, will be required when estimating sample sizes and designing future cluster randomized trials. There was a good baseline level of rabies knowledge (mean pre-session score 71%), however, key knowledge gaps were identified in understanding appropriate behavior around scared dogs, potential sources of rabies and how to correctly order post rabies exposure precaution steps. After adjusting for the effect of gender, age, school location and education level, school and individual post-session test scores improved by 19%, with similar performance amongst boys and girls attending schools in urban and rural regions. The proportion of participants that were able to correctly order post-exposure precautionary steps following educational intervention increased by 87%. The ICC estimates presented in this study will aid in designing cluster-based studies evaluating educational interventions as part of disease control programs. This study demonstrates the likely benefits of educational intervention incorporating bite prevention and rabies education. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Song, Bo; Waldrop, Jonathan M.; Wang, Xiaopo; Patkowski, Konrad
2018-01-01
We have developed a new krypton-krypton interaction-induced isotropic dipole polarizability curve based on high-level ab initio methods. The determination was carried out using the coupled-cluster singles and doubles plus perturbative triples method with very large basis sets up to augmented correlation-consistent sextuple zeta as well as the corrections for core-core and core-valence correlation and relativistic effects. The analytical function of polarizability and our recently constructed reference interatomic potential [J. M. Waldrop et al., J. Chem. Phys. 142, 204307 (2015)] were used to predict the thermophysical and electromagnetic properties of krypton gas. The second pressure, acoustic, and dielectric virial coefficients were computed for the temperature range of 116 K-5000 K using classical statistical mechanics supplemented with high-order quantum corrections. The virial coefficients calculated were compared with the generally less precise available experimental data as well as with values computed from other potentials in the literature {in particular, the recent highly accurate potential of Jäger et al. [J. Chem. Phys. 144, 114304 (2016)]}. The detailed examination in this work suggests that the present theoretical prediction can be applied as reference values in disciplines involving thermophysical and electromagnetic properties of krypton gas.
Diffusion and mobility of atomic particles in a liquid
NASA Astrophysics Data System (ADS)
Smirnov, B. M.; Son, E. E.; Tereshonok, D. V.
2017-11-01
The diffusion coefficient of a test atom or molecule in a liquid is determined for the mechanism where the displacement of the test molecule results from the vibrations and motion of liquid molecules surrounding the test molecule and of the test particle itself. This leads to a random change in the coordinate of the test molecule, which eventually results in the diffusion motion of the test particle in space. Two models parameters of interaction of a particle and a liquid are used to find the activation energy of the diffusion process under consideration: the gas-kinetic cross section for scattering of test molecules in the parent gas and the Wigner-Seitz radius for test molecules. In the context of this approach, we have calculated the diffusion coefficient of atoms and molecules in water, where based on experimental data, we have constructed the dependence of the activation energy for the diffusion of test molecules in water on the interaction parameter and the temperature dependence for diffusion coefficient of atoms or molecules in water within the models considered. The statistically averaged difference of the activation energies for the diffusion coefficients of different test molecules in water that we have calculated based on each of the presented models does not exceed 10% of the diffusion coefficient itself. We have considered the diffusion of clusters in water and present the dependence of the diffusion coefficient on the cluster size. The accuracy of the presented formulas for the diffusion coefficient of atomic particles in water is estimated to be 50%.
Online writer identification using alphabetic information clustering
NASA Astrophysics Data System (ADS)
Tan, Guo Xian; Viard-Gaudin, Christian; Kot, Alex C.
2009-01-01
Writer identification is a topic of much renewed interest today because of its importance in applications such as writer adaptation, routing of documents and forensic document analysis. Various algorithms have been proposed to handle such tasks. Of particular interests are the approaches that use allographic features [1-3] to perform a comparison of the documents in question. The allographic features are used to define prototypes that model the unique handwriting styles of the individual writers. This paper investigates a novel perspective that takes alphabetic information into consideration when the allographic features are clustered into prototypes at the character level. We hypothesize that alphabetic information provides additional clues which help in the clustering of allographic prototypes. An alphabet information coefficient (AIC) has been introduced in our study and the effect of this coefficient is presented. Our experiments showed an increase of writer identification accuracy from 66.0% to 87.0% when alphabetic information was used in conjunction with allographic features on a database of 200 reference writers.
Divisibility patterns of natural numbers on a complex network.
Shekatkar, Snehal M; Bhagwat, Chandrasheel; Ambika, G
2015-09-16
Investigation of divisibility properties of natural numbers is one of the most important themes in the theory of numbers. Various tools have been developed over the centuries to discover and study the various patterns in the sequence of natural numbers in the context of divisibility. In the present paper, we study the divisibility of natural numbers using the framework of a growing complex network. In particular, using tools from the field of statistical inference, we show that the network is scale-free but has a non-stationary degree distribution. Along with this, we report a new kind of similarity pattern for the local clustering, which we call "stretching similarity", in this network. We also show that the various characteristics like average degree, global clustering coefficient and assortativity coefficient of the network vary smoothly with the size of the network. Using analytical arguments we estimate the asymptotic behavior of global clustering and average degree which is validated using numerical analysis.
Alam, Md Ferdous; Haque, Asadul
2017-10-18
An accurate determination of particle-level fabric of granular soils from tomography data requires a maximum correct separation of particles. The popular marker-controlled watershed separation method is widely used to separate particles. However, the watershed method alone is not capable of producing the maximum separation of particles when subjected to boundary stresses leading to crushing of particles. In this paper, a new separation method, named as Monash Particle Separation Method (MPSM), has been introduced. The new method automatically determines the optimal contrast coefficient based on cluster evaluation framework to produce the maximum accurate separation outcomes. Finally, the particles which could not be separated by the optimal contrast coefficient were separated by integrating cuboid markers generated from the clustering by Gaussian mixture models into the routine watershed method. The MPSM was validated on a uniformly graded sand volume subjected to one-dimensional compression loading up to 32 MPa. It was demonstrated that the MPSM is capable of producing the best possible separation of particles required for the fabric analysis.
NASA Astrophysics Data System (ADS)
Li, Mengtian; Zhang, Ruisheng; Hu, Rongjing; Yang, Fan; Yao, Yabing; Yuan, Yongna
2018-03-01
Identifying influential spreaders is a crucial problem that can help authorities to control the spreading process in complex networks. Based on the classical degree centrality (DC), several improved measures have been presented. However, these measures cannot rank spreaders accurately. In this paper, we first calculate the sum of the degrees of the nearest neighbors of a given node, and based on the calculated sum, a novel centrality named clustered local-degree (CLD) is proposed, which combines the sum and the clustering coefficients of nodes to rank spreaders. By assuming that the spreading process in networks follows the susceptible-infectious-recovered (SIR) model, we perform extensive simulations on a series of real networks to compare the performances between the CLD centrality and other six measures. The results show that the CLD centrality has a competitive performance in distinguishing the spreading ability of nodes, and exposes the best performance to identify influential spreaders accurately.
ERIC Educational Resources Information Center
Gray, Heewon Lee; Burgermaster, Marissa; Tipton, Elizabeth; Contento, Isobel R.; Koch, Pamela A.; Di Noia, Jennifer
2016-01-01
Objective: Sample size and statistical power calculation should consider clustering effects when schools are the unit of randomization in intervention studies. The objective of the current study was to investigate how student outcomes are clustered within schools in an obesity prevention trial. Method: Baseline data from the Food, Health &…
A liquid-He cryostat for structural and thermal disorder studies by X-ray absorption.
Bouamrane, F; Ribbens, M; Fonda, E; Adjouri, C; Traverse, A
2003-07-01
A new device operating from 4.2 to 300 K is now installed on the hard X-ray station of the DCI ring in LURE in order to measure absorption coefficients. This liquid-He bath device has three optical windows. One allows the incident beam to impinge on the sample, one located at 180 degrees with respect to the sample allows transmitted beams to be detected, and another located at 90 degrees is used to detect emitted photons. Total electron yield detection mode is also possible thanks to a specific sample holder equipped with an electrode that collects the charges created by the emitted electrons in the He gas brought from the He bath around the sample. The performance of the cryostat is described by measurements of the absorption coefficients versus the temperature for Cu and Co foils. For comparison, the absorption coefficient is also measured for Cu clusters. As expected from dimension effects, the Debye temperature obtained for the clusters is lower than that of bulk Cu.
Modelling clustering of vertically aligned carbon nanotube arrays.
Schaber, Clemens F; Filippov, Alexander E; Heinlein, Thorsten; Schneider, Jörg J; Gorb, Stanislav N
2015-08-06
Previous research demonstrated that arrays of vertically aligned carbon nanotubes (VACNTs) exhibit strong frictional properties. Experiments indicated a strong decrease of the friction coefficient from the first to the second sliding cycle in repetitive measurements on the same VACNT spot, but stable values in consecutive cycles. VACNTs form clusters under shear applied during friction tests, and self-organization stabilizes the mechanical properties of the arrays. With increasing load in the range between 300 µN and 4 mN applied normally to the array surface during friction tests the size of the clusters increases, while the coefficient of friction decreases. To better understand the experimentally obtained results, we formulated and numerically studied a minimalistic model, which reproduces the main features of the system with a minimum of adjustable parameters. We calculate the van der Waals forces between the spherical friction probe and bunches of the arrays using the well-known Morse potential function to predict the number of clusters, their size, instantaneous and mean friction forces and the behaviour of the VACNTs during consecutive sliding cycles and at different normal loads. The data obtained by the model calculations coincide very well with the experimental data and can help in adapting VACNT arrays for biomimetic applications.
Unlearning of Mixed States in the Hopfield Model —Extensive Loading Case—
NASA Astrophysics Data System (ADS)
Hayashi, Kao; Hashimoto, Chinami; Kimoto, Tomoyuki; Uezu, Tatsuya
2018-05-01
We study the unlearning of mixed states in the Hopfield model for the extensive loading case. Firstly, we focus on case I, where several embedded patterns are correlated with each other, whereas the rest are uncorrelated. Secondly, we study case II, where patterns are divided into clusters in such a way that patterns in any cluster are correlated but those in two different clusters are not correlated. By using the replica method, we derive the saddle point equations for order parameters under the ansatz of replica symmetry. The same equations are also derived by self-consistent signal-to-noise analysis in case I. In both cases I and II, we find that when the correlation between patterns is large, the network loses its ability to retrieve the embedded patterns and, depending on the parameters, a confused memory, which is a mixed state and/or spin glass state, emerges. By unlearning the mixed state, the network acquires the ability to retrieve the embedded patterns again in some parameter regions. We find that to delete the mixed state and to retrieve the embedded patterns, the coefficient of unlearning should be chosen appropriately. We perform Markov chain Monte Carlo simulations and find that the simulation and theoretical results agree reasonably well, except for the spin glass solution in a parameter region due to the replica symmetry breaking. Furthermore, we find that the existence of many correlated clusters reduces the stabilities of both embedded patterns and mixed states.
The Nature and Origin of UCDs in the Coma Cluster
NASA Astrophysics Data System (ADS)
Chiboucas, Kristin; Tully, R. Brent; Madrid, Juan; Phillipps, Steven; Carter, David; Peng, Eric
2018-01-01
UCDs are super massive star clusters found largely in dense regions but have also been found around individual galaxies and in smaller groups. Their origin is still under debate but currently favored scenarios include formation as giant star clusters, either as the brightest globular clusters or through mergers of super star clusters, themselves formed during major galaxy mergers, or as remnant nuclei from tidal stripping of nucleated dwarf ellipticals. Establishing the nature of these enigmatic objects has important implications for our understanding of star formation, star cluster formation, the missing satellite problem, and galaxy evolution. We are attempting to disentangle these competing formation scenarios with a large survey of UCDs in the Coma cluster. Using ACS two-passband imaging from the HST/ACS Coma Cluster Treasury Survey, we are using colors and sizes to identify the UCD cluster members. With a large size limited sample of the UCD population within the core region of the Coma cluster, we are investigating the population size, properties, and spatial distribution, and comparing that with the Coma globular cluster and nuclear star cluster populations to discriminate between the threshing and globular cluster scenarios. In previous work, we had found a possible correlation of UCD colors with host galaxy and a possible excess of UCDs around a non-central giant galaxy with an unusually large globular cluster population, both suggestive of a globular cluster origin. With a larger sample size and additional imaging fields that encompass the regions around these giant galaxies, we have found that the color correlation with host persists and the giant galaxy with unusually large globular cluster population does appear to host a large UCD population as well. We present the current status of the survey.
NASA Technical Reports Server (NTRS)
Onstott, Robert G.; Gineris, Denise J.
1990-01-01
This is the third in a series of three reports which address the statistical description of ground clutter at an airport and in the surrounding area. These data are being utilized in a program to detect microbursts. Synthetic aperture radar (SAR) data were collected at the Denver Stapleton Airport using a set of parameters which closely match those which are anticipated to be utilized by an aircraft on approach to an airport. These data and the results of the clutter study are described. Scenes of 13 x 10 km were imaged at 9.38 GHz and HH-, VV-, and HV-polarizations, and contain airport grounds and facilities (up to 14 percent), cultural areas (more than 50 percent), and rural areas (up to 6 percent). Incidence angles range from 40 to 84 deg. At the largest depression angles the distributed targets, such as forest, fields, water, and residential, rarely had mean scattering coefficients greater than -10 dB. From 30 to 80 percent of an image had scattering coefficients less than -20 dB. About 1 to 10 percent of the scattering coefficients exceeded 0 dB, and from 0 to 1 percent above 10 dB. In examining the average backscatter coefficients at large angles, the clutter types cluster according to the following groups: (1) terminals (-3 dB), (2) city and industrial (-7 dB), (3) warehouse (-10 dB), (4) urban and residential (-14 dB), and (5) grass (-24 dB).
Selection of intracellular calcium patterns in a model with clustered Ca2+ release channels
NASA Astrophysics Data System (ADS)
Shuai, J. W.; Jung, P.
2003-03-01
A two-dimensional model is proposed for intracellular Ca2+ waves, which incorporates both the discrete nature of Ca2+ release sites in the endoplasmic reticulum membrane and the stochastic dynamics of the clustered inositol 1,4,5-triphosphate (IP3) receptors. Depending on the Ca2+ diffusion coefficient and concentration of IP3, various spontaneous Ca2+ patterns, such as calcium puffs, local waves, abortive waves, global oscillation, and tide waves, can be observed. We further investigate the speed of the global waves as a function of the IP3 concentration and the Ca2+ diffusion coefficient and under what conditions the spatially averaged Ca2+ response can be described by a simple set of ordinary differential equations.
Deterministic Joint Remote Preparation of Arbitrary Four-Qubit Cluster-Type State Using EPR Pairs
NASA Astrophysics Data System (ADS)
Li, Wenqian; Chen, Hanwu; Liu, Zhihao
2017-02-01
Using four Einstein-Podolsky-Rosen (EPR) pairs as the pre-shared quantum channel, an economic and feasible scheme for deterministic joint remote preparation of the four-particle cluster-type state is presented. In the scheme, one of the senders performs a four-qubit projective measurement based on a set of ingeniously constructed vectors with real coefficients, while the other performs the bipartite projective measurements in terms of the imaginary coefficients. Followed with some appropriate unitary operations and controlled-NOT operations, the receiver can reconstruct the desired state. Compared with other analogous JRSP schemes, our scheme can not only reconstruct the original state (to be prepared remotely) with unit successful probability, but also ensure greater efficiency.
Raja, Suresh; Valsaraj, Kalliat T
2004-12-01
Uptake of aromatic hydrocarbon vapors (benzene and phenanthrene) by typical micrometer-sized fog-water droplets was studied using a falling droplet reactor at temperatures between 296 and 316 K. Uptake of phenanthrene vapor greater than that predicted by bulk (air-water)-phase equilibrium was observed for diameters less than 200 microm, and this was attributed to surface adsorption. The experimental values of the droplet-vapor partition constant were used to obtain the overall mass transfer coefficient and the mass accommodation coefficient for both benzene and phenanthrene. Mass transfer of phenanthrene was dependent only on gas-phase diffusion and mass accommodation at the interface. However, for benzene, the mass transfer was limited by liquid-phase diffusion and mass accommodation. A large value of the mass accommodation coefficient, alpha = (1.4 +/- 0.4) x 10(-2) was observed for the highly surface-active (hydrophobic) phenanthrene, whereas a small alpha = (9.7 +/- 1.8) x 10(-5) was observed for the less hydrophobic benzene. Critical cluster numbers ranging from 2 for benzene to 5.7 for phenanthrene were deduced using the critical cluster nucleation theory for mass accommodation. The enthalpy of mass accommodation was more negative for phenanthrene than it was for benzene. Consequently, the temperature effect was more pronounced for phenanthrene. A linear correlation was observed for the enthalpy of accommodation with the excess enthalpy of solution. A natural organic carbon surrogate (Suwannee Fulvic acid) in the water droplet increased the uptake for phenanthrene and benzene, the effect being more marked for phenanthrene. A characteristic time constant analysis showed that uptake and droplet scavenging would compete for the fog deposition of phenanthrene, whereas deposition would be unimpeded by the uptake rate for benzene vapor. For both compounds, the characteristic atmospheric reaction times were much larger and would not impact fog deposition.
Clustering stock market companies via chaotic map synchronization
NASA Astrophysics Data System (ADS)
Basalto, N.; Bellotti, R.; De Carlo, F.; Facchi, P.; Pascazio, S.
2005-01-01
A pairwise clustering approach is applied to the analysis of the Dow Jones index companies, in order to identify similar temporal behavior of the traded stock prices. To this end, the chaotic map clustering algorithm is used, where a map is associated to each company and the correlation coefficients of the financial time series to the coupling strengths between maps. The simulation of a chaotic map dynamics gives rise to a natural partition of the data, as companies belonging to the same industrial branch are often grouped together. The identification of clusters of companies of a given stock market index can be exploited in the portfolio optimization strategies.
Ab initio calculation of one-nucleon halo states
NASA Astrophysics Data System (ADS)
Rodkin, D. M.; Tchuvil'sky, Yu M.
2018-02-01
We develop an approach to microscopic and ab initio description of clustered systems, states with halo nucleon and one-nucleon resonances. For these purposes a basis combining ordinary shell-model components and cluster-channel terms is built up. The transformation of clustered wave functions to the uniform Slater-determinant type is performed using the concept of cluster coefficients. The resulting basis of orthonormalized wave functions is used for calculating the eigenvalues and the eigenvectors of Hamiltonians built in the framework of ab initio approaches. Calculations of resonance and halo states of 5He, 9Be and 9B nuclei demonstrate that the approach is workable and labor-saving.
Large-Angular-Scale Clustering as a Clue to the Source of UHECRs
NASA Astrophysics Data System (ADS)
Berlind, Andreas A.; Farrar, Glennys R.
We explore what can be learned about the sources of UHECRs from their large-angular-scale clustering (referred to as their "bias" by the cosmology community). Exploiting the clustering on large scales has the advantage over small-scale correlations of being insensitive to uncertainties in source direction from magnetic smearing or measurement error. In a Cold Dark Matter cosmology, the amplitude of large-scale clustering depends on the mass of the system, with more massive systems such as galaxy clusters clustering more strongly than less massive systems such as ordinary galaxies or AGN. Therefore, studying the large-scale clustering of UHECRs can help determine a mass scale for their sources, given the assumption that their redshift depth is as expected from the GZK cutoff. We investigate the constraining power of a given UHECR sample as a function of its cutoff energy and number of events. We show that current and future samples should be able to distinguish between the cases of their sources being galaxy clusters, ordinary galaxies, or sources that are uncorrelated with the large-scale structure of the universe.
NASA Astrophysics Data System (ADS)
Traore, Harouna; Crouzet, Olivier; Mamy, Laure; Sireyjol, Christine; Rossard, Virginie; Servien, Remy; Latrille, Eric; Benoit, Pierre
2017-04-01
The understanding of the fate of pesticides and their environmental impacts largely relies on their molecular properties. We recently developed 'TyPol' (Typology of Pollutants), a clustering method based on statistical analyses combining several environmental endpoints (i.e. environmental parameters such as sorption coefficient, degradation half-life) and one ecotoxicological one (bioconcentration factor), and structural molecular descriptors (number of atoms in the molecule, molecular surface, dipole moment, energy of orbitals…). TyPol has been conceived on the available knowledge on QSAR of a wide diversity of organic compounds (Mamy et al., 2015). This approach also allows to focus on transformation products present in different clusters and to infer possible changes in environmental fate consecutively to different degradation processes (Servien et al., 2014; Benoit et al., 2016). The initial version of TyPol did not include any ecotoxicological parameters except the bioconcentration factor (BCF), which informs more on the transfer along the trophic chain rather than on the effects on non-target organisms. The objective was to implement the TyPol database with a data set of ecotoxicological data concerning pesticides and several aquatic and terrestrial organisms, in order to test the possibility to extend TyPol to ecotoxicological effects on various organisms. The data analysis (available literature and databases) revealed that relevant ecotoxicological endpoints for terrestrial organisms such as soil microorganisms and macroinvertebrates are lacking compared to aquatic organisms. We have added seven parameters for acute (EC50, LC50) and chronic (NOEC) toxicological effects for the following organisms: Daphnia, Algae, Lemna and Earthworm. In this new configuration, TyPol was used to classify about 45 pesticides in different behavioural and ecotoxicity clusters. The clustering results were analyzed to reveals relationships between molecular descriptors, environmental parameters and the added toxicological parameters. Some trends between soil adsorption or Kow coefficient and the acute toxicity towards earthworms or algae were highlighted, and discussed on the basis of the concept of contaminant bioavailability. This proof-of-concept study also showed that the in silico clustering method TyPol can successfully address new questions and can be expanded with other parameters of interest. Keywords: pesticides, toxicity, QSAR, clustering, PLS References : Servien R., Mamy L., Li Z., Rossard V., Latrille E., Bessac F., Patureau D., Benoit P., 2014. TyPol - A new methodology for organic compounds clustering based on their molecular characteristics and environmental behavior. Chemosphere, 111, 613-622. Mamy L., Patureau D., Barriuso E., Bedos C., Bessac F., Louchart X., Martin-Laurent F., Miege C., Benoit P., 2015. Prediction of the fate of organic compounds in the environment from their molecular properties: A review. Critical Reviews in Environmental Science and Technology, 45, 12, 1277-1377 (Open access). Benoit P., Mamy L., Servien R., Li Z., Latrille E., Rossard V., Bessac F., Patureau D., Martin-Laurent F. 2017. Categorizing chlordecone potential degradation products to explore their environmental fate. Science of the Total Environment, 574, 781-795.
Topic modeling for cluster analysis of large biological and medical datasets
2014-01-01
Background The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. Results In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Conclusion Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting that topic model-based methods could provide an analytic advancement in the analysis of large biological or medical datasets. PMID:25350106
Topic modeling for cluster analysis of large biological and medical datasets.
Zhao, Weizhong; Zou, Wen; Chen, James J
2014-01-01
The big data moniker is nowhere better deserved than to describe the ever-increasing prodigiousness and complexity of biological and medical datasets. New methods are needed to generate and test hypotheses, foster biological interpretation, and build validated predictors. Although multivariate techniques such as cluster analysis may allow researchers to identify groups, or clusters, of related variables, the accuracies and effectiveness of traditional clustering methods diminish for large and hyper dimensional datasets. Topic modeling is an active research field in machine learning and has been mainly used as an analytical tool to structure large textual corpora for data mining. Its ability to reduce high dimensionality to a small number of latent variables makes it suitable as a means for clustering or overcoming clustering difficulties in large biological and medical datasets. In this study, three topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, are proposed and tested on the cluster analysis of three large datasets: Salmonella pulsed-field gel electrophoresis (PFGE) dataset, lung cancer dataset, and breast cancer dataset, which represent various types of large biological or medical datasets. All three various methods are shown to improve the efficacy/effectiveness of clustering results on the three datasets in comparison to traditional methods. A preferable cluster analysis method emerged for each of the three datasets on the basis of replicating known biological truths. Topic modeling could be advantageously applied to the large datasets of biological or medical research. The three proposed topic model-derived clustering methods, highest probable topic assignment, feature selection and feature extraction, yield clustering improvements for the three different data types. Clusters more efficaciously represent truthful groupings and subgroupings in the data than traditional methods, suggesting that topic model-based methods could provide an analytic advancement in the analysis of large biological or medical datasets.
Steven's orbital reduction factor in ionic clusters
NASA Astrophysics Data System (ADS)
Gajek, Z.; Mulak, J.
1985-11-01
General expressions for reduction coefficients of matrix elements of angular momentum operator in ionic clusters or molecular systems have been derived. The reduction in this approach results from overlap and covalency effects and plays an important role in the reconciling of magnetic and spectroscopic experimental data. The formulated expressions make possible a phenomenological description of the effect with two independent parameters for typical equidistant clusters. Some detailed calculations also suggest the possibility of a one-parameter description. The results of these calculations for some ionic uranium compounds are presented as an example.
Algorithms of maximum likelihood data clustering with applications
NASA Astrophysics Data System (ADS)
Giada, Lorenzo; Marsili, Matteo
2002-12-01
We address the problem of data clustering by introducing an unsupervised, parameter-free approach based on maximum likelihood principle. Starting from the observation that data sets belonging to the same cluster share a common information, we construct an expression for the likelihood of any possible cluster structure. The likelihood in turn depends only on the Pearson's coefficient of the data. We discuss clustering algorithms that provide a fast and reliable approximation to maximum likelihood configurations. Compared to standard clustering methods, our approach has the advantages that (i) it is parameter free, (ii) the number of clusters need not be fixed in advance and (iii) the interpretation of the results is transparent. In order to test our approach and compare it with standard clustering algorithms, we analyze two very different data sets: time series of financial market returns and gene expression data. We find that different maximization algorithms produce similar cluster structures whereas the outcome of standard algorithms has a much wider variability.
Perilli, Miriam L L; Lima, Fernando; Rodrigues, Flávio H G; Cavalcanti, Sandra M C
2016-01-01
Large cats feeding habits have been studied through two main methods: scat analysis and the carcasses of prey killed by monitored animals. From November 2001 to April 2004, we studied jaguar predation patterns using GPS telemetry location clusters on a cattle ranch in southern Pantanal. During this period, we recorded 431 carcasses of animals preyed upon by monitored jaguars. Concurrently, we collected 125 jaguar scats opportunistically. We compared the frequencies of prey found through each method. We also compared the prey communities using Bray-Curtis similarity coefficient. These comparisons allowed us to evaluate the use of scat analysis as a means to describe jaguar feeding habits. Both approaches identified prey communities with high similarity (Bray-Curtis coefficient > 70). According to either method, jaguars consume three main prey: cattle (Bos taurus), caiman (Caiman yacare) and peccaries (Tayassu pecari and Pecari tajacu). The two methods did not differ in the frequency of the three main prey over dry and wet seasons or years sampled. Our results show that scat analysis is effective and capable of describing jaguar feeding habits.
Estimation of the prevalence of adverse drug reactions from social media.
Nguyen, Thin; Larsen, Mark E; O'Dea, Bridianne; Phung, Dinh; Venkatesh, Svetha; Christensen, Helen
2017-06-01
This work aims to estimate the degree of adverse drug reactions (ADR) for psychiatric medications from social media, including Twitter, Reddit, and LiveJournal. Advances in lightning-fast cluster computing was employed to process large scale data, consisting of 6.4 terabytes of data containing 3.8 billion records from all the media. Rates of ADR were quantified using the SIDER database of drugs and side-effects, and an estimated ADR rate was based on the prevalence of discussion in the social media corpora. Agreement between these measures for a sample of ten popular psychiatric drugs was evaluated using the Pearson correlation coefficient, r, with values between 0.08 and 0.50. Word2vec, a novel neural learning framework, was utilized to improve the coverage of variants of ADR terms in the unstructured text by identifying syntactically or semantically similar terms. Improved correlation coefficients, between 0.29 and 0.59, demonstrates the capability of advanced techniques in machine learning to aid in the discovery of meaningful patterns from medical data, and social media data, at scale. Copyright © 2017 Elsevier B.V. All rights reserved.
The weak lensing analysis of the CFHTLS and NGVS RedGOLD galaxy clusters
NASA Astrophysics Data System (ADS)
Parroni, C.; Mei, S.; Erben, T.; Van Waerbeke, L.; Raichoor, A.; Ford, J.; Licitra, R.; Meneghetti, M.; Hildebrandt, H.; Miller, L.; Côté, P.; Covone, G.; Cuillandre, J.-C.; Duc, P.-A.; Ferrarese, L.; Gwyn, S. D. J.; Puzia, T. H.
2017-12-01
An accurate estimation of galaxy cluster masses is essential for their use in cosmological and astrophysical studies. We studied the accuracy of the optical richness obtained by our RedGOLD cluster detection algorithm tep{licitra2016a, licitra2016b} as a mass proxy, using weak lensing and X-ray mass measurements. We measured stacked weak lensing cluster masses for a sample of 1323 galaxy clusters in the Canada-France-Hawaii Telescope Legacy Survey W1 and the Next Generation Virgo Cluster Survey at 0.2
Impact of heuristics in clustering large biological networks.
Shafin, Md Kishwar; Kabir, Kazi Lutful; Ridwan, Iffatur; Anannya, Tasmiah Tamzid; Karim, Rashid Saadman; Hoque, Mohammad Mozammel; Rahman, M Sohel
2015-12-01
Traditional clustering algorithms often exhibit poor performance for large networks. On the contrary, greedy algorithms are found to be relatively efficient while uncovering functional modules from large biological networks. The quality of the clusters produced by these greedy techniques largely depends on the underlying heuristics employed. Different heuristics based on different attributes and properties perform differently in terms of the quality of the clusters produced. This motivates us to design new heuristics for clustering large networks. In this paper, we have proposed two new heuristics and analyzed the performance thereof after incorporating those with three different combinations in a recently celebrated greedy clustering algorithm named SPICi. We have extensively analyzed the effectiveness of these new variants. The results are found to be promising. Copyright © 2015 Elsevier Ltd. All rights reserved.
An association between neighbourhood wealth inequality and HIV prevalence in sub-Saharan Africa.
Brodish, Paul Henry
2015-05-01
This paper investigates whether community-level wealth inequality predicts HIV serostatus using DHS household survey and HIV biomarker data for men and women ages 15-59 pooled from six sub-Saharan African countries with HIV prevalence rates exceeding 5%. The analysis relates the binary dependent variable HIV-positive serostatus and two weighted aggregate predictors generated from the DHS Wealth Index: the Gini coefficient, and the ratio of the wealth of households in the top 20% wealth quintile to that of those in the bottom 20%. In separate multilevel logistic regression models, wealth inequality is used to predict HIV prevalence within each statistical enumeration area, controlling for known individual-level demographic predictors of HIV serostatus. Potential individual-level sexual behaviour mediating variables are added to assess attenuation, and ordered logit models investigate whether the effect is mediated through extramarital sexual partnerships. Both the cluster-level wealth Gini coefficient and wealth ratio significantly predict positive HIV serostatus: a 1 point increase in the cluster-level Gini coefficient and in the cluster-level wealth ratio is associated with a 2.35 and 1.3 times increased likelihood of being HIV positive, respectively, controlling for individual-level demographic predictors, and associations are stronger in models including only males. Adding sexual behaviour variables attenuates the effects of both inequality measures. Reporting eleven plus lifetime sexual partners increases the odds of being HIV positive over five-fold. The likelihood of having more extramarital partners is significantly higher in clusters with greater wealth inequality measured by the wealth ratio. Disaggregating logit models by sex indicates important risk behaviour differences. Household wealth inequality within DHS clusters predicts HIV serostatus, and the relationship is partially mediated by more extramarital partners. These results emphasize the importance of incorporating higher-level contextual factors, investigating behavioural mediators, and disaggregating by sex in assessing HIV risk in order to uncover potential mechanisms of action and points of preventive intervention.
An association between neighborhood wealth inequality and HIV prevalence in sub-Saharan Africa
Brodish, Paul Henry
2016-01-01
Summary This paper investigates whether community-level wealth inequality predicts HIV serostatus, using DHS household survey and HIV biomarker data for men and women ages 15-59 pooled from six sub-Saharan African countries with HIV prevalence rates exceeding five percent. The analysis relates the binary dependent variable HIV positive serostatus and two weighted aggregate predictors generated from the DHS Wealth Index: the Gini coefficient, and the ratio of the wealth of households in the top 20% wealth quintile to that of those in the bottom 20%. In separate multilevel logistic regression models, wealth inequality is used to predict HIV prevalence within each SEA, controlling for known individual-level demographic predictors of HIV serostatus. Potential individual-level sexual behavior mediating variables are added to assess attenuation, and ordered logit models investigate whether the effect is mediated through extramarital sexual partnerships. Both the cluster-level wealth Gini coefficient and wealth ratio significantly predict positive HIV serostatus: a 1 point increase in the cluster-level Gini coefficient and in the cluster-level wealth ratio is associated with a 2.35 and 1.3 times increased likelihood of being HIV positive, respectively, controlling for individual-level demographic predictors, and associations are stronger in models including only males. Adding sexual behavior variables attenuates the effects of both inequality measures. Reporting 11 plus lifetime sexual partners increases the odds of being HIV positive over five-fold. The likelihood of having more extramarital partners is significantly higher in clusters with greater wealth inequality measured by the wealth ratio. Disaggregating logit models by sex indicates important risk behavior differences. Household wealth inequality within DHS clusters predicts HIV serostatus, and the relationship is partially mediated by more extramarital partners. These results emphasize the importance of incorporating higher-level contextual factors, investigating behavioral mediators, and disaggregating by sex in assessing HIV risk in order to uncover potential mechanisms of action and points of preventive intervention PMID:24406021
Optical mapping of prefrontal brain connectivity and activation during emotion anticipation.
Wang, Meng-Yun; Lu, Feng-Mei; Hu, Zhishan; Zhang, Juan; Yuan, Zhen
2018-09-17
Accumulated neuroimaging evidence shows that the dorsal lateral prefrontal cortex (dlPFC) is activated during emotion anticipation. The aim of this work is to examine the brain connectivity and activation differences in dlPFC between the positive, neutral and negative emotion anticipation by using functional near-infrared spectroscopy (fNIRS). The hemodynamic responses were first assessed for all subjects during the performance of various emotion anticipation tasks. And then small-world analysis was performed, in which the small-world network indicators including the clustering coefficient, average path length, average node degree, and measure of small-world index were calculated for the functional brain networks associated with the positive, neutral and negative emotion anticipation, respectively. We discovered that compared to negative and neutral emotion anticipation, the positive one exhibited enhanced brain activation in the left dlPFC. Although the functional brain networks for the three emotion anticipation cases manifested the small-world properties regarding the clustering coefficient, average path length, average node degree, and measure of small-world index, the positive one showed significantly higher clustering coefficient and shorter average path length than those from the neutral and negative cases. Consequently, the small-world network indicators and brain activation in dlPPC were able to distinguish well between the positive, neutral and negative emotion anticipation. Copyright © 2018 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Bai, Yulong; Yang, Bo; Guo, Fei; Lu, Qingshan; Zhao, Shifeng
2017-11-01
Cluster-assembled SmCo alloy films were prepared by low energy cluster beam deposition. The structure, magnetic domain, magnetization, and magnetostriction of the films were characterized. It is shown that the as-prepared films are assembled in compact and uniformly distributed spherical cluster nanoparticles, most of which, after vacuum in situ annealing at 700 K, aggregated to form cluster islands. These cluster islands result in transformations from superparamagnetic states to magnetic single domain (MSD) states in the films. Such MSD structures contribute to the enhanced magnetostrictive behaviors with a saturation magnetostrictive coefficient of 160 × 10-6 in comparison to 105 × 10-6 for the as-prepared films. This work demonstrates candidate materials that could be applied in nano-electro-mechanical systems, low power information storage, and weak magnetic detecting devices.
Thermophysical properties of krypton-helium gas mixtures from ab initio pair potentials
2017-01-01
A new potential energy curve for the krypton-helium atom pair was developed using supermolecular ab initio computations for 34 interatomic distances. Values for the interaction energies at the complete basis set limit were obtained from calculations with the coupled-cluster method with single, double, and perturbative triple excitations and correlation consistent basis sets up to sextuple-zeta quality augmented with mid-bond functions. Higher-order coupled-cluster excitations up to the full quadruple level were accounted for in a scheme of successive correction terms. Core-core and core-valence correlation effects were included. Relativistic corrections were considered not only at the scalar relativistic level but also using full four-component Dirac–Coulomb and Dirac–Coulomb–Gaunt calculations. The fitted analytical pair potential function is characterized by a well depth of 31.42 K with an estimated standard uncertainty of 0.08 K. Statistical thermodynamics was applied to compute the krypton-helium cross second virial coefficients. The results show a very good agreement with the best experimental data. Kinetic theory calculations based on classical and quantum-mechanical approaches for the underlying collision dynamics were utilized to compute the transport properties of krypton-helium mixtures in the dilute-gas limit for a large temperature range. The results were analyzed with respect to the orders of approximation of kinetic theory and compared with experimental data. Especially the data for the binary diffusion coefficient confirm the predictive quality of the new potential. Furthermore, inconsistencies between two empirical pair potential functions for the krypton-helium system from the literature could be resolved. PMID:28595411
Features and heterogeneities in growing network models
NASA Astrophysics Data System (ADS)
Ferretti, Luca; Cortelezzi, Michele; Yang, Bin; Marmorini, Giacomo; Bianconi, Ginestra
2012-06-01
Many complex networks from the World Wide Web to biological networks grow taking into account the heterogeneous features of the nodes. The feature of a node might be a discrete quantity such as a classification of a URL document such as personal page, thematic website, news, blog, search engine, social network, etc., or the classification of a gene in a functional module. Moreover the feature of a node can be a continuous variable such as the position of a node in the embedding space. In order to account for these properties, in this paper we provide a generalization of growing network models with preferential attachment that includes the effect of heterogeneous features of the nodes. The main effect of heterogeneity is the emergence of an “effective fitness” for each class of nodes, determining the rate at which nodes acquire new links. The degree distribution exhibits a multiscaling behavior analogous to the the fitness model. This property is robust with respect to variations in the model, as long as links are assigned through effective preferential attachment. Beyond the degree distribution, in this paper we give a full characterization of the other relevant properties of the model. We evaluate the clustering coefficient and show that it disappears for large network size, a property shared with the Barabási-Albert model. Negative degree correlations are also present in this class of models, along with nontrivial mixing patterns among features. We therefore conclude that both small clustering coefficients and disassortative mixing are outcomes of the preferential attachment mechanism in general growing networks.
Thermophysical properties of krypton-helium gas mixtures from ab initio pair potentials
NASA Astrophysics Data System (ADS)
Jäger, Benjamin; Bich, Eckard
2017-06-01
A new potential energy curve for the krypton-helium atom pair was developed using supermolecular ab initio computations for 34 interatomic distances. Values for the interaction energies at the complete basis set limit were obtained from calculations with the coupled-cluster method with single, double, and perturbative triple excitations and correlation consistent basis sets up to sextuple-zeta quality augmented with mid-bond functions. Higher-order coupled-cluster excitations up to the full quadruple level were accounted for in a scheme of successive correction terms. Core-core and core-valence correlation effects were included. Relativistic corrections were considered not only at the scalar relativistic level but also using full four-component Dirac-Coulomb and Dirac-Coulomb-Gaunt calculations. The fitted analytical pair potential function is characterized by a well depth of 31.42 K with an estimated standard uncertainty of 0.08 K. Statistical thermodynamics was applied to compute the krypton-helium cross second virial coefficients. The results show a very good agreement with the best experimental data. Kinetic theory calculations based on classical and quantum-mechanical approaches for the underlying collision dynamics were utilized to compute the transport properties of krypton-helium mixtures in the dilute-gas limit for a large temperature range. The results were analyzed with respect to the orders of approximation of kinetic theory and compared with experimental data. Especially the data for the binary diffusion coefficient confirm the predictive quality of the new potential. Furthermore, inconsistencies between two empirical pair potential functions for the krypton-helium system from the literature could be resolved.
GraphCrunch 2: Software tool for network modeling, alignment and clustering.
Kuchaiev, Oleksii; Stevanović, Aleksandar; Hayes, Wayne; Pržulj, Nataša
2011-01-19
Recent advancements in experimental biotechnology have produced large amounts of protein-protein interaction (PPI) data. The topology of PPI networks is believed to have a strong link to their function. Hence, the abundance of PPI data for many organisms stimulates the development of computational techniques for the modeling, comparison, alignment, and clustering of networks. In addition, finding representative models for PPI networks will improve our understanding of the cell just as a model of gravity has helped us understand planetary motion. To decide if a model is representative, we need quantitative comparisons of model networks to real ones. However, exact network comparison is computationally intractable and therefore several heuristics have been used instead. Some of these heuristics are easily computable "network properties," such as the degree distribution, or the clustering coefficient. An important special case of network comparison is the network alignment problem. Analogous to sequence alignment, this problem asks to find the "best" mapping between regions in two networks. It is expected that network alignment might have as strong an impact on our understanding of biology as sequence alignment has had. Topology-based clustering of nodes in PPI networks is another example of an important network analysis problem that can uncover relationships between interaction patterns and phenotype. We introduce the GraphCrunch 2 software tool, which addresses these problems. It is a significant extension of GraphCrunch which implements the most popular random network models and compares them with the data networks with respect to many network properties. Also, GraphCrunch 2 implements the GRAph ALigner algorithm ("GRAAL") for purely topological network alignment. GRAAL can align any pair of networks and exposes large, dense, contiguous regions of topological and functional similarities far larger than any other existing tool. Finally, GraphCruch 2 implements an algorithm for clustering nodes within a network based solely on their topological similarities. Using GraphCrunch 2, we demonstrate that eukaryotic and viral PPI networks may belong to different graph model families and show that topology-based clustering can reveal important functional similarities between proteins within yeast and human PPI networks. GraphCrunch 2 is a software tool that implements the latest research on biological network analysis. It parallelizes computationally intensive tasks to fully utilize the potential of modern multi-core CPUs. It is open-source and freely available for research use. It runs under the Windows and Linux platforms.
NASA Astrophysics Data System (ADS)
Jeong, Donghui; Desjacques, Vincent; Schmidt, Fabian
2018-01-01
Here, we briefly introduce the key results of the recent review (arXiv:1611.09787), whose abstract is as following. This review presents a comprehensive overview of galaxy bias, that is, the statistical relation between the distribution of galaxies and matter. We focus on large scales where cosmic density fields are quasi-linear. On these scales, the clustering of galaxies can be described by a perturbative bias expansion, and the complicated physics of galaxy formation is absorbed by a finite set of coefficients of the expansion, called bias parameters. The review begins with a detailed derivation of this very important result, which forms the basis of the rigorous perturbative description of galaxy clustering, under the assumptions of General Relativity and Gaussian, adiabatic initial conditions. Key components of the bias expansion are all leading local gravitational observables, which include the matter density but also tidal fields and their time derivatives. We hence expand the definition of local bias to encompass all these contributions. This derivation is followed by a presentation of the peak-background split in its general form, which elucidates the physical meaning of the bias parameters, and a detailed description of the connection between bias parameters and galaxy (or halo) statistics. We then review the excursion set formalism and peak theory which provide predictions for the values of the bias parameters. In the remainder of the review, we consider the generalizations of galaxy bias required in the presence of various types of cosmological physics that go beyond pressureless matter with adiabatic, Gaussian initial conditions: primordial non-Gaussianity, massive neutrinos, baryon-CDM isocurvature perturbations, dark energy, and modified gravity. Finally, we discuss how the description of galaxy bias in the galaxies' rest frame is related to clustering statistics measured from the observed angular positions and redshifts in actual galaxy catalogs.
NASA Astrophysics Data System (ADS)
Desjacques, Vincent; Jeong, Donghui; Schmidt, Fabian
2018-02-01
This review presents a comprehensive overview of galaxy bias, that is, the statistical relation between the distribution of galaxies and matter. We focus on large scales where cosmic density fields are quasi-linear. On these scales, the clustering of galaxies can be described by a perturbative bias expansion, and the complicated physics of galaxy formation is absorbed by a finite set of coefficients of the expansion, called bias parameters. The review begins with a detailed derivation of this very important result, which forms the basis of the rigorous perturbative description of galaxy clustering, under the assumptions of General Relativity and Gaussian, adiabatic initial conditions. Key components of the bias expansion are all leading local gravitational observables, which include the matter density but also tidal fields and their time derivatives. We hence expand the definition of local bias to encompass all these contributions. This derivation is followed by a presentation of the peak-background split in its general form, which elucidates the physical meaning of the bias parameters, and a detailed description of the connection between bias parameters and galaxy statistics. We then review the excursion-set formalism and peak theory which provide predictions for the values of the bias parameters. In the remainder of the review, we consider the generalizations of galaxy bias required in the presence of various types of cosmological physics that go beyond pressureless matter with adiabatic, Gaussian initial conditions: primordial non-Gaussianity, massive neutrinos, baryon-CDM isocurvature perturbations, dark energy, and modified gravity. Finally, we discuss how the description of galaxy bias in the galaxies' rest frame is related to clustering statistics measured from the observed angular positions and redshifts in actual galaxy catalogs.
DeGroote, John P; Sugumaran, Ramanathan; Ecker, Mark
2014-11-01
After several years of low West Nile virus (WNV) occurrence in the United States of America (USA), 2012 witnessed large outbreaks in several parts of the country. In order to understand the outbreak dynamics, spatial clustering and landscape, demographic and climatic associations with WNV occurrence were investigated at a regional level in the USA. Previous research has demonstrated that there are a handful of prominent WNV mosquito vectors with varying ecological requirements responsible for WNV transmission in the USA. Published range maps of these important vectors were georeferenced and used to define eight functional ecological regions in the coterminous USA. The number of human WNV cases and human populations by county were attained in order to calculate a WNV rate for each county in 2012. Additionally, a binary value (high/low) was calculated for each county based on whether the county WNV rate was above or below the rate for the region it fell in. Global Moran's I and Anselin Local Moran's I statistics of spatial association were used per region to examine and visualize clustering of the WNV rate and the high/low rating. Spatial data on landscape, demographic and climatic variables were compiled and derived from a variety of sources and then investigated in relation to human WNV using both Spearman rho correlation coefficients and Poisson regression models. Findings demonstrated significant spatial clustering of WNV and substantial inter-regional differences in relationships between WNV occurrence and landscape, demographic and climatically related variables. The regional associations were consistent with the ecologies of the dominant vectors for those regions. The large outbreak in the Southeast region was preceded by higher than normal winter and spring precipitation followed by dry and hot conditions in the summer.
Deterministic Joint Remote Preparation of a Four-Qubit Cluster-Type State via GHZ States
NASA Astrophysics Data System (ADS)
Wang, Hai-bin; Zhou, Xiao-Yan; An, Xing-xing; Cui, Meng-Meng; Fu, De-sheng
2016-08-01
A scheme for the deterministic joint remote preparation of a four-qubit cluster-type state using only two Greenberger-Horne-Zeilinger (GHZ) states as quantum channels is presented. In this scheme, the first sender performs a two-qubit projective measurement according to the real coefficient of the desired state. Then, the other sender utilizes the measurement result and the complex coefficient to perform another projective measurement. To obtain the desired state, the receiver applies appropriate unitary operations to his/her own two qubits and two CNOT operations to the two ancillary ones. Most interestingly, our scheme can achieve unit success probability, i.e., P s u c =1. Furthermore, comparison reveals that the efficiency is higher than that of most other analogous schemes.
Pressure-induced positive electrical resistivity coefficient in Ni-Nb-Zr-H glassy alloy
NASA Astrophysics Data System (ADS)
Fukuhara, M.; Gangli, C.; Matsubayashi, K.; Uwatoko, Y.
2012-06-01
Measurements under hydrostatic pressure of the electrical resistivity of (Ni0.36Nb0.24Zr0.40)100-xHx (x = 9.8, 11.5, and 14) glassy alloys have been made in the range of 0-8 GPa and 0.5-300 K. The resistivity of the (Ni0.36Nb0.24Zr0.40)86H14 alloy changed its sign from negative to positive under application of 2-8 GPa in the temperature range of 300-22 K, coming from electron-phonon interaction in the cluster structure under pressure, accompanied by deformation of the clusters. In temperature region below 22 K, the resistivity showed negative thermal coefficient resistance by Debye-Waller factor contribution, and superconductivity was observed at 1.5 K.
Semisupervised Clustering by Iterative Partition and Regression with Neuroscience Applications
Qian, Guoqi; Wu, Yuehua; Ferrari, Davide; Qiao, Puxue; Hollande, Frédéric
2016-01-01
Regression clustering is a mixture of unsupervised and supervised statistical learning and data mining method which is found in a wide range of applications including artificial intelligence and neuroscience. It performs unsupervised learning when it clusters the data according to their respective unobserved regression hyperplanes. The method also performs supervised learning when it fits regression hyperplanes to the corresponding data clusters. Applying regression clustering in practice requires means of determining the underlying number of clusters in the data, finding the cluster label of each data point, and estimating the regression coefficients of the model. In this paper, we review the estimation and selection issues in regression clustering with regard to the least squares and robust statistical methods. We also provide a model selection based technique to determine the number of regression clusters underlying the data. We further develop a computing procedure for regression clustering estimation and selection. Finally, simulation studies are presented for assessing the procedure, together with analyzing a real data set on RGB cell marking in neuroscience to illustrate and interpret the method. PMID:27212939
Sensitivity evaluation of dynamic speckle activity measurements using clustering methods.
Etchepareborda, Pablo; Federico, Alejandro; Kaufmann, Guillermo H
2010-07-01
We evaluate and compare the use of competitive neural networks, self-organizing maps, the expectation-maximization algorithm, K-means, and fuzzy C-means techniques as partitional clustering methods, when the sensitivity of the activity measurement of dynamic speckle images needs to be improved. The temporal history of the acquired intensity generated by each pixel is analyzed in a wavelet decomposition framework, and it is shown that the mean energy of its corresponding wavelet coefficients provides a suited feature space for clustering purposes. The sensitivity obtained by using the evaluated clustering techniques is also compared with the well-known methods of Konishi-Fujii, weighted generalized differences, and wavelet entropy. The performance of the partitional clustering approach is evaluated using simulated dynamic speckle patterns and also experimental data.
Martens, Jonas; Daly, Daniel; Deschamps, Kevin; Staes, Filip; Fernandes, Ricardo J
2016-12-01
Variability of electromyographic (EMG) recordings is a complex phenomenon rarely examined in swimming. Our purposes were to investigate inter-individual variability in muscle activation patterns during front crawl swimming and assess if there were clusters of sub patterns present. Bilateral muscle activity of rectus abdominis (RA) and deltoideus medialis (DM) was recorded using wireless surface EMG in 15 adult male competitive swimmers. The amplitude of the median EMG trial of six upper arm movement cycles was used for the inter-individual variability assessment, quantified with the coefficient of variation, coefficient of quartile variation, the variance ratio and mean deviation. Key features were selected based on qualitative and quantitative classification strategies to enter in a k-means cluster analysis to examine the presence of strong sub patterns. Such strong sub patterns were found when clustering in two, three and four clusters. Inter-individual variability in a group of highly skilled swimmers was higher compared to other cyclic movements which is in contrast to what has been reported in the previous 50years of EMG research in swimming. This leads to the conclusion that coaches should be careful in using overall reference EMG information to enhance the individual swimming technique of their athletes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Anatomical relationships between serotonin 5-HT2A and dopamine D2 receptors in living human brain.
Ishii, Tatsuya; Kimura, Yasuyuki; Ichise, Masanori; Takahata, Keisuke; Kitamura, Soichiro; Moriguchi, Sho; Kubota, Manabu; Zhang, Ming-Rong; Yamada, Makiko; Higuchi, Makoto; Okubo, Yoshinori; Suhara, Tetsuya
2017-01-01
Seven healthy volunteers underwent PET scans with [18F]altanserin and [11C]FLB 457 for 5-HT2A and D2 receptors, respectively. As a measure of receptor density, a binding potential (BP) was calculated from PET data for 76 cerebral cortical regions. A correlation matrix was calculated between the binding potentials of [18F]altanserin and [11C]FLB 457 for those regions. The regional relationships were investigated using a bicluster analysis of the correlation matrix with an iterative signature algorithm. We identified two clusters of regions. The first cluster identified a distinct profile of correlation coefficients between 5-HT2A and D2 receptors, with the former in regions related to sensorimotor integration (supplementary motor area, superior parietal gyrus, and paracentral lobule) and the latter in most cortical regions. The second cluster identified another distinct profile of correlation coefficients between 5-HT2A receptors in the bilateral hippocampi and D2 receptors in most cortical regions. The observation of two distinct clusters in the correlation matrix suggests regional interactions between 5-HT2A and D2 receptors in sensorimotor integration and hippocampal function. A bicluster analysis of the correlation matrix of these neuroreceptors may be beneficial in understanding molecular networks in the human brain.
The Large-scale Structure of the Universe: Probes of Cosmology and Structure Formation
NASA Astrophysics Data System (ADS)
Noh, Yookyung
The usefulness of large-scale structure as a probe of cosmology and structure formation is increasing as large deep surveys in multi-wavelength bands are becoming possible. The observational analysis of large-scale structure guided by large volume numerical simulations are beginning to offer us complementary information and crosschecks of cosmological parameters estimated from the anisotropies in Cosmic Microwave Background (CMB) radiation. Understanding structure formation and evolution and even galaxy formation history is also being aided by observations of different redshift snapshots of the Universe, using various tracers of large-scale structure. This dissertation work covers aspects of large-scale structure from the baryon acoustic oscillation scale, to that of large scale filaments and galaxy clusters. First, I discuss a large- scale structure use for high precision cosmology. I investigate the reconstruction of Baryon Acoustic Oscillation (BAO) peak within the context of Lagrangian perturbation theory, testing its validity in a large suite of cosmological volume N-body simulations. Then I consider galaxy clusters and the large scale filaments surrounding them in a high resolution N-body simulation. I investigate the geometrical properties of galaxy cluster neighborhoods, focusing on the filaments connected to clusters. Using mock observations of galaxy clusters, I explore the correlations of scatter in galaxy cluster mass estimates from multi-wavelength observations and different measurement techniques. I also examine the sources of the correlated scatter by considering the intrinsic and environmental properties of clusters.
Aćimović, Jugoslava; Mäki-Marttunen, Tuomo; Linne, Marja-Leena
2015-01-01
We developed a two-level statistical model that addresses the question of how properties of neurite morphology shape the large-scale network connectivity. We adopted a low-dimensional statistical description of neurites. From the neurite model description we derived the expected number of synapses, node degree, and the effective radius, the maximal distance between two neurons expected to form at least one synapse. We related these quantities to the network connectivity described using standard measures from graph theory, such as motif counts, clustering coefficient, minimal path length, and small-world coefficient. These measures are used in a neuroscience context to study phenomena from synaptic connectivity in the small neuronal networks to large scale functional connectivity in the cortex. For these measures we provide analytical solutions that clearly relate different model properties. Neurites that sparsely cover space lead to a small effective radius. If the effective radius is small compared to the overall neuron size the obtained networks share similarities with the uniform random networks as each neuron connects to a small number of distant neurons. Large neurites with densely packed branches lead to a large effective radius. If this effective radius is large compared to the neuron size, the obtained networks have many local connections. In between these extremes, the networks maximize the variability of connection repertoires. The presented approach connects the properties of neuron morphology with large scale network properties without requiring heavy simulations with many model parameters. The two-steps procedure provides an easier interpretation of the role of each modeled parameter. The model is flexible and each of its components can be further expanded. We identified a range of model parameters that maximizes variability in network connectivity, the property that might affect network capacity to exhibit different dynamical regimes.
Tracing Gas Motions in the Centaurus Cluster
DOE Office of Scientific and Technical Information (OSTI.GOV)
Graham, James; Fabian, A.C.; Sanders, J.S.
2006-03-01
We apply the stochastic model of iron transport developed by Rebusco et al. (2005) to the Centaurus cluster. Using this model, we find that an effective diffusion coefficient D in the range 2 x 10{sup 28} - 4 x 10{sup 28} cm{sup 2}s{sup -1} can approximately reproduce the observed abundance distribution. Reproducing the flat central profile and sharp drop around 30-70 kpc, however, requires a diffusion coefficient that drops rapidly with radius so that D > 4 x 10{sup 28} cm{sup 2}s{sup -1} only inside about 25 kpc. Assuming that all transport is due to fully-developed turbulence, which is alsomore » responsible for offsetting cooling in the cluster core, we calculate the length and velocity scales of energy injection. These length scales are found to be up to a factor of {approx} 10 larger than expected if the turbulence is due to the inflation and rising of a bubble. We also calculate the turbulent thermal conductivity and find it is unlikely to be significant in preventing cooling.« less
A New Cluster Analysis-Marker-Controlled Watershed Method for Separating Particles of Granular Soils
Alam, Md Ferdous
2017-01-01
An accurate determination of particle-level fabric of granular soils from tomography data requires a maximum correct separation of particles. The popular marker-controlled watershed separation method is widely used to separate particles. However, the watershed method alone is not capable of producing the maximum separation of particles when subjected to boundary stresses leading to crushing of particles. In this paper, a new separation method, named as Monash Particle Separation Method (MPSM), has been introduced. The new method automatically determines the optimal contrast coefficient based on cluster evaluation framework to produce the maximum accurate separation outcomes. Finally, the particles which could not be separated by the optimal contrast coefficient were separated by integrating cuboid markers generated from the clustering by Gaussian mixture models into the routine watershed method. The MPSM was validated on a uniformly graded sand volume subjected to one-dimensional compression loading up to 32 MPa. It was demonstrated that the MPSM is capable of producing the best possible separation of particles required for the fabric analysis. PMID:29057823
NASA Astrophysics Data System (ADS)
Yokoyama, Tadashi; Sakuma, Hiroshi
2018-03-01
Silicon (Si) is the most abundant cation in crustal rocks. The charge and degree of polymerization of dissolved Si significantly change depending on solution pH and Si concentration. We used molecular dynamics (MD) simulations to predict the self-diffusion coefficients of dissolved Si, DSi, for 15 monomeric and polymeric species at ambient temperature. The results showed that DSi decreased with increasing negative charge and increasing degree of polymerization. The relationship between DSi and charge (Z) can be expressed by DSi/10-6 = 2.0 + 9.8e0.47Z, and that between DSi and number of polymerization (NSi) by DSi/10-6 = 9.7/NSi0.56. The results also revealed that multiple Si molecules assembled into a cluster and D decreased as the cluster size increased. Experiments to evaluate the diffusivity of Si in pore water revealed that the diffusion coefficient decreased with increasing Si concentration, a result consistent with the MD simulations. Simulation results can now be used to quantitatively assess water-rock interactions and water-concrete reactions over a wide range of environmentally relevant conditions.
Sector Identification in a Set of Stock Return Time Series Traded at the London Stock Exchange
NASA Astrophysics Data System (ADS)
Coronnello, C.; Tumminello, M.; Lillo, F.; Micciche, S.; Mantegna, R. N.
2005-09-01
We compare some methods recently used in the literature to detect the existence of a certain degree of common behavior of stock returns belonging to the same economic sector. Specifically, we discuss methods based on random matrix theory and hierarchical clustering techniques. We apply these methods to a portfolio of stocks traded at the London Stock Exchange. The investigated time series are recorded both at a daily time horizon and at a 5-minute time horizon. The correlation coefficient matrix is very different at different time horizons confirming that more structured correlation coefficient matrices are observed for long time horizons. All the considered methods are able to detect economic information and the presence of clusters characterized by the economic sector of stocks. However, different methods present a different degree of sensitivity with respect to different sectors. Our comparative analysis suggests that the application of just a single method could not be able to extract all the economic information present in the correlation coefficient matrix of a stock portfolio.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Steenbergen, K. G., E-mail: kgsteen@gmail.com; Gaston, N.
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement formore » a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.« less
Ab Initio Molecular Dynamics Studies of Pb m Sb n ( m + n ≤ 9) Alloy Clusters
NASA Astrophysics Data System (ADS)
Song, Bingyi; Xu, Baoqiang; Yang, Bin; Jiang, Wenlong; Chen, Xiumin; Xu, Na; Liu, Dachun; Dai, Yongnian
2017-10-01
Structure, stability, and dynamics of Pb m Sb n ( m + n ≤ 9) clusters were investigated using ab initio molecular dynamics. Size dependence of binding energies, the second-order energy difference of clusters, dissociation energy, HOMO-LUMO gaps, Mayer bond order, and the diffusion coefficient of Pb m Sb n clusters were discussed. Results suggest that Pb3Sb2, Pb4Sb2, and Pb5Sb4 ( n = 2 or 4) clusters have higher stability than other clusters, which is consistent with previous findings. In case of Pb-Sb alloy, the dynamics results show that Pb4Sb2 (Pb-22.71 wt pct Sb) can exist in gas phase at 1073 K (800 °C), which reasonably explains the azeotropic phenomenon, and the calculated values are in agreement with the experimental results (Pb-22 wt pct Sb).
Steenbergen, K G; Gaston, N
2014-02-14
Inspired by methods of remote sensing image analysis, we analyze structural variation in cluster molecular dynamics (MD) simulations through a unique application of the principal component analysis (PCA) and Pearson Correlation Coefficient (PCC). The PCA analysis characterizes the geometric shape of the cluster structure at each time step, yielding a detailed and quantitative measure of structural stability and variation at finite temperature. Our PCC analysis captures bond structure variation in MD, which can be used to both supplement the PCA analysis as well as compare bond patterns between different cluster sizes. Relying only on atomic position data, without requirement for a priori structural input, PCA and PCC can be used to analyze both classical and ab initio MD simulations for any cluster composition or electronic configuration. Taken together, these statistical tools represent powerful new techniques for quantitative structural characterization and isomer identification in cluster MD.
Pascual-García, Alberto; Abia, David; Ortiz, Angel R; Bastolla, Ugo
2009-03-01
Structural classifications of proteins assume the existence of the fold, which is an intrinsic equivalence class of protein domains. Here, we test in which conditions such an equivalence class is compatible with objective similarity measures. We base our analysis on the transitive property of the equivalence relationship, requiring that similarity of A with B and B with C implies that A and C are also similar. Divergent gene evolution leads us to expect that the transitive property should approximately hold. However, if protein domains are a combination of recurrent short polypeptide fragments, as proposed by several authors, then similarity of partial fragments may violate the transitive property, favouring the continuous view of the protein structure space. We propose a measure to quantify the violations of the transitive property when a clustering algorithm joins elements into clusters, and we find out that such violations present a well defined and detectable cross-over point, from an approximately transitive regime at high structure similarity to a regime with large transitivity violations and large differences in length at low similarity. We argue that protein structure space is discrete and hierarchic classification is justified up to this cross-over point, whereas at lower similarities the structure space is continuous and it should be represented as a network. We have tested the qualitative behaviour of this measure, varying all the choices involved in the automatic classification procedure, i.e., domain decomposition, alignment algorithm, similarity score, and clustering algorithm, and we have found out that this behaviour is quite robust. The final classification depends on the chosen algorithms. We used the values of the clustering coefficient and the transitivity violations to select the optimal choices among those that we tested. Interestingly, this criterion also favours the agreement between automatic and expert classifications. As a domain set, we have selected a consensus set of 2,890 domains decomposed very similarly in SCOP and CATH. As an alignment algorithm, we used a global version of MAMMOTH developed in our group, which is both rapid and accurate. As a similarity measure, we used the size-normalized contact overlap, and as a clustering algorithm, we used average linkage. The resulting automatic classification at the cross-over point was more consistent than expert ones with respect to the structure similarity measure, with 86% of the clusters corresponding to subsets of either SCOP or CATH superfamilies and fewer than 5% containing domains in distinct folds according to both SCOP and CATH. Almost 15% of SCOP superfamilies and 10% of CATH superfamilies were split, consistent with the notion of fold change in protein evolution. These results were qualitatively robust for all choices that we tested, although we did not try to use alignment algorithms developed by other groups. Folds defined in SCOP and CATH would be completely joined in the regime of large transitivity violations where clustering is more arbitrary. Consistently, the agreement between SCOP and CATH at fold level was lower than their agreement with the automatic classification obtained using as a clustering algorithm, respectively, average linkage (for SCOP) or single linkage (for CATH). The networks representing significant evolutionary and structural relationships between clusters beyond the cross-over point may allow us to perform evolutionary, structural, or functional analyses beyond the limits of classification schemes. These networks and the underlying clusters are available at http://ub.cbm.uam.es/research/ProtNet.php.
Statistical indicators of collective behavior and functional clusters in gene networks of yeast
NASA Astrophysics Data System (ADS)
Živković, J.; Tadić, B.; Wick, N.; Thurner, S.
2006-03-01
We analyze gene expression time-series data of yeast (S. cerevisiae) measured along two full cell-cycles. We quantify these data by using q-exponentials, gene expression ranking and a temporal mean-variance analysis. We construct gene interaction networks based on correlation coefficients and study the formation of the corresponding giant components and minimum spanning trees. By coloring genes according to their cell function we find functional clusters in the correlation networks and functional branches in the associated trees. Our results suggest that a percolation point of functional clusters can be identified on these gene expression correlation networks.
NASA Astrophysics Data System (ADS)
Leon, Stéphane; Bergond, Gilles; Vallenari, Antonella
1999-04-01
We present the tidal tail distributions of a sample of candidate binary clusters located in the bar of the Large Magellanic Cloud (LMC). One isolated cluster, SL 268, is presented in order to study the effect of the LMC tidal field. All the candidate binary clusters show tidal tails, confirming that the pairs are formed by physically linked objects. The stellar mass in the tails covers a large range, from 1.8x 10(3) to 3x 10(4) \\msun. We derive a total mass estimate for SL 268 and SL 356. At large radii, the projected density profiles of SL 268 and SL 356 fall off as r(-gamma ) , with gamma = 2.27 and gamma =3.44, respectively. Out of 4 pairs or multiple systems, 2 are older than the theoretical survival time of binary clusters (going from a few 10(6) years to 10(8) years). A pair shows too large age difference between the components to be consistent with classical theoretical models of binary cluster formation (Fujimoto & Kumai \\cite{fujimoto97}). We refer to this as the ``overmerging'' problem. A different scenario is proposed: the formation proceeds in large molecular complexes giving birth to groups of clusters over a few 10(7) years. In these groups the expected cluster encounter rate is larger, and tidal capture has higher probability. Cluster pairs are not born together through the splitting of the parent cloud, but formed later by tidal capture. For 3 pairs, we tentatively identify the star cluster group (SCG) memberships. The SCG formation, through the recent cluster starburst triggered by the LMC-SMC encounter, in contrast with the quiescent open cluster formation in the Milky Way can be an explanation to the paucity of binary clusters observed in our Galaxy. Based on observations collected at the European Southern Observatory, La Silla, Chile}
The geometry of chaotic dynamics — a complex network perspective
NASA Astrophysics Data System (ADS)
Donner, R. V.; Heitzig, J.; Donges, J. F.; Zou, Y.; Marwan, N.; Kurths, J.
2011-12-01
Recently, several complex network approaches to time series analysis have been developed and applied to study a wide range of model systems as well as real-world data, e.g., geophysical or financial time series. Among these techniques, recurrence-based concepts and prominently ɛ-recurrence networks, most faithfully represent the geometrical fine structure of the attractors underlying chaotic (and less interestingly non-chaotic) time series. In this paper we demonstrate that the well known graph theoretical properties local clustering coefficient and global (network) transitivity can meaningfully be exploited to define two new local and two new global measures of dimension in phase space: local upper and lower clustering dimension as well as global upper and lower transitivity dimension. Rigorous analytical as well as numerical results for self-similar sets and simple chaotic model systems suggest that these measures are well-behaved in most non-pathological situations and that they can be estimated reasonably well using ɛ-recurrence networks constructed from relatively short time series. Moreover, we study the relationship between clustering and transitivity dimensions on the one hand, and traditional measures like pointwise dimension or local Lyapunov dimension on the other hand. We also provide further evidence that the local clustering coefficients, or equivalently the local clustering dimensions, are useful for identifying unstable periodic orbits and other dynamically invariant objects from time series. Our results demonstrate that ɛ-recurrence networks exhibit an important link between dynamical systems and graph theory.
Re-estimating sample size in cluster randomised trials with active recruitment within clusters.
van Schie, S; Moerbeek, M
2014-08-30
Often only a limited number of clusters can be obtained in cluster randomised trials, although many potential participants can be recruited within each cluster. Thus, active recruitment is feasible within the clusters. To obtain an efficient sample size in a cluster randomised trial, the cluster level and individual level variance should be known before the study starts, but this is often not the case. We suggest using an internal pilot study design to address this problem of unknown variances. A pilot can be useful to re-estimate the variances and re-calculate the sample size during the trial. Using simulated data, it is shown that an initially low or high power can be adjusted using an internal pilot with the type I error rate remaining within an acceptable range. The intracluster correlation coefficient can be re-estimated with more precision, which has a positive effect on the sample size. We conclude that an internal pilot study design may be used if active recruitment is feasible within a limited number of clusters. Copyright © 2014 John Wiley & Sons, Ltd.
Genetic analysis of floating Enteromorpha prolifera in the Yellow Sea with AFLP marker
NASA Astrophysics Data System (ADS)
Liu, Cui; Zhang, Jing; Sun, Xiaoyu; Li, Jian; Zhang, Xi; Liu, Tao
2011-09-01
Extremely large accumulation of green algae Enteromorpha prolifera floated along China' coastal region of the Yellow Sea ever since the summer of 2008. Amplified Fragment Length Polymorphism (AFLP) analysis was applied to assess the genetic diversity and relationships among E. prolifera samples collected from 9 affected areas of the Yellow Sea. Two hundred reproducible fragments were generated with 8 AFLP primer combinations, of which 194 (97%) were polymorphic. The average Nei's genetic diversity, the coefficiency of genetic differentiation (Gst), and the average gene flow estimated from Gst in the 9 populations were 0.4018, 0.6404 and 0.2807 respectively. Cluster analysis based on the unweighed pair group method with arithmetic averages (UPGMA) showed that the genetic relationships within one population or among different populations were all related to their collecting locations and sampling time. Large genetic differentiation was detected among the populations. The E. prolifera originated from different areas and were undergoing a course of mixing.
Transport Coefficients from Large Deviation Functions
NASA Astrophysics Data System (ADS)
Gao, Chloe; Limmer, David
2017-10-01
We describe a method for computing transport coefficients from the direct evaluation of large deviation function. This method is general, relying on only equilibrium fluctuations, and is statistically efficient, employing trajectory based importance sampling. Equilibrium fluctuations of molecular currents are characterized by their large deviation functions, which is a scaled cumulant generating function analogous to the free energy. A diffusion Monte Carlo algorithm is used to evaluate the large deviation functions, from which arbitrary transport coefficients are derivable. We find significant statistical improvement over traditional Green-Kubo based calculations. The systematic and statistical errors of this method are analyzed in the context of specific transport coefficient calculations, including the shear viscosity, interfacial friction coefficient, and thermal conductivity.
NASA Astrophysics Data System (ADS)
Ostroushko, A. A.; Gagarin, I. D.; Tonkushina, M. O.; Grzhegorzhevskii, K. V.; Danilova, I. G.; Gette, I. F.; Kim, G. A.
2017-09-01
The possibility of iontophoretic transport through the native membranes of biologically active substances (vitamin B1 and insulin) associated with porous clusters Mo72Fe30 polyoxometalate of the Keplerate type is demonstrated for the first time in an experimental setup. The diffusion coefficient is estimated. The possibility of transferring Keplerate ions with a protective coating of biocompatible polymer polyvinylpyrrolidone is also shown.
Folksonomies and clustering in the collaborative system CiteULike
NASA Astrophysics Data System (ADS)
Capocci, Andrea; Caldarelli, Guido
2008-06-01
We analyze CiteULike, an online collaborative tagging system where users bookmark and annotate scientific papers. Such a system can be naturally represented as a tri-partite graph whose nodes represent papers, users and tags connected by individual tag assignments. The semantics of tags is studied here, in order to uncover the hidden relationships between tags. We find that the clustering coefficient can be used to analyze the semantical patterns among tags.
Properties of a new small-world network with spatially biased random shortcuts
NASA Astrophysics Data System (ADS)
Matsuzawa, Ryo; Tanimoto, Jun; Fukuda, Eriko
2017-11-01
This paper introduces a small-world (SW) network with a power-law distance distribution that differs from conventional models in that it uses completely random shortcuts. By incorporating spatial constraints, we analyze the divergence of the proposed model from conventional models in terms of fundamental network properties such as clustering coefficient, average path length, and degree distribution. We find that when the spatial constraint more strongly prohibits a long shortcut, the clustering coefficient is improved and the average path length increases. We also analyze the spatial prisoner's dilemma (SPD) games played on our new SW network in order to understand its dynamical characteristics. Depending on the basis graph, i.e., whether it is a one-dimensional ring or a two-dimensional lattice, and the parameter controlling the prohibition of long-distance shortcuts, the emergent results can vastly differ.
Siudem, Grzegorz; Fronczak, Agata; Fronczak, Piotr
2016-10-10
In this paper, we provide the exact expression for the coefficients in the low-temperature series expansion of the partition function of the two-dimensional Ising model on the infinite square lattice. This is equivalent to exact determination of the number of spin configurations at a given energy. With these coefficients, we show that the ferromagnetic-to-paramagnetic phase transition in the square lattice Ising model can be explained through equivalence between the model and the perfect gas of energy clusters model, in which the passage through the critical point is related to the complete change in the thermodynamic preferences on the size of clusters. The combinatorial approach reported in this article is very general and can be easily applied to other lattice models.
Siudem, Grzegorz; Fronczak, Agata; Fronczak, Piotr
2016-01-01
In this paper, we provide the exact expression for the coefficients in the low-temperature series expansion of the partition function of the two-dimensional Ising model on the infinite square lattice. This is equivalent to exact determination of the number of spin configurations at a given energy. With these coefficients, we show that the ferromagnetic–to–paramagnetic phase transition in the square lattice Ising model can be explained through equivalence between the model and the perfect gas of energy clusters model, in which the passage through the critical point is related to the complete change in the thermodynamic preferences on the size of clusters. The combinatorial approach reported in this article is very general and can be easily applied to other lattice models. PMID:27721435
Large-scale motions in the universe: Using clusters of galaxies as tracers
NASA Technical Reports Server (NTRS)
Gramann, Mirt; Bahcall, Neta A.; Cen, Renyue; Gott, J. Richard
1995-01-01
Can clusters of galaxies be used to trace the large-scale peculiar velocity field of the universe? We answer this question by using large-scale cosmological simulations to compare the motions of rich clusters of galaxies with the motion of the underlying matter distribution. Three models are investigated: Omega = 1 and Omega = 0.3 cold dark matter (CDM), and Omega = 0.3 primeval baryonic isocurvature (PBI) models, all normalized to the Cosmic Background Explorer (COBE) background fluctuations. We compare the cluster and mass distribution of peculiar velocities, bulk motions, velocity dispersions, and Mach numbers as a function of scale for R greater than or = 50/h Mpc. We also present the large-scale velocity and potential maps of clusters and of the matter. We find that clusters of galaxies trace well the large-scale velocity field and can serve as an efficient tool to constrain cosmological models. The recently reported bulk motion of clusters 689 +/- 178 km/s on approximately 150/h Mpc scale (Lauer & Postman 1994) is larger than expected in any of the models studied (less than or = 190 +/- 78 km/s).
Ages of intermediate-age Magellanic Cloud star clusters
NASA Technical Reports Server (NTRS)
Flower, P. J.
1984-01-01
Ages of intermediate-age Large Magellanic Cloud star clusters have been estimated without locating the faint, unevolved portion of cluster main sequences. Six clusters with established color-magnitude diagrams were selected for study: SL 868, NGC 1783, NGC 1868, NGC 2121, NGC 2209, and NGC 2231. Since red giant photometry is more accurate than the necessarily fainter main-sequence photometry, the distributions of red giants on the cluster color-magnitude diagrams were compared to a grid of 33 stellar evolutionary tracks, evolved from the main sequence through core-helium exhaustion, spanning the expected mass and metallicity range for Magellanic Cloud cluster red giants. The time-dependent behavior of the luminosity of the model red giants was used to estimate cluster ages from the observed cluster red giant luminosities. Except for the possibility of SL 868 being an old globular cluster, all clusters studied were found to have ages less than 10 to the 9th yr. It is concluded that there is currently no substantial evidence for a major cluster population of large, populous clusters greater than 10 to the 9th yr old in the Large Magellanic Cloud.
Hyperfine excitation of C2H in collisions with ortho- and para-H2
NASA Astrophysics Data System (ADS)
Dagdigian, Paul J.
2018-06-01
Accurate estimation of the abundance of the ethynyl (C2H) radical requires accurate radiative and collisional rate coefficients. Hyperfine-resolved rate coefficients for (de-)excitation of C2H in collisions with ortho- and para-H2 are presented in this work. These rate coefficients were computed in time-independent close-coupling quantum scattering calculations that employed a potential energy surface recently computed at the coupled-clusters level of theory that describes the interaction of C2H with H2. Rate coefficients for temperatures from 10 to 300 K were computed for all transitions among the first 40 hyperfine energy levels of C2H in collisions with ortho- and para-H2. These rate coefficients were employed in simple radiative transfer calculations to simulate the excitation of C2H in typical molecular clouds.
Mediator and RNA polymerase II clusters associate in transcription-dependent condensates.
Cho, Won-Ki; Spille, Jan-Hendrik; Hecht, Micca; Lee, Choongman; Li, Charles; Grube, Valentin; Cisse, Ibrahim I
2018-06-21
Models of gene control have emerged from genetic and biochemical studies, with limited consideration of the spatial organization and dynamics of key components in living cells. Here we used live cell super-resolution and light sheet imaging to study the organization and dynamics of the Mediator coactivator and RNA polymerase II (Pol II) directly. Mediator and Pol II each form small transient and large stable clusters in living embryonic stem cells. Mediator and Pol II are colocalized in the stable clusters, which associate with chromatin, have properties of phase-separated condensates, and are sensitive to transcriptional inhibitors. We suggest that large clusters of Mediator, recruited by transcription factors at large or clustered enhancer elements, interact with large Pol II clusters in transcriptional condensates in vivo. Copyright © 2018, American Association for the Advancement of Science.
R package to estimate intracluster correlation coefficient with confidence interval for binary data.
Chakraborty, Hrishikesh; Hossain, Akhtar
2018-03-01
The Intracluster Correlation Coefficient (ICC) is a major parameter of interest in cluster randomized trials that measures the degree to which responses within the same cluster are correlated. There are several types of ICC estimators and its confidence intervals (CI) suggested in the literature for binary data. Studies have compared relative weaknesses and advantages of ICC estimators as well as its CI for binary data and suggested situations where one is advantageous in practical research. The commonly used statistical computing systems currently facilitate estimation of only a very few variants of ICC and its CI. To address the limitations of current statistical packages, we developed an R package, ICCbin, to facilitate estimating ICC and its CI for binary responses using different methods. The ICCbin package is designed to provide estimates of ICC in 16 different ways including analysis of variance methods, moments based estimation, direct probabilistic methods, correlation based estimation, and resampling method. CI of ICC is estimated using 5 different methods. It also generates cluster binary data using exchangeable correlation structure. ICCbin package provides two functions for users. The function rcbin() generates cluster binary data and the function iccbin() estimates ICC and it's CI. The users can choose appropriate ICC and its CI estimate from the wide selection of estimates from the outputs. The R package ICCbin presents very flexible and easy to use ways to generate cluster binary data and to estimate ICC and it's CI for binary response using different methods. The package ICCbin is freely available for use with R from the CRAN repository (https://cran.r-project.org/package=ICCbin). We believe that this package can be a very useful tool for researchers to design cluster randomized trials with binary outcome. Copyright © 2017 Elsevier B.V. All rights reserved.
Enhanced diffusion on oscillating surfaces through synchronization
NASA Astrophysics Data System (ADS)
Wang, Jin; Cao, Wei; Ma, Ming; Zheng, Quanshui
2018-02-01
The diffusion of molecules and clusters under nanoscale confinement or absorbed on surfaces is the key controlling factor in dynamical processes such as transport, chemical reaction, or filtration. Enhancing diffusion could benefit these processes by increasing their transport efficiency. Using a nonlinear Langevin equation with an extensive number of simulations, we find a large enhancement in diffusion through surface oscillation. For helium confined in a narrow carbon nanotube, the diffusion enhancement is estimated to be over three orders of magnitude. A synchronization mechanism between the kinetics of the particles and the oscillating surface is revealed. Interestingly, a highly nonlinear negative correlation between diffusion coefficient and temperature is predicted based on this mechanism, and further validated by simulations. Our results provide a general and efficient method for enhancing diffusion, especially at low temperatures.
A Bimodal Hybrid Model for Time-Dependent Probabilistic Seismic Hazard Analysis
NASA Astrophysics Data System (ADS)
Yaghmaei-Sabegh, Saman; Shoaeifar, Nasser; Shoaeifar, Parva
2018-03-01
The evaluation of evidence provided by geological studies and historical catalogs indicates that in some seismic regions and faults, multiple large earthquakes occur in cluster. Then, the occurrences of large earthquakes confront with quiescence and only the small-to-moderate earthquakes take place. Clustering of large earthquakes is the most distinguishable departure from the assumption of constant hazard of random occurrence of earthquakes in conventional seismic hazard analysis. In the present study, a time-dependent recurrence model is proposed to consider a series of large earthquakes that occurs in clusters. The model is flexible enough to better reflect the quasi-periodic behavior of large earthquakes with long-term clustering, which can be used in time-dependent probabilistic seismic hazard analysis with engineering purposes. In this model, the time-dependent hazard results are estimated by a hazard function which comprises three parts. A decreasing hazard of last large earthquake cluster and an increasing hazard of the next large earthquake cluster, along with a constant hazard of random occurrence of small-to-moderate earthquakes. In the final part of the paper, the time-dependent seismic hazard of the New Madrid Seismic Zone at different time intervals has been calculated for illustrative purpose.
Optimizing Cluster Heads for Energy Efficiency in Large-Scale Heterogeneous Wireless Sensor Networks
Gu, Yi; Wu, Qishi; Rao, Nageswara S. V.
2010-01-01
Many complex sensor network applications require deploying a large number of inexpensive and small sensors in a vast geographical region to achieve quality through quantity. Hierarchical clustering is generally considered as an efficient and scalable way to facilitate the management and operation of such large-scale networks and minimize the total energy consumption for prolonged lifetime. Judicious selection of cluster heads for data integration and communication is critical to the success of applications based on hierarchical sensor networks organized as layered clusters. We investigate the problem of selecting sensor nodes in a predeployed sensor network to be the cluster heads tomore » minimize the total energy needed for data gathering. We rigorously derive an analytical formula to optimize the number of cluster heads in sensor networks under uniform node distribution, and propose a Distance-based Crowdedness Clustering algorithm to determine the cluster heads in sensor networks under general node distribution. The results from an extensive set of experiments on a large number of simulated sensor networks illustrate the performance superiority of the proposed solution over the clustering schemes based on k -means algorithm.« less
NASA Astrophysics Data System (ADS)
Budak, S.; Heidary, K.; Johnson, R. B.; Colon, T.; Muntele, C.; Ila, D.
2014-08-01
The performance of thermoelectric materials and devices is characterized by a dimensionless figure of merit, ZT = S2σT/K, where, S and σ denote, respectively, the Seebeck coefficient and electrical conductivity, T is the absolute temperature in Kelvin and K represents the thermal conductivity. The figure of merit may be improved by means of raising either S or σ or by lowering K. In our laboratory, we have fabricated and characterized the performance of a large variety of thermoelectric generators (TEG). Two TEG groups comprised of 50 and 100 alternating layers of Si/Si + Ge multi-nanolayered superlattice films have been fabricated and thoroughly characterized. Ion beam assisted deposition (IBAD) was utilized to assemble the alternating sandwiched layers, resulting in total thickness of 300 nm and 317 nm for 50 and 100 layer devices, respectively. Rutherford Backscattering Spectroscopy (RBS) was employed in order to monitor the precise quantity of Si and Ge utilized in the construction of specific multilayer thin films. The material layers were subsequently impregnated with quantum dots and/or quantum clusters, in order to concurrently reduce the cross plane thermal conductivity, increase the cross plane Seebeck coefficient and raise the cross plane electrical conductivity. The quantum dots/clusters were implanted via the 5 MeV Si ion bombardment which was performed using a Pelletron high energy ion beam accelerator. We have achieved remarkable results for the thermoelectric and optical properties of the Si/Si + Ge multilayer thin film TEG systems. We have demonstrated that with optimal setting of the 5 MeV Si ion beam bombardment fluences, one can fabricate TEG systems with figures of merits substantially higher than the values previously reported.
Brain Network Architecture and Global Intelligence in Children with Focal Epilepsy.
Paldino, M J; Golriz, F; Chapieski, M L; Zhang, W; Chu, Z D
2017-02-01
The biologic basis for intelligence rests to a large degree on the capacity for efficient integration of information across the cerebral network. We aimed to measure the relationship between network architecture and intelligence in the pediatric, epileptic brain. Patients were retrospectively identified with the following: 1) focal epilepsy; 2) brain MR imaging at 3T, including resting-state functional MR imaging; and 3) full-scale intelligence quotient measured by a pediatric neuropsychologist. The cerebral cortex was parcellated into approximately 700 gray matter network "nodes." The strength of a connection between 2 nodes was defined by the correlation between their blood oxygen level-dependent time-series. We calculated the following topologic properties: clustering coefficient, transitivity, modularity, path length, and global efficiency. A machine learning algorithm was used to measure the independent contribution of each metric to the intelligence quotient after adjusting for all other metrics. Thirty patients met the criteria (4-18 years of age); 20 patients required anesthesia during MR imaging. After we accounted for age and sex, clustering coefficient and path length were independently associated with full-scale intelligence quotient. Neither motion parameters nor general anesthesia was an important variable with regard to accurate intelligence quotient prediction by the machine learning algorithm. A longer history of epilepsy was associated with shorter path lengths ( P = .008), consistent with reorganization of the network on the basis of seizures. Considering only patients receiving anesthesia during machine learning did not alter the patterns of network architecture contributing to global intelligence. These findings support the physiologic relevance of imaging-based metrics of network architecture in the pathologic, developing brain. © 2017 by American Journal of Neuroradiology.
OGLE Collection of Star Clusters. New Objects in the Outskirts of the Large Magellanic Cloud
NASA Astrophysics Data System (ADS)
Sitek, M.; Szymański, M. K.; Skowron, D. M.; Udalski, A.; Kostrzewa-Rutkowska, Z.; Skowron, J.; Karczmarek, P.; Cieślar, M.; Wyrzykowski, Ł.; Kozłowski, S.; Pietrukowicz, P.; Soszyński, I.; Mróz, P.; Pawlak, M.; Poleski, R.; Ulaczyk, K.
2016-09-01
The Magellanic System (MS), consisting of the Large Magellanic Cloud (LMC), the Small Magellanic Cloud (SMC) and the Magellanic Bridge (MBR), contains diverse sample of star clusters. Their spatial distribution, ages and chemical abundances may provide important information about the history of formation of the whole System. We use deep photometric maps derived from the images collected during the fourth phase of the Optical Gravitational Lensing Experiment (OGLE-IV) to construct the most complete catalog of star clusters in the Large Magellanic Cloud using the homogeneous photometric data. In this paper we present the collection of star clusters found in the area of about 225 square degrees in the outer regions of the LMC. Our sample contains 679 visually identified star cluster candidates, 226 of which were not listed in any of the previously published catalogs. The new clusters are mainly young small open clusters or clusters similar to associations.
A phase cell cluster expansion for Euclidean field theories
NASA Astrophysics Data System (ADS)
Battle, Guy A., III; Federbush, Paul
1982-08-01
We adapt the cluster expansion first used to treat infrared problems for lattice models (a mass zero cluster expansion) to the usual field theory situation. The field is expanded in terms of special block spin functions and the cluster expansion given in terms of the expansion coefficients (phase cell variables); the cluster expansion expresses correlation functions in terms of contributions from finite coupled subsets of these variables. Most of the present work is carried through in d space time dimensions (for φ24 the details of the cluster expansion are pursued and convergence is proven). Thus most of the results in the present work will apply to a treatment of φ34 to which we hope to return in a succeeding paper. Of particular interest in this paper is a substitute for the stability of the vacuum bound appropriate to this cluster expansion (for d = 2 and d = 3), and a new method for performing estimates with tree graphs. The phase cell cluster expansions have the renormalization group incorporated intimately into their structure. We hope they will be useful ultimately in treating four dimensional field theories.
NASA Astrophysics Data System (ADS)
Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi
2018-04-01
Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models’ performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.
Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi
2018-03-13
Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models' performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.
Robustness and structure of complex networks
NASA Astrophysics Data System (ADS)
Shao, Shuai
This dissertation covers the two major parts of my PhD research on statistical physics and complex networks: i) modeling a new type of attack -- localized attack, and investigating robustness of complex networks under this type of attack; ii) discovering the clustering structure in complex networks and its influence on the robustness of coupled networks. Complex networks appear in every aspect of our daily life and are widely studied in Physics, Mathematics, Biology, and Computer Science. One important property of complex networks is their robustness under attacks, which depends crucially on the nature of attacks and the structure of the networks themselves. Previous studies have focused on two types of attack: random attack and targeted attack, which, however, are insufficient to describe many real-world damages. Here we propose a new type of attack -- localized attack, and study the robustness of complex networks under this type of attack, both analytically and via simulation. On the other hand, we also study the clustering structure in the network, and its influence on the robustness of a complex network system. In the first part, we propose a theoretical framework to study the robustness of complex networks under localized attack based on percolation theory and generating function method. We investigate the percolation properties, including the critical threshold of the phase transition pc and the size of the giant component Pinfinity. We compare localized attack with random attack and find that while random regular (RR) networks are more robust against localized attack, Erdoḧs-Renyi (ER) networks are equally robust under both types of attacks. As for scale-free (SF) networks, their robustness depends crucially on the degree exponent lambda. The simulation results show perfect agreement with theoretical predictions. We also test our model on two real-world networks: a peer-to-peer computer network and an airline network, and find that the real-world networks are much more vulnerable to localized attack compared with random attack. In the second part, we extend the tree-like generating function method to incorporating clustering structure in complex networks. We study the robustness of a complex network system, especially a network of networks (NON) with clustering structure in each network. We find that the system becomes less robust as we increase the clustering coefficient of each network. For a partially dependent network system, we also find that the influence of the clustering coefficient on network robustness decreases as we decrease the coupling strength, and the critical coupling strength qc, at which the first-order phase transition changes to second-order, increases as we increase the clustering coefficient.
Masood, Mohd; Reidpath, Daniel D
2016-01-07
Measuring the intraclass correlation coefficient (ICC) and design effect (DE) may help to modify the public health interventions for body mass index (BMI), physical activity and diet according to geographic targeting of interventions in different countries. The purpose of this study was to quantify the level of clustering and DE in BMI, physical activity and diet in 56 low-income, middle-income and high-income countries. Cross-sectional study design. Multicountry national survey data. The World Health Survey (WHS), 2003, data were used to examine clustering in BMI, physical activity in metabolic equivalent of task (MET) and diet in fruits and vegetables intake (FVI) from low-income, middle-income and high-income countries. Multistage sampling in the WHS used geographical clusters as primary sampling units (PSU). These PSUs were used as a clustering or grouping variable in this analysis. Multilevel intercept only regression models were used to calculate the ICC and DE for each country. The median ICC (0.039) and median DE (1.82) for BMI were low; however, FVI had a higher median ICC (0.189) and median DE (4.16). For MET, the median ICC was 0.141 and median DE was 4.59. In some countries, however, the ICC and DE for BMI were large. For instance, South Africa had the highest ICC (0.39) and DE (11.9) for BMI, whereas Uruguay had the highest ICC (0.434) for MET and Ethiopia had the highest ICC (0.471) for FVI. This study shows that across a wide range of countries, there was low area level clustering for BMI, whereas MET and FVI showed high area level clustering. These results suggested that the country level clustering effect should be considered in developing preventive approaches for BMI, as well as improving physical activity and healthy diets for each country. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
NASA Astrophysics Data System (ADS)
Kong, Xiangzhen; He, Wei; Qin, Ning; He, Qishuang; Yang, Bin; Ouyang, Huiling; Wang, Qingmei; Xu, Fuliu
2013-03-01
Trajectory cluster analysis, including the two-stage cluster method based on Euclidean metrics and the one-stage clustering method based on Mahalanobis metrics and self-organizing maps (SOM), was applied and compared to identify the transport pathways of PM10 for the cities of Chaohu and Hefei, both located near Lake Chaohu in China. The two-stage cluster method was modified to further investigate the long trajectories in the second stage in order to eliminate the observed disaggregation among them. Twelve trajectory clusters were identified for both cities. The one-stage clustering method based on Mahalanobis metrics gives the best performance regarding the variances within clusters. The results showed that local PM10 emission was one of the most important sources in both cities and that the local emission in Hefei was higher than in Chaohu. In addition, Chaohu suffered greater effects from the eastern region (Yangtze River Delta, YRD) than Hefei. On the other hand, the long-range transportation from the northwestern pathway had a higher influence on the PM10 level in Hefei. Receptor models, including potential source contribution function (PSCF) and residence time weighted concentrations (RTWC), were utilized to identify the potential source locations of PM10 for both cities. However, the combined PSCF and RTWC results for the two cities provided PM10 source locations that were more consistent with the results of transport pathways and the total anthropogenic PM10 emission inventory. This indicates that the combined method's ability to identify the source regions is superior to that of the individual PSCF or RTWC methods. Henan and Shanxi Provinces and the YRD were important PM10 source regions for the two cities, but the Henan and Shanxi area was more important for Hefei than for Chaohu, while the YRD region was less important. In addition, the PSCF, RTWC and the combined results all had higher correlation coefficients with PM10 emission from traffic than from industry, electricity generation or residential sources, suggesting the relatively higher contribution of traffic emissions to the PM10 pollution in Lake Chaohu.
NASA Astrophysics Data System (ADS)
Matsumoto, Yoshiteru; Yoshiura, Ryuto; Honma, Kenji
2017-07-01
We investigated the crystalline structures of jet-cooled acetylene (C2H2) large clusters by laser spectroscopy and chemometrics. The CH stretching vibrations of the C2H2 large clusters were observed by infrared (IR) cavity ringdown spectroscopy. The IR spectra of C2H2 clusters were measured under the conditions of various concentrations of C2H2/He mixture gas for supersonic jets. Upon increasing the gas concentration from 1% to 10%, we observed a rapid intensity enhancement for a band in the IR spectra. The strong dependence of the intensity on the gas concentration indicates that the band was assigned to CH stretching vibrations of the large clusters. An analysis of the IR spectra by two-dimensional correlation spectroscopy revealed that the IR absorption due to the C2H2 large cluster is decomposed into two CH stretching vibrations. The vibrational frequencies of the two bands are almost equivalent to the IR absorption of the pure- and poly-crystalline orthorhombic structures in the aerosol particles. The characteristic temperature behavior of the IR spectra implies the existence of the other large cluster, which is discussed in terms of the phase transition of a bulk crystal.
Obaid, Ramiz; Abu-Qaoud, Hassan; Arafeh, Rami
2014-09-03
Eight accessions of olive trees from three common varieties in Palestine, Nabali Baladi, Nabali Mohassan and Surri, were genetically evaluated using five simple sequence repeat (SSR) markers. A total of 17 alleles from 5 loci were observed in which 15 (88.2%) were polymorphic and 2 (11.8%) were monomorphic. An average of 3.4 alleles per locus was found ranging from 2.0 alleles with the primers GAPU-103 and DCA-9 to 5.0 alleles with U9932 and DCA-16. The smallest amplicon size observed was 50 bp with the primer DCA-16, whereas the largest one (450 bp) with the primer U9932. Cluster analysis with the unweighted pair group method with arithmetic average (UPGMA) showed three clusters: a cluster with four accessions from the 'Nabali Baladi' cultivar, another cluster with three accessions that represents the 'Nabali Mohassen' cultivar and finally the 'Surri' cultivar. The similarity coefficient for the eight olive tree samples ranged from a maximum of 100% between two accessions from Nabali Baladi and also in two other samples from Nabali Mohassan, to a minimum similarity coefficient (0.315) between the Surri and two Nabali Baladi accessions. The results in this investigation clearly highlight the genetic dissimilarity between the three main olive cultivars that have been misidentified and mixed up in the past, based on conventional morphological characters.
An improved global dynamic routing strategy for scale-free network with tunable clustering
NASA Astrophysics Data System (ADS)
Sun, Lina; Huang, Ning; Zhang, Yue; Bai, Yannan
2016-08-01
An efficient routing strategy can deliver packets quickly to improve the network capacity. Node congestion and transmission path length are inevitable real-time factors for a good routing strategy. Existing dynamic global routing strategies only consider the congestion of neighbor nodes and the shortest path, which ignores other key nodes’ congestion on the path. With the development of detection methods and techniques, global traffic information is readily available and important for the routing choice. Reasonable use of this information can effectively improve the network routing. So, an improved global dynamic routing strategy is proposed, which considers the congestion of all nodes on the shortest path and incorporates the waiting time of the most congested node into the path. We investigate the effectiveness of the proposed routing for scale-free network with different clustering coefficients. The shortest path routing strategy and the traffic awareness routing strategy only considering the waiting time of neighbor node are analyzed comparatively. Simulation results show that network capacity is greatly enhanced compared with the shortest path; congestion state increase is relatively slow compared with the traffic awareness routing strategy. Clustering coefficient increase will not only reduce the network throughput, but also result in transmission average path length increase for scale-free network with tunable clustering. The proposed routing is favorable to ease network congestion and network routing strategy design.
Clustering in complex directed networks
NASA Astrophysics Data System (ADS)
Fagiolo, Giorgio
2007-08-01
Many empirical networks display an inherent tendency to cluster, i.e., to form circles of connected nodes. This feature is typically measured by the clustering coefficient (CC). The CC, originally introduced for binary, undirected graphs, has been recently generalized to weighted, undirected networks. Here we extend the CC to the case of (binary and weighted) directed networks and we compute its expected value for random graphs. We distinguish between CCs that count all directed triangles in the graph (independently of the direction of their edges) and CCs that only consider particular types of directed triangles (e.g., cycles). The main concepts are illustrated by employing empirical data on world-trade flows.
Relaxation dynamics of maximally clustered networks
NASA Astrophysics Data System (ADS)
Klaise, Janis; Johnson, Samuel
2018-01-01
We study the relaxation dynamics of fully clustered networks (maximal number of triangles) to an unclustered state under two different edge dynamics—the double-edge swap, corresponding to degree-preserving randomization of the configuration model, and single edge replacement, corresponding to full randomization of the Erdős-Rényi random graph. We derive expressions for the time evolution of the degree distribution, edge multiplicity distribution and clustering coefficient. We show that under both dynamics networks undergo a continuous phase transition in which a giant connected component is formed. We calculate the position of the phase transition analytically using the Erdős-Rényi phenomenology.
Correlation between structural and transport properties of electron beam irradiated PrMnO3 compounds
NASA Astrophysics Data System (ADS)
Christopher, Benedict; Rao, Ashok; Nagaraja, B. S.; Shyam Prasad, K.; Okram, G. S.; Sanjeev, Ganesh; Petwal, Vikash Chandra; Verma, Vijay Pal; Dwivedi, Jishnu; Poornesh, P.
2018-02-01
The structural, electrical, magnetic, and thermal properties of electron beam (EB) irradiated PrMnO3 manganites were investigated in the present communication. X-ray diffraction data reveals that all samples are single phased with orthorhombic distorted structure (Pbnm). Furthermore, the diffracted data are analyzed in detail using Rietveld refinement technique. It is observed that the EB dosage feebly disturbs the MnO6 octahedra. The electrical resistivity of all the samples exhibits semiconducting behavior. Small polaron hopping model is conveniently employed to investigate the semiconducting nature of the pristine as well as EB irradiated samples. The Seebeck coefficient (S) of the pristine as well as the irradiated samples exhibit large positive values at lower temperatures, signifying holes as the dominant charge carriers. The analysis of Seebeck coefficient data confirms that the small polaron hopping mechanism assists the thermoelectric transport property in the high temperature region. The magnetic measurements confirm the existence of paramagnetic (PM) to ferromagnetic (FM) behavior for the pristine and irradiated samples. In the lower temperature regime, coexistence of FM clusters and AFM matrix is dominating. Thus, the complex magnetic behavior of the compound has been explained in terms of rearrangement of antiferromagnetically coupled ionic moments.
Rotationally Invariant Image Representation for Viewing Direction Classification in Cryo-EM
Zhao, Zhizhen; Singer, Amit
2014-01-01
We introduce a new rotationally invariant viewing angle classification method for identifying, among a large number of cryo-EM projection images, similar views without prior knowledge of the molecule. Our rotationally invariant features are based on the bispectrum. Each image is denoised and compressed using steerable principal component analysis (PCA) such that rotating an image is equivalent to phase shifting the expansion coefficients. Thus we are able to extend the theory of bispectrum of 1D periodic signals to 2D images. The randomized PCA algorithm is then used to efficiently reduce the dimensionality of the bispectrum coefficients, enabling fast computation of the similarity between any pair of images. The nearest neighbors provide an initial classification of similar viewing angles. In this way, rotational alignment is only performed for images with their nearest neighbors. The initial nearest neighbor classification and alignment are further improved by a new classification method called vector diffusion maps. Our pipeline for viewing angle classification and alignment is experimentally shown to be faster and more accurate than reference-free alignment with rotationally invariant K-means clustering, MSA/MRA 2D classification, and their modern approximations. PMID:24631969
Perilli, Miriam L. L.; Lima, Fernando; Rodrigues, Flávio H. G.; Cavalcanti, Sandra M. C.
2016-01-01
Large cats feeding habits have been studied through two main methods: scat analysis and the carcasses of prey killed by monitored animals. From November 2001 to April 2004, we studied jaguar predation patterns using GPS telemetry location clusters on a cattle ranch in southern Pantanal. During this period, we recorded 431 carcasses of animals preyed upon by monitored jaguars. Concurrently, we collected 125 jaguar scats opportunistically. We compared the frequencies of prey found through each method. We also compared the prey communities using Bray-Curtis similarity coefficient. These comparisons allowed us to evaluate the use of scat analysis as a means to describe jaguar feeding habits. Both approaches identified prey communities with high similarity (Bray-Curtis coefficient > 70). According to either method, jaguars consume three main prey: cattle (Bos taurus), caiman (Caiman yacare) and peccaries (Tayassu pecari and Pecari tajacu). The two methods did not differ in the frequency of the three main prey over dry and wet seasons or years sampled. Our results show that scat analysis is effective and capable of describing jaguar feeding habits. PMID:27002524
Dhawan, S S; Rai, G K; Darokar, M P; Lal, R K; Misra, H O; Khanuja, S P S
2011-09-15
Velvet bean (Mucuna pruriens) seeds contain the catecholic amino acid L-DoPA (L-3,4-dihydroxyphenylalanine), which is a neurotransmitter precursor and used for the treatment of Parkinson's disease and mental disorders. The great demand for L-DoPA is largely met by the pharmaceutical industry through extraction of the compound from wild populations of this plant; commercial exploitation of this compound is hampered because of its limited availability. The trichomes present on the pods can cause severe itching, blisters and dermatitis, discouraging cultivation. We screened genetic stocks of velvet bean for the trichome-less trait, along with high seed yield and L-DoPA content. The highest yielding trichome-less elite strain was selected and indentified on the basis of a PCR-based DNA fingerprinting method (RAPD), using deca-nucleotide primers. A genetic similarity index matrix was obtained through multivariant analysis using Nei and Li's coefficient. The similarity coefficients were used to generate a tree for cluster analysis using the UPGMA method. Analysis of amplification spectra of 408 bands obtained with 56 primers allowed us to distinguish a trichome-less elite strain of M. pruriens.
Segura-Correa, J C; Domínguez-Díaz, D; Avalos-Ramírez, R; Argaez-Sosa, J
2010-09-01
Knowledge of the intraherd correlation coefficient (ICC) and design (D) effect for infectious diseases could be of interest in sample size calculation and to provide the correct standard errors of prevalence estimates in cluster or two-stage samplings surveys. Information on 813 animals from 48 non-vaccinated cow-calf herds from North-eastern Mexico was used. The ICC for the bovine viral diarrhoea (BVD), infectious bovine rhinotracheitis (IBR), leptospirosis and neosporosis diseases were calculated using a Bayesian approach adjusting for the sensitivity and specificity of the diagnostic tests. The ICC and D values for BVD, IBR, leptospirosis and neosporosis were 0.31 and 5.91, 0.18 and 3.88, 0.22 and 4.53, and 0.11 and 2.68, respectively. The ICC and D values were different from 0 and D greater than 1, therefore large sample sizes are required to obtain the same precision in prevalence estimates than for a random simple sampling design. The report of ICC and D values is of great help in planning and designing two-stage sampling studies. 2010 Elsevier B.V. All rights reserved.
A study of the threshold method utilizing raingage data
NASA Technical Reports Server (NTRS)
Short, David A.; Wolff, David B.; Rosenfeld, Daniel; Atlas, David
1993-01-01
The threshold method for estimation of area-average rain rate relies on determination of the fractional area where rain rate exceeds a preset level of intensity. Previous studies have shown that the optimal threshold level depends on the climatological rain-rate distribution (RRD). It has also been noted, however, that the climatological RRD may be composed of an aggregate of distributions, one for each of several distinctly different synoptic conditions, each having its own optimal threshold. In this study, the impact of RRD variations on the threshold method is shown in an analysis of 1-min rainrate data from a network of tipping-bucket gauges in Darwin, Australia. Data are analyzed for two distinct regimes: the premonsoon environment, having isolated intense thunderstorms, and the active monsoon rains, having organized convective cell clusters that generate large areas of stratiform rain. It is found that a threshold of 10 mm/h results in the same threshold coefficient for both regimes, suggesting an alternative definition of optimal threshold as that which is least sensitive to distribution variations. The observed behavior of the threshold coefficient is well simulated by assumption of lognormal distributions with different scale parameters and same shape parameters.
Caeyenberghs, Karen; Taymans, Tom; Wilson, Peter H; Vanderstraeten, Guy; Hosseini, Hadi; van Waelvelde, Hilde
2016-07-01
Children with autism spectrum disorders (ASD) often exhibit motor clumsiness (Developmental Coordination Disorder, DCD), i.e. they struggle with everyday tasks that require motor coordination like dressing, self-care, and participating in sport and leisure activities. Previous studies in these neurodevelopmental disorders have demonstrated functional abnormalities and alterations of white matter microstructural integrity in specific brain regions. These findings suggest that the global organization of brain networks is affected in DCD and ASD and support the hypothesis of a 'dys-connectivity syndrome' from a network perspective. No studies have compared the structural covariance networks between ASD and DCD in order to look for the signature of DCD independent of comorbid autism. Here, we aimed to address the question of whether abnormal connectivity in DCD overlaps that seen in autism or comorbid DCD-autism. Using graph theoretical analysis, we investigated differences in global and regional topological properties of structural brain networks in 53 children: 8 ASD children with DCD (DCD+ASD), 15 ASD children without DCD (ASD), 11 with DCD only, and 19 typically developing (TD) children. We constructed separate structural correlation networks based on cortical thickness derived from Freesurfer. The children were assessed on the Movement-ABC and the Beery Test of Visual Motor Integration. Behavioral results demonstrated that the DCD group and DCD+ASD group scored on average poorer than the TD and ASD groups on various motor measures. Furthermore, although the brain networks of all groups exhibited small-world properties, the topological architecture of the networks was significantly altered in children with ASD compared with DCD and TD. ASD children showed increased normalized path length and higher values of clustering coefficient. Also, paralimbic regions exhibited nodal clustering coefficient alterations in singular disorders. These changes were disorder-specific, and included alterations in clustering coefficient in the isthmus of the right cingulate gyrus and the pars orbitalis of the right inferior frontal gyrus in ASD children, and DCD-related increases in the lateral orbitofrontal cortex. Children meeting criteria for both DCD and ASD exhibited topological changes that were more widespread from those seen in children with only DCD, i.e. children with DCD+ASD showed alterations of clustering coefficient in (para)limbic regions, primary areas, and association areas. The DCD+ASD group showed changes in clustering coefficient in the left association cortex relative to the ASD group. Finally, the DCD+ASD group shared ASD-specific abnormalities in the pars orbitalis of right inferior frontal gyrus, which was hypothesized to reflect atypical emotional-cognitive processing. Our results provide evidence that DCD and ASD are neurodevelopmental disorders with a low degree of overlap in abnormalities in connectivity. The co-occurrence of DCD+ASD was also associated with a distinct topological pattern, highlighting the unique neural signature of comorbid neurodevelopmental disorders. © 2016 John Wiley & Sons Ltd.
Mining subspace clusters from DNA microarray data using large itemset techniques.
Chang, Ye-In; Chen, Jiun-Rung; Tsai, Yueh-Chi
2009-05-01
Mining subspace clusters from the DNA microarrays could help researchers identify those genes which commonly contribute to a disease, where a subspace cluster indicates a subset of genes whose expression levels are similar under a subset of conditions. Since in a DNA microarray, the number of genes is far larger than the number of conditions, those previous proposed algorithms which compute the maximum dimension sets (MDSs) for any two genes will take a long time to mine subspace clusters. In this article, we propose the Large Itemset-Based Clustering (LISC) algorithm for mining subspace clusters. Instead of constructing MDSs for any two genes, we construct only MDSs for any two conditions. Then, we transform the task of finding the maximal possible gene sets into the problem of mining large itemsets from the condition-pair MDSs. Since we are only interested in those subspace clusters with gene sets as large as possible, it is desirable to pay attention to those gene sets which have reasonable large support values in the condition-pair MDSs. From our simulation results, we show that the proposed algorithm needs shorter processing time than those previous proposed algorithms which need to construct gene-pair MDSs.
Active Brownian agents with concentration-dependent chemotactic sensitivity.
Meyer, Marcel; Schimansky-Geier, Lutz; Romanczuk, Pawel
2014-02-01
We study a biologically motivated model of overdamped, autochemotactic Brownian agents with concentration-dependent chemotactic sensitivity. The agents in our model move stochastically and produce a chemical ligand at their current position. The ligand concentration obeys a reaction-diffusion equation and acts as a chemoattractant for the agents, which bias their motion towards higher concentrations of the dynamically altered chemical field. We explore the impact of concentration-dependent response to chemoattractant gradients on large-scale pattern formation, by deriving a coarse-grained macroscopic description of the individual-based model, and compare the conditions for emergence of inhomogeneous solutions for different variants of the chemotactic sensitivity. We focus primarily on the so-called receptor-law sensitivity, which models a nonlinear decrease of chemotactic sensitivity with increasing ligand concentration. Our results reveal qualitative differences between the receptor law, the constant chemotactic response, and the so-called log law, with respect to stability of the homogeneous solution, as well as the emergence of different patterns (labyrinthine structures, clusters, and bubbles) via spinodal decomposition or nucleation. We discuss two limiting cases, where the model can be reduced to the dynamics of single species: (I) the agent density governed by a density-dependent effective diffusion coefficient and (II) the ligand field with an effective bistable, time-dependent reaction rate. In the end, we turn to single clusters of agents, studying domain growth and determining mean characteristics of the stationary inhomogeneous state. Analytical results are confirmed and extended by large-scale GPU simulations of the individual based model.
Individual and group-level job resources and their relationships with individual work engagement.
Füllemann, Désirée; Brauchli, Rebecca; Jenny, Gregor J; Bauer, Georg F
2016-06-16
This study adds a multilevel perspective to the well-researched individual-level relationship between job resources and work engagement. In addition, we explored whether individual job resources cluster within work groups because of a shared psychosocial environment and investigated whether a resource-rich psychosocial work group environment is beneficial for employee engagement over and above the beneficial effect of individual job resources and independent of their variability within groups. Data of 1,219 employees nested in 103 work groups were obtained from a baseline employee survey of a large stress management intervention project implemented in six medium and large-sized organizations in diverse sectors. A variety of important job resources were assessed and grouped to an overall job resource factor with three subfactors (manager behavior, peer behavior, and task-related resources). Data were analyzed using multilevel random coefficient modeling. The results indicated that job resources cluster within work groups and can be aggregated to a group-level job resources construct. However, a resource-rich environment, indicated by high group-level job resources, did not additionally benefit employee work engagement but on the contrary, was negatively related to it. On the basis of this unexpected result, replication studies are encouraged and suggestions for future studies on possible underlying within-group processes are discussed. The study supports the presumed value of integrating work group as a relevant psychosocial environment into the motivational process and indicates a need to further investigate emergent processes involved in aggregation procedures across levels.
Fu, Chuanbo; Dan, Li; Chen, Youlong; Tang, Jiaxiang
2015-08-01
The long-term observational data of sunshine duration (SD) and diffuse radiation percentage (defined as diffuse solar radiation/total solar radiation, DRP) on sunny days during 1960-2005 were analyzed in 7 urban agglomerations and the whole of China. The results show that the sunny sunshine duration (SSD) has decreased significantly except at a few stations over northwestern China in the past 46 years. An obvious decrease of the SSD is found in eastern China, with the trend coefficients lower than -0.8. Accompanied by the SSD decline, the sunny diffuse radiation percentage (SDRP) in most stations shows obvious increasing trends during the 46 years. The averaged SDRP over China has increased 2.33% per decade, while the averaged SSD shows a decrease of -0.13 hr/day per decade. The correlation coefficient between SDRP and SSD is -0.88. SSD decreased over urban agglomerations (small to large city clusters) in the past 46 years, especially in large cities and medium cities, due to the strong anthropogenic activities and air pollution represented by aerosol option depth (AOD) and tropospheric column NO2 (TroNO2). On the regional scale, SSD has an opposite trend from SDRP during 1960 to 2005, and the variation trends of regional mean values of SSD and SDRP in southeastern China are more pronounced than those in northwestern China. Copyright © 2015. Published by Elsevier B.V.
SCUD: fast structure clustering of decoys using reference state to remove overall rotation.
Li, Hongzhi; Zhou, Yaoqi
2005-08-01
We developed a method for fast decoy clustering by using reference root-mean-squared distance (rRMSD) rather than commonly used pairwise RMSD (pRMSD) values. For 41 proteins with 2000 decoys each, the computing efficiency increases nine times without a significant change in the accuracy of near-native selections. Tests on additional protein decoys based on different reference conformations confirmed this result. Further analysis indicates that the pRMSD and rRMSD values are highly correlated (with an average correlation coefficient of 0.82) and the clusters obtained from pRMSD and rRMSD values are highly similar (the representative structures of the top five largest clusters from the two methods are 74% identical). SCUD (Structure ClUstering of Decoys) with an automatic cutoff value is available at http://theory.med.buffalo.edu. (c) 2005 Wiley Periodicals, Inc.
Numerical taxonomy and ecology of petroleum-degrading bacteria.
Austin, B; Calomiris, J J; Walker, J D; Colwell, R R
1977-01-01
A total of 99 strains of petroleum-degrading bacteria isolated from Chesapeake Bay water and sediment were identified by using numerical taxonomy procedures. The isolates, together with 33 reference cultures, were examined for 48 biochemical, cultural, morphological, and physiological characters. The data were analyzed by computer, using both the simple matching and the Jaccard coefficients. Clustering was achieved by the unweighted average linkage method. From the sorted similarity matrix and dendrogram, 14 phenetic groups, comprising 85 of the petroleum-degrading bacteria, were defined at the 80 to 85% similarity level. These groups were identified as actinomycetes (mycelial forms, four clusters), coryneforms, Enterobacteriaceae, Klebsiella aerogenes, Micrococcus spp. (two clusters), Nocardia species (two clusters), Pseudomonas spp. (two clusters), and Sphaerotilus natans. It is concluded that the degradation of petroleum is accomplished by a diverse range of bacterial taxa, some of which were isolated only at given sampling stations and, more specifically, from sediment collected at a given station. PMID:889329
NASA Technical Reports Server (NTRS)
Haskins, Justin B.; Bauschlicher, Charles W.; Lawson, John W.
2015-01-01
Zero-temperature density functional theory (DFT), density functional theory molecular dynamics (DFT-MD), and classical molecular dynamics using polarizable force fields (PFF-MD) are employed to evaluate the influence of Lithium ion on the structure, transport, and electrochemical stability of three potential ionic liquid electrolytes: N--methyl-N-butylpyrrolidinium bis(trifluoromethanesulfonyl)imide ([pyr14][TFSI]), N--methyl-N-propylpyrrolidinium bis(fluorosulfonyl)imide ([pyr13][FSI]), and 1-ethyl-3--methylimidazolium boron tetrafluoride ([EMIM][BF4]). We characterize the Lithium ion solvation shell through zero-temperature DFT simulations of [Li(Anion)sub n](exp n-1) -clusters, DFT-MD simulations of isolated lithium ions in small ionic liquid systems, and PFF-MD simulations with high Li-doping levels in large ionic liquid systems. At low levels of Li-salt doping, highly stable solvation shells having 2-3 anions are seen in both [pyr14][TFSI] and [pyr13][FSI], while solvation shells with 4 anions dominate in [EMIM][BF sub 4]. At higher levels of doping, we find the formation of complex Li-network structures that increase the frequency of 4 anion-coordinated solvation shells. A comparison of computational and experimental Raman spectra for a wide range of [Li(Anion) sub n](exp n -1) - clusters shows that our proposed structures are consistent with experiment. We estimate the ion diffusion coefficients and quantify both size and simulation time effects. We find estimates of lithium ion diffusion are a reasonable order of magnitude and can be corrected for simulation time effects. Simulation size, on the other hand, is also important, with diffusion coefficients from long PFF-MD simulations of small cells having 20-40% error compared to large-cell values. Finally, we compute the electrochemical window using differences in electronic energy levels of both isolated cation/anion pairs and small ionic liquid systems with Li-salt doping. The single pair and liquid-phase systems provide similar estimates of electrochemical window, while Li-doping in the liquid-phase systems results in electrochemical windows little changed from the neat systems. Pure and hybrid functionals systematically provide an upper and lower bound, respectively, to the experimental electrochemical window for the systems studied here.
Bansal, Ravi; Peterson, Bradley S
2018-06-01
Identifying regional effects of interest in MRI datasets usually entails testing a priori hypotheses across many thousands of brain voxels, requiring control for false positive findings in these multiple hypotheses testing. Recent studies have suggested that parametric statistical methods may have incorrectly modeled functional MRI data, thereby leading to higher false positive rates than their nominal rates. Nonparametric methods for statistical inference when conducting multiple statistical tests, in contrast, are thought to produce false positives at the nominal rate, which has thus led to the suggestion that previously reported studies should reanalyze their fMRI data using nonparametric tools. To understand better why parametric methods may yield excessive false positives, we assessed their performance when applied both to simulated datasets of 1D, 2D, and 3D Gaussian Random Fields (GRFs) and to 710 real-world, resting-state fMRI datasets. We showed that both the simulated 2D and 3D GRFs and the real-world data contain a small percentage (<6%) of very large clusters (on average 60 times larger than the average cluster size), which were not present in 1D GRFs. These unexpectedly large clusters were deemed statistically significant using parametric methods, leading to empirical familywise error rates (FWERs) as high as 65%: the high empirical FWERs were not a consequence of parametric methods failing to model spatial smoothness accurately, but rather of these very large clusters that are inherently present in smooth, high-dimensional random fields. In fact, when discounting these very large clusters, the empirical FWER for parametric methods was 3.24%. Furthermore, even an empirical FWER of 65% would yield on average less than one of those very large clusters in each brain-wide analysis. Nonparametric methods, in contrast, estimated distributions from those large clusters, and therefore, by construct rejected the large clusters as false positives at the nominal FWERs. Those rejected clusters were outlying values in the distribution of cluster size but cannot be distinguished from true positive findings without further analyses, including assessing whether fMRI signal in those regions correlates with other clinical, behavioral, or cognitive measures. Rejecting the large clusters, however, significantly reduced the statistical power of nonparametric methods in detecting true findings compared with parametric methods, which would have detected most true findings that are essential for making valid biological inferences in MRI data. Parametric analyses, in contrast, detected most true findings while generating relatively few false positives: on average, less than one of those very large clusters would be deemed a true finding in each brain-wide analysis. We therefore recommend the continued use of parametric methods that model nonstationary smoothness for cluster-level, familywise control of false positives, particularly when using a Cluster Defining Threshold of 2.5 or higher, and subsequently assessing rigorously the biological plausibility of the findings, even for large clusters. Finally, because nonparametric methods yielded a large reduction in statistical power to detect true positive findings, we conclude that the modest reduction in false positive findings that nonparametric analyses afford does not warrant a re-analysis of previously published fMRI studies using nonparametric techniques. Copyright © 2018 Elsevier Inc. All rights reserved.
Kozyra, Paweł; Góra-Marek, Kinga; Datka, Jerzy
2015-02-05
The values of extinction coefficients of CC and CC IR bands of ethyne and ethene interacting with Cu+ and Ag+ in zeolites were determined in quantitative IR experiments and also by quantumchemical DFT calculations with QM/MM method. Both experimental and calculated values were in very good agreement validating the reliability of calculations. The values of extinction coefficients of ethyne and ethene interacting with bare cations and cations embedded in zeolite-like clusters were calculated. The interaction of organic molecules with Cu+ and Ag+ in zeolites ZSM-5 and especially charge transfers between molecule, cation and zeolite framework was also discussed in relation to the values of extinction coefficients. Copyright © 2014 Elsevier B.V. All rights reserved.
A Stationary Wavelet Entropy-Based Clustering Approach Accurately Predicts Gene Expression
Nguyen, Nha; Vo, An; Choi, Inchan
2015-01-01
Abstract Studying epigenetic landscapes is important to understand the condition for gene regulation. Clustering is a useful approach to study epigenetic landscapes by grouping genes based on their epigenetic conditions. However, classical clustering approaches that often use a representative value of the signals in a fixed-sized window do not fully use the information written in the epigenetic landscapes. Clustering approaches to maximize the information of the epigenetic signals are necessary for better understanding gene regulatory environments. For effective clustering of multidimensional epigenetic signals, we developed a method called Dewer, which uses the entropy of stationary wavelet of epigenetic signals inside enriched regions for gene clustering. Interestingly, the gene expression levels were highly correlated with the entropy levels of epigenetic signals. Dewer separates genes better than a window-based approach in the assessment using gene expression and achieved a correlation coefficient above 0.9 without using any training procedure. Our results show that the changes of the epigenetic signals are useful to study gene regulation. PMID:25383910
NASA Astrophysics Data System (ADS)
Zhou, Xiaomei; Zheng, Yun; Zhang, Tingting; Zhang, Xiaoqian; Ma, Mengli; Meng, Hengling; Wang, Tiantao; Lu, Bingyue
2018-06-01
In order to provide useful information for protection and utilization of red-grained rice landraces from Hani's terraced fields, the phenotypic diversity of 61 red-grained rice landraces were assessed based 20 quantitative traits. The results indicated that the phenotypic diversity was abundant in red-grained rice landraces. Coefficients of variation (CV) ranged from 4.878% to 72.878%, and the largest of CV was the panicle neck length, while grain width was smallest. Shannon-Weaver diversity index (H') of 20 traits ranged from 1.464 to 2.165, the largest and the smallest H' values were observed in filled grain number and chalkiness, respectively. Cluster analysis based on unweighted pair group method showed 61 red-grain rice landraces grouped into eight clusters at a cut-off value of 6.2631. The first cluster included 11 landraces, the main cluster II involved 42 landraces, and the cluster IV included 3 landraces. Laopinzhonghongmi, Chena2, Laojingnuo, Bianhao6 and Baimi were separated from the main clusters.
NASA Astrophysics Data System (ADS)
Russell, Brock Richard
X-ray astrophysics provides a great many opportunities to study astronomical structures with large energies or high temperatures. This dissertation will describe two such applications: the use of Swift X-ray Telescope (XRT) data to analyze the interaction between a supernova shock and the circumstellar medium, and the use of a straightforward computer simulation to model the dynamics of intracluster gas in clusters of galaxies and constrain the thermal conduction coefficient. Stars emit stellar wind at varying rates throughout their lifetimes. This wind populates the circumstellar medium (CSM) with gas. When the supernova explodes, the shock wave propogates outward through this CSM and heats it to X-ray emitting temperatures. By analyzing X-ray observations of the immediate post-supernova environment, we are able to determine whether any significant CSM is present. By stacking a large number of Swift observations of SNe Ia, we increase the sensitivity. We find no X-rays, with an upper limit of 1.7 x 1038 erg s-1 and a 3 sigma upper limit on the mass loss rate of progenitor systems 1.1 x 10-6 solar masses per year x (vw)/(10 km s -1). This low upper limit precludes a massive progenitor as the binary companion in the supernova progenitor system, unless that star is in Roche lobe overflow. The hot Intracluster Medium (ICM) is composed of tenuous gas which is gravitationally-bound to the cluster of galaxies. This gas is not initially of uniform temperature, and experiences thermal conduction while maintaining hydrostatic equilibrium. However, magnetic field lines present in the ionized gas inhibit the full thermal conduction. In this dissertation, we present the results of a new one-dimensional simulation that models this conduction (and includes cooling while maintaining hydrostatic equilibrium). By comparing the results of this model with the observed gas temperature profiles and recent accurate constraints on the scatter of the gas fraction, we are able to constrain the thermal conductivity. Our results suggest that conduction factors are not higher than 10% of full Spitzer conduction for hot, relaxed clusters.
Gaillard, F O; Boudin, C; Chau, N P; Robert, V; Pichon, G
2003-11-01
Previous experimental gametocyte infections of Anopheles arabiensis on 3 volunteers naturally infected with Plasmodium falciparum were conducted in Senegal. They showed that gametocyte counts in the mosquitoes are, like macroparasite intakes, heterogeneous (overdispersed). They followed a negative binomial distribution, the overdispersion coefficient seeming constant (k = 3.1). To try to explain this heterogeneity, we used an individual-based model (IBM), simulating the behaviour of gametocytes in the human blood circulation and their ingestion by mosquitoes. The hypothesis was that there exists a clustering of the gametocytes in the capillaries. From a series of simulations, in the case of clustering the following results were obtained: (i) the distribution of the gametocytes ingested by the mosquitoes followed a negative binomial, (ii) the k coefficient significantly increased with the density of circulating gametocytes. To validate this model result, 2 more experiments were conducted in Cameroon. Pooled experiments showed a distinct density dependency of the k-values. The simulation results and the experimental results were thus in agreement and suggested that an aggregation process at the microscopic level might produce the density-dependent overdispersion at the macroscopic level. Simulations also suggested that the clustering of gametocytes might facilitate fertilization of gametes.
A social network's changing statistical properties and the quality of human innovation
NASA Astrophysics Data System (ADS)
Uzzi, Brian
2008-06-01
We examined the entire network of creative artists that made Broadway musicals, in the post-War period, a collaboration network of international acclaim and influence, with an eye to investigating how the network's structural features condition the relationship between individual artistic talent and the success of their musicals. Our findings show that some of the evolving topographical qualities of degree distributions, path lengths and assortativity are relatively stable with time even as collaboration patterns shift, which suggests their changes are only minimally associated with the ebb and flux of the success of new productions. In contrast, the clustering coefficient changed substantially over time and we found that it had a nonlinear association with the production of financially and artistically successful shows. When the clustering coefficient ratio is low or high, the financial and artistic success of the industry is low, while an intermediate level of clustering is associated with successful shows. We supported these findings with sociological theory on the relationship between social structure and collaboration and with tests of statistical inference. Our discussion focuses on connecting the statistical properties of social networks to their performance and the performance of the actors embedded within them.
Saki, Sahar; Bagheri, Hedayat; Deljou, Ali; Zeinalabedini, Mehrshad
2016-01-01
Descurainia sophia is a valuable medicinal plant in family of Brassicaceae. To determine the range of diversity amongst D. sophia in Iran, 32 naturally distributed plants belonging to six natural populations of the Iranian plateau were investigated by inter-simple sequence repeat (ISSR) markers. The average percentage of polymorphism produced by 12 ISSR primers was 86 %. The PIC values for primers ranged from 0.22 to 0.40 and Rp values ranged between 6.5 and 19.9. The relative genetic diversity of the populations was not high (Gst =0.32). However, the value of gene flow revealed by the ISSR marker was high (Nm = 1.03). UPGMA clustering method based on Jaccard similarity coefficient grouped the genotypes into two major clusters. Graph results from Neighbor-Net Network generated after a 1000 bootstrap test using Jaccard coefficient, and STRUCTURE analysis confirmed the UPGMA clustering. The first three PCAs represented 57.31 % of the total variation. The high levels of genetic diversity were observed within populations, which is useful in breeding and conservation programs. ISSR is found to be an eligible marker to study genetic diversity of D. sophia.
Smetana, K; Jirásková, I; Malasková, V; Cermák, J
2002-01-01
Nucleoli were studied in the proliferation as well as maturation granulopoietic compartment in patients suffering from refractory anemia with excess blasts (RAEB) of the myelodysplastic syndrome (MDS) by means of simple cytochemical procedures for the demonstration of nucleolar RNA and silver stained proteins of nucleolus organizer regions. Regardless of the procedure used for the nucleolar visualization, early stages of the granulopoietic compartment and particularly myeloblasts of RAEB patients were characterized by reduction of the nucleolar number expressed by the nucleolar coefficient the values of which resembled those described previously in acute myeloid leukemias. The reduced values of the nucleolar coefficient of these cells in silver stained specimens of RAEB patients were accompanied by a decreased number of clusters of silver stained particles representing interphasic silver stained nucleolus organizer regions (AgNORs). The reduction of these clusters was also described previously in leukemic cells. In addition, the differences in the values of the nucleolar coefficient of granulocytic precursors between specimens stained for RNA and those stained with the silver reaction might reflect changing composition and proportions of nucleolar components in the course of the granulocytic development.
Hierarchical coefficient of a multifractal based network
NASA Astrophysics Data System (ADS)
Moreira, Darlan A.; Lucena, Liacir dos Santos; Corso, Gilberto
2014-02-01
The hierarchical property for a general class of networks stands for a power-law relation between clustering coefficient, CC and connectivity k: CC∝kβ. This relation is empirically verified in several biologic and social networks, as well as in random and deterministic network models, in special for hierarchical networks. In this work we show that the hierarchical property is also present in a Lucena network. To create a Lucena network we use the dual of a multifractal lattice ML, the vertices are the sites of the ML and links are established between neighbouring lattices, therefore this network is space filling and planar. Besides a Lucena network shows a scale-free distribution of connectivity. We deduce a relation for the maximal local clustering coefficient CCimax of a vertex i in a planar graph. This condition expresses that the number of links among neighbour, N△, of a vertex i is equal to its connectivity ki, that means: N△=ki. The Lucena network fulfils the condition N△≃ki independent of ki and the anisotropy of ML. In addition, CCmax implies the threshold β=1 for the hierarchical property for any scale-free planar network.
Race Factors Affecting Performance Times in Elite Long-Track Speed Skating.
Noordhof, Dionne A; Mulder, Roy C; de Koning, Jos J; Hopkins, Will G
2016-05-01
Analysis of sport performance can provide effects of environmental and other venue-specific factors in addition to estimates of within-athlete variability between competitions, which determines smallest worthwhile effects. To analyze elite long-track speed-skating events. Log-transformed performance times were analyzed with a mixed linear model that estimated percentage mean effects for altitude, barometric pressure, type of rink, and competition importance. In addition, coefficients of variation representing residual venue-related differences and within-athlete variability between races within clusters spanning ~8 d were determined. Effects and variability were assessed with magnitude-based inference. A 1000-m increase in altitude resulted in very large mean performance improvements of 2.8% in juniors and 2.1% in seniors. An increase in barometric pressure of 100 hPa resulted in a moderate reduction in performance of 1.1% for juniors but an unclear effect for seniors. Only juniors competed at open rinks, resulting in a very large reduction in performance of 3.4%. Juniors and seniors showed small performance improvements (0.4% and 0.3%) at the more important competitions. After accounting for these effects, residual venue-related variability was still moderate to large. The within-athlete within-cluster race-to-race variability was 0.3-1.3%, with a small difference in variability between male (0.8%) and female juniors (1.0%) and no difference between male and female seniors (both 0.6%). The variability in performance times of skaters is similar to that of athletes in other sports in which air or water resistance limits speed. A performance enhancement of 0.1-0.4% by top-10 athletes is necessary to increase medal-winning chances by 10%.
Tuberculosis outbreaks predicted by characteristics of first patients in a DNA fingerprint cluster.
Kik, Sandra V; Verver, Suzanne; van Soolingen, Dick; de Haas, Petra E W; Cobelens, Frank G; Kremer, Kristin; van Deutekom, Henk; Borgdorff, Martien W
2008-07-01
Some clusters of patients who have Mycobacterium tuberculosis isolates with identical DNA fingerprint patterns grow faster than others. It is unclear what predictors determine cluster growth. To assess whether the development of a tuberculosis (TB) outbreak can be predicted by the characteristics of its first two patients. Demographic and clinical data of all culture-confirmed patients with TB in the Netherlands from 1993 through 2004 were combined with DNA fingerprint data. Clusters were restricted to cluster episodes of 2 years to only detect newly arising clusters. Characteristics of the first two patients were compared between small (2-4 cases) and large (5 or more cases) cluster episodes. Of 5,454 clustered cases, 1,756 (32%) were part of a cluster episode of 2 years. Of 622 cluster episodes, 54 (9%) were large and 568 (91%) were small episodes. Independent predictors for large cluster episodes were as follows: less than 3 months' time between the diagnosis of the first two patients, one or both patients were young (<35 yr), both patients lived in an urban area, and both patients came from sub-Saharan Africa. In the Netherlands, patients in new cluster episodes should be screened for these risk factors. When the risk pattern applies, targeted interventions (e.g., intensified contact investigation) should be considered to prevent further cluster expansion.
SOMBI: Bayesian identification of parameter relations in unstructured cosmological data
NASA Astrophysics Data System (ADS)
Frank, Philipp; Jasche, Jens; Enßlin, Torsten A.
2016-11-01
This work describes the implementation and application of a correlation determination method based on self organizing maps and Bayesian inference (SOMBI). SOMBI aims to automatically identify relations between different observed parameters in unstructured cosmological or astrophysical surveys by automatically identifying data clusters in high-dimensional datasets via the self organizing map neural network algorithm. Parameter relations are then revealed by means of a Bayesian inference within respective identified data clusters. Specifically such relations are assumed to be parametrized as a polynomial of unknown order. The Bayesian approach results in a posterior probability distribution function for respective polynomial coefficients. To decide which polynomial order suffices to describe correlation structures in data, we include a method for model selection, the Bayesian information criterion, to the analysis. The performance of the SOMBI algorithm is tested with mock data. As illustration we also provide applications of our method to cosmological data. In particular, we present results of a correlation analysis between galaxy and active galactic nucleus (AGN) properties provided by the SDSS catalog with the cosmic large-scale-structure (LSS). The results indicate that the combined galaxy and LSS dataset indeed is clustered into several sub-samples of data with different average properties (for example different stellar masses or web-type classifications). The majority of data clusters appear to have a similar correlation structure between galaxy properties and the LSS. In particular we revealed a positive and linear dependency between the stellar mass, the absolute magnitude and the color of a galaxy with the corresponding cosmic density field. A remaining subset of data shows inverted correlations, which might be an artifact of non-linear redshift distortions.
A comparison of hair colour measurement by digital image analysis with reflective spectrophotometry.
Vaughn, Michelle R; van Oorschot, Roland A H; Baindur-Hudson, Swati
2009-01-10
While reflective spectrophotometry is an established method for measuring macroscopic hair colour, it can be cumbersome to use on a large number of individuals and not all reflective spectrophotometry instruments are easily portable. This study investigates the use of digital photographs to measure hair colour and compares its use to reflective spectrophotometry. An understanding of the accuracy of colour determination by these methods is of relevance when undertaking specific investigations, such as those on the genetics of hair colour. Measurements of hair colour may also be of assistance in cases where a photograph is the only evidence of hair colour available (e.g. surveillance). Using the CIE L(*)a(*)b(*) colour space, the hair colour of 134 individuals of European ancestry was measured by both reflective spectrophotometry and by digital image analysis (in V++). A moderate correlation was found along all three colour axes, with Pearson correlation coefficients of 0.625, 0.593 and 0.513 for L(*), a(*) and b(*) respectively (p-values=0.000), with means being significantly overestimated by digital image analysis for all three colour components (by an average of 33.42, 3.38 and 8.00 for L(*), a(*) and b(*) respectively). When using digital image data to group individuals into clusters previously determined by reflective spectrophotometric analysis using a discriminant analysis, individuals were classified into the correct clusters 85.8% of the time when there were two clusters. The percentage of cases correctly classified decreases as the number of clusters increases. It is concluded that, although more convenient, hair colour measurement from digital images has limited use in situations requiring accurate and consistent measurements.
Balasubramaniam, Krishna N; Beisner, Brianne A; Berman, Carol M; De Marco, Arianna; Duboscq, Julie; Koirala, Sabina; Majolo, Bonaventura; MacIntosh, Andrew J; McFarland, Richard; Molesti, Sandra; Ogawa, Hideshi; Petit, Odile; Schino, Gabriele; Sosa, Sebastian; Sueur, Cédric; Thierry, Bernard; de Waal, Frans B M; McCowan, Brenda
2018-01-01
Among nonhuman primates, the evolutionary underpinnings of variation in social structure remain debated, with both ancestral relationships and adaptation to current conditions hypothesized to play determining roles. Here we assess whether interspecific variation in higher-order aspects of female macaque (genus: Macaca) dominance and grooming social structure show phylogenetic signals, that is, greater similarity among more closely-related species. We use a social network approach to describe higher-order characteristics of social structure, based on both direct interactions and secondary pathways that connect group members. We also ask whether network traits covary with each other, with species-typical social style grades, and/or with sociodemographic characteristics, specifically group size, sex-ratio, and current living condition (captive vs. free-living). We assembled 34-38 datasets of female-female dyadic aggression and allogrooming among captive and free-living macaques representing 10 species. We calculated dominance (transitivity, certainty), and grooming (centrality coefficient, Newman's modularity, clustering coefficient) network traits as aspects of social structure. Computations of K statistics and randomization tests on multiple phylogenies revealed moderate-strong phylogenetic signals in dominance traits, but moderate-weak signals in grooming traits. GLMMs showed that grooming traits did not covary with dominance traits and/or social style grade. Rather, modularity and clustering coefficient, but not centrality coefficient, were strongly predicted by group size and current living condition. Specifically, larger groups showed more modular networks with sparsely-connected clusters than smaller groups. Further, this effect was independent of variation in living condition, and/or sampling effort. In summary, our results reveal that female dominance networks were more phylogenetically conserved across macaque species than grooming networks, which were more labile to sociodemographic factors. Such findings narrow down the processes that influence interspecific variation in two core aspects of macaque social structure. Future directions should include using phylogeographic approaches, and addressing challenges in examining the effects of socioecological factors on primate social structure. © 2017 Wiley Periodicals, Inc.
Hollingsworth, John M.; Funk, Russell J.; Garrison, Spencer A.; Owen-Smith, Jason; Kaufman, Samuel A.; Pagani, Francis D.; Nallamothu, Brahmajee K.
2017-01-01
Background Patients undergoing coronary artery bypass grafting (CABG) must often see multiple providers dispersed across many care locations. To test whether “teamwork” (assessed with the bipartite clustering coefficient) among these physicians is a determinant of surgical outcomes, we examined national Medicare data from patients undergoing CABG. Methods and Results Among Medicare beneficiaries who underwent CABG between 2008 and 2011, we mapped relationships between all physicians who treated them during their surgical episodes, including both surgeons and nonsurgeons. After aggregating across CABG episodes in a year to construct the physician social networks serving each health system, we then assessed the level of physician teamwork in these networks with the bipartite clustering coefficient. Finally, we fit a series of multivariable regression models to evaluate associations between a health system’s teamwork level and its 60-day surgical outcomes. We observed substantial variation in the level of teamwork between health systems performing CABG (standard deviation for the bipartite clustering coefficient was 0.09). While health systems with high and low teamwork levels treated beneficiaries with comparable comorbidity scores, these health systems differed over several sociocultural and healthcare capacity factors (e.g., physician staff size, surgical caseload). After controlling for these differences, health systems with higher teamwork levels had significantly lower 60-day rates of emergency department visit, readmission, and mortality. Conclusions Health systems with physicians who tend to work together in tightly knit groups during CABG episodes realize better surgical outcomes. As such, delivery system reforms focused on building teamwork may have positive effects on surgical care. PMID:28263939
Hollingsworth, John M; Funk, Russell J; Garrison, Spencer A; Owen-Smith, Jason; Kaufman, Samuel A; Pagani, Francis D; Nallamothu, Brahmajee K
2016-11-01
Patients undergoing coronary artery bypass grafting (CABG) must often see multiple providers dispersed across many care locations. To test whether teamwork (assessed with the bipartite clustering coefficient) among these physicians is a determinant of surgical outcomes, we examined national Medicare data from patients undergoing CABG. Among Medicare beneficiaries who underwent CABG between 2008 and 2011, we mapped relationships between all physicians who treated them during their surgical episodes, including both surgeons and nonsurgeons. After aggregating across CABG episodes in a year to construct the physician social networks serving each health system, we then assessed the level of physician teamwork in these networks with the bipartite clustering coefficient. Finally, we fit a series of multivariable regression models to evaluate associations between a health system's teamwork level and its 60-day surgical outcomes. We observed substantial variation in the level of teamwork between health systems performing CABG (SD for the bipartite clustering coefficient was 0.09). Although health systems with high and low teamwork levels treated beneficiaries with comparable comorbidity scores, these health systems differed over several sociocultural and healthcare capacity factors (eg, physician staff size and surgical caseload). After controlling for these differences, health systems with higher teamwork levels had significantly lower 60-day rates of emergency department visit, readmission, and mortality. Health systems with physicians who tend to work together in tightly-knit groups during CABG episodes realize better surgical outcomes. As such, delivery system reforms focused on building teamwork may have positive effects on surgical care. © 2016 American Heart Association, Inc.
Bharath, Rose D; Panda, Rajanikant; Reddam, Venkateswara Reddy; Bhaskar, M V; Gohel, Suril; Bhardwaj, Sujas; Prajapati, Arvind; Pal, Pramod Kumar
2017-01-01
Background and Purpose : Repetitive transcranial magnetic stimulation (rTMS) induces widespread changes in brain connectivity. As the network topology differences induced by a single session of rTMS are less known we undertook this study to ascertain whether the network alterations had a small-world morphology using multi-modal graph theory analysis of simultaneous EEG-fMRI. Method : Simultaneous EEG-fMRI was acquired in duplicate before (R1) and after (R2) a single session of rTMS in 14 patients with Writer's Cramp (WC). Whole brain neuronal and hemodynamic network connectivity were explored using the graph theory measures and clustering coefficient, path length and small-world index were calculated for EEG and resting state fMRI (rsfMRI). Multi-modal graph theory analysis was used to evaluate the correlation of EEG and fMRI clustering coefficients. Result : A single session of rTMS was found to increase the clustering coefficient and small-worldness significantly in both EEG and fMRI ( p < 0.05). Multi-modal graph theory analysis revealed significant modulations in the fronto-parietal regions immediately after rTMS. The rsfMRI revealed additional modulations in several deep brain regions including cerebellum, insula and medial frontal lobe. Conclusion : Multi-modal graph theory analysis of simultaneous EEG-fMRI can supplement motor physiology methods in understanding the neurobiology of rTMS in vivo . Coinciding evidence from EEG and rsfMRI reports small-world morphology for the acute phase network hyper-connectivity indicating changes ensuing low-frequency rTMS is probably not "noise".
Hashmi, Javeria Ali; Kong, Jian; Spaeth, Rosa; Khan, Sheraz; Kaptchuk, Ted J; Gollub, Randy L
2014-03-12
Placebo analgesia is an indicator of how efficiently the brain translates psychological signals conveyed by a treatment procedure into pain relief. It has been demonstrated that functional connectivity between distributed brain regions predicts placebo analgesia in chronic back pain patients. Greater network efficiency in baseline brain networks may allow better information transfer and facilitate adaptive physiological responses to psychological aspects of treatment. Here, we theorized that topological network alignments in resting state scans predict psychologically conditioned analgesic responses to acupuncture treatment in chronic knee osteoarthritis pain patients (n = 45). Analgesia was induced by building positive expectations toward acupuncture treatment with verbal suggestion and heat pain conditioning on a test site of the arm. This procedure induced significantly more analgesia after sham or real acupuncture on the test site than in a control site. The psychologically conditioned analgesia was invariant to sham versus real treatment. Efficiency of information transfer within local networks calculated with graph-theoretic measures (local efficiency and clustering coefficients) significantly predicted conditioned analgesia. Clustering coefficients in regions associated with memory, motivation, and pain modulation were closely involved in predicting analgesia. Moreover, women showed higher clustering coefficients and marginally greater pain reduction than men. Overall, analgesic response to placebo cues can be predicted from a priori resting state data by observing local network topology. Such low-cost synchronizations may represent preparatory resources that facilitate subsequent performance of brain circuits in responding to adaptive environmental cues. This suggests a potential utility of network measures in predicting placebo response for clinical use.
Sleep stages identification in patients with sleep disorder using k-means clustering
NASA Astrophysics Data System (ADS)
Fadhlullah, M. U.; Resahya, A.; Nugraha, D. F.; Yulita, I. N.
2018-05-01
Data mining is a computational intelligence discipline where a large dataset processed using a certain method to look for patterns within the large dataset. This pattern then used for real time application or to develop some certain knowledge. This is a valuable tool to solve a complex problem, discover new knowledge, data analysis and decision making. To be able to get the pattern that lies inside the large dataset, clustering method is used to get the pattern. Clustering is basically grouping data that looks similar so a certain pattern can be seen in the large data set. Clustering itself has several algorithms to group the data into the corresponding cluster. This research used data from patients who suffer sleep disorders and aims to help people in the medical world to reduce the time required to classify the sleep stages from a patient who suffers from sleep disorders. This study used K-Means algorithm and silhouette evaluation to find out that 3 clusters are the optimal cluster for this dataset which means can be divided to 3 sleep stages.
NASA Astrophysics Data System (ADS)
Tisch, J. W. G.; Hay, N.; Springate, E.; Gumbrell, E. T.; Hutchinson, M. H. R.; Marangos, J. P.
1999-10-01
We present measurements of ion energies from the interaction of intense, femtosecond laser pulses with large mixed-species clusters. Multi-keV protons and ~100-keV iodine ions are observed from the explosion of HI clusters produced in a gas jet operated at room temperature. Clusters formed from molecular gases such as HI are thus seen to extend the advantages of the laser-cluster interaction to elements that do not readily form single-species clusters. In the light of recently reported nuclear fusion in laser-heated clusters, we also examine the possibility of boosting the explosion energies of low-Z ions through the use of mixed species clusters.
How networks split when rival leaders emerge
NASA Astrophysics Data System (ADS)
Krawczyk, Malgorzata J.; Kułakowski, Krzysztof
2018-02-01
In a model social network, two hubs are appointed as leaders. Consecutive cutting of links on a shortest path between them splits the network in two. Next, the network is growing again till the initial size. Both processes are cyclically repeated. We investigate the size of a cluster containing the largest hub, the degree, the clustering coefficient, the closeness centrality and the betweenness centrality of the largest hub, as dependent on the number of cycles. The results are interpreted in terms of the leader's benefits from conflicts.
Using neighborhood cohesiveness to infer interactions between protein domains.
Segura, Joan; Sorzano, C O S; Cuenca-Alba, Jesus; Aloy, Patrick; Carazo, J M
2015-08-01
In recent years, large-scale studies have been undertaken to describe, at least partially, protein-protein interaction maps, or interactomes, for a number of relevant organisms, including human. However, current interactomes provide a somehow limited picture of the molecular details involving protein interactions, mostly because essential experimental information, especially structural data, is lacking. Indeed, the gap between structural and interactomics information is enlarging and thus, for most interactions, key experimental information is missing. We elaborate on the observation that many interactions between proteins involve a pair of their constituent domains and, thus, the knowledge of how protein domains interact adds very significant information to any interactomic analysis. In this work, we describe a novel use of the neighborhood cohesiveness property to infer interactions between protein domains given a protein interaction network. We have shown that some clustering coefficients can be extended to measure a degree of cohesiveness between two sets of nodes within a network. Specifically, we used the meet/min coefficient to measure the proportion of interacting nodes between two sets of nodes and the fraction of common neighbors. This approach extends previous works where homolog coefficients were first defined around network nodes and later around edges. The proposed approach substantially increases both the number of predicted domain-domain interactions as well as its accuracy as compared with current methods. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Abualhaj, Bedor; Weng, Guoyang; Ong, Melissa; Attarwala, Ali Asgar; Molina, Flavia; Büsing, Karen; Glatting, Gerhard
2017-01-01
Dynamic [ 18 F]fluoro-ethyl-L-tyrosine positron emission tomography ([ 18 F]FET-PET) is used to identify tumor lesions for radiotherapy treatment planning, to differentiate glioma recurrence from radiation necrosis and to classify gliomas grading. To segment different regions in the brain k-means cluster analysis can be used. The main disadvantage of k-means is that the number of clusters must be pre-defined. In this study, we therefore compared different cluster validity indices for automated and reproducible determination of the optimal number of clusters based on the dynamic PET data. The k-means algorithm was applied to dynamic [ 18 F]FET-PET images of 8 patients. Akaike information criterion (AIC), WB, I, modified Dunn's and Silhouette indices were compared on their ability to determine the optimal number of clusters based on requirements for an adequate cluster validity index. To check the reproducibility of k-means, the coefficients of variation CVs of the objective function values OFVs (sum of squared Euclidean distances within each cluster) were calculated using 100 random centroid initialization replications RCI 100 for 2 to 50 clusters. k-means was performed independently on three neighboring slices containing tumor for each patient to investigate the stability of the optimal number of clusters within them. To check the independence of the validity indices on the number of voxels, cluster analysis was applied after duplication of a slice selected from each patient. CVs of index values were calculated at the optimal number of clusters using RCI 100 to investigate the reproducibility of the validity indices. To check if the indices have a single extremum, visual inspection was performed on the replication with minimum OFV from RCI 100 . The maximum CV of OFVs was 2.7 × 10 -2 from all patients. The optimal number of clusters given by modified Dunn's and Silhouette indices was 2 or 3 leading to a very poor segmentation. WB and I indices suggested in median 5, [range 4-6] and 4, [range 3-6] clusters, respectively. For WB, I, modified Dunn's and Silhouette validity indices the suggested optimal number of clusters was not affected by the number of the voxels. The maximum coefficient of variation of WB, I, modified Dunn's, and Silhouette validity indices were 3 × 10 -2 , 1, 2 × 10 -1 and 3 × 10 -3 , respectively. WB-index showed a single global maximum, whereas the other indices showed also local extrema. From the investigated cluster validity indices, the WB-index is best suited for automated determination of the optimal number of clusters for [ 18 F]FET-PET brain images for the investigated image reconstruction algorithm and the used scanner: it yields meaningful results allowing better differentiation of tissues with higher number of clusters, it is simple, reproducible and has an unique global minimum. © 2016 American Association of Physicists in Medicine.
A rapid method to extract Seebeck coefficient under a large temperature difference
NASA Astrophysics Data System (ADS)
Zhu, Qing; Kim, Hee Seok; Ren, Zhifeng
2017-09-01
The Seebeck coefficient is one of the three important properties in thermoelectric materials. Since thermoelectric materials usually work under large temperature difference in real applications, we propose a quasi-steady state method to accurately measure the Seebeck coefficient under large temperature gradient. Compared to other methods, this method is not only highly accurate but also less time consuming. It can measure the Seebeck coefficient in both the temperature heating up and cooling down processes. In this work, a Zintl material (Mg3.15Nb0.05Sb1.5Bi0.49Te0.01) was tested to extract the Seebeck coefficient from room temperature to 573 K. Compared with a commercialized Seebeck coefficient measurement device (ZEM-3), there is ±5% difference between those from ZEM-3 and this method.
Atlas-guided cluster analysis of large tractography datasets.
Ros, Christian; Güllmar, Daniel; Stenzel, Martin; Mentzel, Hans-Joachim; Reichenbach, Jürgen Rainer
2013-01-01
Diffusion Tensor Imaging (DTI) and fiber tractography are important tools to map the cerebral white matter microstructure in vivo and to model the underlying axonal pathways in the brain with three-dimensional fiber tracts. As the fast and consistent extraction of anatomically correct fiber bundles for multiple datasets is still challenging, we present a novel atlas-guided clustering framework for exploratory data analysis of large tractography datasets. The framework uses an hierarchical cluster analysis approach that exploits the inherent redundancy in large datasets to time-efficiently group fiber tracts. Structural information of a white matter atlas can be incorporated into the clustering to achieve an anatomically correct and reproducible grouping of fiber tracts. This approach facilitates not only the identification of the bundles corresponding to the classes of the atlas; it also enables the extraction of bundles that are not present in the atlas. The new technique was applied to cluster datasets of 46 healthy subjects. Prospects of automatic and anatomically correct as well as reproducible clustering are explored. Reconstructed clusters were well separated and showed good correspondence to anatomical bundles. Using the atlas-guided cluster approach, we observed consistent results across subjects with high reproducibility. In order to investigate the outlier elimination performance of the clustering algorithm, scenarios with varying amounts of noise were simulated and clustered with three different outlier elimination strategies. By exploiting the multithreading capabilities of modern multiprocessor systems in combination with novel algorithms, our toolkit clusters large datasets in a couple of minutes. Experiments were conducted to investigate the achievable speedup and to demonstrate the high performance of the clustering framework in a multiprocessing environment.
Besga, Ariadna; Chyzhyk, Darya; Gonzalez-Ortega, Itxaso; Echeveste, Jon; Graña-Lecuona, Marina; Graña, Manuel; Gonzalez-Pinto, Ana
2017-01-01
Background: Late Onset Bipolar Disorder (LOBD) is the development of Bipolar Disorder (BD) at an age above 50 years old. It is often difficult to differentiate from other aging dementias, such as Alzheimer's Disease (AD), because they share cognitive and behavioral impairment symptoms. Objectives: We look for WM tract voxel clusters showing significant differences when comparing of AD vs. LOBD, and its correlations with systemic blood plasma biomarkers (inflammatory, neurotrophic factors, and oxidative stress). Materials: A sample of healthy controls (HC) ( n = 19), AD patients ( n = 35), and LOBD patients ( n = 24) was recruited at the Alava University Hospital. Blood plasma samples were obtained at recruitment time and analyzed to extract the inflammatory, oxidative stress, and neurotrophic factors. Several modalities of MRI were acquired for each subject, Methods: Fractional anisotropy (FA) coefficients are obtained from diffusion weighted imaging (DWI). Tract based spatial statistics (TBSS) finds FA skeleton clusters of WM tract voxels showing significant differences for all possible contrasts between HC, AD, and LOBD. An ANOVA F -test over all contrasts is carried out. Results of F -test are used to mask TBSS detected clusters for the AD > LOBD and LOBD > AD contrast to select the image clusters used for correlation analysis. Finally, Pearson's correlation coefficients between FA values at cluster sites and systemic blood plasma biomarker values are computed. Results: The TBSS contrasts with by ANOVA F -test has identified strongly significant clusters in the forceps minor, inferior longitudinal fasciculus, inferior fronto-occipital fasciculus, and cingulum gyrus. The correlation analysis of these tract clusters found strong negative correlation of AD with the nerve growth factor (NGF) and brain derived neurotrophic factor (BDNF) blood biomarkers. Negative correlation of AD and positive correlation of LOBD with inflammation biomarker IL6 was also found. Conclusion: TBSS voxel clusters tract atlas localizations are consistent with greater behavioral impairment and mood disorders in LOBD than in AD. Correlation analysis confirms that neurotrophic factors (i.e., NGF, BDNF) play a great role in AD while are absent in LOBD pathophysiology. Also, correlation results of IL1 and IL6 suggest stronger inflammatory effects in LOBD than in AD.
Identification of lethal cluster of genes in the yeast transcription network
NASA Astrophysics Data System (ADS)
Rho, K.; Jeong, H.; Kahng, B.
2006-05-01
Identification of essential or lethal genes would be one of the ultimate goals in drug designs. Here we introduce an in silico method to select the cluster with a high population of lethal genes, called lethal cluster, through microarray assay. We construct a gene transcription network based on the microarray expression level. Links are added one by one in the descending order of the Pearson correlation coefficients between two genes. As the link density p increases, two meaningful link densities pm and ps are observed. At pm, which is smaller than the percolation threshold, the number of disconnected clusters is maximum, and the lethal genes are highly concentrated in a certain cluster that needs to be identified. Thus the deletion of all genes in that cluster could efficiently lead to a lethal inviable mutant. This lethal cluster can be identified by an in silico method. As p increases further beyond the percolation threshold, the power law behavior in the degree distribution of a giant cluster appears at ps. We measure the degree of each gene at ps. With the information pertaining to the degrees of each gene at ps, we return to the point pm and calculate the mean degree of genes of each cluster. We find that the lethal cluster has the largest mean degree.
NASA Technical Reports Server (NTRS)
Moog, R. D.; Bacchus, D. L.; Utreja, L. R.
1979-01-01
The aerodynamic performance characteristics have been determined for the Space Shuttle Solid Rocket Booster drogue, main, and pilot parachutes. The performance evaluation on the 20-degree conical ribbon parachutes is based primarily on air drop tests of full scale prototype parachutes. In addition, parametric wind tunnel tests were performed and used in parachute configuration development and preliminary performance assessments. The wind tunnel test data are compared to the drop test results and both sets of data are used to determine the predicted performance of the Solid Rocket Booster flight parachutes. Data from other drop tests of large ribbon parachutes are also compared with the Solid Rocket Booster parachute performance characteristics. Parameters assessed include full open terminal drag coefficients, reefed drag area, opening characteristics, clustering effects, and forebody interference.
NASA Technical Reports Server (NTRS)
Schowalter, D. G.; DeCroix, D. S.; Lin, Y. L.; Arya, S. P.; Kaplan, M. L.
1996-01-01
It was found that the homogeneity of the surface drag coefficient plays an important role in the large scale structure of turbulence in large-eddy simulation of the convective atmospheric boundary layer. Particularly when a ground surface temperature was specified, large horizontal anisotropies occurred when the drag coefficient depended upon local velocities and heat fluxes. This was due to the formation of streamwise roll structures in the boundary layer. In reality, these structures have been found to form when shear is approximately balanced by buoyancy. The present cases, however, were highly convective. The formation was caused by particularly low values of the drag coefficient at the entrance to thermal plume structures.
Power and money in cluster randomized trials: when is it worth measuring a covariate?
Moerbeek, Mirjam
2006-08-15
The power to detect a treatment effect in cluster randomized trials can be increased by increasing the number of clusters. An alternative is to include covariates into the regression model that relates treatment condition to outcome. In this paper, formulae are derived in order to evaluate both strategies on basis of their costs. It is shown that the strategy that uses covariates is more cost-efficient in detecting a treatment effect when the costs to measure these covariates are small and the correlation between the covariates and outcome is sufficiently large. The minimum required correlation depends on the cluster size, and the costs to recruit a cluster and to measure the covariate, relative to the costs to recruit a person. Measuring a covariate that varies at the person level only is recommended when cluster sizes are small and the costs to recruit and measure a cluster are large. Measuring a cluster level covariate is recommended when cluster sizes are large and the costs to recruit and measure a cluster are small. An illustrative example shows the use of the formulae in a practical setting. Copyright 2006 John Wiley & Sons, Ltd.
NASA Technical Reports Server (NTRS)
Thomas, K. L.; Keller, L. P.; Klock, W.; Warren, J.; Blanford, G. E.; Mckay, David S.
1994-01-01
The objective of this study was to determine whether or not cluster particles are sufficiently homogeneous to enable observations from one fragment of the cluster to be extrapolated to the entire cluster. We report on the results of a consortium study of the fragments of a large cluster particle. Multiple fragments from one large cluster were distributed to several research groups and were subjected to a variety of mineralogical and chemical analyses including: SEM, TEM, ion probe, SXRF, noble gas measurements, and microprobe laser mass spectrometry of individual fragments.
NASA Astrophysics Data System (ADS)
Wagner-Kaiser, R.; Mackey, Dougal; Sarajedini, Ata; Cohen, Roger E.; Geisler, Doug; Yang, Soung-Chul; Grocholski, Aaron J.; Cummings, Jeffrey D.
2018-03-01
We leverage new high-quality data from Hubble Space Telescope program GO-14164 to explore the variation in horizontal branch morphology among globular clusters in the Large Magellanic Cloud (LMC). Our new observations lead to photometry with a precision commensurate with that available for the Galactic globular cluster population. Our analysis indicates that, once metallicity is accounted for, clusters in the LMC largely share similar horizontal branch morphologies regardless of their location within the system. Furthermore, the LMC clusters possess, on average, slightly redder morphologies than most of the inner halo Galactic population; we find, instead, that their characteristics tend to be more similar to those exhibited by clusters in the outer Galactic halo. Our results are consistent with previous studies, showing a correlation between horizontal branch morphology and age.
NASA Astrophysics Data System (ADS)
Portegies Zwart, S. F.; Chen, H.-C.
2008-06-01
We reconstruct the initial two-body relaxation time at the half mass radius for a sample of young ⪉ 300 Myr star clusters in the Large Magellanic cloud. We achieve this by simulating star clusters with 12288 to 131072 stars using direct N-body integration. The equations of motion of all stars are calculated with high precision direct N-body simulations which include the effects of the evolution of single stars and binaries. We find that the initial relaxation times of the sample of observed clusters in the Large Magellanic Cloud ranges from about 200 Myr to about 2 Gyr. The reconstructed initial half-mass relaxation times for these clusters have a much narrower distribution than the currently observed distribution, which ranges over more than two orders of magnitude.
Scalable cluster administration - Chiba City I approach and lessons learned.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Navarro, J. P.; Evard, R.; Nurmi, D.
2002-07-01
Systems administrators of large clusters often need to perform the same administrative activity hundreds or thousands of times. Often such activities are time-consuming, especially the tasks of installing and maintaining software. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control, cluster administrators can automate all administrative tasks. Scalable cluster administration addresses the following challenge: What systems design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linuxmore » cluster; and we analyze the scalability, flexibility, and reliability benefits and limitations from that approach.« less
Perturbative two- and three-loop coefficients from large β Monte Carlo
NASA Astrophysics Data System (ADS)
Lepage, G. P.; Mackenzie, P. B.; Shakespeare, N. H.; Trottier, H. D.
Perturbative coefficients for Wilson loops and the static quark self-energy are extracted from Monte Carlo simulations at large β on finite volumes, where all the lattice momenta are large. The Monte Carlo results are in excellent agreement with perturbation theory through second order. New results for third order coefficients are reported. Twisted boundary conditions are used to eliminate zero modes and to suppress Z3 tunneling.
Perturbative two- and three-loop coefficients from large b Monte Carlo
DOE Office of Scientific and Technical Information (OSTI.GOV)
G.P. Lepage; P.B. Mackenzie; N.H. Shakespeare
1999-10-18
Perturbative coefficients for Wilson loops and the static quark self-energy are extracted from Monte Carlo simulations at large {beta} on finite volumes, where all the lattice momenta are large. The Monte Carlo results are in excellent agreement with perturbation theory through second order. New results for third order coefficients are reported. Twisted boundary conditions are used to eliminate zero modes and to suppress Z{sub 3} tunneling.
Evolution of the properties of Al(n)N(n) clusters with size.
Costales, Aurora; Blanco, M A; Francisco, E; Pandey, Ravindra; Martín Pendás, A
2005-12-29
A global optimization of stoichiometric (AlN)(n) clusters (n = 1-25, 30, 35, ..., 95, 100) has been performed using the basin-hopping (BH) method and describing the interactions with simple and yet realistic interatomic potentials. The results for the smaller isomers agree with those of previous electronic structure calculations, thus validating the present scheme. The lowest-energy isomers found can be classified in three different categories according to their structural motifs: (i) small clusters (n = 2-5), with planar ring structures and 2-fold coordination, (ii) medium clusters (n = 6-40), where a competition between stacked rings and globular-like empty cages exists, and (iii) large clusters (n > 40), large enough to mix different elements of the previous stage. All the atoms in small and medium-sized clusters are in the surface, while large clusters start to display interior atoms. Large clusters display a competition between tetrahedral and octahedral-like features: the former lead to a lower energy interior in the cluster, while the latter allow for surface terminations with a lower energy. All of the properties studied present different regimes according to the above classification. It is of particular interest that the local properties of the interior atoms do converge to the bulk limit. The isomers with n = 6 and 12 are specially stable with respect to the gain or loss of AlN molecules.
2017-05-05
dependent density functional theory (TD-DFT). The size of the clusters considered is relatively large compared to those considered in previous studies...are characterized by many different geometries, which potentially can be optimized with respect to specific materials design criteria, i.e., molecular...SixOy molecular clusters using density functional theory (DFT). The size of the clusters considered, however, is relatively large compared to those
Irradiation-induced microchemical changes in highly irradiated 316 stainless steel
NASA Astrophysics Data System (ADS)
Fujii, K.; Fukuya, K.
2016-02-01
Cold-worked 316 stainless steel specimens irradiated to 74 dpa in a pressurized water reactor (PWR) were analyzed by atom probe tomography (APT) to extend knowledge of solute clusters and segregation at higher doses. The analyses confirmed that those clusters mainly enriched in Ni-Si or Ni-Si-Mn were formed at high number density. The clusters were divided into three types based on their size and Mn content; small Ni-Si clusters (3-4 nm in diameter), and large Ni-Si and Ni-Si-Mn clusters (8-10 nm in diameter). The total cluster number density was 7.7 × 1023 m-3. The fraction of large clusters was almost 1/10 of the total density. The average composition (in at%) for small clusters was: Fe, 54; Cr, 12; Mn, 1; Ni, 22; Si, 11; Mo, 1, and for large clusters it was: Fe, 44; Cr, 9; Mn, 2; Ni, 29; Si, 14; Mo,1. It was likely that some of the Ni-Si clusters correspond to γ‧ phase precipitates while the Ni-Si-Mn clusters were precursors of G phase precipitates. The APT analyses at grain boundaries confirmed enrichment of Ni, Si, P and Cu and depletion of Fe, Cr, Mo and Mn. The segregation behavior was consistent with previous knowledge of radiation induced segregation.
Anharmonic effects in the quantum cluster equilibrium method
NASA Astrophysics Data System (ADS)
von Domaros, Michael; Perlt, Eva
2017-03-01
The well-established quantum cluster equilibrium (QCE) model provides a statistical thermodynamic framework to apply high-level ab initio calculations of finite cluster structures to macroscopic liquid phases using the partition function. So far, the harmonic approximation has been applied throughout the calculations. In this article, we apply an important correction in the evaluation of the one-particle partition function and account for anharmonicity. Therefore, we implemented an analytical approximation to the Morse partition function and the derivatives of its logarithm with respect to temperature, which are required for the evaluation of thermodynamic quantities. This anharmonic QCE approach has been applied to liquid hydrogen chloride and cluster distributions, and the molar volume, the volumetric thermal expansion coefficient, and the isobaric heat capacity have been calculated. An improved description for all properties is observed if anharmonic effects are considered.
Obaid, Ramiz; Abu-Qaoud, Hassan; Arafeh, Rami
2014-01-01
Eight accessions of olive trees from three common varieties in Palestine, Nabali Baladi, Nabali Mohassan and Surri, were genetically evaluated using five simple sequence repeat (SSR) markers. A total of 17 alleles from 5 loci were observed in which 15 (88.2%) were polymorphic and 2 (11.8%) were monomorphic. An average of 3.4 alleles per locus was found ranging from 2.0 alleles with the primers GAPU-103 and DCA-9 to 5.0 alleles with U9932 and DCA-16. The smallest amplicon size observed was 50 bp with the primer DCA-16, whereas the largest one (450 bp) with the primer U9932. Cluster analysis with the unweighted pair group method with arithmetic average (UPGMA) showed three clusters: a cluster with four accessions from the ‘Nabali Baladi’ cultivar, another cluster with three accessions that represents the ‘Nabali Mohassen’ cultivar and finally the ‘Surri’ cultivar. The similarity coefficient for the eight olive tree samples ranged from a maximum of 100% between two accessions from Nabali Baladi and also in two other samples from Nabali Mohassan, to a minimum similarity coefficient (0.315) between the Surri and two Nabali Baladi accessions. The results in this investigation clearly highlight the genetic dissimilarity between the three main olive cultivars that have been misidentified and mixed up in the past, based on conventional morphological characters. PMID:26019564
Assessment of the vision-specific quality of life using clustered visual field in glaucoma patients.
Sawada, Hideko; Yoshino, Takaiko; Fukuchi, Takeo; Abe, Haruki
2014-02-01
To investigate the significance of vision-specific quality of life (QOL) in glaucoma patients based on the location of visual field defects. We examined 336 eyes of 168 patients. The 25-item National Eye Institute Visual Function Questionnaire was used to evaluate patients' QOL. Visual field testing was performed using the Humphrey Field Analyzer; the visual field was divided into 10 clusters. We defined the eye with better mean deviation as the better eye and the fellow eye as the worse eye. A single linear regression analysis was applied to assess the significance of the relationship between QOL and the clustered visual field. The strongest correlation was observed in the lower paracentral visual field in the better eye. The lower peripheral visual field in the better eye also showed a good correlation. Correlation coefficients in the better eye were generally higher than those in the worse eye. For driving, the upper temporal visual field in the better eye was the most strongly correlated (r=0.509). For role limitation and peripheral vision, the lower peripheral visual field in the better eye had the highest correlation coefficients at 0.459 and 0.425, respectively. Overall, clusters in the lower hemifield in the better eye were more strongly correlated with QOL than those in the worse eye. In particular, the lower paracentral visual field in the better eye was correlated most strongly of all. Driving, however, strongly correlated with the upper hemifield in the better eye.
Technical Efficiency of Automotive Industry Cluster in Chennai
NASA Astrophysics Data System (ADS)
Bhaskaran, E.
2012-07-01
Chennai is also called as Detroit of India due to its automotive industry presence producing over 40 % of the India's vehicle and components. During 2001-2002, diagnostic study was conducted on the Automotive Component Industries (ACI) in Ambattur Industrial Estate, Chennai and in SWOT analysis it was found that it had faced problems on infrastructure, technology, procurement, production and marketing. In the year 2004-2005 under the cluster development approach (CDA), they formed Chennai auto cluster, under public private partnership concept, received grant from Government of India, Government of Tamil Nadu, Ambattur Municipality, bank loans and stake holders. This results development in infrastructure, technology, procurement, production and marketing interrelationships among ACI. The objective is to determine the correlation coefficient, regression equation, technical efficiency, peer weights, slack variables and return to scale of cluster before and after the CDA. The methodology adopted is collection of primary data from ACI and analyzing using data envelopment analysis (DEA) of input oriented Banker-Charnes-Cooper model. There is significant increase in correlation coefficient and the regression analysis reveals that for one percent increase in employment and net worth, the gross output increases significantly after the CDA. The DEA solver gives the technical efficiency of ACI by taking shift, employment, net worth as input data and quality, gross output and export ratio as output data. From the technical score and ranking of ACI, it is found that there is significant increase in technical efficiency of ACI when compared to CDA. The slack variables obtained clearly reveals the excess employment and net worth and no shortage of gross output. To conclude there is increase in technical efficiency of not only Chennai auto cluster in general but also Chennai auto components industries in particular.
Open star clusters and Galactic structure
NASA Astrophysics Data System (ADS)
Joshi, Yogesh C.
2018-04-01
In order to understand the Galactic structure, we perform a statistical analysis of the distribution of various cluster parameters based on an almost complete sample of Galactic open clusters yet available. The geometrical and physical characteristics of a large number of open clusters given in the MWSC catalogue are used to study the spatial distribution of clusters in the Galaxy and determine the scale height, solar offset, local mass density and distribution of reddening material in the solar neighbourhood. We also explored the mass-radius and mass-age relations in the Galactic open star clusters. We find that the estimated parameters of the Galactic disk are largely influenced by the choice of cluster sample.
Atlas-Guided Cluster Analysis of Large Tractography Datasets
Ros, Christian; Güllmar, Daniel; Stenzel, Martin; Mentzel, Hans-Joachim; Reichenbach, Jürgen Rainer
2013-01-01
Diffusion Tensor Imaging (DTI) and fiber tractography are important tools to map the cerebral white matter microstructure in vivo and to model the underlying axonal pathways in the brain with three-dimensional fiber tracts. As the fast and consistent extraction of anatomically correct fiber bundles for multiple datasets is still challenging, we present a novel atlas-guided clustering framework for exploratory data analysis of large tractography datasets. The framework uses an hierarchical cluster analysis approach that exploits the inherent redundancy in large datasets to time-efficiently group fiber tracts. Structural information of a white matter atlas can be incorporated into the clustering to achieve an anatomically correct and reproducible grouping of fiber tracts. This approach facilitates not only the identification of the bundles corresponding to the classes of the atlas; it also enables the extraction of bundles that are not present in the atlas. The new technique was applied to cluster datasets of 46 healthy subjects. Prospects of automatic and anatomically correct as well as reproducible clustering are explored. Reconstructed clusters were well separated and showed good correspondence to anatomical bundles. Using the atlas-guided cluster approach, we observed consistent results across subjects with high reproducibility. In order to investigate the outlier elimination performance of the clustering algorithm, scenarios with varying amounts of noise were simulated and clustered with three different outlier elimination strategies. By exploiting the multithreading capabilities of modern multiprocessor systems in combination with novel algorithms, our toolkit clusters large datasets in a couple of minutes. Experiments were conducted to investigate the achievable speedup and to demonstrate the high performance of the clustering framework in a multiprocessing environment. PMID:24386292
Method and system for data clustering for very large databases
NASA Technical Reports Server (NTRS)
Livny, Miron (Inventor); Zhang, Tian (Inventor); Ramakrishnan, Raghu (Inventor)
1998-01-01
Multi-dimensional data contained in very large databases is efficiently and accurately clustered to determine patterns therein and extract useful information from such patterns. Conventional computer processors may be used which have limited memory capacity and conventional operating speed, allowing massive data sets to be processed in a reasonable time and with reasonable computer resources. The clustering process is organized using a clustering feature tree structure wherein each clustering feature comprises the number of data points in the cluster, the linear sum of the data points in the cluster, and the square sum of the data points in the cluster. A dense region of data points is treated collectively as a single cluster, and points in sparsely occupied regions can be treated as outliers and removed from the clustering feature tree. The clustering can be carried out continuously with new data points being received and processed, and with the clustering feature tree being restructured as necessary to accommodate the information from the newly received data points.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ben-Naim, Eli; Krapivsky, Paul
Here we generalize the ordinary aggregation process to allow for choice. In ordinary aggregation, two random clusters merge and form a larger aggregate. In our implementation of choice, a target cluster and two candidate clusters are randomly selected and the target cluster merges with the larger of the two candidate clusters.We study the long-time asymptotic behavior and find that as in ordinary aggregation, the size density adheres to the standard scaling form. However, aggregation with choice exhibits a number of different features. First, the density of the smallest clusters exhibits anomalous scaling. Second, both the small-size and the large-size tailsmore » of the density are overpopulated, at the expense of the density of moderate-size clusters. Finally, we also study the complementary case where the smaller candidate cluster participates in the aggregation process and find an abundance of moderate clusters at the expense of small and large clusters. Additionally, we investigate aggregation processes with choice among multiple candidate clusters and a symmetric implementation where the choice is between two pairs of clusters.« less
Graph characterization via Ihara coefficients.
Ren, Peng; Wilson, Richard C; Hancock, Edwin R
2011-02-01
The novel contributions of this paper are twofold. First, we demonstrate how to characterize unweighted graphs in a permutation-invariant manner using the polynomial coefficients from the Ihara zeta function, i.e., the Ihara coefficients. Second, we generalize the definition of the Ihara coefficients to edge-weighted graphs. For an unweighted graph, the Ihara zeta function is the reciprocal of a quasi characteristic polynomial of the adjacency matrix of the associated oriented line graph. Since the Ihara zeta function has poles that give rise to infinities, the most convenient numerically stable representation is to work with the coefficients of the quasi characteristic polynomial. Moreover, the polynomial coefficients are invariant to vertex order permutations and also convey information concerning the cycle structure of the graph. To generalize the representation to edge-weighted graphs, we make use of the reduced Bartholdi zeta function. We prove that the computation of the Ihara coefficients for unweighted graphs is a special case of our proposed method for unit edge weights. We also present a spectral analysis of the Ihara coefficients and indicate their advantages over other graph spectral methods. We apply the proposed graph characterization method to capturing graph-class structure and clustering graphs. Experimental results reveal that the Ihara coefficients are more effective than methods based on Laplacian spectra.
Current State of an Intelligent System to Aid in Tephra Layer Correlation
NASA Astrophysics Data System (ADS)
Hanson-Hedgecock, S.; Bursik, M.; Rogova, G.
2007-12-01
We are developing a computer based intelligent system to correlate tephra layers by using the lithologic, mineralogic, and geochemical characteristics of field samples, to aid geologists in interpreting eruption patterns of volcanic chains and fields. The intelligent system is used to define groups of tephra source vents by utilizing geochemical data, and to correlate tephra layers based on lithostratigraphic characteristics. Understanding the eruption history of a volcano from stratigraphic studies is important for forecasting future eruptive behavior and hazards. In volcanic chains and fields with a complex eruptive history and no central vent, determining the spatio- temporal eruption patterns is difficult. Sedimentologic and chemical variability, and sparse sampling often result in relatively large variances and imprecision in the dataset. Lithostratigraphic and geochemical interpretation also depends on ones' level of expertise and can be subjective. The processing of lithostratigraphic features is conducted by a hybrid classifier, composed of supervised artificial neural networks (ANNs) combined within the framework of the Dempster-Shafer theory of evidence. Since lithostratigraphic features vary with distance from source, hypothetical vent locations are determined by using expert domain knowledge and geostatistical methods. Geochemical data are processed by a suit of fuzzy k- means classifiers. Each fuzzy k-means classifier assigns observations to multiple clusters with various degrees, called membership coefficients. The assignment minimizes a function of the total distance between the centers of clusters and the individual geochemical data patterns weighed by the membership coefficients. Improved clustering results of geochemical data are achieved by the fusion of individual clustering results with an evidential combination method. Lithostratigraphic data from individual tephra beds of the North Mono eruption sequence are used to test the effectiveness of the intelligent system for tephra layer correlation. Geochemical data from tephra bedsets of the Mono and Inyo Craters, CA, are used to test the effectiveness of the intelligent system for eruption sequence correlation. The intelligent system aids correlation by showing matches and disparities between data patterns from different outcrops that may have been overlooked in initial interpretations. Initial results show that the lithostratigraphic classifier is able to accurately differentiate known layers 76% of the time. Output from the lithostratigraphic classifier can furthermore be plotted directly as isopleth maps that can aid in rapid recognition of tephra layers as well as determination of eruption characteristics, e.g. eruption volume, plume height, etc. The intelligent system produces a useful recognition result, while dealing with the uncertainty from sparse data and the imprecise description of layer characteristics.
Gupta, Mayetri; Cheung, Ching-Lung; Hsu, Yi-Hsiang; Demissie, Serkalem; Cupples, L Adrienne; Kiel, Douglas P; Karasik, David
2011-06-01
Genome-wide association studies (GWAS) using high-density genotyping platforms offer an unbiased strategy to identify new candidate genes for osteoporosis. It is imperative to be able to clearly distinguish signal from noise by focusing on the best phenotype in a genetic study. We performed GWAS of multiple phenotypes associated with fractures [bone mineral density (BMD), bone quantitative ultrasound (QUS), bone geometry, and muscle mass] with approximately 433,000 single-nucleotide polymorphisms (SNPs) and created a database of resulting associations. We performed analysis of GWAS data from 23 phenotypes by a novel modification of a block clustering algorithm followed by gene-set enrichment analysis. A data matrix of standardized regression coefficients was partitioned along both axes--SNPs and phenotypes. Each partition represents a distinct cluster of SNPs that have similar effects over a particular set of phenotypes. Application of this method to our data shows several SNP-phenotype connections. We found a strong cluster of association coefficients of high magnitude for 10 traits (BMD at several skeletal sites, ultrasound measures, cross-sectional bone area, and section modulus of femoral neck and shaft). These clustered traits were highly genetically correlated. Gene-set enrichment analyses indicated the augmentation of genes that cluster with the 10 osteoporosis-related traits in pathways such as aldosterone signaling in epithelial cells, role of osteoblasts, osteoclasts, and chondrocytes in rheumatoid arthritis, and Parkinson signaling. In addition to several known candidate genes, we also identified PRKCH and SCNN1B as potential candidate genes for multiple bone traits. In conclusion, our mining of GWAS results revealed the similarity of association results between bone strength phenotypes that may be attributed to pleiotropic effects of genes. This knowledge may prove helpful in identifying novel genes and pathways that underlie several correlated phenotypes, as well as in deciphering genetic and phenotypic modularity underlying osteoporosis risk. Copyright © 2011 American Society for Bone and Mineral Research.
Mullan, B F; Madsen, M T; Messerle, L; Kolesnichenko, V; Kruger, J
2000-04-01
The purpose of this study was to examine the radiologic attenuation properties of the parent cluster compounds, particularly attenuation as a function of discrete photon energy, before investigating ligand substitutions, which are necessary to improve cluster biocompatibility and to impart desirable physicochemical properties. The linear attenuation coefficients for solutions of the cluster compounds Ta6Br14, K8Ta6O19, and (H3O)2W6Cl14 were determined at 60, 80, 103, 122, and 140 keV from gamma-ray transmission measurements with americium-241, xenon-133, gadolinium-153, cobalt-57, and technetium-99m radioactive sources. Transmission measurements were obtained for a fixed time interval that ensured a statistically accurate count distribution exceeding 20,000 counts through the sample for each trial. On a strictly mole per liter basis, a 0.075 mol/L aqueous solution of K8Ta6O19 showed 1.08 times the attenuation of 0.063 mol/L aqueous iohexol at 60 keV and 3.30 times the attenuation at 80 keV. Similarly, a 0.05 mol/L methanolic solution of (H3O)2W6Cl4 showed 0.96 times (96%) the attenuation of 0.063 mol/L aqueous iohexol at 60 keV but 3.09 times the attenuation of the iohexol solution at 80 keV. Attenuations of 0.063 mol/L aqueous iohexol and 0.0125 mol/L Ta6Br14 (ie, at approximately one-fifth the iohexol concentration) were comparable at greater than 60 keV. These results confirm the theoretic potential for use of early transition metal cluster compounds as radiographic contrast agents. At higher x-ray energies, cluster compounds demonstrate multiplied x-ray attenuation relative to iodinated contrast agents.
NASA Astrophysics Data System (ADS)
Zhao, P.; Peng, Z.
2008-12-01
We systemically identify repeating earthquakes and investigate spatio-temporal variations of fault zone properties associated with the 2004 Mw6.0 Parkfield earthquake along the Parkfield section of the San Andreas fault, and the 1984 Mw6.2 Morgan Hill earthquake along the central Calaveras fault. The procedure for identifying repeating earthquakes is based on overlapping of the source regions and the waveform similarity, and is briefly described as follows. First, we estimate the source radius of each event based on a circular crack model and a normal stress drop of 3 MPa. Next, we compute inter-hypocentral distance for events listed in the relocated catalog of Thurber et al. (2006) around Parkfield, and Schaff et al. (2002) along the Calaveras fault. Then, we group all events into 'initial' clusters by requiring the separation distance between each event pair to be less than the source radius of larger event, and their magnitude difference to be less than 1. Next, we calculate the correlation coefficients between every event pair within each 'initial' cluster using a 3-s time window around the direct P waves for all available stations. The median value of the correlation coefficients is used as a measure of similarity between each event pair. We drop an event if the median similarity to the rest events in that cluster is less than 0.9. After identifying repeating clusters in both regions, our next step is to apply a sliding window waveform cross-correlation technique (Niu et al., 2003; Peng and Ben-Zion, 2006) to calculate the delay time and decorrelation index for each repeating cluster. By measuring temporal changes in waveforms of repeating clusters at different locations and depth, we hope to obtain a better constraint on spatio-temporal variations of fault zone properties and near-surface layers associated with the occurrence of major earthquakes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, C; Noid, G; Dalah, E
2015-06-15
Purpose: It has been reported recently that the change of CT number (CTN) during and after radiation therapy (RT) may be used to assess RT response. The purpose of this work is to develop a tool to automatically segment the regions with differentiating CTN and/or with change of CTN in a series of CTs. Methods: A software tool was developed to identify regions with differentiating CTN using K-mean Cluster of CT numbers and to automatically delineate these regions using convex hull enclosing method. Pre- and Post-RT CT, PET, or MRI images acquired for sample lung and pancreatic cancer cases weremore » used to test the software tool. K-mean cluster of CT numbers within the gross tumor volumes (GTVs) delineated based on PET SUV (standard uptake value of fludeoxyglucose) and/or MRI ADC (apparent diffusion coefficient) map was analyzed. The cluster centers with higher value were considered as active tumor volumes (ATV). The convex hull contours enclosing preset clusters were used to delineate these ATVs with color washed displays. The CTN defined ATVs were compared with the SUV- or ADC-defined ATVs. Results: CTN stability of the CT scanner used to acquire the CTs in this work is less than 1.5 Hounsfield Unit (HU) variation annually. K-mean cluster centers in the GTV have difference of ∼20 HU, much larger than variation due to CTN stability, for the lung cancer cases studied. The dice coefficient between the ATVs delineated based on convex hull enclosure of high CTN centers and the PET defined GTVs based on SUV cutoff value of 2.5 was 90(±5)%. Conclusion: A software tool was developed using K-mean cluster and convex hull contour to automatically segment high CTN regions which may not be identifiable using a simple threshold method. These CTN regions were reasonably overlapped with the PET or MRI defined GTVs.« less
EXTENDED STAR FORMATION IN THE INTERMEDIATE-AGE LARGE MAGELLANIC CLOUD STAR CLUSTER NGC 2209
DOE Office of Scientific and Technical Information (OSTI.GOV)
Keller, Stefan C.; Mackey, A. Dougal; Da Costa, Gary S.
2012-12-10
We present observations of the 1 Gyr old star cluster NGC 2209 in the Large Magellanic Cloud made with the GMOS imager on the Gemini South Telescope. These observations show that the cluster exhibits a main-sequence turnoff that spans a broader range in luminosity than can be explained by a single-aged stellar population. This places NGC 2209 amongst a growing list of intermediate-age (1-3 Gyr) clusters that show evidence for extended or multiple epochs of star formation of between 50 and 460 Myr in extent. The extended main-sequence turnoff observed in NGC 2209 is a confirmation of the prediction inmore » Keller et al. made on the basis of the cluster's large core radius. We propose that secondary star formation is a defining feature of the evolution of massive star clusters. Dissolution of lower mass clusters through evaporation results in only clusters that have experienced secondary star formation surviving for a Hubble time, thus providing a natural connection between the extended main-sequence turnoff phenomenon and the ubiquitous light-element abundance ranges seen in the ancient Galactic globular clusters.« less
Gene essentiality and the topology of protein interaction networks
Coulomb, Stéphane; Bauer, Michel; Bernard, Denis; Marsolier-Kergoat, Marie-Claude
2005-01-01
The mechanistic bases for gene essentiality and for cell mutational resistance have long been disputed. The recent availability of large protein interaction databases has fuelled the analysis of protein interaction networks and several authors have proposed that gene dispensability could be strongly related to some topological parameters of these networks. However, many results were based on protein interaction data whose biases were not taken into account. In this article, we show that the essentiality of a gene in yeast is poorly related to the number of interactants (or degree) of the corresponding protein and that the physiological consequences of gene deletions are unrelated to several other properties of proteins in the interaction networks, such as the average degrees of their nearest neighbours, their clustering coefficients or their relative distances. We also found that yeast protein interaction networks lack degree correlation, i.e. a propensity for their vertices to associate according to their degrees. Gene essentiality and more generally cell resistance against mutations thus seem largely unrelated to many parameters of protein network topology. PMID:16087428
Sizing the star cluster population of the Large Magellanic Cloud
NASA Astrophysics Data System (ADS)
Piatti, Andrés E.
2018-04-01
The number of star clusters that populate the Large Magellanic Cloud (LMC) at deprojected distances <4 deg has been recently found to be nearly double the known size of the system. Because of the unprecedented consequences of this outcome in our knowledge of the LMC cluster formation and dissolution histories, we closely revisited such a compilation of objects and found that only ˜35 per cent of the previously known catalogued clusters have been included. The remaining entries are likely related to stellar overdensities of the LMC composite star field, because there is a remarkable enhancement of objects with assigned ages older than log(t yr-1) ˜ 9.4, which contrasts with the existence of the LMC cluster age gap; the assumption of a cluster formation rate similar to that of the LMC star field does not help to conciliate so large amount of clusters either; and nearly 50 per cent of them come from cluster search procedures known to produce more than 90 per cent of false detections. The lack of further analyses to confirm the physical reality as genuine star clusters of the identified overdensities also glooms those results. We support that the actual size of the LMC main body cluster population is close to that previously known.
Understanding network concepts in modules
2007-01-01
Background Network concepts are increasingly used in biology and genetics. For example, the clustering coefficient has been used to understand network architecture; the connectivity (also known as degree) has been used to screen for cancer targets; and the topological overlap matrix has been used to define modules and to annotate genes. Dozens of potentially useful network concepts are known from graph theory. Results Here we study network concepts in special types of networks, which we refer to as approximately factorizable networks. In these networks, the pairwise connection strength (adjacency) between 2 network nodes can be factored into node specific contributions, named node 'conformity'. The node conformity turns out to be highly related to the connectivity. To provide a formalism for relating network concepts to each other, we define three types of network concepts: fundamental-, conformity-based-, and approximate conformity-based concepts. Fundamental concepts include the standard definitions of connectivity, density, centralization, heterogeneity, clustering coefficient, and topological overlap. The approximate conformity-based analogs of fundamental network concepts have several theoretical advantages. First, they allow one to derive simple relationships between seemingly disparate networks concepts. For example, we derive simple relationships between the clustering coefficient, the heterogeneity, the density, the centralization, and the topological overlap. The second advantage of approximate conformity-based network concepts is that they allow one to show that fundamental network concepts can be approximated by simple functions of the connectivity in module networks. Conclusion Using protein-protein interaction, gene co-expression, and simulated data, we show that a) many networks comprised of module nodes are approximately factorizable and b) in these types of networks, simple relationships exist between seemingly disparate network concepts. Our results are implemented in freely available R software code, which can be downloaded from the following webpage: http://www.genetics.ucla.edu/labs/horvath/ModuleConformity/ModuleNetworks PMID:17547772
Rowland, Jared A; Stapleton-Kotloski, Jennifer R; Dobbins, Dorothy L; Rogers, Emily; Godwin, Dwayne W; Taber, Katherine H
2018-05-01
Cross-sectional and longitudinal studies in active duty and veteran cohorts have both demonstrated that deployment-acquired traumatic brain injury (TBI) is an independent risk factor for developing post-traumatic stress disorder (PTSD), beyond confounds such as combat exposure, physical injury, predeployment TBI, and pre-deployment psychiatric symptoms. This study investigated how resting-state brain networks differ between individuals who developed PTSD and those who did not following deployment-acquired TBI. Participants included postdeployment veterans with deployment-acquired TBI history both with and without current PTSD diagnosis. Graph metrics, including small-worldness, clustering coefficient, and modularity, were calculated from individually constructed whole-brain networks based on 5-min eyes-open resting-state magnetoencephalography (MEG) recordings. Analyses were adjusted for age and premorbid IQ. Results demonstrated that participants with current PTSD displayed higher levels of small-worldness, F(1,12) = 5.364, p < 0.039, partial eta squared = 0.309, and Cohen's d = 0.972, and clustering coefficient, F(1, 12) = 12.204, p < 0.004, partial eta squared = 0.504, and Cohen's d = 0.905, than participants without current PTSD. There were no between-group differences in modularity or the number of modules present. These findings are consistent with a hyperconnectivity hypothesis of the effect of TBI history on functional networks rather than a disconnection hypothesis, demonstrating increased levels of clustering coefficient rather than a decrease as might be expected; however, these results do not account for potential changes in brain structure. These results demonstrate the potential pathological sequelae of changes in functional brain networks following deployment-acquired TBI and represent potential neurobiological changes associated with deployment-acquired TBI that may increase the risk of subsequently developing PTSD.
Discrete Cosine Transform Image Coding With Sliding Block Codes
NASA Astrophysics Data System (ADS)
Divakaran, Ajay; Pearlman, William A.
1989-11-01
A transform trellis coding scheme for images is presented. A two dimensional discrete cosine transform is applied to the image followed by a search on a trellis structured code. This code is a sliding block code that utilizes a constrained size reproduction alphabet. The image is divided into blocks by the transform coding. The non-stationarity of the image is counteracted by grouping these blocks in clusters through a clustering algorithm, and then encoding the clusters separately. Mandela ordered sequences are formed from each cluster i.e identically indexed coefficients from each block are grouped together to form one dimensional sequences. A separate search ensues on each of these Mandela ordered sequences. Padding sequences are used to improve the trellis search fidelity. The padding sequences absorb the error caused by the building up of the trellis to full size. The simulations were carried out on a 256x256 image ('LENA'). The results are comparable to any existing scheme. The visual quality of the image is enhanced considerably by the padding and clustering.
Numerical taxonomy of heavy metal tolerant bacteria isolated from the estuarine environment
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen, D.A.; Austin, B.; Mills, A.L.
1977-01-01
Metal tolerant bacteria, totalling 301 strains, were isolated from water and sediment samples collected from Chesapeake Bay. Growth in the presence of 100 ppm cadmium, chromium, cobalt, lead, mercury and molybdenum was tested. In addition, the strains were examined for 118 biochemical, cultural, morphological, nutritional and physiological, characters and the data were analyzed by computer, using the simple matching and Jaccard coefficients. From sorted similarity matrices, 293 strains, 97% of the total, were removed in 12 clusters defined at the 80 to 85% similarity level. The clusters included Bacillus and Pseudomonas spp. and genera and species of Enterobacteriaceae. Three clusters,more » containing gram negative rods, were not identified. Several of the clusters were composed of strains exhibiting tolerance to a wide range of heavy metals, whereas three of the clusters contained bacteria that were capable of growth in the presence of only a few of the metals examined in this study. Antibiotic resistance of the metal resistant strains has also been examined.« less
Numerical taxonomy and ecology of petroleum-degrading bacteria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Austin, B.; Calomiris, J.J.; Walker, J.D.
1977-07-01
A total of 99 strains of petroleum-degrading bacteria isolated from Chesapeake Bay water and sediment were identified by using numerical taxonomy procedures. The isolates, together with 33 reference cultures, were examined for 48 biochemical, cultural, morphological, and physiological characters. The data were analyzed by computer, using both the simple matching and the Jaccard coefficients. Clustering was achieved by the unweighted average linkage method. From the sorted similarity matrix and dendrogram, 14 phenetic groups, comprising 85 of the petroleum-degrading bacteria, were defined at the 80 to 85% similarity level. These groups were identified as actinomycetes (mycelial forms, four clusters), coryneforms, Enterobacteriaceae,more » Klebsiella aerogenes, Micrococcus spp. (two clusters), Nocardia species (two clusters), Pseudomonas spp. (two clusters), and Sphaerotilus natans. It is concluded that the degradation of petroleum is accomplished by a diverse range of bacterial taxa, some of which were isolated only at given sampling stations and, more specifically, from sediment collected at a given station.« less
Classification of arrhythmia using hybrid networks.
Haseena, Hassan H; Joseph, Paul K; Mathew, Abraham T
2011-12-01
Reliable detection of arrhythmias based on digital processing of Electrocardiogram (ECG) signals is vital in providing suitable and timely treatment to a cardiac patient. Due to corruption of ECG signals with multiple frequency noise and presence of multiple arrhythmic events in a cardiac rhythm, computerized interpretation of abnormal ECG rhythms is a challenging task. This paper focuses a Fuzzy C- Mean (FCM) clustered Probabilistic Neural Network (PNN) and Multi Layered Feed Forward Network (MLFFN) for the discrimination of eight types of ECG beats. Parameters such as fourth order Auto Regressive (AR) coefficients along with Spectral Entropy (SE) are extracted from each ECG beat and feature reduction has been carried out using FCM clustering. The cluster centers form the input of neural network classifiers. The extensive analysis of Massachusetts Institute of Technology- Beth Israel Hospital (MIT-BIH) arrhythmia database shows that FCM clustered PNNs is superior in cardiac arrhythmia classification than FCM clustered MLFFN with an overall accuracy of 99.05%, 97.14%, respectively.
Electrical Load Profile Analysis Using Clustering Techniques
NASA Astrophysics Data System (ADS)
Damayanti, R.; Abdullah, A. G.; Purnama, W.; Nandiyanto, A. B. D.
2017-03-01
Data mining is one of the data processing techniques to collect information from a set of stored data. Every day the consumption of electricity load is recorded by Electrical Company, usually at intervals of 15 or 30 minutes. This paper uses a clustering technique, which is one of data mining techniques to analyse the electrical load profiles during 2014. The three methods of clustering techniques were compared, namely K-Means (KM), Fuzzy C-Means (FCM), and K-Means Harmonics (KHM). The result shows that KHM is the most appropriate method to classify the electrical load profile. The optimum number of clusters is determined using the Davies-Bouldin Index. By grouping the load profile, the demand of variation analysis and estimation of energy loss from the group of load profile with similar pattern can be done. From the group of electric load profile, it can be known cluster load factor and a range of cluster loss factor that can help to find the range of values of coefficients for the estimated loss of energy without performing load flow studies.
New potential energy surface for the HCS{sup +}–He system and inelastic rate coefficients
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dubernet, Marie-Lise; Quintas-Sánchez, Ernesto; Tuckey, Philip
2015-07-28
A new high quality potential energy surface is calculated at a coupled-cluster single double triple level with an aug-cc-pV5Z basis set for the HCS{sup +}–He system. This potential energy surface is used in low energy quantum scattering calculations to provide a set of (de)-excitation cross sections and rate coefficients among the first 20 rotational levels of HCS{sup +} by He in the range of temperature from 5 K to 100 K. The paper discusses the impact of the new ab initio potential energy surface on the cross sections at low energy and provides a comparison with the HCO{sup +}–He system.more » The HCS{sup +}–He rate coefficients for the strongest transitions differ by factors of up to 2.5 from previous rate coefficients; thus, analysis of astrophysical spectra should be reconsidered with the new rate coefficients.« less
NASA Astrophysics Data System (ADS)
Mackey, A. D.; Gilmore, G. F.
2003-01-01
We have compiled a pseudo-snapshot data set of two-colour observations from the Hubble Space Telescope archive for a sample of 53 rich LMC clusters with ages of 106-1010 yr. We present surface brightness profiles for the entire sample, and derive structural parameters for each cluster, including core radii, and luminosity and mass estimates. Because we expect the results presented here to form the basis for several further projects, we describe in detail the data reduction and surface brightness profile construction processes, and compare our results with those of previous ground-based studies. The surface brightness profiles show a large amount of detail, including irregularities in the profiles of young clusters (such as bumps, dips and sharp shoulders), and evidence for both double clusters and post-core-collapse (PCC) clusters. In particular, we find power-law profiles in the inner regions of several candidate PCC clusters, with slopes of approximately -0.7, but showing considerable variation. We estimate that 20 +/- 7 per cent of the old cluster population of the Large Magellanic Cloud (LMC) has entered PCC evolution, a similar fraction to that for the Galactic globular cluster system. In addition, we examine the profile of R136 in detail and show that it is probably not a PCC cluster. We also observe a trend in core radius with age that has been discovered and discussed in several previous publications by different authors. Our diagram has better resolution, however, and appears to show a bifurcation at several hundred Myr. We argue that this observed relationship reflects true physical evolution in LMC clusters, with some experiencing small-scale core expansion owing to mass loss, and others large-scale expansion owing to some unidentified characteristic or physical process.
The observed clustering of damaging extra-tropical cyclones in Europe
NASA Astrophysics Data System (ADS)
Cusack, S.
2015-12-01
The clustering of severe European windstorms on annual timescales has substantial impacts on the re/insurance industry. Management of the risk is impaired by large uncertainties in estimates of clustering from historical storm datasets typically covering the past few decades. The uncertainties are unusually large because clustering depends on the variance of storm counts. Eight storm datasets are gathered for analysis in this study in order to reduce these uncertainties. Six of the datasets contain more than 100~years of severe storm information to reduce sampling errors, and the diversity of information sources and analysis methods between datasets sample observational errors. All storm severity measures used in this study reflect damage, to suit re/insurance applications. It is found that the shortest storm dataset of 42 years in length provides estimates of clustering with very large sampling and observational errors. The dataset does provide some useful information: indications of stronger clustering for more severe storms, particularly for southern countries off the main storm track. However, substantially different results are produced by removal of one stormy season, 1989/1990, which illustrates the large uncertainties from a 42-year dataset. The extended storm records place 1989/1990 into a much longer historical context to produce more robust estimates of clustering. All the extended storm datasets show a greater degree of clustering with increasing storm severity and suggest clustering of severe storms is much more material than weaker storms. Further, they contain signs of stronger clustering in areas off the main storm track, and weaker clustering for smaller-sized areas, though these signals are smaller than uncertainties in actual values. Both the improvement of existing storm records and development of new historical storm datasets would help to improve management of this risk.
[Genetic diversity of wild Cynodon dactylon germplasm detected by SRAP markers].
Yi, Yang-Jie; Zhang, Xin-Quan; Huang, Lin-Kai; Ling, Yao; Ma, Xiao; Liu, Wei
2008-01-01
Sequence-related amplified polymorphism (SRAP) molecular markers were used to detect the genetic diversity of 32 wild accessions of Cynodon dactylon collected from Sichuan, Chongqing, Guizhou and Tibet, China. The following results were obtained. (1) Fourteen primer pairs produced 132 polymorphic bands, averaged 9.4 bands per primer pair. The percentage of polymorphic bands in average was 79.8%. The Nei's genetic similarity coefficient of the tested accessions ranged from 0.591 to 0.957, and the average Nei's coefficient was 0.759. These results suggested that there was rich genetic diversity among the wild resources of Cynodon dactylon tested. (2) Thirty two wild accessions were clustered into four groups. Moreover, the accessions from the same origin frequently clustered into one group. The findings implied that a correlation among the wild resources, geographical and ecological environment. (3) Genetic differentiation between and within six eco-geographical groups of C. dactylon was estimated by Shannon's diversity index, which showed that 65.56% genetic variance existed within group, and 34.44% genetic variance was among groups. (4) Based on Nei's unbiased measures of genetic identity, UPGMA cluster analysis measures of six eco-geographical groups of Cynodon dactylon, indicated that there was a correlation between genetic differentiation and eco-geographical habits among the groups.
Pulse-coupled mixed-mode oscillators: Cluster states and extreme noise sensitivity
NASA Astrophysics Data System (ADS)
Karamchandani, Avinash J.; Graham, James N.; Riecke, Hermann
2018-04-01
Motivated by rhythms in the olfactory system of the brain, we investigate the synchronization of all-to-all pulse-coupled neuronal oscillators exhibiting various types of mixed-mode oscillations (MMOs) composed of sub-threshold oscillations (STOs) and action potentials ("spikes"). We focus particularly on the impact of the delay in the interaction. In the weak-coupling regime, we reduce the system to a Kuramoto-type equation with non-sinusoidal phase coupling and the associated Fokker-Planck equation. Its linear stability analysis identifies the appearance of various cluster states. Their type depends sensitively on the delay and the width of the pulses. Interestingly, long delays do not imply slow population rhythms, and the number of emerging clusters only loosely depends on the number of STOs. Direct simulations of the oscillator equations reveal that for quantitative agreement of the weak-coupling theory the coupling strength and the noise have to be extremely small. Even moderate noise leads to significant skipping of STO cycles, which can enhance the diffusion coefficient in the Fokker-Planck equation by two orders of magnitude. Introducing an effective diffusion coefficient extends the range of agreement significantly. Numerical simulations of the Fokker-Planck equation reveal bistability and solutions with oscillatory order parameters that result from nonlinear mode interactions. These are confirmed in simulations of the full spiking model.
Tian, Lixia; Wang, Jinhui; Yan, Chaogan; He, Yong
2011-01-01
We employed resting-state functional MRI (R-fMRI) to investigate hemisphere- and gender-related differences in the topological organization of human brain functional networks. Brain networks were first constructed by measuring inter-regional temporal correlations of R-fMRI data within each hemisphere in 86 young, healthy, right-handed adults (38 males and 48 females) followed by a graph-theory analysis. The hemispheric networks exhibit small-world attributes (high clustering and short paths) that are compatible with previous results in the whole-brain functional networks. Furthermore, we found that compared with females, males have a higher normalized clustering coefficient in the right hemispheric network but a lower clustering coefficient in the left hemispheric network, suggesting a gender-hemisphere interaction. Moreover, we observed significant hemisphere-related differences in the regional nodal characteristics in various brain regions, such as the frontal and occipital regions (leftward asymmetry) and the temporal regions (rightward asymmetry), findings that are consistent with previous studies of brain structural and functional asymmetries. Together, our results suggest that the topological organization of human brain functional networks is associated with gender and hemispheres, and they provide insights into the understanding of functional substrates underlying individual differences in behaviors and cognition. Copyright © 2010 Elsevier Inc. All rights reserved.
Anomaly detection of flight routes through optimal waypoint
NASA Astrophysics Data System (ADS)
Pusadan, M. Y.; Buliali, J. L.; Ginardi, R. V. H.
2017-01-01
Deciding factor of flight, one of them is the flight route. Flight route determined by coordinate (latitude and longitude). flight routed is determined by its coordinates (latitude and longitude) as defined is waypoint. anomaly occurs, if the aircraft is flying outside the specified waypoint area. In the case of flight data, anomalies occur by identifying problems of the flight route based on data ADS-B. This study has an aim of to determine the optimal waypoints of the flight route. The proposed methods: i) Agglomerative Hierarchical Clustering (AHC) in several segments based on range area coordinates (latitude and longitude) in every waypoint; ii) The coefficient cophenetics correlation (c) to determine the correlation between the members in each cluster; iii) cubic spline interpolation as a graphic representation of the has connected between the coordinates on every waypoint; and iv). Euclidean distance to measure distances between waypoints with 2 centroid result of clustering AHC. The experiment results are value of coefficient cophenetics correlation (c): 0,691≤ c ≤ 0974, five segments the generated of the range area waypoint coordinates, and the shortest and longest distance between the centroid with waypoint are 0.46 and 2.18. Thus, concluded that the shortest distance is used as the reference coordinates of optimal waypoint, and farthest distance can be indicated potentially detected anomaly.
Friction Durability of Extremely Thin Diamond-Like Carbon Films at High Temperature
Miyake, Shojiro; Suzuki, Shota; Miyake, Masatoshi
2017-01-01
To clarify the friction durability, both during and after the high-temperature heating of nanometer-thick diamond-like carbon (DLC) films, deposited using filtered cathodic vacuum arc (FCVA) and plasma chemical vapor deposition (P-CVD) methods, the dependence of the friction coefficient on the load and sliding cycles of the DLC films, were evaluated. Cluster-I consisted of a low friction area in which the DLC film was effective, while cluster-II consisted of a high friction area in which the lubricating effect of the DLC film was lost. The friction durability of the films was evaluated by statistical cluster analysis. Extremely thin FCVA-DLC films exhibited an excellent wear resistance at room temperature, but their friction durability was decreased at high temperatures. In contrast, the durability of the P-CVD-DLC films was increased at high temperatures when compared with that observed at room temperature. This inverse dependence on temperature corresponded to the nano-friction results obtained by atomic force microscopy. The decrease in the friction durability of the FCVA-DLC films at high temperatures, was caused by a complex effect of temperature and friction. The tribochemical reaction produced by the P-CVD-DLC films reduced their friction coefficient, increasing their durability at high temperatures. PMID:28772520
Friction Durability of Extremely Thin Diamond-Like Carbon Films at High Temperature.
Miyake, Shojiro; Suzuki, Shota; Miyake, Masatoshi
2017-02-10
To clarify the friction durability, both during and after the high-temperature heating of nanometer-thick diamond-like carbon (DLC) films, deposited using filtered cathodic vacuum arc (FCVA) and plasma chemical vapor deposition (P-CVD) methods, the dependence of the friction coefficient on the load and sliding cycles of the DLC films, were evaluated. Cluster-I consisted of a low friction area in which the DLC film was effective, while cluster-II consisted of a high friction area in which the lubricating effect of the DLC film was lost. The friction durability of the films was evaluated by statistical cluster analysis. Extremely thin FCVA-DLC films exhibited an excellent wear resistance at room temperature, but their friction durability was decreased at high temperatures. In contrast, the durability of the P-CVD-DLC films was increased at high temperatures when compared with that observed at room temperature. This inverse dependence on temperature corresponded to the nano-friction results obtained by atomic force microscopy. The decrease in the friction durability of the FCVA-DLC films at high temperatures, was caused by a complex effect of temperature and friction. The tribochemical reaction produced by the P-CVD-DLC films reduced their friction coefficient, increasing their durability at high temperatures.
Li, Sheng; Zöllner, Frank G; Merrem, Andreas D; Peng, Yinghong; Roervik, Jarle; Lundervold, Arvid; Schad, Lothar R
2012-03-01
Renal diseases can lead to kidney failure that requires life-long dialysis or renal transplantation. Early detection and treatment can prevent progression towards end stage renal disease. MRI has evolved into a standard examination for the assessment of the renal morphology and function. We propose a wavelet-based clustering to group the voxel time courses and thereby, to segment the renal compartments. This approach comprises (1) a nonparametric, discrete wavelet transform of the voxel time course, (2) thresholding of the wavelet coefficients using Stein's Unbiased Risk estimator, and (3) k-means clustering of the wavelet coefficients to segment the kidneys. Our method was applied to 3D dynamic contrast enhanced (DCE-) MRI data sets of human kidney in four healthy volunteers and three patients. On average, the renal cortex in the healthy volunteers could be segmented at 88%, the medulla at 91%, and the pelvis at 98% accuracy. In the patient data, with aberrant voxel time courses, the segmentation was also feasible with good results for the kidney compartments. In conclusion wavelet based clustering of DCE-MRI of kidney is feasible and a valuable tool towards automated perfusion and glomerular filtration rate quantification. Copyright © 2011 Elsevier Ltd. All rights reserved.
Mutoru, J W; Smith, W; O'Hern, C S; Firoozabadi, A
2013-01-14
Understanding the transport properties of molecular fluids in the critical region is important for a number of industrial and natural systems. In the literature, there are conflicting reports on the behavior of the self diffusion coefficient D(s) in the critical region of single-component molecular systems. For example, D(s) could decrease to zero, reach a maximum, or remain unchanged and finite at the critical point. Moreover, there is no molecular-scale understanding of the behavior of diffusion coefficients in molecular fluids in the critical regime. We perform extensive molecular dynamics simulations in the critical region of single-component fluids composed of medium-chain n-alkanes-n-pentane, n-decane, and n-dodecane-that interact via anisotropic united-atom potentials. For each system, we calculate D(s), and average molecular cluster sizes κ(cl) and numbers N(cl) at various cluster lifetimes τ, as a function of density ρ in the range 0.2ρ(c) ≤ ρ ≤ 2.0ρ(c) at the critical temperature T(c). We find that D(s) decreases with increasing ρ but remains finite at the critical point. Moreover, for any given τ < 1.2 × 10(-12) s, κ(cl) increases with increasing ρ but is also finite at the critical point.
Wang, Hongxiang; Wang, Wei; Hu, Dandan; Luo, Min; Xue, Chaozhuang; Li, Dongsheng; Wu, Tao
2018-06-04
Reported here is a unique crystalline semiconductor open-framework material built from the large-sized supertetrahedral T4 and T5 clusters with the Mn-In-S compositions. The hybrid assembly between T4 and T5 clusters by sharing terminal μ 2 -S 2- is for the first time observed among the cluster-based chalcogenide open frameworks. Such three-dimensional structure displays non-interpenetrated diamond-type topology with extra-large nonframework volume of 82%. Moreover, ion exchange, CO 2 adsorption, as well as photoluminescence properties of the title compound are also investigated.
Robustness of serial clustering of extra-tropical cyclones to the choice of tracking method
NASA Astrophysics Data System (ADS)
Pinto, Joaquim G.; Ulbrich, Sven; Karremann, Melanie K.; Stephenson, David B.; Economou, Theodoros; Shaffrey, Len C.
2016-04-01
Cyclone families are a frequent synoptic weather feature in the Euro-Atlantic area in winter. Given appropriate large-scale conditions, the occurrence of such series (clusters) of storms may lead to large socio-economic impacts and cumulative losses. Recent studies analyzing Reanalysis data using single cyclone tracking methods have shown that serial clustering of cyclones occurs on both flanks and downstream regions of the North Atlantic storm track. This study explores the sensitivity of serial clustering to the choice of tracking method. With this aim, the IMILAST cyclone track database based on ERA-interim data is analysed. Clustering is estimated by the dispersion (ratio of variance to mean) of winter (DJF) cyclones passages near each grid point over the Euro-Atlantic area. Results indicate that while the general pattern of clustering is identified for all methods, there are considerable differences in detail. This can primarily be attributed to the differences in the variance of cyclone counts between the methods, which range up to one order of magnitude. Nevertheless, clustering over the Eastern North Atlantic and Western Europe can be identified for all methods and can thus be generally considered as a robust feature. The statistical links between large-scale patterns like the NAO and clustering are obtained for all methods, though with different magnitudes. We conclude that the occurrence of cyclone clustering over the Eastern North Atlantic and Western Europe is largely independent from the choice of tracking method and hence from the definition of a cyclone.
Parallel Clustering Algorithm for Large-Scale Biological Data Sets
Wang, Minchao; Zhang, Wu; Ding, Wang; Dai, Dongbo; Zhang, Huiran; Xie, Hao; Chen, Luonan; Guo, Yike; Xie, Jiang
2014-01-01
Backgrounds Recent explosion of biological data brings a great challenge for the traditional clustering algorithms. With increasing scale of data sets, much larger memory and longer runtime are required for the cluster identification problems. The affinity propagation algorithm outperforms many other classical clustering algorithms and is widely applied into the biological researches. However, the time and space complexity become a great bottleneck when handling the large-scale data sets. Moreover, the similarity matrix, whose constructing procedure takes long runtime, is required before running the affinity propagation algorithm, since the algorithm clusters data sets based on the similarities between data pairs. Methods Two types of parallel architectures are proposed in this paper to accelerate the similarity matrix constructing procedure and the affinity propagation algorithm. The memory-shared architecture is used to construct the similarity matrix, and the distributed system is taken for the affinity propagation algorithm, because of its large memory size and great computing capacity. An appropriate way of data partition and reduction is designed in our method, in order to minimize the global communication cost among processes. Result A speedup of 100 is gained with 128 cores. The runtime is reduced from serval hours to a few seconds, which indicates that parallel algorithm is capable of handling large-scale data sets effectively. The parallel affinity propagation also achieves a good performance when clustering large-scale gene data (microarray) and detecting families in large protein superfamilies. PMID:24705246
Study of clustering structures through breakup reactions
NASA Astrophysics Data System (ADS)
Capel, Pierre
2014-12-01
Models for the description of breakup reactions used to study the structure of exotic cluster structures like halos are reviewed. The sensitivity of these models to the projectile description is presented. Calculations are sensitive to the projectile ground state mostly through its asymptotic normalisation coefficient (ANC). They also probe the continuum of the projectile. This enables studying not only resonant states of the projectile but also its non-resonant continuum both resonant and non-resonant. This opens the possibility to study correlations between both halo neutrons in two-neutron halo nuclei.
Mass to Luminosity Ratios of Some Clusters in the Large Magellanic Cloud
NASA Astrophysics Data System (ADS)
Sohn, Young-Jong; Chun, Mun-Suk
1990-12-01
Luminosity profiles and dynamical parameters of 12 globular clusters in the Large Magellanic Cloud(SB(s)m) are obtained from the concentric aperture photoelectric photometry of 3 different aged clusters and the collected photometric data of 9 clusters. The total masses of the globular clusters are the calculated using the equation M = Mrt3(4¥Ø2-k2), which is derived from the theoretical rotation curve for the exponential disk(Chun 1987). These masses lie between 0.3 x 104 and 15.8 x 104 M . From the determined total mass and luminosity ratios are also derived. The M/L ratio of a cluster increases with the cluster age; about 0.03 for the youngest clusters(SWB ¥°) and about 0.24 for the oldest clusters(SUB ¥¶). There is a difference in M/L by a factor of 10 between the galactic globular clusters and the old globular clusters in the LCM.
NASA Astrophysics Data System (ADS)
Liu, Tingguang; Xia, Shuang; Bai, Qin; Zhou, Bangxin; Zhang, Lefu; Lu, Yonghao; Shoji, Tetsuo
2018-01-01
The intergranular cracks and grain boundary (GB) network of a GB-engineered 316 stainless steel after stress corrosion cracking (SCC) test in high temperature high pressure water of reactor environment were investigated by two-dimensional and three-dimensional (3D) characterization in order to expose the mechanism that GB-engineering mitigates intergranular SCC. The 3D microstructure shown that the essential characteristic of the GB-engineered microstructure is formation of many large twin-boundaries as a result of multiple-twinning, which results in the formation of large grain-clusters. The large grain-clusters played a key role to the improvement of intergranular SCC resistance by GB-engineering. The main intergranular cracks propagated in a zigzag along the outer boundaries of these large grain-clusters because all inner boundaries of the grain-clusters were twin-boundaries (∑3) or twin-related boundaries (∑3n) which had much lower susceptibility to SCC than random boundaries. These large grain-clusters had tree-ring-shaped topology structure and very complex morphology. They got tangled so that difficult to be separated during SCC, resulting in some large crack-bridges retained in the crack surface.
"A Richness Study of 14 Distant X-Ray Clusters from the 160 Square Degree Survey"
NASA Technical Reports Server (NTRS)
Jones, Christine; West, Donald (Technical Monitor)
2001-01-01
We have measured the surface density of galaxies toward 14 X-ray-selected cluster candidates at redshifts z(sub i) 0.46, and we show that they are associated with rich galaxy concentrations. These clusters, having X-ray luminosities of Lx(0.5-2 keV) approx. (0.5 - 2.6) x 10(exp 44) ergs/ sec are among the most distant and luminous in our 160 deg(exp 2) ROSAT Position Sensitive Proportional Counter cluster survey. We find that the clusters range between Abell richness classes 0 and 2 and have a most probable richness class of 1. We compare the richness distribution of our distant clusters to those for three samples of nearby clusters with similar X-ray luminosities. We find that the nearby and distant samples have similar richness distributions, which shows that clusters have apparently not evolved substantially in richness since redshift z=0.5. There is, however, a marginal tendency for the distant clusters to be slightly poorer than nearby clusters, although deeper multicolor data for a large sample would be required to confirm this trend. We compare the distribution of distant X-ray clusters in the L(sub X)-richness plane to the distribution of optically selected clusters from the Palomar Distant Cluster Survey. The optically selected clusters appear overly rich for their X-ray luminosities, when compared to X-ray-selected clusters. Apparently, X-ray and optical surveys do not necessarily sample identical mass concentrations at large redshifts. This may indicate the existence of a population of optically rich clusters with anomalously low X-ray emission, More likely, however, it reflects the tendency for optical surveys to select unvirialized mass concentrations, as might be expected when peering along large-scale filaments.
Cluster galaxy dynamics and the effects of large-scale environment
NASA Astrophysics Data System (ADS)
White, Martin; Cohn, J. D.; Smit, Renske
2010-11-01
Advances in observational capabilities have ushered in a new era of multi-wavelength, multi-physics probes of galaxy clusters and ambitious surveys are compiling large samples of cluster candidates selected in different ways. We use a high-resolution N-body simulation to study how the influence of large-scale structure in and around clusters causes correlated signals in different physical probes and discuss some implications this has for multi-physics probes of clusters (e.g. richness, lensing, Compton distortion and velocity dispersion). We pay particular attention to velocity dispersions, matching galaxies to subhaloes which are explicitly tracked in the simulation. We find that not only do haloes persist as subhaloes when they fall into a larger host, but groups of subhaloes retain their identity for long periods within larger host haloes. The highly anisotropic nature of infall into massive clusters, and their triaxiality, translates into an anisotropic velocity ellipsoid: line-of-sight galaxy velocity dispersions for any individual halo show large variance depending on viewing angle. The orientation of the velocity ellipsoid is correlated with the large-scale structure, and thus velocity outliers correlate with outliers caused by projection in other probes. We quantify this orientation uncertainty and give illustrative examples. Such a large variance suggests that velocity dispersion estimators will work better in an ensemble sense than for any individual cluster, which may inform strategies for obtaining redshifts of cluster members. We similarly find that the ability of substructure indicators to find kinematic substructures is highly viewing angle dependent. While groups of subhaloes which merge with a larger host halo can retain their identity for many Gyr, they are only sporadically picked up by substructure indicators. We discuss the effects of correlated scatter on scaling relations estimated through stacking, both analytically and in the simulations, showing that the strong correlation of measures with mass and the large scatter in mass at fixed observable mitigate line-of-sight projections.
Individual and group-level job resources and their relationships with individual work engagement
Füllemann, Désirée; Brauchli, Rebecca; Jenny, Gregor J.; Bauer, Georg F.
2016-01-01
Objectives: This study adds a multilevel perspective to the well-researched individual-level relationship between job resources and work engagement. In addition, we explored whether individual job resources cluster within work groups because of a shared psychosocial environment and investigated whether a resource-rich psychosocial work group environment is beneficial for employee engagement over and above the beneficial effect of individual job resources and independent of their variability within groups. Methods: Data of 1,219 employees nested in 103 work groups were obtained from a baseline employee survey of a large stress management intervention project implemented in six medium and large-sized organizations in diverse sectors. A variety of important job resources were assessed and grouped to an overall job resource factor with three subfactors (manager behavior, peer behavior, and task-related resources). Data were analyzed using multilevel random coefficient modeling. Results: The results indicated that job resources cluster within work groups and can be aggregated to a group-level job resources construct. However, a resource-rich environment, indicated by high group-level job resources, did not additionally benefit employee work engagement but on the contrary, was negatively related to it. Conclusions: On the basis of this unexpected result, replication studies are encouraged and suggestions for future studies on possible underlying within-group processes are discussed. The study supports the presumed value of integrating work group as a relevant psychosocial environment into the motivational process and indicates a need to further investigate emergent processes involved in aggregation procedures across levels. PMID:27108639
Seman, Ali; Sapawi, Azizian Mohd; Salleh, Mohd Zaki
2015-06-01
Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.
Spatial scan statistics for detection of multiple clusters with arbitrary shapes.
Lin, Pei-Sheng; Kung, Yi-Hung; Clayton, Murray
2016-12-01
In applying scan statistics for public health research, it would be valuable to develop a detection method for multiple clusters that accommodates spatial correlation and covariate effects in an integrated model. In this article, we connect the concepts of the likelihood ratio (LR) scan statistic and the quasi-likelihood (QL) scan statistic to provide a series of detection procedures sufficiently flexible to apply to clusters of arbitrary shape. First, we use an independent scan model for detection of clusters and then a variogram tool to examine the existence of spatial correlation and regional variation based on residuals of the independent scan model. When the estimate of regional variation is significantly different from zero, a mixed QL estimating equation is developed to estimate coefficients of geographic clusters and covariates. We use the Benjamini-Hochberg procedure (1995) to find a threshold for p-values to address the multiple testing problem. A quasi-deviance criterion is used to regroup the estimated clusters to find geographic clusters with arbitrary shapes. We conduct simulations to compare the performance of the proposed method with other scan statistics. For illustration, the method is applied to enterovirus data from Taiwan. © 2016, The International Biometric Society.
ERIC Educational Resources Information Center
Robertson, C. T.
1973-01-01
Discusses theories underlying the phenomena of solution viscosities, involving the Jones and Dole equation, B-coefficient determination, and flickering cluster model. Indicates that viscosity measurements provide a basis for the study of the structural effects of ions in aqueous solutions and are applicable in teaching high school chemistry. (CC)
Jin, Yonghong; Zhang, Qi; Shan, Lifei; Li, Sai-Ping
2015-01-01
Financial networks have been extensively studied as examples of real world complex networks. In this paper, we establish and study the network of venture capital (VC) firms in China. We compute and analyze the statistical properties of the network, including parameters such as degrees, mean lengths of the shortest paths, clustering coefficient and robustness. We further study the topology of the network and find that it has small-world behavior. A multiple linear regression model is introduced to study the relation between network parameters and major regional economic indices in China. From the result of regression, we find that, economic aggregate (including the total GDP, investment, consumption and net export), upgrade of industrial structure, employment and remuneration of a region are all positively correlated with the degree and the clustering coefficient of the VC sub-network of the region, which suggests that the development of the VC industry has substantial effects on regional economy in China.
Jin, Yonghong; Zhang, Qi; Shan, Lifei; Li, Sai-Ping
2015-01-01
Financial networks have been extensively studied as examples of real world complex networks. In this paper, we establish and study the network of venture capital (VC) firms in China. We compute and analyze the statistical properties of the network, including parameters such as degrees, mean lengths of the shortest paths, clustering coefficient and robustness. We further study the topology of the network and find that it has small-world behavior. A multiple linear regression model is introduced to study the relation between network parameters and major regional economic indices in China. From the result of regression, we find that, economic aggregate (including the total GDP, investment, consumption and net export), upgrade of industrial structure, employment and remuneration of a region are all positively correlated with the degree and the clustering coefficient of the VC sub-network of the region, which suggests that the development of the VC industry has substantial effects on regional economy in China. PMID:26340555
Coevolving complex networks in the model of social interactions
NASA Astrophysics Data System (ADS)
Raducha, Tomasz; Gubiec, Tomasz
2017-04-01
We analyze Axelrod's model of social interactions on coevolving complex networks. We introduce four extensions with different mechanisms of edge rewiring. The models are intended to catch two kinds of interactions-preferential attachment, which can be observed in scientists or actors collaborations, and local rewiring, which can be observed in friendship formation in everyday relations. Numerical simulations show that proposed dynamics can lead to the power-law distribution of nodes' degree and high value of the clustering coefficient, while still retaining the small-world effect in three models. All models are characterized by two phase transitions of a different nature. In case of local rewiring we obtain order-disorder discontinuous phase transition even in the thermodynamic limit, while in case of long-distance switching discontinuity disappears in the thermodynamic limit, leaving one continuous phase transition. In addition, we discover a new and universal characteristic of the second transition point-an abrupt increase of the clustering coefficient, due to formation of many small complete subgraphs inside the network.
Is It Feasible to Identify Natural Clusters of TSC-Associated Neuropsychiatric Disorders (TAND)?
Leclezio, Loren; Gardner-Lubbe, Sugnet; de Vries, Petrus J
2018-04-01
Tuberous sclerosis complex (TSC) is a genetic disorder with multisystem involvement. The lifetime prevalence of TSC-Associated Neuropsychiatric Disorders (TAND) is in the region of 90% in an apparently unique, individual pattern. This "uniqueness" poses significant challenges for diagnosis, psycho-education, and intervention planning. To date, no studies have explored whether there may be natural clusters of TAND. The purpose of this feasibility study was (1) to investigate the practicability of identifying natural TAND clusters, and (2) to identify appropriate multivariate data analysis techniques for larger-scale studies. TAND Checklist data were collected from 56 individuals with a clinical diagnosis of TSC (n = 20 from South Africa; n = 36 from Australia). Using R, the open-source statistical platform, mean squared contingency coefficients were calculated to produce a correlation matrix, and various cluster analyses and exploratory factor analysis were examined. Ward's method rendered six TAND clusters with good face validity and significant convergence with a six-factor exploratory factor analysis solution. The "bottom-up" data-driven strategies identified a "scholastic" cluster of TAND manifestations, an "autism spectrum disorder-like" cluster, a "dysregulated behavior" cluster, a "neuropsychological" cluster, a "hyperactive/impulsive" cluster, and a "mixed/mood" cluster. These feasibility results suggest that a combination of cluster analysis and exploratory factor analysis methods may be able to identify clinically meaningful natural TAND clusters. Findings require replication and expansion in larger dataset, and could include quantification of cluster or factor scores at an individual level. Copyright © 2018 Elsevier Inc. All rights reserved.
Lack of small-scale clustering in 21-cm intensity maps crossed with 2dF galaxy densities at z ~ 0.08
NASA Astrophysics Data System (ADS)
Anderson, Christopher; Luciw, Nicholas; Li, Yi-Chao; Kuo, Cheng-Yu; Yadav, Jaswant; Masui, Kiyoshi; Chang, Tzu-Ching; Chen, Xuelei; Oppermann, Niels; Pen, Ue-Li; Timbie, Peter T.
2017-06-01
I report results from 21-cm intensity maps acquired from the Parkes radio telescope and cross-correlated with galaxy maps from the 2dF galaxy survey. The data span the redshift range 0.057
NASA Astrophysics Data System (ADS)
Chuan, Zun Liang; Ismail, Noriszura; Shinyie, Wendy Ling; Lit Ken, Tan; Fam, Soo-Fen; Senawi, Azlyna; Yusoff, Wan Nur Syahidah Wan
2018-04-01
Due to the limited of historical precipitation records, agglomerative hierarchical clustering algorithms widely used to extrapolate information from gauged to ungauged precipitation catchments in yielding a more reliable projection of extreme hydro-meteorological events such as extreme precipitation events. However, identifying the optimum number of homogeneous precipitation catchments accurately based on the dendrogram resulted using agglomerative hierarchical algorithms are very subjective. The main objective of this study is to propose an efficient regionalized algorithm to identify the homogeneous precipitation catchments for non-stationary precipitation time series. The homogeneous precipitation catchments are identified using average linkage hierarchical clustering algorithm associated multi-scale bootstrap resampling, while uncentered correlation coefficient as the similarity measure. The regionalized homogeneous precipitation is consolidated using K-sample Anderson Darling non-parametric test. The analysis result shows the proposed regionalized algorithm performed more better compared to the proposed agglomerative hierarchical clustering algorithm in previous studies.
Sharafi, Zahra
2017-01-01
Background The purpose of this study was to evaluate the effectiveness of two methods of detecting differential item functioning (DIF) in the presence of multilevel data and polytomously scored items. The assessment of DIF with multilevel data (e.g., patients nested within hospitals, hospitals nested within districts) from large-scale assessment programs has received considerable attention but very few studies evaluated the effect of hierarchical structure of data on DIF detection for polytomously scored items. Methods The ordinal logistic regression (OLR) and hierarchical ordinal logistic regression (HOLR) were utilized to assess DIF in simulated and real multilevel polytomous data. Six factors (DIF magnitude, grouping variable, intraclass correlation coefficient, number of clusters, number of participants per cluster, and item discrimination parameter) with a fully crossed design were considered in the simulation study. Furthermore, data of Pediatric Quality of Life Inventory™ (PedsQL™) 4.0 collected from 576 healthy school children were analyzed. Results Overall, results indicate that both methods performed equivalently in terms of controlling Type I error and detection power rates. Conclusions The current study showed negligible difference between OLR and HOLR in detecting DIF with polytomously scored items in a hierarchical structure. Implications and considerations while analyzing real data were also discussed. PMID:29312463
Sharafi, Zahra; Mousavi, Amin; Ayatollahi, Seyyed Mohammad Taghi; Jafari, Peyman
2017-01-01
The purpose of this study was to evaluate the effectiveness of two methods of detecting differential item functioning (DIF) in the presence of multilevel data and polytomously scored items. The assessment of DIF with multilevel data (e.g., patients nested within hospitals, hospitals nested within districts) from large-scale assessment programs has received considerable attention but very few studies evaluated the effect of hierarchical structure of data on DIF detection for polytomously scored items. The ordinal logistic regression (OLR) and hierarchical ordinal logistic regression (HOLR) were utilized to assess DIF in simulated and real multilevel polytomous data. Six factors (DIF magnitude, grouping variable, intraclass correlation coefficient, number of clusters, number of participants per cluster, and item discrimination parameter) with a fully crossed design were considered in the simulation study. Furthermore, data of Pediatric Quality of Life Inventory™ (PedsQL™) 4.0 collected from 576 healthy school children were analyzed. Overall, results indicate that both methods performed equivalently in terms of controlling Type I error and detection power rates. The current study showed negligible difference between OLR and HOLR in detecting DIF with polytomously scored items in a hierarchical structure. Implications and considerations while analyzing real data were also discussed.
López-Carr, David; Davis, Jason; Jankowska, Marta; Grant, Laura; López-Carr, Anna Carla; Clark, Matthew
2013-01-01
The relative role of space and place has long been debated in geography. Yet modeling efforts applied to coupled human-natural systems seemingly favor models assuming continuous spatial relationships. We examine the relative importance of placebased hierarchical versus spatial clustering influences in tropical land use/cover change (LUCC). Guatemala was chosen as our study site given its high rural population growth and deforestation in recent decades. We test predictors of 2009 forest cover and forest cover change from 2001-2009 across Guatemala's 331 municipalities and 22 departments using spatial and multi-level statistical models. Our results indicate the emergence of several socio-economic predictors of LUCC regardless of model choice. Hierarchical model results suggest that significant differences exist at the municipal and departmental levels but largely maintain the magnitude and direction of single-level model coefficient estimates. They are also intervention-relevant since policies tend to be applicable to distinct political units rather than to continuous space. Spatial models complement hierarchical approaches by indicating where and to what magnitude significant negative and positive clustering associations emerge. Appreciating the comparative advantages and limitations of spatial and nested models enhances a holistic approach to geographical analysis of tropical LUCC and human-environment interactions. PMID:24013908
Wavelet investigation of preferential concentration in particle-laden turbulence
NASA Astrophysics Data System (ADS)
Bassenne, Maxime; Urzay, Javier; Schneider, Kai; Moin, Parviz
2017-11-01
Direct numerical simulations of particle-laden homogeneous-isotropic turbulence are employed in conjunction with wavelet multi-resolution analyses to study preferential concentration in both physical and spectral spaces. Spatially-localized energy spectra for velocity, vorticity and particle-number density are computed, along with their spatial fluctuations that enable the quantification of scale-dependent probability density functions, intermittency and inter-phase conditional statistics. The main result is that particles are found in regions of lower turbulence spectral energy than the corresponding mean. This suggests that modeling the subgrid-scale turbulence intermittency is required for capturing the small-scale statistics of preferential concentration in large-eddy simulations. Additionally, a method is defined that decomposes a particle number-density field into the sum of a coherent and an incoherent components. The coherent component representing the clusters can be sparsely described by at most 1.6% of the total number of wavelet coefficients. An application of the method, motivated by radiative-heat-transfer simulations, is illustrated in the form of a grid-adaptation algorithm that results in non-uniform meshes refined around particle clusters. It leads to a reduction of the number of control volumes by one to two orders of magnitude. PSAAP-II Center at Stanford (Grant DE-NA0002373).
Development and Characterization of 1,906 EST-SSR Markers from Unigenes in Jute (Corchorus spp.)
Zhang, Liwu; Li, Yanru; Tao, Aifen; Fang, Pingping; Qi, Jianmin
2015-01-01
Jute, comprising white and dark jute, is the second important natural fiber crop after cotton worldwide. However, the lack of expressed sequence tag-derived simple sequence repeat (EST-SSR) markers has resulted in a large gap in the improvement of jute. Previously, de novo 48,914 unigenes from white jute were assembled. In this study, 1,906 EST-SSRs were identified from these assembled uingenes. Among these markers, di-, tri- and tetra-nucleotide repeat types were the abundant types (12.0%, 56.9% and 21.6% respectively). The AG-rich or GA-rich nucleotide repeats were the predominant. Subsequently, a sample of 116 SSRs, located in genes encoding transcription factors and cellulose synthases, were selected to survey polymorphisms among12 diverse jute accessions. Of these, 83.6% successfully amplified at least one fragment and detected polymorphism among the 12diverse genotypes, indicating that the newly developed SSRs are of good quality. Furthermore, the genetic similarity coefficients of all the 12 accessions were evaluated using 97 polymorphic SSRs. The cluster analysis divided the jute accessions into two main groups with genetic similarity coefficient of 0.61. These EST-SSR markers not only enrich molecular markers of jute genome, but also facilitate genetic and genomic researches in jute. PMID:26512891
Operational foreshock forecasting: Fifteen years after
NASA Astrophysics Data System (ADS)
Ogata, Y.
2010-12-01
We are concerned with operational forecasting of the probability that events are foreshocks of a forthcoming earthquake that is significantly larger (mainshock). Specifically, we define foreshocks as the preshocks substantially smaller than the mainshock by a magnitude gap of 0.5 or larger. The probability gain of foreshock forecast is extremely high compare to long-term forecast by renewal processes or various alarm-based intermediate-term forecasts because of a large event’s low occurrence rate in a short period and a narrow target region. Thus, it is desired to establish operational foreshock probability forecasting as seismologists have done for aftershocks. When a series of earthquakes occurs in a region, we attempt to discriminate foreshocks from a swarm or mainshock-aftershock sequence. Namely, after real time identification of an earthquake cluster using methods such as the single-link algorithm, the probability is calculated by applying statistical features that discriminate foreshocks from other types of clusters, by considering the events' stronger proximity in time and space and tendency towards chronologically increasing magnitudes. These features were modeled for probability forecasting and the coefficients of the model were estimated in Ogata et al. (1996) for the JMA hypocenter data (M≧4, 1926-1993). Currently, fifteen years has passed since the publication of the above-stated work so that we are able to present the performance and validation of the forecasts (1994-2009) by using the same model. Taking isolated events into consideration, the probability of the first events in a potential cluster being a foreshock vary in a range between 0+% and 10+% depending on their locations. This conditional forecasting performs significantly better than the unconditional (average) foreshock probability of 3.7% throughout Japan region. Furthermore, when we have the additional events in a cluster, the forecast probabilities range more widely from nearly 0% to about 40% depending on the discrimination features among the events in the cluster. This conditional forecasting further performs significantly better than the unconditional foreshock probability of 7.3%, which is the average probability of the plural events in the earthquake clusters. Indeed, the frequency ratios of the actual foreshocks are consistent with the forecasted probabilities. Reference: Ogata, Y., Utsu, T. and Katsura, K. (1996). Statistical discrimination of foreshocks from other earthquake clusters, Geophys. J. Int. 127, 17-30.
Characterising large-scale structure with the REFLEX II cluster survey
NASA Astrophysics Data System (ADS)
Chon, Gayoung
2016-10-01
We study the large-scale structure with superclusters from the REFLEX X-ray cluster survey together with cosmological N-body simulations. It is important to construct superclusters with criteria such that they are homogeneous in their properties. We lay out our theoretical concept considering future evolution of superclusters in their definition, and show that the X-ray luminosity and halo mass functions of clusters in superclusters are found to be top-heavy, different from those of clusters in the field. We also show a promising aspect of using superclusters to study the local cluster bias and mass scaling relation with simulations.
The impact of baryonic matter on gravitational lensing by galaxy clusters
NASA Astrophysics Data System (ADS)
Lee, Brandyn E.; King, Lindsay; Applegate, Douglas; McCarthy, Ian
2017-01-01
Since the bulk of the matter comprising galaxy clusters exists in the form of dark matter, gravitational N-body simulations have historically been an effective way to investigate large scale structure formation and the astrophysics of galaxy clusters. However, upcoming telescopes such as the Large Synoptic Survey Telescope are expected to have lower systematic errors than older generations, reducing measurement uncertainties and requiring that astrophysicists better quantify the impact of baryonic matter on the cluster lensing signal. Here we outline the effects of baryonic processes on cluster density profiles and on weak lensing mass and concentration estimates. Our analysis is done using clusters grown in the suite of cosmological hydrodynamical simulations known as cosmo-OWLS.
NASA Astrophysics Data System (ADS)
Moran, Sean; Smith, G.; Haines, C.; Egami, E.; Hardegree-Ullman, E.; Heckman, T.
2010-01-01
We present results from LoCuSS, the Local Cluster Substructure Survey, on the distribution and abundance of cluster galaxies showing signatures of recently quenched star formation, within a sample of 15 z 0.2 clusters. Combining LoCuSS' wide-field UV through NIR photometry with weak-lensing derived mass maps for these clusters, we identify passive galaxies that have undergone recent quenching via both rapid (100Myr) and slow (1Gyr) mechanisms. By studying their abundance in a statistically significant sample of z 0.2 clusters, we explore how the effectiveness of environmental quenching of star formation varies as a function of the level of cluster substructure, in addition to global cluster characteristics such as mass or X-ray luminosity and temperature, with the aim of understanding the role that pre-processing of galaxies within groups and filaments plays in the overall buildup of the morphology-density and SFR-density relations. We find that clusters with large levels of substructure indicative of recent assembly or cluster-cluster mergers host a higher fraction of galaxies with signs of recent ram-pressure stripping by the hot intra-cluster gas. In addition, we find that the fraction of post-starburst galaxies increases with cluster mass (M500), but fractions of optically-selected AGN and GALEX-defined "Green Valley" galaxies show the opposite trend, being most abundant in rather low-mass clusters. These trends suggest a picture where quenching of star formation occurs most vigorously in actively assembling structures, with comparatively little activity in the most massive structures where most of the nearby large-scale structure has already been accreted and Virialized into the main cluster body.
Shi, Weifang; Zeng, Weihua
2013-01-01
Reducing human vulnerability to chemical hazards in the industrialized city is a matter of great urgency. Vulnerability mapping is an alternative approach for providing vulnerability-reducing interventions in a region. This study presents a method for mapping human vulnerability to chemical hazards by using clustering analysis for effective vulnerability reduction. Taking the city of Shanghai as the study area, we measure human exposure to chemical hazards by using the proximity model with additionally considering the toxicity of hazardous substances, and capture the sensitivity and coping capacity with corresponding indicators. We perform an improved k-means clustering approach on the basis of genetic algorithm by using a 500 m × 500 m geographical grid as basic spatial unit. The sum of squared errors and silhouette coefficient are combined to measure the quality of clustering and to determine the optimal clustering number. Clustering result reveals a set of six typical human vulnerability patterns that show distinct vulnerability dimension combinations. The vulnerability mapping of the study area reflects cluster-specific vulnerability characteristics and their spatial distribution. Finally, we suggest specific points that can provide new insights in rationally allocating the limited funds for the vulnerability reduction of each cluster. PMID:23787337
Unequal cluster sizes in stepped-wedge cluster randomised trials: a systematic review
Morris, Tom; Gray, Laura
2017-01-01
Objectives To investigate the extent to which cluster sizes vary in stepped-wedge cluster randomised trials (SW-CRT) and whether any variability is accounted for during the sample size calculation and analysis of these trials. Setting Any, not limited to healthcare settings. Participants Any taking part in an SW-CRT published up to March 2016. Primary and secondary outcome measures The primary outcome is the variability in cluster sizes, measured by the coefficient of variation (CV) in cluster size. Secondary outcomes include the difference between the cluster sizes assumed during the sample size calculation and those observed during the trial, any reported variability in cluster sizes and whether the methods of sample size calculation and methods of analysis accounted for any variability in cluster sizes. Results Of the 101 included SW-CRTs, 48% mentioned that the included clusters were known to vary in size, yet only 13% of these accounted for this during the calculation of the sample size. However, 69% of the trials did use a method of analysis appropriate for when clusters vary in size. Full trial reports were available for 53 trials. The CV was calculated for 23 of these: the median CV was 0.41 (IQR: 0.22–0.52). Actual cluster sizes could be compared with those assumed during the sample size calculation for 14 (26%) of the trial reports; the cluster sizes were between 29% and 480% of that which had been assumed. Conclusions Cluster sizes often vary in SW-CRTs. Reporting of SW-CRTs also remains suboptimal. The effect of unequal cluster sizes on the statistical power of SW-CRTs needs further exploration and methods appropriate to studies with unequal cluster sizes need to be employed. PMID:29146637
Kinetics of Aggregation with Choice
Ben-Naim, Eli; Krapivsky, Paul
2016-12-01
Here we generalize the ordinary aggregation process to allow for choice. In ordinary aggregation, two random clusters merge and form a larger aggregate. In our implementation of choice, a target cluster and two candidate clusters are randomly selected and the target cluster merges with the larger of the two candidate clusters.We study the long-time asymptotic behavior and find that as in ordinary aggregation, the size density adheres to the standard scaling form. However, aggregation with choice exhibits a number of different features. First, the density of the smallest clusters exhibits anomalous scaling. Second, both the small-size and the large-size tailsmore » of the density are overpopulated, at the expense of the density of moderate-size clusters. Finally, we also study the complementary case where the smaller candidate cluster participates in the aggregation process and find an abundance of moderate clusters at the expense of small and large clusters. Additionally, we investigate aggregation processes with choice among multiple candidate clusters and a symmetric implementation where the choice is between two pairs of clusters.« less
Simultaneous multi-species tracking in live cells with quantum dot conjugates.
Clausen, Mathias P; Arnspang, Eva C; Ballou, Byron; Bear, James E; Lagerholm, B Christoffer
2014-01-01
Quantum dots are available in a range of spectrally separated emission colors and with a range of water-stabilizing surface coatings that offers great flexibility for enabling bio-specificity. In this study, we have taken advantage of this flexibility to demonstrate that it is possible to perform a simultaneous investigation of the lateral dynamics in the plasma membrane of i) the transmembrane epidermal growth factor receptor, ii) the glucosylphospatidylinositol-anchored protein CD59, and iii) ganglioside GM1-cholera toxin subunit B clusters in a single cell. We show that a large number of the trajectories are longer than 50 steps, which we by simulations show to be sufficient for robust single trajectory analysis. This analysis shows that the populations of the diffusion coefficients are heterogeneously distributed for all three species, but differ between the different species. We further show that the heterogeneity is decreased upon treating the cells with methyl-β-cyclodextrin.
Polaronic Effect on Electrical Conductivity and Thermoelectric Power in Ga(Cu)V4S8
NASA Astrophysics Data System (ADS)
Naik, I.
2018-01-01
Polycrystalline V4-cluster compounds of GaV4S8 and its derivatives Ga0.90 Cu0.10V4S8 and Ga0.90Cu0.20V4S8 have been prepared at 800°C by solid-state reaction method. Although the cubic-rhombohedral phase transformation at 45 K was found to be absent in the derivatives of GaV4S8, low-temperature hopping conduction occurred in all the materials. In the present context, we explain the conduction mechanism for all the materials using polaron theory. The polaron size was found to be large above 260 K but small below 260 K in GaV4S8, as confirmed by the Seebeck coefficient. From the activation energies and polaron size, the anomaly at 260 K is interpreted as associated with crossover from thermally activated to nearest-neighbor hopping upon cooling.
Wavelet Based Protection Scheme for Multi Terminal Transmission System with PV and Wind Generation
NASA Astrophysics Data System (ADS)
Manju Sree, Y.; Goli, Ravi kumar; Ramaiah, V.
2017-08-01
A hybrid generation is a part of large power system in which number of sources usually attached to a power electronic converter and loads are clustered can operate independent of the main power system. The protection scheme is crucial against faults based on traditional over current protection since there are adequate problems due to fault currents in the mode of operation. This paper adopts a new approach for detection, discrimination of the faults for multi terminal transmission line protection in presence of hybrid generation. Transient current based protection scheme is developed with discrete wavelet transform. Fault indices of all phase currents at all terminals are obtained by analyzing the detail coefficients of current signals using bior 1.5 mother wavelet. This scheme is tested for different types of faults and is found effective for detection and discrimination of fault with various fault inception angle and fault impedance.
Taxonomic study of extreme halophilic archaea isolated from the "Salar de Atacama", Chile.
Lizama, C; Monteoliva-Sánchez, M; Prado, B; Ramos-Cormenzana, A; Weckesser, J; Campos, V
2001-11-01
A large number of halophilic bacteria were isolated in 1984-1992 from the Atacama Saltern (North of Chile). For this study 82 strains of extreme halophilic archaea were selected. The characterization was performed by using the phenotypic characters including morphological, physiological, biochemical, nutritional and antimicrobial susceptibility test. The results, together with those from reference strains, were subjected to numerical analysis, using the Simple Matching (S(SM)) coefficient and clustered by the unweighted pair group method of association (UPGMA). Fifteen phena were obtained at an 70% similarity level. The results obtained reveal a high diversity among the halophilic archaea isolated. Representative strains from the phena were chosen to determine their DNA base composition and the percentage of DNA-DNA similarity compared to reference strains. The 16S rRNA studies showed that some of these strains constitutes a new taxa of extreme halophilic archaea.
Analysis of the Chinese air route network as a complex network
NASA Astrophysics Data System (ADS)
Cai, Kai-Quan; Zhang, Jun; Du, Wen-Bo; Cao, Xian-Bin
2012-02-01
The air route network, which supports all the flight activities of the civil aviation, is the most fundamental infrastructure of air traffic management system. In this paper, we study the Chinese air route network (CARN) within the framework of complex networks. We find that CARN is a geographical network possessing exponential degree distribution, low clustering coefficient, large shortest path length and exponential spatial distance distribution that is obviously different from that of the Chinese airport network (CAN). Besides, via investigating the flight data from 2002 to 2010, we demonstrate that the topology structure of CARN is homogeneous, howbeit the distribution of flight flow on CARN is rather heterogeneous. In addition, the traffic on CARN keeps growing in an exponential form and the increasing speed of west China is remarkably larger than that of east China. Our work will be helpful to better understand Chinese air traffic systems.
Weak signal transmission in complex networks and its application in detecting connectivity.
Liang, Xiaoming; Liu, Zonghua; Li, Baowen
2009-10-01
We present a network model of coupled oscillators to study how a weak signal is transmitted in complex networks. Through both theoretical analysis and numerical simulations, we find that the response of other nodes to the weak signal decays exponentially with their topological distance to the signal source and the coupling strength between two neighboring nodes can be figured out by the responses. This finding can be conveniently used to detect the topology of unknown network, such as the degree distribution, clustering coefficient and community structure, etc., by repeatedly choosing different nodes as the signal source. Through four typical networks, i.e., the regular one dimensional, small world, random, and scale-free networks, we show that the features of network can be approximately given by investigating many fewer nodes than the network size, thus our approach to detect the topology of unknown network may be efficient in practical situations with large network size.
Modelling students' knowledge organisation: Genealogical conceptual networks
NASA Astrophysics Data System (ADS)
Koponen, Ismo T.; Nousiainen, Maija
2018-04-01
Learning scientific knowledge is largely based on understanding what are its key concepts and how they are related. The relational structure of concepts also affects how concepts are introduced in teaching scientific knowledge. We model here how students organise their knowledge when they represent their understanding of how physics concepts are related. The model is based on assumptions that students use simple basic linking-motifs in introducing new concepts and mostly relate them to concepts that were introduced a few steps earlier, i.e. following a genealogical ordering. The resulting genealogical networks have relatively high local clustering coefficients of nodes but otherwise resemble networks obtained with an identical degree distribution of nodes but with random linking between them (i.e. the configuration-model). However, a few key nodes having a special structural role emerge and these nodes have a higher than average communicability betweenness centralities. These features agree with the empirically found properties of students' concept networks.
Research on the precise positioning of customers in large data environment
NASA Astrophysics Data System (ADS)
Zhou, Xu; He, Lili
2018-04-01
Customer positioning has always been a problem that enterprises focus on. In this paper, FCM clustering algorithm is used to cluster customer groups. However, due to the traditional FCM clustering algorithm, which is susceptible to the influence of the initial clustering center and easy to fall into the local optimal problem, the short board of FCM is solved by the gray optimization algorithm (GWO) to achieve efficient and accurate handling of a large number of retailer data.
Pagoto, Sherry L; Schneider, Kristin L; Oleski, Jessica; Bodenlos, Jamie S; Merriam, Philip; Ma, Yunsheng
2009-02-05
Skin cancer is the most prevalent yet most preventable cancer in the US. While protecting oneself from ultraviolet radiation (UVR) can largely reduce risk, rates of unprotected sun exposure remain high. Because the desire to be tan often outweighs health concerns among sunbathers, very few interventions have been successful at reducing sunbathing behavior. Sunless tanning (self-tanners and spray tans), a method of achieving the suntanned look without UVR exposure, might be an effective supplement to prevention interventions. This cluster randomized trial will examine whether a beach-based intervention that promotes sunless tanning as a substitute for sunbathing and includes sun damage imaging and sun safety recommendations is superior to a questionnaire only control group in reducing sunbathing frequency. Female beach visitors (N = 250) will be recruited from 2 public beaches in eastern Massachusetts. Beach site will be the unit of randomization. Follow-up assessment will occur at the end of the summer (1-month following intervention) and 1 year later. The primary outcome is average sunbathing time per week. The study was designed to provide 90% power for detecting a difference of .70 hours between conditions (standard deviation of 2.0) at 1-year with an intra-cluster correlation coefficient of 0.01 and assuming a 25% rate of loss to follow-up. Secondary outcomes include frequency of sunburns, use of sunless tanning products, and sun protection behavior. Interventions might be improved by promoting behavioral substitutes for sun exposure, such as sunless tanners, that create a tanned look without exposure to UVR. NCT00403377.
NASA Astrophysics Data System (ADS)
Lian, Liping; Song, Weiguo; Richard, Yuen Kwok Kit; Ma, Jian; Telesca, Luciano
2017-03-01
Pedestrian stampede happened more and more often during these years, such as Love Parade disaster in Germany 2010, trampling in Shanghai bund 2014 and crowd stampede in pilgrimages. Love Parade disaster 2010 stands out for well recorded videos, which are HD quality and available for researchers. There were totally seven surveillance cameras capturing the whole festival progress and the video we study is just before the disaster happened. Pedestrian motion was special and a small disturbance would lead the group to an avalanche in this kind of critical situation. Here we focus on the individual movement pattern. The trajectories of each pedestrian involved were extracted by a mean-shift algorithm. We analyzed the space-time patterns of the pedestrians involved in the Love Parade stampede by using the detrended fluctuation analysis and the coefficient of variation. Our results reveal that the pedestrians' movement in crowd-quakes is persistent in space, globally time-clusterized but locally regular or quasi-periodic behavior. Pedestrian movement was treated as stop and go state by point process-based representation. When the threshold increases, this means that the "go" state is longer and pedestrians keep on walking in several consecutive time frames; this is difficult in crowded situations and lead to special time-clustering behavior of the sequence of "go" events. The study reveals pedestrian motion characteristics in critical situations, which will enhance the understanding of pedestrian behaviors and supply early warning features for not only Love Parade Disaster, but also other similar large events.
The Mass Function of Abell Clusters
NASA Astrophysics Data System (ADS)
Chen, J.; Huchra, J. P.; McNamara, B. R.; Mader, J.
1998-12-01
The velocity dispersion and mass functions for rich clusters of galaxies provide important constraints on models of the formation of Large-Scale Structure (e.g., Frenk et al. 1990). However, prior estimates of the velocity dispersion or mass function for galaxy clusters have been based on either very small samples of clusters (Bahcall and Cen 1993; Zabludoff et al. 1994) or large but incomplete samples (e.g., the Girardi et al. (1998) determination from a sample of clusters with more than 30 measured galaxy redshifts). In contrast, we approach the problem by constructing a volume-limited sample of Abell clusters. We collected individual galaxy redshifts for our sample from two major galaxy velocity databases, the NASA Extragalactic Database, NED, maintained at IPAC, and ZCAT, maintained at SAO. We assembled a database with velocity information for possible cluster members and then selected cluster members based on both spatial and velocity data. Cluster velocity dispersions and masses were calculated following the procedures of Danese, De Zotti, and di Tullio (1980) and Heisler, Tremaine, and Bahcall (1985), respectively. The final velocity dispersion and mass functions were analyzed in order to constrain cosmological parameters by comparison to the results of N-body simulations. Our data for the cluster sample as a whole and for the individual clusters (spatial maps and velocity histograms) in our sample is available on-line at http://cfa-www.harvard.edu/ huchra/clusters. This website will be updated as more data becomes available in the master redshift compilations, and will be expanded to include more clusters and large groups of galaxies.
Nuclear Rings in the IR: Hidden Super Star Clusters
NASA Astrophysics Data System (ADS)
Maoz, Dan
1997-07-01
We propose NICMOS broad-band {F160W, F187W} and Paschen Alpha {F187N} imaging of nuclear starburst rings in two nearby galaxies. We already have UV {F220W} FOC data, and are scheduled to obtain WFPC2 images in U, V, I, and Halpha+[NII] of these rings. The rings contain large populations of super star clusters similar to those recently discovered in other types of starburst systems. Nuclear rings contain large numbers of these clusters in relatively unobscured starburst environments. Measurement of the age, size, and stellar contents of the clusters can test the hypothesis that super star clusters are young globular clusters. Together with our UV and optical data, NICMOS images will provide the SED of numerous super star clusters over a decade in wavelength. Our already-approved observations will allow us to estimate, by comparison with evolutionary synthesis models, the masses and ages of the clusters. The proposed IR data will be sensitive to the number of supergiants {1.6 micron} and O-stars {Paschen Alpha} in each of the clusters. The observations will provide an independent determination of the reddening, mass, and age of each cluster. We expect to see in the IR numerous clusters that are obscured in the UV and optical. These clusters may be the younger ones, which are still embedded in their molecular clouds. By measuring the mass, age, and size of a large number of clusters, we can actually obtain an evolutionary picture of these objects at different stages in their lives.
Evaluating Mixture Modeling for Clustering: Recommendations and Cautions
ERIC Educational Resources Information Center
Steinley, Douglas; Brusco, Michael J.
2011-01-01
This article provides a large-scale investigation into several of the properties of mixture-model clustering techniques (also referred to as latent class cluster analysis, latent profile analysis, model-based clustering, probabilistic clustering, Bayesian classification, unsupervised learning, and finite mixture models; see Vermunt & Magdison,…
DISPLACEMENT CASCADE SIMULATION IN TUNGSTEN UP TO 200 KEV OF DAMAGE ENERGY AT 300, 1025, AND 2050 K
DOE Office of Scientific and Technical Information (OSTI.GOV)
Setyawan, Wahyu; Nandipati, Giridhar; Roche, Kenneth J.
2015-09-22
We generated molecular dynamics database of primary defects that adequately covers the range of tungsten recoil energy imparted by 14-MeV neutrons. During this semi annual period, cascades at 150 and 200 keV at 300 and 1025 K were simulated. Overall, we included damage energy up to 200 keV at 300 and 1025 K, and up to 100 keV at 2050 K. We report the number of surviving Frenkel pairs (NF) and the size distribution of defect clusters. The slope of the NF curve versus cascade damage energy (EMD), on a log-log scale, changes at a transition energy (μ). For EMDmore » > μ, the cascade forms interconnected damage regions that facilitate the formation of large clusters of defects. At 300 K and EMD = 200 keV, the largest size of interstitial cluster and vacancy cluster is 266 and 335, respectively. Similarly, at 1025 K and EMD = 200 keV, the largest size of interstitial cluster and vacancy cluster is 296 and 338, respectively. At 2050 K, large interstitial clusters also routinely form, but practically no large vacancy clusters do« less
Dahms, Sven O.; Kuester, Miriam; Streb, Carsten; Roth, Christian; Sträter, Norbert; Than, Manuel E.
2013-01-01
Heavy-atom clusters (HA clusters) containing a large number of specifically arranged electron-dense scatterers are especially useful for experimental phase determination of large complex structures, weakly diffracting crystals or structures with large unit cells. Often, the determination of the exact orientation of the HA cluster and hence of the individual heavy-atom positions proves to be the critical step in successful phasing and subsequent structure solution. Here, it is demonstrated that molecular replacement (MR) with either anomalous or isomorphous differences is a useful strategy for the correct placement of HA cluster compounds. The polyoxometallate cluster hexasodium α-metatungstate (HMT) was applied in phasing the structure of death receptor 6. Even though the HA cluster is bound in alternate partially occupied orientations and is located at a special position, its correct localization and orientation could be determined at resolutions as low as 4.9 Å. The broad applicability of this approach was demonstrated for five different derivative crystals that included the compounds tantalum tetradecabromide and trisodium phosphotungstate in addition to HMT. The correct placement of the HA cluster depends on the length of the intramolecular vectors chosen for MR, such that both a larger cluster size and the optimal choice of the wavelength used for anomalous data collection strongly affect the outcome. PMID:23385464
Small-Scale Drop-Size Variability: Empirical Models for Drop-Size-Dependent Clustering in Clouds
NASA Technical Reports Server (NTRS)
Marshak, Alexander; Knyazikhin, Yuri; Larsen, Michael L.; Wiscombe, Warren J.
2005-01-01
By analyzing aircraft measurements of individual drop sizes in clouds, it has been shown in a companion paper that the probability of finding a drop of radius r at a linear scale l decreases as l(sup D(r)), where 0 less than or equals D(r) less than or equals 1. This paper shows striking examples of the spatial distribution of large cloud drops using models that simulate the observed power laws. In contrast to currently used models that assume homogeneity and a Poisson distribution of cloud drops, these models illustrate strong drop clustering, especially with larger drops. The degree of clustering is determined by the observed exponents D(r). The strong clustering of large drops arises naturally from the observed power-law statistics. This clustering has vital consequences for rain physics, including how fast rain can form. For radiative transfer theory, clustering of large drops enhances their impact on the cloud optical path. The clustering phenomenon also helps explain why remotely sensed cloud drop size is generally larger than that measured in situ.
Modelling wildfire activity in Iberia with different Atmospheric Circulation WTs
NASA Astrophysics Data System (ADS)
Sousa, P. M.; Trigo, R.; Pereira, M. G.; Rasilla, D.; Gouveia, C.
2012-04-01
This work focuses on the spatial and temporal variability of burnt area (BA) for the entire Iberian Peninsula (IP) and on the construction of statistical models to reproduce the inter-annual variability, based on Weather Types Classification (WTC). A common BA dataset was assembled for the first time for the entire Iberian Peninsula, by merging BA records for the 66 administrative regions of Portugal and Spain. A normalization procedure was then applied to the various size regions before performing a k-means cluster analysis to identify large areas characterized by similar fire regimes. The most compelling results were obtained for 4 clusters (Northwestern, Northern, Southwestern and Eastern) whose spatial patterns and seasonal fire regimes are shown to be related with constraining factors such as topography, vegetation cover and climate conditions. The response of fire burnt surface at monthly time scales to both long-term climatic pre-conditions and short-term synoptic forcing was assessed through correlation and regression analysis using: (i) temperature and precipitation from 2 to 7 months in advance to fire peak season; (ii) synoptic weather patterns derived from 11 distinct classifications derived under the COSTaction-733. Different responses were obtained for each of the considered regions: (i) a relevant link between BA and short-term synoptic forcing (represented by monthly frequencies of WTC) was identified for all clusters; (ii) long-term climatic preconditioning was relevant for all but one cluster (Northern). Taking into account these links, we developed stepwise regression models with the aim of reproducing the observed BA series (i.e. in hindcast mode). These models were based on the best climatic and synoptic circulation predictors identified previously. All models were cross-validated and their performance varies between clusters, though models exclusively based on WTCs tend to better reproduce annual BA time series than those only based on pre-conditioning climatic information. Nevertheless, the best results are attained when both synoptic and climatic predictors are used simultaneously as predictors, in particular for the two western clusters, where correlation coefficient values are higher than 0.7. Finally, we have used WTC composite maps to characterize the typical synoptic configurations that favor high values of BA. These patterns correspond to dry and warm fluxes, associated with anticyclonic regimes, which foster fire ignition (Pereira et al., 2005). Pereira, M.G., Trigo, R.M., DaCamara, C.C., Pereira, J.M.C., Leite, S.M., 2005: "Synoptic patterns associated with large summer forest fires in Portugal". Agricultural and Forest Meteorology. 129, 11-25. COST733, 2011: "COST 733 Wiki - Harmonisation and Applications of Weather Type Classifications for European regions or COST733 spatial domains for Europe". Available at http://geo21.geo.uni-augsburg.de/cost733wiki/Cost733_Wiki_Main [accessed 1 September 2011].
Using BMDP and SPSS for a Q factor analysis.
Tanner, B A; Koning, S M
1980-12-01
While Euclidean distances and Q factor analysis may sometimes be preferred to correlation coefficients and cluster analysis for developing a typology, commercially available software does not always facilitate their use. Commands are provided for using BMDP and SPSS in a Q factor analysis with Euclidean distances.
Mitrano, Peter P; Garzó, Vicente; Hilger, Andrew M; Ewasko, Christopher J; Hrenya, Christine M
2012-04-01
An intriguing phenomenon displayed by granular flows and predicted by kinetic-theory-based models is the instability known as particle "clustering," which refers to the tendency of dissipative grains to form transient, loose regions of relatively high concentration. In this work, we assess a modified-Sonine approximation recently proposed [Garzó, Santos, and Montanero, Physica A 376, 94 (2007)] for a granular gas via an examination of system stability. In particular, we determine the critical length scale associated with the onset of two types of instabilities--vortices and clusters--via stability analyses of the Navier-Stokes-order hydrodynamic equations by using the expressions of the transport coefficients obtained from both the standard and the modified-Sonine approximations. We examine the impact of both Sonine approximations over a range of solids fraction φ<0.2 for small restitution coefficients e = 0.25-0.4, where the standard and modified theories exhibit discrepancies. The theoretical predictions for the critical length scales are compared to molecular dynamics (MD) simulations, of which a small percentage were not considered due to inelastic collapse. Results show excellent quantitative agreement between MD and the modified-Sonine theory, while the standard theory loses accuracy for this highly dissipative parameter space. The modified theory also remedies a high-dissipation qualitative mismatch between the standard theory and MD for the instability that forms more readily. Furthermore, the evolution of cluster size is briefly examined via MD, indicating that domain-size clusters may remain stable or halve in size, depending on system parameters.
Subramaniyam, Narayan Puthanmadam; Hyttinen, Jari
2015-02-01
Recently Andrezejak et al. combined the randomness and nonlinear independence test with iterative amplitude adjusted Fourier transform (iAAFT) surrogates to distinguish between the dynamics of seizure-free intracranial electroencephalographic (EEG) signals recorded from epileptogenic (focal) and nonepileptogenic (nonfocal) brain areas of epileptic patients. However, stationarity is a part of the null hypothesis for iAAFT surrogates and thus nonstationarity can violate the null hypothesis. In this work we first propose the application of the randomness and nonlinear independence test based on recurrence network measures to distinguish between the dynamics of focal and nonfocal EEG signals. Furthermore, we combine these tests with both iAAFT and truncated Fourier transform (TFT) surrogate methods, which also preserves the nonstationarity of the original data in the surrogates along with its linear structure. Our results indicate that focal EEG signals exhibit an increased degree of structural complexity and interdependency compared to nonfocal EEG signals. In general, we find higher rejections for randomness and nonlinear independence tests for focal EEG signals compared to nonfocal EEG signals. In particular, the univariate recurrence network measures, the average clustering coefficient C and assortativity R, and the bivariate recurrence network measure, the average cross-clustering coefficient C(cross), can successfully distinguish between the focal and nonfocal EEG signals, even when the analysis is restricted to nonstationary signals, irrespective of the type of surrogates used. On the other hand, we find that the univariate recurrence network measures, the average path length L, and the average betweenness centrality BC fail to distinguish between the focal and nonfocal EEG signals when iAAFT surrogates are used. However, these two measures can distinguish between focal and nonfocal EEG signals when TFT surrogates are used for nonstationary signals. We also report an improvement in the performance of nonlinear prediction error N and nonlinear interdependence measure L used by Andrezejak et al., when TFT surrogates are used for nonstationary EEG signals. We also find that the outcome of the nonlinear independence test based on the average cross-clustering coefficient C(cross) is independent of the outcome of the randomness test based on the average clustering coefficient C. Thus, the univariate and bivariate recurrence network measures provide independent information regarding the dynamics of the focal and nonfocal EEG signals. In conclusion, recurrence network analysis combined with nonstationary surrogates can be applied to derive reliable biomarkers to distinguish between epileptogenic and nonepileptogenic brain areas using EEG signals.
NASA Astrophysics Data System (ADS)
Subramaniyam, Narayan Puthanmadam; Hyttinen, Jari
2015-02-01
Recently Andrezejak et al. combined the randomness and nonlinear independence test with iterative amplitude adjusted Fourier transform (iAAFT) surrogates to distinguish between the dynamics of seizure-free intracranial electroencephalographic (EEG) signals recorded from epileptogenic (focal) and nonepileptogenic (nonfocal) brain areas of epileptic patients. However, stationarity is a part of the null hypothesis for iAAFT surrogates and thus nonstationarity can violate the null hypothesis. In this work we first propose the application of the randomness and nonlinear independence test based on recurrence network measures to distinguish between the dynamics of focal and nonfocal EEG signals. Furthermore, we combine these tests with both iAAFT and truncated Fourier transform (TFT) surrogate methods, which also preserves the nonstationarity of the original data in the surrogates along with its linear structure. Our results indicate that focal EEG signals exhibit an increased degree of structural complexity and interdependency compared to nonfocal EEG signals. In general, we find higher rejections for randomness and nonlinear independence tests for focal EEG signals compared to nonfocal EEG signals. In particular, the univariate recurrence network measures, the average clustering coefficient C and assortativity R , and the bivariate recurrence network measure, the average cross-clustering coefficient Ccross, can successfully distinguish between the focal and nonfocal EEG signals, even when the analysis is restricted to nonstationary signals, irrespective of the type of surrogates used. On the other hand, we find that the univariate recurrence network measures, the average path length L , and the average betweenness centrality BC fail to distinguish between the focal and nonfocal EEG signals when iAAFT surrogates are used. However, these two measures can distinguish between focal and nonfocal EEG signals when TFT surrogates are used for nonstationary signals. We also report an improvement in the performance of nonlinear prediction error N and nonlinear interdependence measure L used by Andrezejak et al., when TFT surrogates are used for nonstationary EEG signals. We also find that the outcome of the nonlinear independence test based on the average cross-clustering coefficient Ccross is independent of the outcome of the randomness test based on the average clustering coefficient C . Thus, the univariate and bivariate recurrence network measures provide independent information regarding the dynamics of the focal and nonfocal EEG signals. In conclusion, recurrence network analysis combined with nonstationary surrogates can be applied to derive reliable biomarkers to distinguish between epileptogenic and nonepileptogenic brain areas using EEG signals.
2012-01-01
Background A discrete choice experiment (DCE) is a preference survey which asks participants to make a choice among product portfolios comparing the key product characteristics by performing several choice tasks. Analyzing DCE data needs to account for within-participant correlation because choices from the same participant are likely to be similar. In this study, we empirically compared some commonly-used statistical methods for analyzing DCE data while accounting for within-participant correlation based on a survey of patient preference for colorectal cancer (CRC) screening tests conducted in Hamilton, Ontario, Canada in 2002. Methods A two-stage DCE design was used to investigate the impact of six attributes on participants' preferences for CRC screening test and willingness to undertake the test. We compared six models for clustered binary outcomes (logistic and probit regressions using cluster-robust standard error (SE), random-effects and generalized estimating equation approaches) and three models for clustered nominal outcomes (multinomial logistic and probit regressions with cluster-robust SE and random-effects multinomial logistic model). We also fitted a bivariate probit model with cluster-robust SE treating the choices from two stages as two correlated binary outcomes. The rank of relative importance between attributes and the estimates of β coefficient within attributes were used to assess the model robustness. Results In total 468 participants with each completing 10 choices were analyzed. Similar results were reported for the rank of relative importance and β coefficients across models for stage-one data on evaluating participants' preferences for the test. The six attributes ranked from high to low as follows: cost, specificity, process, sensitivity, preparation and pain. However, the results differed across models for stage-two data on evaluating participants' willingness to undertake the tests. Little within-patient correlation (ICC ≈ 0) was found in stage-one data, but substantial within-patient correlation existed (ICC = 0.659) in stage-two data. Conclusions When small clustering effect presented in DCE data, results remained robust across statistical models. However, results varied when larger clustering effect presented. Therefore, it is important to assess the robustness of the estimates via sensitivity analysis using different models for analyzing clustered data from DCE studies. PMID:22348526
NASA Astrophysics Data System (ADS)
Krogh-Madsen, Trine; Kold Taylor, Louise; Skriver, Anne D.; Schaffer, Peter; Guevara, Michael R.
2017-09-01
The transmembrane potential is recorded from small isopotential clusters of 2-4 embryonic chick ventricular cells spontaneously generating action potentials. We analyze the cycle-to-cycle fluctuations in the time between successive action potentials (the interbeat interval or IBI). We also convert an existing model of electrical activity in the cluster, which is formulated as a Hodgkin-Huxley-like deterministic system of nonlinear ordinary differential equations describing five individual ionic currents, into a stochastic model consisting of a population of ˜20 000 independently and randomly gating ionic channels, with the randomness being set by a real physical stochastic process (radio static). This stochastic model, implemented using the Clay-DeFelice algorithm, reproduces the fluctuations seen experimentally: e.g., the coefficient of variation (standard deviation/mean) of IBI is 4.3% in the model vs. the 3.9% average value of the 17 clusters studied. The model also replicates all but one of several other quantitative measures of the experimental results, including the power spectrum and correlation integral of the voltage, as well as the histogram, Poincaré plot, serial correlation coefficients, power spectrum, detrended fluctuation analysis, approximate entropy, and sample entropy of IBI. The channel noise from one particular ionic current (IKs), which has channel kinetics that are relatively slow compared to that of the other currents, makes the major contribution to the fluctuations in IBI. Reproduction of the experimental coefficient of variation of IBI by adding a Gaussian white noise-current into the deterministic model necessitates using an unrealistically high noise-current amplitude. Indeed, a major implication of the modelling results is that, given the wide range of time-scales over which the various species of channels open and close, only a cell-specific stochastic model that is formulated taking into consideration the widely different ranges in the frequency content of the channel-noise produced by the opening and closing of several different types of channels will be able to reproduce precisely the various effects due to membrane noise seen in a particular electrophysiological preparation.
Generalized parastatistical systems
NASA Astrophysics Data System (ADS)
Satriawan, Mirza
2002-01-01
In the first chapter we consider systems of n-identical particles whose Hilbert spaces are invariant under the "particle permutation group" Sn, and which obey cluster decomposition. The classification of such systems by Hartle, Stolt, and Taylor in terms of state symmetry types is used, together with an additional classification based on the allowed observables in the system. We have indistinguishable (p, q)- statistics, indistinguishable infinite statistics, distinguishable (p, q)-statistics, and distinguishable infinite statistics. We refer to all of these as generalized parastatistical systems. We obtain a closed form for the grand canonical partition function (GCPF) for a non-interacting gas of particles obeying indistinguishable ( p, q)-statistics (for any p and q). As a special case we have the GCPF for the usual parabose statistics of order p, solving a 50 year old problem. Except for indistinguishable (1,1)-statistics, our results are not suitable for calculating the GCPF in the continuum energy limit. However, for indistinguishable (1,1)-statistics we calculate the continuum limit and obtain some simple thermodynamic results. In particular, we show that a system of free particles in any spatial dimension d ≥ 2 that obeys indistinguishable (1,1)-statistics will exhibit Bose-like condensation. In the second chapter we consider systems similar to those in the first chapter, but now assuming that the GCPF factorizes so that we obtain an extensive system. It turns out that having such an extensive GCPF is equivalent to the factorization of the counting function, and also to the factorization of the cluster coefficients and to the strong cluster condition on the counting coefficients. We calculate several simple thermodynamic quantities for such systems, where the results are given in terms of the cluster coefficients. In the third chapter, we give a second quantized realization of generalized parastatistical systems in the form of scalar product requirements on the Fock space F . We also give realizations of these scalar product requirements in terms of creation and annihilation operator algebras, and find that the Govorkov algebra and Greenberg's q-mutator algebra for q = 0 are the only ones from the literature that correspond to special cases of our systems.
Accreting Black Hole Binaries in Globular Clusters
NASA Astrophysics Data System (ADS)
Kremer, Kyle; Chatterjee, Sourav; Rodriguez, Carl L.; Rasio, Frederic A.
2018-01-01
We explore the formation of mass-transferring binary systems containing black holes (BHs) within globular clusters (GC). We show that it is possible to form mass-transferring BH binaries with main sequence, giant, and white dwarf companions with a variety of orbital parameters in GCs spanning a large range in present-day properties. All mass-transferring BH binaries found in our models at late times are dynamically created. The BHs in these systems experienced a median of ∼30 dynamical encounters within the cluster before and after acquiring the donor. Furthermore, we show that the presence of mass-transferring BH systems has little correlation with the total number of BHs within the cluster at any time. This is because the net rate of formation of BH–non-BH binaries in a cluster is largely independent of the total number of retained BHs. Our results suggest that the detection of a mass-transferring BH binary in a GC does not necessarily indicate that the host cluster contains a large BH population.
Large Scale Structure Studies: Final Results from a Rich Cluster Redshift Survey
NASA Astrophysics Data System (ADS)
Slinglend, K.; Batuski, D.; Haase, S.; Hill, J.
1995-12-01
The results from the COBE satellite show the existence of structure on scales on the order of 10% or more of the horizon scale of the universe. Rich clusters of galaxies from the Abell-ACO catalogs show evidence of structure on scales of 100 Mpc and hold the promise of confirming structure on the scale of the COBE result. Unfortunately, until now, redshift information has been unavailable for a large percentage of these clusters, so present knowledge of their three dimensional distribution has quite large uncertainties. Our approach in this effort has been to use the MX multifiber spectrometer on the Steward 2.3m to measure redshifts of at least ten galaxies in each of 88 Abell cluster fields with richness class R>= 1 and mag10 <= 16.8 (estimated z<= 0.12) and zero or one measured redshifts. This work has resulted in a deeper, 95% complete and more reliable sample of 3-D positions of rich clusters. The primary intent of this survey has been to constrain theoretical models for the formation of the structure we see in the universe today through 2-pt. spatial correlation function and other analyses of the large scale structures traced by these clusters. In addition, we have obtained enough redshifts per cluster to greatly improve the quality and size of the sample of reliable cluster velocity dispersions available for use in other studies of cluster properties. This new data has also allowed the construction of an updated and more reliable supercluster candidate catalog. Our efforts have resulted in effectively doubling the volume traced by these clusters. Presented here is the resulting 2-pt. spatial correlation function, as well as density plots and several other figures quantifying the large scale structure from this much deeper and complete sample. Also, with 10 or more redshifts in most of our cluster fields, we have investigated the extent of projection effects within the Abell catalog in an effort to quantify and understand how this may effect the Abell sample.
NASA Astrophysics Data System (ADS)
Yamashita, S.; Nakajo, T.; Naruse, H.
2009-12-01
In this study, we statistically classified the grain size distribution of the bottom surface sediment on a microtidal sand flat to analyze the depositional processes of the sediment. Multiple classification analysis revealed that two types of sediment populations exist in the bottom surface sediment. Then, we employed the sediment trend model developed by Gao and Collins (1992) for the estimation of sediment transport pathways. As a result, we found that statistical discrimination of the bottom surface sediment provides useful information for the sediment trend model while dealing with various types of sediment transport processes. The microtidal sand flat along the Kushida River estuary, Ise Bay, central Japan, was investigated, and 102 bottom surface sediment samples were obtained. Then, their grain size distribution patterns were measured by the settling tube method, and each grain size distribution parameter (mud and gravel contents, mean grain size, coefficient of variance (CV), skewness, kurtosis, 5, 25, 50, 75, and 95 percentile) was calculated. Here, CV is the normalized sorting value divided by the mean grain size. Two classical statistical methods—principal component analysis (PCA) and fuzzy cluster analysis—were applied. The results of PCA showed that the bottom surface sediment of the study area is mainly characterized by grain size (mean grain size and 5-95 percentile) and the CV value, indicating predominantly large absolute values of factor loadings in primal component (PC) 1. PC1 is interpreted as being indicative of the grain-size trend, in which a finer grain-size distribution indicates better size sorting. The frequency distribution of PC1 has a bimodal shape and suggests the existence of two types of sediment populations. Therefore, we applied fuzzy cluster analysis, the results of which revealed two groupings of the sediment (Cluster 1 and Cluster 2). Cluster 1 shows a lower value of PC1, indicating coarse and poorly sorted sediments. Cluster 1 sediments are distributed around the branched channel from Kushida River and show an expanding distribution from the river mouth toward the northeast direction. Cluster 2 shows a higher value of PC1, indicating fine and well-sorted sediments; this cluster is distributed in a distant area from the river mouth, including the offshore region. Therefore, Cluster 1 and Cluster 2 are interpreted as being deposited by fluvial and wave processes, respectively. Finally, on the basis of this distribution pattern, the sediment trend model was applied in areas dominated separately by fluvial and wave processes. Resultant sediment transport patterns showed good agreement with those obtained by field observations. The results of this study provide an important insight into the numerical models of sediment transport.
Approximate kernel competitive learning.
Wu, Jian-Sheng; Zheng, Wei-Shi; Lai, Jian-Huang
2015-03-01
Kernel competitive learning has been successfully used to achieve robust clustering. However, kernel competitive learning (KCL) is not scalable for large scale data processing, because (1) it has to calculate and store the full kernel matrix that is too large to be calculated and kept in the memory and (2) it cannot be computed in parallel. In this paper we develop a framework of approximate kernel competitive learning for processing large scale dataset. The proposed framework consists of two parts. First, it derives an approximate kernel competitive learning (AKCL), which learns kernel competitive learning in a subspace via sampling. We provide solid theoretical analysis on why the proposed approximation modelling would work for kernel competitive learning, and furthermore, we show that the computational complexity of AKCL is largely reduced. Second, we propose a pseudo-parallelled approximate kernel competitive learning (PAKCL) based on a set-based kernel competitive learning strategy, which overcomes the obstacle of using parallel programming in kernel competitive learning and significantly accelerates the approximate kernel competitive learning for large scale clustering. The empirical evaluation on publicly available datasets shows that the proposed AKCL and PAKCL can perform comparably as KCL, with a large reduction on computational cost. Also, the proposed methods achieve more effective clustering performance in terms of clustering precision against related approximate clustering approaches. Copyright © 2014 Elsevier Ltd. All rights reserved.
Clustering high dimensional data using RIA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Aziz, Nazrina
2015-05-15
Clustering may simply represent a convenient method for organizing a large data set so that it can easily be understood and information can efficiently be retrieved. However, identifying cluster in high dimensionality data sets is a difficult task because of the curse of dimensionality. Another challenge in clustering is some traditional functions cannot capture the pattern dissimilarity among objects. In this article, we used an alternative dissimilarity measurement called Robust Influence Angle (RIA) in the partitioning method. RIA is developed using eigenstructure of the covariance matrix and robust principal component score. We notice that, it can obtain cluster easily andmore » hence avoid the curse of dimensionality. It is also manage to cluster large data sets with mixed numeric and categorical value.« less
Hydration of a Large Anionic Charge Distribution - Naphthalene-Water Cluster Anions
NASA Astrophysics Data System (ADS)
Weber, J. Mathias; Adams, Christopher L.
2010-06-01
We report the infrared spectra of anionic clusters of naphthalene with up to three water molecules. Comparison of the experimental infrared spectra with theoretically predicted spectra from quantum chemistry calculations allow conclusions regarding the structures of the clusters under study. The first water molecule forms two hydrogen bonds with the π electron system of the naphthalene moiety. Subsequent water ligands interact with both the naphthalene and the other water ligands to form hydrogen bonded networks, similar to other hydrated anion clusters. Naphthalene-water anion clusters illustrate how water interacts with negative charge delocalized over a large π electron system. The clusters are interesting model systems that are discussed in the context of wetting of graphene surfaces and polyaromatic hydrocarbons.
A Genetic Algorithm That Exchanges Neighboring Centers for Fuzzy c-Means Clustering
ERIC Educational Resources Information Center
Chahine, Firas Safwan
2012-01-01
Clustering algorithms are widely used in pattern recognition and data mining applications. Due to their computational efficiency, partitional clustering algorithms are better suited for applications with large datasets than hierarchical clustering algorithms. K-means is among the most popular partitional clustering algorithm, but has a major…
Statistical Analysis of Large Scale Structure by the Discrete Wavelet Transform
NASA Astrophysics Data System (ADS)
Pando, Jesus
1997-10-01
The discrete wavelet transform (DWT) is developed as a general statistical tool for the study of large scale structures (LSS) in astrophysics. The DWT is used in all aspects of structure identification including cluster analysis, spectrum and two-point correlation studies, scale-scale correlation analysis and to measure deviations from Gaussian behavior. The techniques developed are demonstrated on 'academic' signals, on simulated models of the Lymanα (Lyα) forests, and on observational data of the Lyα forests. This technique can detect clustering in the Ly-α clouds where traditional techniques such as the two-point correlation function have failed. The position and strength of these clusters in both real and simulated data is determined and it is shown that clusters exist on scales as large as at least 20 h-1 Mpc at significance levels of 2-4 σ. Furthermore, it is found that the strength distribution of the clusters can be used to distinguish between real data and simulated samples even where other traditional methods have failed to detect differences. Second, a method for measuring the power spectrum of a density field using the DWT is developed. All common features determined by the usual Fourier power spectrum can be calculated by the DWT. These features, such as the index of a power law or typical scales, can be detected even when the samples are geometrically complex, the samples are incomplete, or the mean density on larger scales is not known (the infrared uncertainty). Using this method the spectra of Ly-α forests in both simulated and real samples is calculated. Third, a method for measuring hierarchical clustering is introduced. Because hierarchical evolution is characterized by a set of rules of how larger dark matter halos are formed by the merging of smaller halos, scale-scale correlations of the density field should be one of the most sensitive quantities in determining the merging history. We show that these correlations can be completely determined by the correlations between discrete wavelet coefficients on adjacent scales and at nearly the same spatial position, Cj,j+12/cdot2. Scale-scale correlations on two samples of the QSO Ly-α forests absorption spectra are computed. Lastly, higher order statistics are developed to detect deviations from Gaussian behavior. These higher order statistics are necessary to fully characterize the Ly-α forests because the usual 2nd order statistics, such as the two-point correlation function or power spectrum, give inconclusive results. It is shown how this technique takes advantage of the locality of the DWT to circumvent the central limit theorem. A non-Gaussian spectrum is defined and this spectrum reveals not only the magnitude, but the scales of non-Gaussianity. When applied to simulated and observational samples of the Ly-α clouds, it is found that different popular models of structure formation have different spectra while two, independent observational data sets, have the same spectra. Moreover, the non-Gaussian spectra of real data sets are significantly different from the spectra of various possible random samples. (Abstract shortened by UMI.)
NASA Astrophysics Data System (ADS)
Choi, Hyun-Jung; Lee, Hwa Woon; Sung, Kyoung-Hee; Kim, Min-Jung; Kim, Yoo-Keun; Jung, Woo-Sik
In order to incorporate correctly the large or local scale circulation in the model, a nudging term is introduced into the equation of motion. Nudging effects should be included properly in the model to reduce the uncertainties and improve the air flow field. To improve the meteorological components, the nudging coefficient should perform the adequate influence on complex area for the model initialization technique which related to data reliability and error suppression. Several numerical experiments have been undertaken in order to evaluate the effects on air quality modeling by comparing the performance of the meteorological result with variable nudging coefficient experiment. All experiments are calculated by the upper wind conditions (synoptic or asynoptic condition), respectively. Consequently, it is important to examine the model response to nudging effect of wind and mass information. The MM5-CMAQ model was used to assess the ozone differences in each case, during the episode day in Seoul, Korea and we revealed that there were large differences in the ozone concentration for each run. These results suggest that for the appropriate simulation of large or small-scale circulations, nudging considering the synoptic and asynoptic nudging coefficient does have a clear advantage over dynamic initialization, so appropriate limitation of these nudging coefficient values on its upper wind conditions is necessary before making an assessment. The statistical verifications showed that adequate nudging coefficient for both wind and temperature data throughout the model had a consistently positive impact on the atmospheric and air quality field. On the case dominated by large-scale circulation, a large nudging coefficient shows a minor improvement in the atmospheric and air quality field. However, when small-scale convection is present, the large nudging coefficient produces consistent improvement in the atmospheric and air quality field.
DICON: interactive visual analysis of multidimensional clusters.
Cao, Nan; Gotz, David; Sun, Jimeng; Qu, Huamin
2011-12-01
Clustering as a fundamental data analysis technique has been widely used in many analytic applications. However, it is often difficult for users to understand and evaluate multidimensional clustering results, especially the quality of clusters and their semantics. For large and complex data, high-level statistical information about the clusters is often needed for users to evaluate cluster quality while a detailed display of multidimensional attributes of the data is necessary to understand the meaning of clusters. In this paper, we introduce DICON, an icon-based cluster visualization that embeds statistical information into a multi-attribute display to facilitate cluster interpretation, evaluation, and comparison. We design a treemap-like icon to represent a multidimensional cluster, and the quality of the cluster can be conveniently evaluated with the embedded statistical information. We further develop a novel layout algorithm which can generate similar icons for similar clusters, making comparisons of clusters easier. User interaction and clutter reduction are integrated into the system to help users more effectively analyze and refine clustering results for large datasets. We demonstrate the power of DICON through a user study and a case study in the healthcare domain. Our evaluation shows the benefits of the technique, especially in support of complex multidimensional cluster analysis. © 2011 IEEE
Efficiency of parallel direct optimization
NASA Technical Reports Server (NTRS)
Janies, D. A.; Wheeler, W. C.
2001-01-01
Tremendous progress has been made at the level of sequential computation in phylogenetics. However, little attention has been paid to parallel computation. Parallel computing is particularly suited to phylogenetics because of the many ways large computational problems can be broken into parts that can be analyzed concurrently. In this paper, we investigate the scaling factors and efficiency of random addition and tree refinement strategies using the direct optimization software, POY, on a small (10 slave processors) and a large (256 slave processors) cluster of networked PCs running LINUX. These algorithms were tested on several data sets composed of DNA and morphology ranging from 40 to 500 taxa. Various algorithms in POY show fundamentally different properties within and between clusters. All algorithms are efficient on the small cluster for the 40-taxon data set. On the large cluster, multibuilding exhibits excellent parallel efficiency, whereas parallel building is inefficient. These results are independent of data set size. Branch swapping in parallel shows excellent speed-up for 16 slave processors on the large cluster. However, there is no appreciable speed-up for branch swapping with the further addition of slave processors (>16). This result is independent of data set size. Ratcheting in parallel is efficient with the addition of up to 32 processors in the large cluster. This result is independent of data set size. c2001 The Willi Hennig Society.
Cluster-to-cluster transformation among Au6, Au8 and Au11 nanoclusters.
Ren, Xiuqing; Fu, Junhong; Lin, Xinzhang; Fu, Xuemei; Yan, Jinghui; Wu, Ren'an; Liu, Chao; Huang, Jiahui
2018-05-22
We present the cluster-to-cluster transformations among three gold nanoclusters, [Au6(dppp)4]2+ (Au6), [Au8(dppp)4Cl2]2+ (Au8) and [Au11(dppp)5]3+ (Au11). The conversion process follows a rule that states that the transformation of a small cluster to a large cluster is achieved through an oxidation process with an oxidizing agent (H2O2) or with heating, while the conversion of a large cluster to a small one occurs through a reduction process with a reducing agent (NaBH4). All the reactions were monitored using UV-Vis spectroscopy and ESI-MS. This work may provide an alternative approach to the synthesis of novel gold nanoclusters and a further understanding of the structural transformation relationship of gold nanoclusters.
NASA Technical Reports Server (NTRS)
Romanishin, W.
1988-01-01
Preliminary results are given for a program to measure color gradients in the central galaxies in clusters with a variety of cooling flow rates. The objectives are to search for extended blue continuum regions indicative of star formation, to study the spatial distribution of star formation, and to make a quantitative measure of the amount of light from young stars, which can lead to a measure of the star formation rate (for an assumed initial mass function). Four clusters with large masses and large cluster H-alpha emission fluxes are found to have an excess of blue light concentrated to the centers of the cluster central galaxy. Assumption of a disk IMF leads to the conclusion that the starlight might play a major role in ionizing the emission line gas in these clusters.
Mapping Dark Matter in Simulated Galaxy Clusters
NASA Astrophysics Data System (ADS)
Bowyer, Rachel
2018-01-01
Galaxy clusters are the most massive bound objects in the Universe with most of their mass being dark matter. Cosmological simulations of structure formation show that clusters are embedded in a cosmic web of dark matter filaments and large scale structure. It is thought that these filaments are found preferentially close to the long axes of clusters. We extract galaxy clusters from the simulations "cosmo-OWLS" in order to study their properties directly and also to infer their properties from weak gravitational lensing signatures. We investigate various stacking procedures to enhance the signal of the filaments and large scale structure surrounding the clusters to better understand how the filaments of the cosmic web connect with galaxy clusters. This project was supported in part by the NSF REU grant AST-1358980 and by the Nantucket Maria Mitchell Association.
Small-scale Conformity of the Virgo Cluster Galaxies
NASA Astrophysics Data System (ADS)
Lee, Hye-Ran; Lee, Joon Hyeop; Jeong, Hyunjin; Park, Byeong-Gon
2016-06-01
We investigate the small-scale conformity in color between bright galaxies and their faint companions in the Virgo Cluster. Cluster member galaxies are spectroscopically determined using the Extended Virgo Cluster Catalog and the Sloan Digital Sky Survey Data Release 12. We find that the luminosity-weighted mean color of faint galaxies depends on the color of adjacent bright galaxy as well as on the cluster-scale environment (gravitational potential index). From this result for the entire area of the Virgo Cluster, it is not distinguishable whether the small-scale conformity is genuine or if it is artificially produced due to cluster-scale variation of galaxy color. To disentangle this degeneracy, we divide the Virgo Cluster area into three sub-areas so that the cluster-scale environmental dependence is minimized: A1 (central), A2 (intermediate), and A3 (outermost). We find conformity in color between bright galaxies and their faint companions (color-color slope significance S ˜ 2.73σ and correlation coefficient {cc}˜ 0.50) in A2, where the cluster-scale environmental dependence is almost negligible. On the other hand, the conformity is not significant or very marginal (S ˜ 1.75σ and {cc}˜ 0.27) in A1. The conformity is not significant either in A3 (S ˜ 1.59σ and {cc}˜ 0.44), but the sample size is too small in this area. These results are consistent with a scenario in which the small-scale conformity in a cluster is a vestige of infallen groups and these groups lose conformity as they come closer to the cluster center.
A cluster merging method for time series microarray with production values.
Chira, Camelia; Sedano, Javier; Camara, Monica; Prieto, Carlos; Villar, Jose R; Corchado, Emilio
2014-09-01
A challenging task in time-course microarray data analysis is to cluster genes meaningfully combining the information provided by multiple replicates covering the same key time points. This paper proposes a novel cluster merging method to accomplish this goal obtaining groups with highly correlated genes. The main idea behind the proposed method is to generate a clustering starting from groups created based on individual temporal series (representing different biological replicates measured in the same time points) and merging them by taking into account the frequency by which two genes are assembled together in each clustering. The gene groups at the level of individual time series are generated using several shape-based clustering methods. This study is focused on a real-world time series microarray task with the aim to find co-expressed genes related to the production and growth of a certain bacteria. The shape-based clustering methods used at the level of individual time series rely on identifying similar gene expression patterns over time which, in some models, are further matched to the pattern of production/growth. The proposed cluster merging method is able to produce meaningful gene groups which can be naturally ranked by the level of agreement on the clustering among individual time series. The list of clusters and genes is further sorted based on the information correlation coefficient and new problem-specific relevant measures. Computational experiments and results of the cluster merging method are analyzed from a biological perspective and further compared with the clustering generated based on the mean value of time series and the same shape-based algorithm.
Galaxy clusters and cold dark matter - A low-density unbiased universe?
NASA Technical Reports Server (NTRS)
Bahcall, Neta A.; Cen, Renyue
1992-01-01
Large-scale simulations of a universe dominated by cold dark matter (CDM) are tested against two fundamental properties of clusters of galaxies: the cluster mass function and the cluster correlation function. We find that standard biased CDM models are inconsistent with these observations for any bias parameter b. A low-density, low-bias CDM-type model, with or without a cosmological constant, appears to be consistent with both the cluster mass function and the cluster correlations. The low-density model agrees well with the observed correlation function of the Abell, Automatic Plate Measuring Facility (APM), and Edinburgh-Durham cluster catalogs. The model is in excellent agreement with the observed dependence of the correlation strength on cluster mean separation, reproducing the measured universal dimensionless cluster correlation. The low-density model is also consistent with other large-scale structure observations, including the APM angular galaxy-correlations, and for lambda = 1-Omega with the COBE results of the microwave background radiation fluctuations.
Coherent Power Analysis in Multilevel Studies Using Parameters from Surveys
ERIC Educational Resources Information Center
Rhoads, Christopher
2017-01-01
Researchers designing multisite and cluster randomized trials of educational interventions will usually conduct a power analysis in the planning stage of the study. To conduct the power analysis, researchers often use estimates of intracluster correlation coefficients and effect sizes derived from an analysis of survey data. When there is…
USDA-ARS?s Scientific Manuscript database
The analyses methods of Pearson correlation coefficient (PCC), hierarchical cluster analysis and diversity index were used to study the relevance between citrus huanglongbing (HLB) and the endophytic bacteria in different branches and leaves as well as roots of huanglongbing (HLB)-affected citrus tr...
Armah, Frederick A; Obiri, Samuel; Yawson, David O; Onumah, Edward E; Yengoh, Genesis T; Afrifa, Ernest K A; Odoi, Justice O
2010-11-01
The levels of heavy metals in surface water and their potential origin (natural and anthropogenic) were respectively determined and analysed for the Obuasi mining area in Ghana. Using Hawth's tool an extension in ArcGIS 9.2 software, a total of 48 water sample points in Obuasi and its environs were randomly selected for study. The magnitude of As, Cu, Mn, Fe, Pb, Hg, Zn and Cd in surface water from the sampling sites were measured by flame Atomic Absorption Spectrophotometry (AAS). Water quality parameters including conductivity, pH, total dissolved solids and turbidity were also evaluated. Principal component analysis and cluster analysis, coupled with correlation coefficient analysis, were used to identify possible sources of these heavy metals. Pearson correlation coefficients among total metal concentrations and selected water properties showed a number of strong associations. The results indicate that apart from tap water, surface water in Obuasi has elevated heavy metal concentrations, especially Hg, Pb, As, Cu and Cd, which are above the Ghana Environmental Protection Agency (GEPA) and World Health Organisation (WHO) permissible levels; clearly demonstrating anthropogenic impact. The mean heavy metal concentrations in surface water divided by the corresponding background values of surface water in Obuasi decrease in the order of Cd > Cu > As > Pb > Hg > Zn > Mn > Fe. The results also showed that Cu, Mn, Cd and Fe are largely responsible for the variations in the data, explaining 72% of total variance; while Pb, As and Hg explain only 18.7% of total variance. Three main sources of these heavy metals were identified. As originates from nature (oxidation of sulphide minerals particularly arsenopyrite-FeAsS). Pb derives from water carrying drainage from towns and mine machinery maintenance yards. Cd, Zn, Fe and Mn mainly emanate from industry sources. Hg mainly originates from artisanal small-scale mining. It cannot be said that the difference in concentration of heavy metals might be attributed to difference in proximity to mining-related activities because this is inconsistent with the cluster analysis. Based on cluster analysis SN32, SN42 and SN43 all belong to group one and are spatially similar. But the maximum Cu concentration was found in SN32 while the minimum Cu concentration was found in SN42 and SN43.
Effect of dose and size on defect engineering in carbon cluster implanted silicon wafers
NASA Astrophysics Data System (ADS)
Okuyama, Ryosuke; Masada, Ayumi; Shigematsu, Satoshi; Kadono, Takeshi; Hirose, Ryo; Koga, Yoshihiro; Okuda, Hidehiko; Kurita, Kazunari
2018-01-01
Carbon-cluster-ion-implanted defects were investigated by high-resolution cross-sectional transmission electron microscopy toward achieving high-performance CMOS image sensors. We revealed that implantation damage formation in the silicon wafer bulk significantly differs between carbon-cluster and monomer ions after implantation. After epitaxial growth, small and large defects were observed in the implanted region of carbon clusters. The electron diffraction pattern of both small and large defects exhibits that from bulk crystalline silicon in the implanted region. On the one hand, we assumed that the silicon carbide structure was not formed in the implanted region, and small defects formed because of the complex of carbon and interstitial silicon. On the other hand, large defects were hypothesized to originate from the recrystallization of the amorphous layer formed by high-dose carbon-cluster implantation. These defects are considered to contribute to the powerful gettering capability required for high-performance CMOS image sensors.
Aikawa, Ken; Kataoka, Masao; Ogawa, Soichiro; Akaihata, Hidenori; Sato, Yuichi; Yabe, Michihiro; Hata, Junya; Koguchi, Tomoyuki; Kojima, Yoshiyuki; Shiragasawa, Chihaya; Kobayashi, Toshimitsu; Yamaguchi, Osamu
2015-08-01
To present a new grouping of male patients with lower urinary tract symptoms (LUTS) based on symptom patterns and clarify whether the therapeutic effect of α1-blocker differs among the groups. We performed secondary analysis of anonymous data from 4815 patients enrolled in a postmarketing surveillance study of tamsulosin in Japan. Data on 7 International Prostate Symptom Score (IPSS) items at the initial visit were used in the cluster analysis. IPSS and quality of life (QOL) scores before and after tamsulosin treatment for 12 weeks were assessed in each cluster. Partial correlation coefficients were also obtained for IPSS and QOL scores based on changes before and after treatment. Five symptom groups were identified by cluster analysis of IPSS. On their symptom profile, each cluster was labeled as minimal type (cluster 1), multiple severe type (cluster 2), weak stream type (cluster 3), storage type (cluster 4), and voiding type (cluster 5). Prevalence and the mean symptom score were significantly improved in almost all symptoms in all clusters by tamsulosin treatment. Nocturia and weak stream had the strongest effect on QOL in clusters 1, 2, and 4 and clusters 3 and 5, respectively. The study clarified that 5 characteristic symptom patterns exist by cluster analysis of IPSS in male patients with LUTS. Tamsulosin improved various symptoms and QOL in each symptom group. The study reports many male patients with LUTS being satisfied with monotherapy using tamsulosin and suggests the usefulness of α1-blockers as a drug of first choice. Copyright © 2015 Elsevier Inc. All rights reserved.
Where are Low Mass X-ray Binaries Formed?
NASA Astrophysics Data System (ADS)
Kundu, A.; Maccarone, T. J.; Zepf, S. E.
2004-08-01
Chandra images of nearby galaxies reveal large numbers of low mass X-ray binaries (LMXBs). As in the Galaxy, a significant fraction of these are associated with globular clusters. We exploit the LMXB-globular cluster link in order to probe both the physical properties of globular clusters that promote the formation of LMXBs within clusters with specific characteristics, and to study whether the non-cluster field LMXB population was originally formed in clusters and then released into the field. The large population of globular clusters around nearby galaxies and the range of properties such as age, metallicity and host galaxy environment spanned by these objects enables us to identify and probe the link between these characteristics and the formation of LMXBs. We present the results of our study of a large sample of elliptical and S0 galaxies which reveals among other things that bright LMXBs definitively prefer metal-rich cluster hosts and that this relationship is unlikely to be driven by age effects. The ancestry of the non-cluster field LMXBs is a matter of some debate with suggestions that they they might have formed in the field, or created in globular clusters and then subsequently released into the field either by being ejected from clusters by dynamical processes or as remnants of dynamically destroyed clusters. Each of these scenarios has a specific spatial signature that can be tested by our combined optical and X-ray study. Furthermore, these scenarios predict additional statistical variations that may be driven by the specific host galaxy environment. We present a detailed analysis of our sample galaxies and comment on the probability that the field sources were actually formed in clusters.
Chemistry and Physics of Solid Surfaces 5
1984-04-01
associated with dimers and trimers, Type 2 particles form large clusters of 2000-5000 A size in aqueous solution. Luminescence studies carried out with...and rates of energy transfer, real time measurements using ultrashort laser pulses hold great promise. With the possible exception of the stimulated...the dynamic prop- erties of such clusters . The clusters are not stationary entities as origi- nally envisioned. Instead even fairly large aggregates
Genetic diversity and patterns of population structure in Creole goats from the Americas.
Ginja, C; Gama, L T; Martínez, A; Sevane, N; Martin-Burriel, I; Lanari, M R; Revidatti, M A; Aranguren-Méndez, J A; Bedotti, D O; Ribeiro, M N; Sponenberg, P; Aguirre, E L; Alvarez-Franco, L A; Menezes, M P C; Chacón, E; Galarza, A; Gómez-Urviola, N; Martínez-López, O R; Pimenta-Filho, E C; da Rocha, L L; Stemmer, A; Landi, V; Delgado-Bermejo, J V
2017-06-01
Biodiversity studies are more efficient when large numbers of breeds belonging to several countries are involved, as they allow for an in-depth analysis of the within- and between-breed components of genetic diversity. A set of 21 microsatellites was used to investigate the genetic composition of 24 Creole goat breeds (910 animals) from 10 countries to estimate levels of genetic variability, infer population structure and understand genetic relationships among populations across the American continent. Three commercial transboundary breeds were included in the analyses to investigate admixture with Creole goats. Overall, the genetic diversity of Creole populations (mean number of alleles = 5.82 ± 1.14, observed heterozygosity = 0.585 ± 0.074) was moderate and slightly lower than what was detected in other studies with breeds from other regions. The Bayesian clustering analysis without prior information on source populations identified 22 breed clusters. Three groups comprised more than one population, namely from Brazil (Azul and Graúna; Moxotó and Repartida) and Argentina (Long and shorthair Chilluda, Pampeana Colorada and Angora-type goat). Substructure was found in Criolla Paraguaya. When prior information on sample origin was considered, 92% of the individuals were assigned to the source population (threshold q ≥ 0.700). Creole breeds are well-differentiated entities (mean coefficient of genetic differentiation = 0.111 ± 0.048, with the exception of isolated island populations). Dilution from admixture with commercial transboundary breeds appears to be negligible. Significant levels of inbreeding were detected (inbreeding coefficient > 0 in most Creole goat populations, P < 0.05). Our results provide a broad perspective on the extant genetic diversity of Creole goats, however further studies are needed to understand whether the observed geographical patterns of population structure may reflect the mode of goat colonization in the Americas. © 2017 Stichting International Foundation for Animal Genetics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Congyao; Yu, Qingjuan; Lu, Youjun, E-mail: yuqj@pku.edu.cn
2014-12-01
Observations reveal that the peaks of the X-ray map and the Sunyaev-Zel'dovich (SZ) effect map of some galaxy clusters are offset from each other. In this paper, we perform a set of hydrodynamical simulations of mergers of two galaxy clusters to investigate the spatial offset between the maxima of the X-ray and the SZ surface brightness of the merging clusters. We find that significantly large SZ-X-ray offsets (>100 kpc) can be produced during the major mergers of galaxy clusters (with mass > 1 × 10{sup 14} M {sub ☉}). The significantly large offsets are mainly caused by a 'jump effect'more » that occurs between the primary and secondary pericentric passages of the two merging clusters, during which the X-ray peak may jump to the densest gas region located near the center of the small cluster, but the SZ peak remains near the center of the large one. Our simulations show that merging systems with higher masses and larger initial relative velocities may result in larger offset sizes and longer offset time durations; and only nearly head-on mergers are likely to produce significantly large offsets. We further investigate the statistical distribution of the SZ-X-ray offset sizes and find that (1) the number distribution of the offset sizes is bimodal with one peak located at low offsets ∼0 and the other at large offsets ∼350-450 h {sup –1} kpc, but the objects with intermediate offsets are scarce; and (2) the probabilities of the clusters in the mass range higher than 2 × 10{sup 14} h {sup –1} M {sub ☉} that have offsets larger than 20, 50, 200, 300, and 500 h {sup –1} kpc are 34.0%, 11.1%, 8.0%, 6.5%, and 2.0%, respectively, at z = 0.7. The probability is sensitive to the underlying pairwise velocity distribution and the merger rate of clusters. We suggest that the SZ-X-ray offsets provide a probe to the cosmic velocity fields on the cluster scale and the cluster merger rate, and future observations on the SZ-X-ray offsets for a large number of clusters may put strong constraints on them. Our simulation results suggest that the SZ-X-ray offset in the Bullet Cluster, together with the mass ratio of the two merging clusters, requires a relative velocity larger than 3000 km s{sup –1} at an initial separation 5 Mpc. The cosmic velocity distribution at the high-velocity end is expected to be crucial in determining whether there exists an incompatibility between the existence of the Bullet Cluster and the prediction of a ΛCDM model.« less
Pego-Reigosa, José María; Lois-Iglesias, Ana; Rúa-Figueroa, Íñigo; Galindo, María; Calvo-Alén, Jaime; de Uña-Álvarez, Jacobo; Balboa-Barreiro, Vanessa; Ibáñez Ruan, Jesús; Olivé, Alejandro; Rodríguez-Gómez, Manuel; Fernández Nebro, Antonio; Andrés, Mariano; Erausquin, Celia; Tomero, Eva; Horcada Rubio, Loreto; Uriarte Isacelaya, Esther; Freire, Mercedes; Montilla, Carlos; Sánchez-Atrio, Ana I; Santos-Soler, Gregorio; Zea, Antonio; Díez, Elvira; Narváez, Javier; Blanco-Alonso, Ricardo; Silva-Fernández, Lucía; Ruiz-Lucea, María Esther; Fernández-Castro, Mónica; Hernández-Beriain, José Ángel; Gantes-Mora, Marian; Hernández-Cruz, Blanca; Pérez-Venegas, José; Pecondón-Español, Ángela; Marras Fernández-Cid, Carlos; Ibáñez-Barcelo, Mónica; Bonilla, Gema; Torrente-Segarra, Vicenç; Castellví, Iván; Alegre, Juan José; Calvet, Joan; Marenco de la Fuente, José Luis; Raya, Enrique; Vázquez-Rodríguez, Tomás Ramón; Quevedo-Vila, Víctor; Muñoz-Fernández, Santiago; Otón, Teresa; Rahman, Anisur; López-Longo, Francisco Javier
2016-07-01
To identify patterns (clusters) of damage manifestations within a large cohort of SLE patients and evaluate the potential association of these clusters with a higher risk of mortality. This is a multicentre, descriptive, cross-sectional study of a cohort of 3656 SLE patients from the Spanish Society of Rheumatology Lupus Registry. Organ damage was ascertained using the Systemic Lupus International Collaborating Clinics Damage Index. Using cluster analysis, groups of patients with similar patterns of damage manifestations were identified. Then, overall clusters were compared as well as the subgroup of patients within every cluster with disease duration shorter than 5 years. Three damage clusters were identified. Cluster 1 (80.6% of patients) presented a lower amount of individuals with damage (23.2 vs 100% in clusters 2 and 3, P < 0.001). Cluster 2 (11.4% of patients) was characterized by musculoskeletal damage in all patients. Cluster 3 (8.0% of patients) was the only group with cardiovascular damage, and this was present in all patients. The overall mortality rate of patients in clusters 2 and 3 was higher than that in cluster 1 (P < 0.001 for both comparisons) and in patients with disease duration shorter than 5 years as well. In a large cohort of SLE patients, cardiovascular and musculoskeletal damage manifestations were the two dominant forms of damage to sort patients into clinically meaningful clusters. Both in early and late stages of the disease, there was a significant association of these clusters with an increased risk of mortality. Physicians should pay special attention to the early prevention of damage in these two systems. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Preliminary Evidence for a Virial Shock around the Coma Galaxy Cluster
NASA Astrophysics Data System (ADS)
Keshet, Uri; Kushnir, Doron; Loeb, Abraham; Waxman, Eli
2017-08-01
Galaxy clusters, the largest gravitationally bound objects in the universe, are thought to grow by accreting mass from their surroundings through large-scale virial shocks. Due to electron acceleration in such a shock, it should appear as a γ-ray, hard X-ray, and radio ring, elongated toward the large-scale filaments feeding the cluster, coincident with a cutoff in the thermal Sunyaev-Zel’dovich (SZ) signal. However, no such signature was found until now, and the very existence of cluster virial shocks has remained a theory. We find preliminary evidence for a large γ-ray ring of ˜ 5 {Mpc} minor axis around the Coma cluster, elongated toward the large-scale filament connecting Coma and Abell 1367, detected at the nominal 2.7σ confidence level (5.1σ using control signal simulations). The γ-ray ring correlates both with a synchrotron signal and with the SZ cutoff, but not with Galactic tracers. The γ-ray and radio signatures agree with analytic and numerical predictions if the shock deposits ˜ 1 % of the thermal energy in relativistic electrons over a Hubble time and ˜ 1 % in magnetic fields. The implied inverse Compton and synchrotron cumulative emission from similar shocks can contribute significantly to the diffuse extragalactic γ-ray and low-frequency radio backgrounds. Our results, if confirmed, reveal the prolate structure of the hot gas in Coma, the feeding pattern of the cluster, and properties of the surrounding large-scale voids and filaments. The anticipated detection of such shocks around other clusters would provide a powerful new cosmological probe.
Predicting item popularity: Analysing local clustering behaviour of users
NASA Astrophysics Data System (ADS)
Liebig, Jessica; Rao, Asha
2016-01-01
Predicting the popularity of items in rating networks is an interesting but challenging problem. This is especially so when an item has first appeared and has received very few ratings. In this paper, we propose a novel approach to predicting the future popularity of new items in rating networks, defining a new bipartite clustering coefficient to predict the popularity of movies and stories in the MovieLens and Digg networks respectively. We show that the clustering behaviour of the first user who rates a new item gives insight into the future popularity of that item. Our method predicts, with a success rate of over 65% for the MovieLens network and over 50% for the Digg network, the future popularity of an item. This is a major improvement on current results.
Ethical implications of excessive cluster sizes in cluster randomised trials.
Hemming, Karla; Taljaard, Monica; Forbes, Gordon; Eldridge, Sandra M; Weijer, Charles
2018-02-20
The cluster randomised trial (CRT) is commonly used in healthcare research. It is the gold-standard study design for evaluating healthcare policy interventions. A key characteristic of this design is that as more participants are included, in a fixed number of clusters, the increase in achievable power will level off. CRTs with cluster sizes that exceed the point of levelling-off will have excessive numbers of participants, even if they do not achieve nominal levels of power. Excessively large cluster sizes may have ethical implications due to exposing trial participants unnecessarily to the burdens of both participating in the trial and the potential risks of harm associated with the intervention. We explore these issues through the use of two case studies. Where data are routinely collected, available at minimum cost and the intervention poses low risk, the ethical implications of excessively large cluster sizes are likely to be low (case study 1). However, to maximise the social benefit of the study, identification of excessive cluster sizes can allow for prespecified and fully powered secondary analyses. In the second case study, while there is no burden through trial participation (because the outcome data are routinely collected and non-identifiable), the intervention might be considered to pose some indirect risk to patients and risks to the healthcare workers. In this case study it is therefore important that the inclusion of excessively large cluster sizes is justifiable on other grounds (perhaps to show sustainability). In any randomised controlled trial, including evaluations of health policy interventions, it is important to minimise the burdens and risks to participants. Funders, researchers and research ethics committees should be aware of the ethical issues of excessively large cluster sizes in cluster trials. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Constructing storyboards based on hierarchical clustering analysis
NASA Astrophysics Data System (ADS)
Hasebe, Satoshi; Sami, Mustafa M.; Muramatsu, Shogo; Kikuchi, Hisakazu
2005-07-01
There are growing needs for quick preview of video contents for the purpose of improving accessibility of video archives as well as reducing network traffics. In this paper, a storyboard that contains a user-specified number of keyframes is produced from a given video sequence. It is based on hierarchical cluster analysis of feature vectors that are derived from wavelet coefficients of video frames. Consistent use of extracted feature vectors is the key to avoid a repetition of computationally-intensive parsing of the same video sequence. Experimental results suggest that a significant reduction in computational time is gained by this strategy.
Seismic Data Analysis throught Multi-Class Classification.
NASA Astrophysics Data System (ADS)
Anderson, P.; Kappedal, R. D.; Magana-Zook, S. A.
2017-12-01
In this research, we conducted twenty experiments of varying time and frequency bands on 5000seismic signals with the intent of finding a method to classify signals as either an explosion or anearthquake in an automated fashion. We used a multi-class approach by clustering of the data throughvarious techniques. Dimensional reduction was examined through the use of wavelet transforms withthe use of the coiflet mother wavelet and various coefficients to explore possible computational time vsaccuracy dependencies. Three and four classes were generated from the clustering techniques andexamined with the three class approach producing the most accurate and realistic results.
General Dynamics of Topology and Traffic on Weighted Technological Networks
NASA Astrophysics Data System (ADS)
Wang, Wen-Xu; Wang, Bing-Hong; Hu, Bo; Yan, Gang; Ou, Qing
2005-05-01
For most technical networks, the interplay of dynamics, traffic, and topology is assumed crucial to their evolution. In this Letter, we propose a traffic-driven evolution model of weighted technological networks. By introducing a general strength-coupling mechanism under which the traffic and topology mutually interact, the model gives power-law distributions of degree, weight, and strength, as confirmed in many real networks. Particularly, depending on a parameter W that controls the total weight growth of the system, the nontrivial clustering coefficient C, degree assortativity coefficient r, and degree-strength correlation are all consistent with empirical evidence.
NASA Astrophysics Data System (ADS)
Kumar, Ashok; Thakkar, Ajit J.
2017-03-01
Dipole oscillator strength distributions for Br2 and BrCN are constructed from photoabsorption cross-sections combined with constraints provided by the Kuhn-Reiche-Thomas sum rule, the high-energy behavior of the dipole-oscillator-strength density and molar refractivity data when available. The distributions are used to predict dipole sum rules S (k) , mean excitation energies I (k) , and van der Waals C6 coefficients. Coupled-cluster calculations of the static dipole polarizabilities of Br2 and BrCN are reported for comparison with the values of S (- 2) extracted from the distributions.
NASA Astrophysics Data System (ADS)
Heikes, Brian G.; Treadaway, Victoria; McNeill, Ashley S.; Silwal, Indira K. C.; O'Sullivan, Daniel W.
2018-04-01
An ion-neutral chemical kinetic model is described and used to simulate the negative ion chemistry occurring within a mixed-reagent ion chemical ionization mass spectrometer (CIMS). The model objective was the establishment of a theoretical basis to understand ambient pressure (variable sample flow and reagent ion carrier gas flow rates), water vapor, ozone and oxides of nitrogen effects on ion cluster sensitivities for hydrogen peroxide (H2O2), methyl peroxide (CH3OOH), formic acid (HFo) and acetic acid (HAc). The model development started with established atmospheric ion chemistry mechanisms, thermodynamic data and reaction rate coefficients. The chemical mechanism was augmented with additional reactions and their reaction rate coefficients specific to the analytes. Some existing reaction rate coefficients were modified to enable the model to match laboratory and field campaign determinations of ion cluster sensitivities as functions of CIMS sample flow rate and ambient humidity. Relative trends in predicted and observed sensitivities are compared as instrument specific factors preclude a direct calculation of instrument sensitivity as a function of sample pressure and humidity. Predicted sensitivity trends and experimental sensitivity trends suggested the model captured the reagent ion and cluster chemistry and reproduced trends in ion cluster sensitivity with sample flow and humidity observed with a CIMS instrument developed for atmospheric peroxide measurements (PCIMSs). The model was further used to investigate the potential for isobaric compounds as interferences in the measurement of the above species. For ambient O3 mixing ratios more than 50 times those of H2O2, O3-(H2O) was predicted to be a significant isobaric interference to the measurement of H2O2 using O2-(H2O2) at m/z 66. O3 and NO give rise to species and cluster ions, CO3-(H2O) and NO3-(H2O), respectively, which interfere in the measurement of CH3OOH using O2-(CH3OOH) at m/z 80. The CO3-(H2O) interference assumed one of its O atoms was 18O and present in the cluster in proportion to its natural abundance. The model results indicated monitoring water vapor mixing ratio, m/z 78 for CO3-(H2O) and m/z 98 for isotopic CO3-(H2O)2 can be used to determine when CO3-(H2O) interference is significant. Similarly, monitoring water vapor mixing ratio, m/z 62 for NO3- and m/z 98 for NO3-(H2O)2 can be used to determine when NO3-(H2O) interference is significant.
Reanalysis of 24 Nearby Open Clusters using Gaia data
NASA Astrophysics Data System (ADS)
Yen, Steffi X.; Reffert, Sabine; Röser, Siegfried; Schilbach, Elena; Kharchenko, Nina V.; Piskunov, Anatoly E.
2018-04-01
We have developed a fully automated cluster characterization pipeline, which simultaneously determines cluster membership and fits the fundamental cluster parameters: distance, reddening, and age. We present results for 24 established clusters and compare them to literature values. Given the large amount of stellar data for clusters available from Gaia DR2 in 2018, this pipeline will be beneficial to analyzing the parameters of open clusters in our Galaxy.
Azad, Ariful; Ouzounis, Christos A; Kyrpides, Nikos C; Buluç, Aydin
2018-01-01
Abstract Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times and memory demands. Here, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ∼70 million nodes with ∼68 billion edges in ∼2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license. PMID:29315405
Azad, Ariful; Pavlopoulos, Georgios A.; Ouzounis, Christos A.; ...
2018-01-05
Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times andmore » memory demands. In this paper, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ~70 million nodes with ~68 billion edges in ~2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. Finally, HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Azad, Ariful; Pavlopoulos, Georgios A.; Ouzounis, Christos A.
Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times andmore » memory demands. In this paper, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ~70 million nodes with ~68 billion edges in ~2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. Finally, HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license.« less
Graph analysis of cell clusters forming vascular networks
NASA Astrophysics Data System (ADS)
Alves, A. P.; Mesquita, O. N.; Gómez-Gardeñes, J.; Agero, U.
2018-03-01
This manuscript describes the experimental observation of vasculogenesis in chick embryos by means of network analysis. The formation of the vascular network was observed in the area opaca of embryos from 40 to 55 h of development. In the area opaca endothelial cell clusters self-organize as a primitive and approximately regular network of capillaries. The process was observed by bright-field microscopy in control embryos and in embryos treated with Bevacizumab (Avastin), an antibody that inhibits the signalling of the vascular endothelial growth factor (VEGF). The sequence of images of the vascular growth were thresholded, and used to quantify the forming network in control and Avastin-treated embryos. This characterization is made by measuring vessels density, number of cell clusters and the largest cluster density. From the original images, the topology of the vascular network was extracted and characterized by means of the usual network metrics such as: the degree distribution, average clustering coefficient, average short path length and assortativity, among others. This analysis allows to monitor how the largest connected cluster of the vascular network evolves in time and provides with quantitative evidence of the disruptive effects that Avastin has on the tree structure of vascular networks.
Dark matter phenomenology of high-speed galaxy cluster collisions
Mishchenko, Yuriy; Ji, Chueng-Ryong
2017-07-29
Here, we perform a general computational analysis of possible post-collision mass distributions in high-speed galaxy cluster collisions in the presence of self-interacting dark matter. Using this analysis, we show that astrophysically weakly self-interacting dark matter can impart subtle yet measurable features in the mass distributions of colliding galaxy clusters even without significant disruptions to the dark matter halos of the colliding galaxy clusters themselves. Most profound such evidence is found to reside in the tails of dark matter halos’ distributions, in the space between the colliding galaxy clusters. Such features appear in our simulations as shells of scattered dark mattermore » expanding in alignment with the outgoing original galaxy clusters, contributing significant densities to projected mass distributions at large distances from collision centers and large scattering angles of up to 90°. Our simulations indicate that as much as 20% of the total collision’s mass may be deposited into such structures without noticeable disruptions to the main galaxy clusters. Such structures at large scattering angles are forbidden in purely gravitational high-speed galaxy cluster collisions.Convincing identification of such structures in real colliding galaxy clusters would be a clear indication of the self-interacting nature of dark matter. Our findings may offer an explanation for the ring-like dark matter feature recently identified in the long-range reconstructions of the mass distribution of the colliding galaxy cluster CL0024+017.« less
Dark matter phenomenology of high-speed galaxy cluster collisions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mishchenko, Yuriy; Ji, Chueng-Ryong
Here, we perform a general computational analysis of possible post-collision mass distributions in high-speed galaxy cluster collisions in the presence of self-interacting dark matter. Using this analysis, we show that astrophysically weakly self-interacting dark matter can impart subtle yet measurable features in the mass distributions of colliding galaxy clusters even without significant disruptions to the dark matter halos of the colliding galaxy clusters themselves. Most profound such evidence is found to reside in the tails of dark matter halos’ distributions, in the space between the colliding galaxy clusters. Such features appear in our simulations as shells of scattered dark mattermore » expanding in alignment with the outgoing original galaxy clusters, contributing significant densities to projected mass distributions at large distances from collision centers and large scattering angles of up to 90°. Our simulations indicate that as much as 20% of the total collision’s mass may be deposited into such structures without noticeable disruptions to the main galaxy clusters. Such structures at large scattering angles are forbidden in purely gravitational high-speed galaxy cluster collisions.Convincing identification of such structures in real colliding galaxy clusters would be a clear indication of the self-interacting nature of dark matter. Our findings may offer an explanation for the ring-like dark matter feature recently identified in the long-range reconstructions of the mass distribution of the colliding galaxy cluster CL0024+017.« less
Ebinger, Michael R.; Haroldson, Mark A.; van Manen, Frank T.; Costello, Cecily M.; Bjornlie, Daniel D.; Thompson, Daniel J.; Gunther, Kerry A.; Fortin, Jennifer K.; Teisberg, Justin E.; Pils, Shannon R; White, P J; Cain, Steven L.; Cross, Paul C.
2016-01-01
Global positioning system (GPS) wildlife collars have revolutionized wildlife research. Studies of predation by free-ranging carnivores have particularly benefited from the application of location clustering algorithms to determine when and where predation events occur. These studies have changed our understanding of large carnivore behavior, but the gains have concentrated on obligate carnivores. Facultative carnivores, such as grizzly/brown bears (Ursus arctos), exhibit a variety of behaviors that can lead to the formation of GPS clusters. We combined clustering techniques with field site investigations of grizzly bear GPS locations (n = 732 site investigations; 2004–2011) to produce 174 GPS clusters where documented behavior was partitioned into five classes (large-biomass carcass, small-biomass carcass, old carcass, non-carcass activity, and resting). We used multinomial logistic regression to predict the probability of clusters belonging to each class. Two cross-validation methods—leaving out individual clusters, or leaving out individual bears—showed that correct prediction of bear visitation to large-biomass carcasses was 78–88%, whereas the false-positive rate was 18–24%. As a case study, we applied our predictive model to a GPS data set of 266 bear-years in the Greater Yellowstone Ecosystem (2002–2011) and examined trends in carcass visitation during fall hyperphagia (September–October). We identified 1997 spatial GPS clusters, of which 347 were predicted to be large-biomass carcasses. We used the clustered data to develop a carcass visitation index, which varied annually, but more than doubled during the study period. Our study demonstrates the effectiveness and utility of identifying GPS clusters associated with carcass visitation by a facultative carnivore.
Ebinger, Michael R; Haroldson, Mark A; van Manen, Frank T; Costello, Cecily M; Bjornlie, Daniel D; Thompson, Daniel J; Gunther, Kerry A; Fortin, Jennifer K; Teisberg, Justin E; Pils, Shannon R; White, P J; Cain, Steven L; Cross, Paul C
2016-07-01
Global positioning system (GPS) wildlife collars have revolutionized wildlife research. Studies of predation by free-ranging carnivores have particularly benefited from the application of location clustering algorithms to determine when and where predation events occur. These studies have changed our understanding of large carnivore behavior, but the gains have concentrated on obligate carnivores. Facultative carnivores, such as grizzly/brown bears (Ursus arctos), exhibit a variety of behaviors that can lead to the formation of GPS clusters. We combined clustering techniques with field site investigations of grizzly bear GPS locations (n = 732 site investigations; 2004-2011) to produce 174 GPS clusters where documented behavior was partitioned into five classes (large-biomass carcass, small-biomass carcass, old carcass, non-carcass activity, and resting). We used multinomial logistic regression to predict the probability of clusters belonging to each class. Two cross-validation methods-leaving out individual clusters, or leaving out individual bears-showed that correct prediction of bear visitation to large-biomass carcasses was 78-88 %, whereas the false-positive rate was 18-24 %. As a case study, we applied our predictive model to a GPS data set of 266 bear-years in the Greater Yellowstone Ecosystem (2002-2011) and examined trends in carcass visitation during fall hyperphagia (September-October). We identified 1997 spatial GPS clusters, of which 347 were predicted to be large-biomass carcasses. We used the clustered data to develop a carcass visitation index, which varied annually, but more than doubled during the study period. Our study demonstrates the effectiveness and utility of identifying GPS clusters associated with carcass visitation by a facultative carnivore.
NASA Technical Reports Server (NTRS)
Barnes, J.; Dekel, A.; Efstathiou, G.; Frenk, C. S.
1985-01-01
The cluster correlation function xi sub c(r) is compared with the particle correlation function, xi(r) in cosmological N-body simulations with a wide range of initial conditions. The experiments include scale-free initial conditions, pancake models with a coherence length in the initial density field, and hybrid models. Three N-body techniques and two cluster-finding algorithms are used. In scale-free models with white noise initial conditions, xi sub c and xi are essentially identical. In scale-free models with more power on large scales, it is found that the amplitude of xi sub c increases with cluster richness; in this case the clusters give a biased estimate of the particle correlations. In the pancake and hybrid models (with n = 0 or 1), xi sub c is steeper than xi, but the cluster correlation length exceeds that of the points by less than a factor of 2, independent of cluster richness. Thus the high amplitude of xi sub c found in studies of rich clusters of galaxies is inconsistent with white noise and pancake models and may indicate a primordial fluctuation spectrum with substantial power on large scales.
NASA Astrophysics Data System (ADS)
Ginanjar, Irlandia; Pasaribu, Udjianna S.; Indratno, Sapto W.
2017-03-01
This article presents the application of the principal component analysis (PCA) biplot for the needs of data mining. This article aims to simplify and objectify the methods for objects clustering in PCA biplot. The novelty of this paper is to get a measure that can be used to objectify the objects clustering in PCA biplot. Orthonormal eigenvectors, which are the coefficients of a principal component model representing an association between principal components and initial variables. The existence of the association is a valid ground to objects clustering based on principal axes value, thus if m principal axes used in the PCA, then the objects can be classified into 2m clusters. The inter-city buses are clustered based on maintenance costs data by using two principal axes PCA biplot. The buses are clustered into four groups. The first group is the buses with high maintenance costs, especially for lube, and brake canvass. The second group is the buses with high maintenance costs, especially for tire, and filter. The third group is the buses with low maintenance costs, especially for lube, and brake canvass. The fourth group is buses with low maintenance costs, especially for tire, and filter.
Detection of protein complex from protein-protein interaction network using Markov clustering
NASA Astrophysics Data System (ADS)
Ochieng, P. J.; Kusuma, W. A.; Haryanto, T.
2017-05-01
Detection of complexes, or groups of functionally related proteins, is an important challenge while analysing biological networks. However, existing algorithms to identify protein complexes are insufficient when applied to dense networks of experimentally derived interaction data. Therefore, we introduced a graph clustering method based on Markov clustering algorithm to identify protein complex within highly interconnected protein-protein interaction networks. Protein-protein interaction network was first constructed to develop geometrical network, the network was then partitioned using Markov clustering to detect protein complexes. The interest of the proposed method was illustrated by its application to Human Proteins associated to type II diabetes mellitus. Flow simulation of MCL algorithm was initially performed and topological properties of the resultant network were analysed for detection of the protein complex. The results indicated the proposed method successfully detect an overall of 34 complexes with 11 complexes consisting of overlapping modules and 20 non-overlapping modules. The major complex consisted of 102 proteins and 521 interactions with cluster modularity and density of 0.745 and 0.101 respectively. The comparison analysis revealed MCL out perform AP, MCODE and SCPS algorithms with high clustering coefficient (0.751) network density and modularity index (0.630). This demonstrated MCL was the most reliable and efficient graph clustering algorithm for detection of protein complexes from PPI networks.
Enhanced Scattering of Diffuse Ions on Front of the Earth's Quasi-Parallel Bow Shock: a Case Study
NASA Astrophysics Data System (ADS)
Kis, A.; Matsukiyo, S.; Otsuka, F.; Hada, T.; Lemperger, I.; Dandouras, I. S.; Barta, V.; Facsko, G. I.
2017-12-01
In the analysis we present a case study of three energetic upstream ion events at the Earth's quasi-parallel bow shock based on multi-spacecraft data recorded by Cluster. The CIS-HIA instrument onboard Cluster provides partial energetic ion densities in 4 energy channels between 10 and 32 keV.The difference of the partial ion densities recorded by the individual spacecraft at various distances from the bow shock surface makes possible the determination of the spatial gradient of energetic ions.Using the gradient values we determined the spatial profile of the energetic ion partial densities as a function of distance from the bow shock and we calculated the e-folding distance and the diffusion coefficient for each event and each ion energy range. Results show that in two cases the scattering of diffuse ions takes place in a normal way, as "by the book", and the e-folding distance and diffusion coefficient values are comparable with previous results. On the other hand, in the third case the e-folding distance and the diffusion coefficient values are significantly lower, which suggests that in this case the scattering process -and therefore the diffusive shock acceleration (DSA) mechanism also- is much more efficient. Our analysis provides an explanation for this "enhanced" scattering process recorded in the third case.
Galaxy clusters in local Universe simulations without density constraints: a long uphill struggle
NASA Astrophysics Data System (ADS)
Sorce, Jenny G.
2018-06-01
Galaxy clusters are excellent cosmological probes provided that their formation and evolution within the large scale environment are precisely understood. Therefore studies with simulated galaxy clusters have flourished. However detailed comparisons between simulated and observed clusters and their population - the galaxies - are complicated by the diversity of clusters and their surrounding environment. An original way initiated by Bertschinger as early as 1987, to legitimize the one-to-one comparison exercise down to the details, is to produce simulations constrained to resemble the cluster under study within its large scale environment. Subsequently several methods have emerged to produce simulations that look like the local Universe. This paper highlights one of these methods and its essential steps to get simulations that not only resemble the local Large Scale Structure but also that host the local clusters. It includes a new modeling of the radial peculiar velocity uncertainties to remove the observed correlation between the decreases of the simulated cluster masses and of the amount of data used as constraints with the distance from us. This method has the particularity to use solely radial peculiar velocities as constraints: no additional density constraints are required to get local cluster simulacra. The new resulting simulations host dark matter halos that match the most prominent local clusters such as Coma. Zoom-in simulations of the latter and of a volume larger than the 30h-1 Mpc radius inner sphere become now possible to study local clusters and their effects. Mapping the local Sunyaev-Zel'dovich and Sachs-Wolfe effects can follow.
NASA Astrophysics Data System (ADS)
Elangasinghe, M. A.; Singhal, N.; Dirks, K. N.; Salmond, J. A.; Samarasinghe, S.
2014-09-01
This paper uses artificial neural networks (ANN), combined with k-means clustering, to understand the complex time series of PM10 and PM2.5 concentrations at a coastal location of New Zealand based on data from a single site. Out of available meteorological parameters from the network (wind speed, wind direction, solar radiation, temperature, relative humidity), key factors governing the pattern of the time series concentrations were identified through input sensitivity analysis performed on the trained neural network model. The transport pathways of particulate matter under these key meteorological parameters were further analysed through bivariate concentration polar plots and k-means clustering techniques. The analysis shows that the external sources such as marine aerosols and local sources such as traffic and biomass burning contribute equally to the particulate matter concentrations at the study site. These results are in agreement with the results of receptor modelling by the Auckland Council based on Positive Matrix Factorization (PMF). Our findings also show that contrasting concentration-wind speed relationships exist between marine aerosols and local traffic sources resulting in very noisy and seemingly large random PM10 concentrations. The inclusion of cluster rankings as an input parameter to the ANN model showed a statistically significant (p < 0.005) improvement in the performance of the ANN time series model and also showed better performance in picking up high concentrations. For the presented case study, the correlation coefficient between observed and predicted concentrations improved from 0.77 to 0.79 for PM2.5 and from 0.63 to 0.69 for PM10 and reduced the root mean squared error (RMSE) from 5.00 to 4.74 for PM2.5 and from 6.77 to 6.34 for PM10. The techniques presented here enable the user to obtain an understanding of potential sources and their transport characteristics prior to the implementation of costly chemical analysis techniques or advanced air dispersion models.
International scientific collaboration in HIV and HPV: a network analysis.
Vanni, Tazio; Mesa-Frias, Marco; Sanchez-Garcia, Ruben; Roesler, Rafael; Schwartsmann, Gilberto; Goldani, Marcelo Z; Foss, Anna M
2014-01-01
Research endeavours require the collaborative effort of an increasing number of individuals. International scientific collaborations are particularly important for HIV and HPV co-infection studies, since the burden of disease is rising in developing countries, but most experts and research funds are found in developed countries, where the prevalence of HIV is low. The objective of our study was to investigate patterns of international scientific collaboration in HIV and HPV research using social network analysis. Through a systematic review of the literature, we obtained epidemiological data, as well as data on countries and authors involved in co-infection studies. The collaboration network was analysed in respect to the following: centrality, density, modularity, connected components, distance, clustering and spectral clustering. We observed that for many low- and middle-income countries there were no epidemiological estimates of HPV infection of the cervix among HIV-infected individuals. Most studies found only involved researchers from the same country (64%). Studies derived from international collaborations including high-income countries and either low- or middle-income countries had on average three times larger sample sizes than those including only high-income countries or low-income countries. The high global clustering coefficient (0.9) coupled with a short average distance between researchers (4.34) suggests a "small-world phenomenon." Researchers from high-income countries seem to have higher degree centrality and tend to cluster together in densely connected communities. We found a large well-connected community, which encompasses 70% of researchers, and 49 other small isolated communities. Our findings suggest that in the field of HIV and HPV, there seems to be both room and incentives for researchers to engage in collaborations between countries of different income-level. Through international collaboration resources available to researchers in high-income countries can be efficiently used to enroll more participants in low- and middle-income countries.
The use of hierarchical clustering for the design of optimized monitoring networks
NASA Astrophysics Data System (ADS)
Soares, Joana; Makar, Paul Andrew; Aklilu, Yayne; Akingunola, Ayodeji
2018-05-01
Associativity analysis is a powerful tool to deal with large-scale datasets by clustering the data on the basis of (dis)similarity and can be used to assess the efficacy and design of air quality monitoring networks. We describe here our use of Kolmogorov-Zurbenko filtering and hierarchical clustering of NO2 and SO2 passive and continuous monitoring data to analyse and optimize air quality networks for these species in the province of Alberta, Canada. The methodology applied in this study assesses dissimilarity between monitoring station time series based on two metrics: 1 - R, R being the Pearson correlation coefficient, and the Euclidean distance; we find that both should be used in evaluating monitoring site similarity. We have combined the analytic power of hierarchical clustering with the spatial information provided by deterministic air quality model results, using the gridded time series of model output as potential station locations, as a proxy for assessing monitoring network design and for network optimization. We demonstrate that clustering results depend on the air contaminant analysed, reflecting the difference in the respective emission sources of SO2 and NO2 in the region under study. Our work shows that much of the signal identifying the sources of NO2 and SO2 emissions resides in shorter timescales (hourly to daily) due to short-term variation of concentrations and that longer-term averages in data collection may lose the information needed to identify local sources. However, the methodology identifies stations mainly influenced by seasonality, if larger timescales (weekly to monthly) are considered. We have performed the first dissimilarity analysis based on gridded air quality model output and have shown that the methodology is capable of generating maps of subregions within which a single station will represent the entire subregion, to a given level of dissimilarity. We have also shown that our approach is capable of identifying different sampling methodologies as well as outliers (stations' time series which are markedly different from all others in a given dataset).
Observations of a nearby filament of galaxy clusters with the Sardinia Radio Telescope
NASA Astrophysics Data System (ADS)
Vacca, Valentina; Murgia, M.; Loi, F. Govoni F.; Vazza, F.; Finoguenov, A.; Carretti, E.; Feretti, L.; Giovannini, G.; Concu, R.; Melis, A.; Gheller, C.; Paladino, R.; Poppi, S.; Valente, G.; Bernardi, G.; Boschin, W.; Brienza, M.; Clarke, T. E.; Colafrancesco, S.; Enßlin, T.; Ferrari, C.; de Gasperin, F.; Gastaldello, F.; Girardi, M.; Gregorini, L.; Johnston-Hollitt, M.; Junklewitz, H.; Orrù, E.; Parma, P.; Perley, R.; Taylor, G. B.
2018-05-01
We report the detection of diffuse radio emission which might be connected to a large-scale filament of the cosmic web covering a 8° × 8° area in the sky, likely associated with a z≈0.1 over-density traced by nine massive galaxy clusters. In this work, we present radio observations of this region taken with the Sardinia Radio Telescope. Two of the clusters in the field host a powerful radio halo sustained by violent ongoing mergers and provide direct proof of intra-cluster magnetic fields. In order to investigate the presence of large-scale diffuse radio synchrotron emission in and beyond the galaxy clusters in this complex system, we combined the data taken at 1.4 GHz with the Sardinia Radio Telescope with higher resolution data taken with the NRAO VLA Sky Survey. We found 28 candidate new sources with a size larger and X-ray emission fainter than known diffuse large-scale synchrotron cluster sources for a given radio power. This new population is potentially the tip of the iceberg of a class of diffuse large-scale synchrotron sources associated with the filaments of the cosmic web. In addition, we found in the field a candidate new giant radio galaxy.
Chandra Observations of MS0440.5+0204 & MS0839.9+2938: Cooling Flow Clusters in Formation?
NASA Astrophysics Data System (ADS)
McNamara, Brian
2000-09-01
We propose to observe two redshift z~0.2 clusters, MS0839.9+2938 and MS0440+0204, discovered as bright X-ray sources in the Einstein Medium Sensitivity Survey. The cluster cores are structured in the X-ray and optical bands, and they harbor large cooling flows. Their central cluster galaxies contain luminous nebular emission systems, active star formation, and strong radio sources. Using the Chandra data, we will determine whether the large discrepancies between the X-ray cooling rates and optical star formation rates can be reconciled, and we will test the hypothesis that cooling flows form as cool, dense groups accrete into massive clusters.
Merging Clusters, Cluster Outskirts, and Large Scale Filaments
NASA Astrophysics Data System (ADS)
Randall, Scott; Alvarez, Gabriella; Bulbul, Esra; Jones, Christine; Forman, William; Su, Yuanyuan; Miller, Eric D.; Bourdin, Herve; Scott Randall
2018-01-01
Recent X-ray observations of the outskirts of clusters show that entropy profiles of the intracluster medium (ICM) generally flatten and lie below what is expected from purely gravitational structure formation near the cluster's virial radius. Possible explanations include electron/ion non-equilibrium, accretion shocks that weaken during cluster formation, and the presence of unresolved cool gas clumps. Some of these mechanisms are expected to correlate with large scale structure (LSS), such that the entropy is lower in regions where the ICM interfaces with LSS filaments and, presumably, the warm-hot intergalactic medium (WHIM). Major, binary cluster mergers are expected to take place at the intersection of LSS filaments, with the merger axis initially oriented along a filament. We present results from deep X-ray observations of the virialization regions of binary, early-stage merging clusters, including a possible detection of the dense end of the WHIM along a LSS filament.
Zhang, Wei; Zhang, Xiaolong; Qiang, Yan; Tian, Qi; Tang, Xiaoxian
2017-01-01
The fast and accurate segmentation of lung nodule image sequences is the basis of subsequent processing and diagnostic analyses. However, previous research investigating nodule segmentation algorithms cannot entirely segment cavitary nodules, and the segmentation of juxta-vascular nodules is inaccurate and inefficient. To solve these problems, we propose a new method for the segmentation of lung nodule image sequences based on superpixels and density-based spatial clustering of applications with noise (DBSCAN). First, our method uses three-dimensional computed tomography image features of the average intensity projection combined with multi-scale dot enhancement for preprocessing. Hexagonal clustering and morphological optimized sequential linear iterative clustering (HMSLIC) for sequence image oversegmentation is then proposed to obtain superpixel blocks. The adaptive weight coefficient is then constructed to calculate the distance required between superpixels to achieve precise lung nodules positioning and to obtain the subsequent clustering starting block. Moreover, by fitting the distance and detecting the change in slope, an accurate clustering threshold is obtained. Thereafter, a fast DBSCAN superpixel sequence clustering algorithm, which is optimized by the strategy of only clustering the lung nodules and adaptive threshold, is then used to obtain lung nodule mask sequences. Finally, the lung nodule image sequences are obtained. The experimental results show that our method rapidly, completely and accurately segments various types of lung nodule image sequences. PMID:28880916
Liu, Hui; Song, Yongduan; Xue, Fangzheng; Li, Xiumin
2015-11-01
In this paper, the generation of multi-clustered structure of self-organized neural network with different neuronal firing patterns, i.e., bursting or spiking, has been investigated. The initially all-to-all-connected spiking neural network or bursting neural network can be self-organized into clustered structure through the symmetric spike-timing-dependent plasticity learning for both bursting and spiking neurons. However, the time consumption of this clustering procedure of the burst-based self-organized neural network (BSON) is much shorter than the spike-based self-organized neural network (SSON). Our results show that the BSON network has more obvious small-world properties, i.e., higher clustering coefficient and smaller shortest path length than the SSON network. Also, the results of larger structure entropy and activity entropy of the BSON network demonstrate that this network has higher topological complexity and dynamical diversity, which benefits for enhancing information transmission of neural circuits. Hence, we conclude that the burst firing can significantly enhance the efficiency of clustering procedure and the emergent clustered structure renders the whole network more synchronous and therefore more sensitive to weak input. This result is further confirmed from its improved performance on stochastic resonance. Therefore, we believe that the multi-clustered neural network which self-organized from the bursting dynamics has high efficiency in information processing.
Clustering and flow around a sphere moving into a grain cloud.
Seguin, A; Lefebvre-Lepot, A; Faure, S; Gondret, P
2016-06-01
A bidimensional simulation of a sphere moving at constant velocity into a cloud of smaller spherical grains far from any boundaries and without gravity is presented with a non-smooth contact dynamics method. A dense granular "cluster" zone builds progressively around the moving sphere until a stationary regime appears with a constant upstream cluster size. The key point is that the upstream cluster size increases with the initial solid fraction [Formula: see text] but the cluster packing fraction takes an about constant value independent of [Formula: see text]. Although the upstream cluster size around the moving sphere diverges when [Formula: see text] approaches a critical value, the drag force exerted by the grains on the sphere does not. The detailed analysis of the local strain rate and local stress fields made in the non-parallel granular flow inside the cluster allows us to extract the local invariants of the two tensors: dilation rate, shear rate, pressure and shear stress. Despite different spatial variations of these invariants, the local friction coefficient μ appears to depend only on the local inertial number I as well as the local solid fraction, which means that a local rheology does exist in the present non-parallel flow. The key point is that the spatial variations of I inside the cluster do not depend on the sphere velocity and explore only a small range around the value one.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Hui; Song, Yongduan; Xue, Fangzheng
In this paper, the generation of multi-clustered structure of self-organized neural network with different neuronal firing patterns, i.e., bursting or spiking, has been investigated. The initially all-to-all-connected spiking neural network or bursting neural network can be self-organized into clustered structure through the symmetric spike-timing-dependent plasticity learning for both bursting and spiking neurons. However, the time consumption of this clustering procedure of the burst-based self-organized neural network (BSON) is much shorter than the spike-based self-organized neural network (SSON). Our results show that the BSON network has more obvious small-world properties, i.e., higher clustering coefficient and smaller shortest path length than themore » SSON network. Also, the results of larger structure entropy and activity entropy of the BSON network demonstrate that this network has higher topological complexity and dynamical diversity, which benefits for enhancing information transmission of neural circuits. Hence, we conclude that the burst firing can significantly enhance the efficiency of clustering procedure and the emergent clustered structure renders the whole network more synchronous and therefore more sensitive to weak input. This result is further confirmed from its improved performance on stochastic resonance. Therefore, we believe that the multi-clustered neural network which self-organized from the bursting dynamics has high efficiency in information processing.« less
Gray, Heewon Lee; Burgermaster, Marissa; Tipton, Elizabeth; Contento, Isobel R; Koch, Pamela A; Di Noia, Jennifer
2016-04-01
Sample size and statistical power calculation should consider clustering effects when schools are the unit of randomization in intervention studies. The objective of the current study was to investigate how student outcomes are clustered within schools in an obesity prevention trial. Baseline data from the Food, Health & Choices project were used. Participants were 9- to 13-year-old students enrolled in 20 New York City public schools (n= 1,387). Body mass index (BMI) was calculated based on measures of height and weight, and body fat percentage was measured with a Tanita® body composition analyzer (Model SC-331s). Energy balance-related behaviors were self-reported with a frequency questionnaire. To examine the cluster effects, intraclass correlation coefficients (ICCs) were calculated as school variance over total variance for outcome variables. School-level covariates, percentage students eligible for free and reduced-price lunch, percentage Black or Hispanic, and English language learners were added in the model to examine ICC changes. The ICCs for obesity indicators are: .026 for BMI-percentile, .031 for BMIz-score, .035 for percentage of overweight students, .037 for body fat percentage, and .041 for absolute BMI. The ICC range for the six energy balance-related behaviors are .008 to .044 for fruit and vegetables, .013 to .055 for physical activity, .031 to .052 for recreational screen time, .013 to .091 for sweetened beverages, .033 to .121 for processed packaged snacks, and .020 to .083 for fast food. When school-level covariates were included in the model, ICC changes varied from -95% to 85%. This is the first study reporting ICCs for obesity-related anthropometric and behavioral outcomes among New York City public schools. The results of the study may aid sample size estimation for future school-based cluster randomized controlled trials in similar urban setting and population. Additionally, identifying school-level covariates that can reduce cluster effects is important when analyzing data. © 2015 Society for Public Health Education.
Tracing Large Scale Structure with a Redshift Survey of Rich Clusters of Galaxies
NASA Astrophysics Data System (ADS)
Batuski, D.; Slinglend, K.; Haase, S.; Hill, J. M.
1993-12-01
Rich clusters of galaxies from Abell's catalog show evidence of structure on scales of 100 Mpc and hold promise of confirming the existence of structure in the more immediate universe on scales corresponding to COBE results (i.e., on the order of 10% or more of the horizon size of the universe). However, most Abell clusters do not as yet have measured redshifts (or, in the case of most low redshift clusters, have only one or two galaxies measured), so present knowledge of their three dimensional distribution has quite large uncertainties. The shortage of measured redshifts for these clusters may also mask a problem of projection effects corrupting the membership counts for the clusters, perhaps even to the point of spurious identifications of some of the clusters themselves. Our approach in this effort has been to use the MX multifiber spectrometer to measure redshifts of at least ten galaxies in each of about 80 Abell cluster fields with richness class R>= 1 and mag10 <= 16.8. This work will result in a somewhat deeper, much more complete (and reliable) sample of positions of rich clusters. Our primary use for the sample is for two-point correlation and other studies of the large scale structure traced by these clusters. We are also obtaining enough redshifts per cluster so that a much better sample of reliable cluster velocity dispersions will be available for other studies of cluster properties. To date, we have collected such data for 40 clusters, and for most of them, we have seven or more cluster members with redshifts, allowing for reliable velocity dispersion calculations. Velocity histograms for several interesting cluster fields are presented, along with summary tables of cluster redshift results. Also, with 10 or more redshifts in most of our cluster fields (30({') } square, just about an `Abell diameter' at z ~ 0.1) we have investigated the extent of projection effects within the Abell catalog in an effort to quantify and understand how this may effect the Abell sample.
Detonation of Meta-stable Clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuhl, Allen; Kuhl, Allen L.; Fried, Laurence E.
2008-05-31
We consider the energy accumulation in meta-stable clusters. This energy can be much larger than the typical chemical bond energy (~;;1 ev/atom). For example, polymeric nitrogen can accumulate 4 ev/atom in the N8 (fcc) structure, while helium can accumulate 9 ev/atom in the excited triplet state He2* . They release their energy by cluster fission: N8 -> 4N2 and He2* -> 2He. We study the locus of states in thermodynamic state space for the detonation of such meta-stable clusters. In particular, the equilibrium isentrope, starting at the Chapman-Jouguet state, and expanding down to 1 atmosphere was calculated with the Cheetahmore » code. Large detonation pressures (3 and 16 Mbar), temperatures (12 and 34 kilo-K) and velocities (20 and 43 km/s) are a consequence of the large heats of detonation (6.6 and 50 kilo-cal/g) for nitrogen and helium clusters respectively. If such meta-stable clusters could be synthesized, they offer the potential for large increases in the energy density of materials.« less
Inorganic material profiling using Arn+ cluster: Can we achieve high quality profiles?
NASA Astrophysics Data System (ADS)
Conard, T.; Fleischmann, C.; Havelund, R.; Franquet, A.; Poleunis, C.; Delcorte, A.; Vandervorst, W.
2018-06-01
Retrieving molecular information by sputtering of organic systems has been concretized in the last years due to the introduction of sputtering by large gas clusters which drastically eliminated the compound degradation during the analysis and has led to strong improvements in depth resolution. Rapidly however, a limitation was observed for heterogeneous systems where inorganic layers or structures needed to be profiled concurrently. As opposed to organic material, erosion of the inorganic layer appears very difficult and prone to many artefacts. To shed some light on these problems we investigated a simple system consisting of aluminum delta layer(s) buried in a silicon matrix in order to define the most favorable beam conditions for practical analysis. We show that counterintuitive to the small energy/atom used and unlike monoatomic ion sputtering, the information depth obtained with large cluster ions is typically very large (∼10 nm) and that this can be caused both by a large roughness development at early stages of the sputtering process and by a large mixing zone. As a consequence, a large deformation of the Al intensity profile is observed. Using sample rotation during profiling significantly improves the depth resolution while sample temperature has no significant effect. The determining parameter for high depth resolution still remains the total energy of the cluster instead of the energy per atom in the cluster.
NASA Astrophysics Data System (ADS)
Sebastianelli, Francesco; Xu, Minzhong; Bačić, Zlatko
2008-12-01
We report diffusion Monte Carlo (DMC) calculations of the quantum translation-rotation (T-R) dynamics of one to five para-H2 (p-H2) and ortho-D2 (o-D2) molecules inside the large hexakaidecahedral (51264) cage of the structure II clathrate hydrate, which was taken to be rigid. These calculations provide a quantitative description of the size evolution of the ground-state properties, energetics, and the vibrationally averaged geometries, of small (p-H2)n and (o-D2)n clusters, n=1-5, in nanoconfinement. The zero-point energy (ZPE) of the T-R motions rises steeply with the cluster size, reaching 74% of the potential well depth for the caged (p-H2)4. At low temperatures, the rapid increase of the cluster ZPE as a function of n is the main factor that limits the occupancy of the large cage to at most four H2 or D2 molecules, in agreement with experiments. Our DMC results concerning the vibrationally averaged spatial distribution of four D2 molecules, their mean distance from the cage center, the D2-D2 separation, and the specific orientation and localization of the tetrahedral (D2)4 cluster relative to the framework of the large cage, agree very well with the low-temperature neutron diffraction experiments involving the large cage with the quadruple D2 occupancy.
Sebastianelli, Francesco; Xu, Minzhong; Bacić, Zlatko
2008-12-28
We report diffusion Monte Carlo (DMC) calculations of the quantum translation-rotation (T-R) dynamics of one to five para-H(2) (p-H(2)) and ortho-D(2) (o-D(2)) molecules inside the large hexakaidecahedral (5(12)6(4)) cage of the structure II clathrate hydrate, which was taken to be rigid. These calculations provide a quantitative description of the size evolution of the ground-state properties, energetics, and the vibrationally averaged geometries, of small (p-H(2))(n) and (o-D(2))(n) clusters, n=1-5, in nanoconfinement. The zero-point energy (ZPE) of the T-R motions rises steeply with the cluster size, reaching 74% of the potential well depth for the caged (p-H(2))(4). At low temperatures, the rapid increase of the cluster ZPE as a function of n is the main factor that limits the occupancy of the large cage to at most four H(2) or D(2) molecules, in agreement with experiments. Our DMC results concerning the vibrationally averaged spatial distribution of four D(2) molecules, their mean distance from the cage center, the D(2)-D(2) separation, and the specific orientation and localization of the tetrahedral (D(2))(4) cluster relative to the framework of the large cage, agree very well with the low-temperature neutron diffraction experiments involving the large cage with the quadruple D(2) occupancy.
Gas-liquid nucleation at large metastability: unusual features and a new formalism
NASA Astrophysics Data System (ADS)
Santra, Mantu; Singh, Rakesh S.; Bagchi, Biman
2011-03-01
Nucleation at large metastability is still largely an unsolved problem, even though it is a problem of tremendous current interest, with wide-ranging practical value, from atmospheric research to materials science. It is now well accepted that the classical nucleation theory (CNT) fails to provide a qualitative picture and gives incorrect quantitative values for such quantities as activation-free energy barrier and supersaturation dependence of nucleation rate, especially at large metastability. In this paper, we present an alternative formalism to treat nucleation at large supersaturation by introducing an extended set of order parameters in terms of the kth largest liquid-like clusters, where k = 1 is the largest cluster in the system, k = 2 is the second largest cluster and so on. At low supersaturation, the size of the largest liquid-like cluster acts as a suitable order parameter. At large supersaturation, the free energy barrier for the largest liquid-like cluster disappears. We identify this supersaturation as the one at the onset of kinetic spinodal. The kinetic spinodal is system-size-dependent. Beyond kinetic spinodal many clusters grow simultaneously and competitively and hence the nucleation and growth become collective. In order to describe collective growth, we need to consider the full set of order parameters. We derive an analytic expression for the free energy of formation of the kth largest cluster. The expression predicts that, at large metastability (beyond kinetic spinodal), the barrier of growth for several largest liquid-like clusters disappears, and all these clusters grow simultaneously. The approach to the critical size occurs by barrierless diffusion in the cluster size space. The expression for the rate of barrier crossing predicts weaker supersaturation dependence than what is predicted by CNT at large metastability. Such a crossover behavior has indeed been observed in recent experiments (but eluded an explanation till now). In order to understand the large numerical discrepancy between simulation predictions and experimental results, we carried out a study of the dependence on the range of intermolecular interactions of both the surface tension of an equilibrium planar gas-liquid interface and the free energy barrier of nucleation. Both are found to depend significantly on the range of interaction for the Lennard-Jones potential, both in two and three dimensions. The value of surface tension and also the free energy difference between the gas and the liquid phase increase significantly and converge only when the range of interaction is extended beyond 6-7 molecular diameters. We find, with the full range of interaction potential, that the surface tension shows only a weak dependence on supersaturation, so the reason for the breakdown of CNT (with simulated values of surface tension and free energy gap) cannot be attributed to the supersaturation dependence of surface tension. This remains an unsettled issue at present because of the use of the value of surface tension obtained at coexistence.
Identifying collaborative care teams through electronic medical record utilization patterns.
Chen, You; Lorenzi, Nancy M; Sandberg, Warren S; Wolgast, Kelly; Malin, Bradley A
2017-04-01
The goal of this investigation was to determine whether automated approaches can learn patient-oriented care teams via utilization of an electronic medical record (EMR) system. To perform this investigation, we designed a data-mining framework that relies on a combination of latent topic modeling and network analysis to infer patterns of collaborative teams. We applied the framework to the EMR utilization records of over 10 000 employees and 17 000 inpatients at a large academic medical center during a 4-month window in 2010. Next, we conducted an extrinsic evaluation of the patterns to determine the plausibility of the inferred care teams via surveys with knowledgeable experts. Finally, we conducted an intrinsic evaluation to contextualize each team in terms of collaboration strength (via a cluster coefficient) and clinical credibility (via associations between teams and patient comorbidities). The framework discovered 34 collaborative care teams, 27 (79.4%) of which were confirmed as administratively plausible. Of those, 26 teams depicted strong collaborations, with a cluster coefficient > 0.5. There were 119 diagnostic conditions associated with 34 care teams. Additionally, to provide clarity on how the survey respondents arrived at their determinations, we worked with several oncologists to develop an illustrative example of how a certain team functions in cancer care. Inferred collaborative teams are plausible; translating such patterns into optimized collaborative care will require administrative review and integration with management practices. EMR utilization records can be mined for collaborative care patterns in large complex medical centers. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Yeatts, Dale E; Seckin, Gul; Shen, Yuying; Thompson, Michael; Auden, Dana; Cready, Cynthia M
2018-01-10
The many negative effects of burnout have prompted researchers to better understand the factors contributing to it. The purpose of this paper is to add to this body of knowledge through the study of burnout among direct care workers (DCWs) in nursing homes (NH). Perhaps the factor most often associated with employee burnout is the level of staffing-insufficient staffing results in work overload and eventually employee burnout. A closer look at research findings suggest that there are many other factors also contributing to burnout. These range from those at the organizational level, such as availability of training and resources to individual characteristics such as self-esteem and length of employment. A self-administered survey instrument was completed by 410 DCWs working within 11 NHs in the north Texas region. Regression analyses were performed, adjusting for clustering by NH. Beta coefficients and structure coefficients are reported. Burnout was measured through three dimensions: emotional exhaustion, depersonalization, and personal accomplishment. Organizational, work design, interpersonal, and individual characteristics were found to be associated with one or more dimensions of burnout. The analyses largely support previous research. Organizational variables of significance included the availability of resources to do the work, available training, and fair pay. Work design variables of significance included adequate staffing. The individual characteristic, self-esteem, appeared to have the strongest impact on burnout. Commitment to the organization also had a large impact. While the data do not allow for the testing of causal relationships, the data do suggest that providing adequate staffing, perceived fair pay, sufficient work resources (e.g., towels, gowns), management support, and adequate training may result in less DCW burnout on the job. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Wedge geometry, frictional properties and interseismic coupling of the Java megathrust
NASA Astrophysics Data System (ADS)
Koulali, Achraf; McClusky, Simon; Cummins, Phil; Tregoning, Paul
2018-06-01
The mechanical interaction between rocks at fault zones is a key element for understanding how earthquakes nucleate and propagate. Therefore, estimating frictional properties along fault planes allows us to infer the degree of elastic strain accumulation throughout the seismic cycle. The Java subduction zone is an active plate boundary where high seismic activity has long been documented. However, very little is known about the seismogenic processes of the megathrust, especially its shallowest portion where onshore geodetic networks are insensitive to recover the pattern of elastic strain. Here, we use the geometry of the offshore accretionary prism to infer frictional properties along the Java subduction zone, using Coulomb critical taper theory. We show that large portions of the inner wedge in the eastern part of the Java subduction megathrust are in a critical state, where the wedge is on the verge of failure everywhere. We identify four clusters with an internal coefficient of friction μint of ∼ 0.8 and hydrostatic pore pressure within the wedge. The average effective coefficient of friction ranges between 0.3 and 0.4, reflecting a strong décollement. Our results also show that the aftershock sequence of the 1994 Mw 7.9 earthquake halted adjacent to a critical segment of the wedge, suggesting that critical taper wedge areas in the eastern Java subduction interface may behave as a permanent barrier to large earthquake rupture. In contrast, in western Java topographic slope and slab dip profiles suggest that the wedge is mechanically stable, i.e deformation is restricted to sliding along the décollement, and likely to coincide with a seismogenic portion of the megathrust. We discuss the seismic hazard implications and highlight the importance of considering the segmentation of the Java subduction zone when assessing the seismic hazard of this region.