Stocker, Gernot; Rieder, Dietmar; Trajanoski, Zlatko
2004-03-22
ClusterControl is a web interface to simplify distributing and monitoring bioinformatics applications on Linux cluster systems. We have developed a modular concept that enables integration of command line oriented program into the application framework of ClusterControl. The systems facilitate integration of different applications accessed through one interface and executed on a distributed cluster system. The package is based on freely available technologies like Apache as web server, PHP as server-side scripting language and OpenPBS as queuing system and is available free of charge for academic and non-profit institutions. http://genome.tugraz.at/Software/ClusterControl
Abreu, Rui Mv; Froufe, Hugo Jc; Queiroz, Maria João Rp; Ferreira, Isabel Cfr
2010-10-28
Virtual screening of small molecules using molecular docking has become an important tool in drug discovery. However, large scale virtual screening is time demanding and usually requires dedicated computer clusters. There are a number of software tools that perform virtual screening using AutoDock4 but they require access to dedicated Linux computer clusters. Also no software is available for performing virtual screening with Vina using computer clusters. In this paper we present MOLA, an easy-to-use graphical user interface tool that automates parallel virtual screening using AutoDock4 and/or Vina in bootable non-dedicated computer clusters. MOLA automates several tasks including: ligand preparation, parallel AutoDock4/Vina jobs distribution and result analysis. When the virtual screening project finishes, an open-office spreadsheet file opens with the ligands ranked by binding energy and distance to the active site. All results files can automatically be recorded on an USB-flash drive or on the hard-disk drive using VirtualBox. MOLA works inside a customized Live CD GNU/Linux operating system, developed by us, that bypass the original operating system installed on the computers used in the cluster. This operating system boots from a CD on the master node and then clusters other computers as slave nodes via ethernet connections. MOLA is an ideal virtual screening tool for non-experienced users, with a limited number of multi-platform heterogeneous computers available and no access to dedicated Linux computer clusters. When a virtual screening project finishes, the computers can just be restarted to their original operating system. The originality of MOLA lies on the fact that, any platform-independent computer available can he added to the cluster, without ever using the computer hard-disk drive and without interfering with the installed operating system. With a cluster of 10 processors, and a potential maximum speed-up of 10x, the parallel algorithm of MOLA performed with a speed-up of 8,64× using AutoDock4 and 8,60× using Vina.
Liu, Yan-Lin; Shih, Cheng-Ting; Chang, Yuan-Jen; Chang, Shu-Jun; Wu, Jay
2014-01-01
The rapid development of picture archiving and communication systems (PACSs) thoroughly changes the way of medical informatics communication and management. However, as the scale of a hospital's operations increases, the large amount of digital images transferred in the network inevitably decreases system efficiency. In this study, a server cluster consisting of two server nodes was constructed. Network load balancing (NLB), distributed file system (DFS), and structured query language (SQL) duplication services were installed. A total of 1 to 16 workstations were used to transfer computed radiography (CR), computed tomography (CT), and magnetic resonance (MR) images simultaneously to simulate the clinical situation. The average transmission rate (ATR) was analyzed between the cluster and noncluster servers. In the download scenario, the ATRs of CR, CT, and MR images increased by 44.3%, 56.6%, and 100.9%, respectively, when using the server cluster, whereas the ATRs increased by 23.0%, 39.2%, and 24.9% in the upload scenario. In the mix scenario, the transmission performance increased by 45.2% when using eight computer units. The fault tolerance mechanisms of the server cluster maintained the system availability and image integrity. The server cluster can improve the transmission efficiency while maintaining high reliability and continuous availability in a healthcare environment.
Chang, Shu-Jun; Wu, Jay
2014-01-01
The rapid development of picture archiving and communication systems (PACSs) thoroughly changes the way of medical informatics communication and management. However, as the scale of a hospital's operations increases, the large amount of digital images transferred in the network inevitably decreases system efficiency. In this study, a server cluster consisting of two server nodes was constructed. Network load balancing (NLB), distributed file system (DFS), and structured query language (SQL) duplication services were installed. A total of 1 to 16 workstations were used to transfer computed radiography (CR), computed tomography (CT), and magnetic resonance (MR) images simultaneously to simulate the clinical situation. The average transmission rate (ATR) was analyzed between the cluster and noncluster servers. In the download scenario, the ATRs of CR, CT, and MR images increased by 44.3%, 56.6%, and 100.9%, respectively, when using the server cluster, whereas the ATRs increased by 23.0%, 39.2%, and 24.9% in the upload scenario. In the mix scenario, the transmission performance increased by 45.2% when using eight computer units. The fault tolerance mechanisms of the server cluster maintained the system availability and image integrity. The server cluster can improve the transmission efficiency while maintaining high reliability and continuous availability in a healthcare environment. PMID:24701580
Cluster redshifts in five suspected superclusters
NASA Technical Reports Server (NTRS)
Ciardullo, R.; Ford, H.; Harms, R.
1985-01-01
Redshift surveys for rich superclusters were carried out in five regions of the sky containing surface-density enhancements of Abell clusters. While several superclusters are identified, projection effects dominate each field, and no system contains more than five rich clusters. Two systems are found to be especially interesting. The first, field 0136 10, is shown to contain a superposition of at least four distinct superclusters, with the richest system possessing a small velocity dispersion. The second system, 2206 - 22, though a region of exceedingly high Abell cluster surface density, appears to be a remarkable superposition of 23 rich clusters almost uniformly distributed in redshift space between 0.08 and 0.24. The new redshifts significantly increase the three-dimensional information available for the distance class 5 and 6 Abell clusters and allow the spatial correlation function around rich superclusters to be estimated.
Cluster formation by allelomimesis in real-world complex adaptive systems
NASA Astrophysics Data System (ADS)
Juanico, Dranreb Earl; Monterola, Christopher; Saloma, Caesar
2005-04-01
Animal and human clusters are complex adaptive systems and many organize in cluster sizes s that obey the frequency distribution D(s)∝s-τ . The exponent τ describes the relative abundance of the cluster sizes in a given system. Data analyses reveal that real-world clusters exhibit a broad spectrum of τ values, 0.7 (tuna fish schools) ⩽τ⩽4.61 (T4 bacteriophage gene family sizes). Allelomimesis is proposed as an underlying mechanism for adaptation that explains the observed broad τ spectrum. Allelomimesis is the tendency of an individual to imitate the actions of others and two cluster systems have different τ values when their component agents display unequal degrees of allelomimetic tendencies. Cluster formation by allelomimesis is shown to be of three general types: namely, blind copying, information-use copying, and noncopying. Allelomimetic adaptation also reveals that the most stable cluster size is formed by three strongly allelomimetic individuals. Our finding is consistent with available field data taken from killer whales and marmots.
Jungle Computing: Distributed Supercomputing Beyond Clusters, Grids, and Clouds
NASA Astrophysics Data System (ADS)
Seinstra, Frank J.; Maassen, Jason; van Nieuwpoort, Rob V.; Drost, Niels; van Kessel, Timo; van Werkhoven, Ben; Urbani, Jacopo; Jacobs, Ceriel; Kielmann, Thilo; Bal, Henri E.
In recent years, the application of high-performance and distributed computing in scientific practice has become increasingly wide spread. Among the most widely available platforms to scientists are clusters, grids, and cloud systems. Such infrastructures currently are undergoing revolutionary change due to the integration of many-core technologies, providing orders-of-magnitude speed improvements for selected compute kernels. With high-performance and distributed computing systems thus becoming more heterogeneous and hierarchical, programming complexity is vastly increased. Further complexities arise because urgent desire for scalability and issues including data distribution, software heterogeneity, and ad hoc hardware availability commonly force scientists into simultaneous use of multiple platforms (e.g., clusters, grids, and clouds used concurrently). A true computing jungle.
Sorokin, A; Vancassel, X; Mirabel, P
2005-12-22
A kinetic model to predict nucleation rates in the sulfuric acid-water system is presented. It allows calculating steady-state nucleation rates and the corresponding time lag, using a direct solution of a system of kinetic equations that describe the populations of sub- and near-critical clusters. This kinetic model takes into account cluster-cluster collisions and decay of clusters into smaller clusters. The model results are compared with some predictions obtained with the classical nucleation theory (CNT) and also with available measurement data obtained in smog chambers or flow tubes. It is shown that in the case of slow nucleation processes, the kinetic model and the CNT as used by Shugard et al. [J. Chem. Phys. 75, 5298 (1974)] give the same results. However, in the case of intensive nucleation, a large part of the nucleation flux is due to cluster-cluster collisions and the CNT underestimates the nucleation rates.
Towards Effective Clustering Techniques for the Analysis of Electric Power Grids
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hogan, Emilie A.; Cotilla Sanchez, Jose E.; Halappanavar, Mahantesh
2013-11-30
Clustering is an important data analysis technique with numerous applications in the analysis of electric power grids. Standard clustering techniques are oblivious to the rich structural and dynamic information available for power grids. Therefore, by exploiting the inherent topological and electrical structure in the power grid data, we propose new methods for clustering with applications to model reduction, locational marginal pricing, phasor measurement unit (PMU or synchrophasor) placement, and power system protection. We focus our attention on model reduction for analysis based on time-series information from synchrophasor measurement devices, and spectral techniques for clustering. By comparing different clustering techniques onmore » two instances of realistic power grids we show that the solutions are related and therefore one could leverage that relationship for a computational advantage. Thus, by contrasting different clustering techniques we make a case for exploiting structure inherent in the data with implications for several domains including power systems.« less
Clustervision: Visual Supervision of Unsupervised Clustering.
Kwon, Bum Chul; Eysenbach, Ben; Verma, Janu; Ng, Kenney; De Filippi, Christopher; Stewart, Walter F; Perer, Adam
2018-01-01
Clustering, the process of grouping together similar items into distinct partitions, is a common type of unsupervised machine learning that can be useful for summarizing and aggregating complex multi-dimensional data. However, data can be clustered in many ways, and there exist a large body of algorithms designed to reveal different patterns. While having access to a wide variety of algorithms is helpful, in practice, it is quite difficult for data scientists to choose and parameterize algorithms to get the clustering results relevant for their dataset and analytical tasks. To alleviate this problem, we built Clustervision, a visual analytics tool that helps ensure data scientists find the right clustering among the large amount of techniques and parameters available. Our system clusters data using a variety of clustering techniques and parameters and then ranks clustering results utilizing five quality metrics. In addition, users can guide the system to produce more relevant results by providing task-relevant constraints on the data. Our visual user interface allows users to find high quality clustering results, explore the clusters using several coordinated visualization techniques, and select the cluster result that best suits their task. We demonstrate this novel approach using a case study with a team of researchers in the medical domain and showcase that our system empowers users to choose an effective representation of their complex data.
Ciesielski, Szymon J; Craig, Elizabeth A
2017-02-01
Though toxic in excess, iron is vital for life. Thus, its use in all cells is tightly regulated. Analysis of Saccharomyces cerevisiae, which has been used extensively as a model system, has revealed layers of regulation of cellular iron trafficking and utilization. This regulation is based on the availability of both elemental iron and functionality of the Fe-S cluster biogenesis system. Here, we discuss a possible "first responder" regulatory mechanism centered on the stability of the scaffold protein on which Fe-S clusters are built.
A dynamical study of Galactic globular clusters under different relaxation conditions
NASA Astrophysics Data System (ADS)
Zocchi, A.; Bertin, G.; Varri, A. L.
2012-03-01
Aims: We perform a systematic combined photometric and kinematic analysis of a sample of globular clusters under different relaxation conditions, based on their core relaxation time (as listed in available catalogs), by means of two well-known families of spherical stellar dynamical models. Systems characterized by shorter relaxation time scales are expected to be better described by isotropic King models, while less relaxed systems might be interpreted by means of non-truncated, radially-biased anisotropic f(ν) models, originally designed to represent stellar systems produced by a violent relaxation formation process and applied here for the first time to the study of globular clusters. Methods: The comparison between dynamical models and observations is performed by fitting simultaneously surface brightness and velocity dispersion profiles. For each globular cluster, the best-fit model in each family is identified, along with a full error analysis on the relevant parameters. Detailed structural properties and mass-to-light ratios are also explicitly derived. Results: We find that King models usually offer a good representation of the observed photometric profiles, but often lead to less satisfactory fits to the kinematic profiles, independently of the relaxation condition of the systems. For some less relaxed clusters, f(ν) models provide a good description of both observed profiles. Some derived structural characteristics, such as the total mass or the half-mass radius, turn out to be significantly model-dependent. The analysis confirms that, to answer some important dynamical questions that bear on the formation and evolution of globular clusters, it would be highly desirable to acquire larger numbers of accurate kinematic data-points, well distributed over the cluster field. Appendices are available in electronic form at http://www.aanda.org
GibbsCluster: unsupervised clustering and alignment of peptide sequences.
Andreatta, Massimo; Alvarez, Bruno; Nielsen, Morten
2017-07-03
Receptor interactions with short linear peptide fragments (ligands) are at the base of many biological signaling processes. Conserved and information-rich amino acid patterns, commonly called sequence motifs, shape and regulate these interactions. Because of the properties of a receptor-ligand system or of the assay used to interrogate it, experimental data often contain multiple sequence motifs. GibbsCluster is a powerful tool for unsupervised motif discovery because it can simultaneously cluster and align peptide data. The GibbsCluster 2.0 presented here is an improved version incorporating insertion and deletions accounting for variations in motif length in the peptide input. In basic terms, the program takes as input a set of peptide sequences and clusters them into meaningful groups. It returns the optimal number of clusters it identified, together with the sequence alignment and sequence motif characterizing each cluster. Several parameters are available to customize cluster analysis, including adjustable penalties for small clusters and overlapping groups and a trash cluster to remove outliers. As an example application, we used the server to deconvolute multiple specificities in large-scale peptidome data generated by mass spectrometry. The server is available at http://www.cbs.dtu.dk/services/GibbsCluster-2.0. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Identification of piecewise affine systems based on fuzzy PCA-guided robust clustering technique
NASA Astrophysics Data System (ADS)
Khanmirza, Esmaeel; Nazarahari, Milad; Mousavi, Alireza
2016-12-01
Hybrid systems are a class of dynamical systems whose behaviors are based on the interaction between discrete and continuous dynamical behaviors. Since a general method for the analysis of hybrid systems is not available, some researchers have focused on specific types of hybrid systems. Piecewise affine (PWA) systems are one of the subsets of hybrid systems. The identification of PWA systems includes the estimation of the parameters of affine subsystems and the coefficients of the hyperplanes defining the partition of the state-input domain. In this paper, we have proposed a PWA identification approach based on a modified clustering technique. By using a fuzzy PCA-guided robust k-means clustering algorithm along with neighborhood outlier detection, the two main drawbacks of the well-known clustering algorithms, i.e., the poor initialization and the presence of outliers, are eliminated. Furthermore, this modified clustering technique enables us to determine the number of subsystems without any prior knowledge about system. In addition, applying the structure of the state-input domain, that is, considering the time sequence of input-output pairs, provides a more efficient clustering algorithm, which is the other novelty of this work. Finally, the proposed algorithm has been evaluated by parameter identification of an IGV servo actuator. Simulation together with experiment analysis has proved the effectiveness of the proposed method.
Atomistic cluster alignment method for local order mining in liquids and glasses
NASA Astrophysics Data System (ADS)
Fang, X. W.; Wang, C. Z.; Yao, Y. X.; Ding, Z. J.; Ho, K. M.
2010-11-01
An atomistic cluster alignment method is developed to identify and characterize the local atomic structural order in liquids and glasses. With the “order mining” idea for structurally disordered systems, the method can detect the presence of any type of local order in the system and can quantify the structural similarity between a given set of templates and the aligned clusters in a systematic and unbiased manner. Moreover, population analysis can also be carried out for various types of clusters in the system. The advantages of the method in comparison with other previously developed analysis methods are illustrated by performing the structural analysis for four prototype systems (i.e., pure Al, pure Zr, Zr35Cu65 , and Zr36Ni64 ). The results show that the cluster alignment method can identify various types of short-range orders (SROs) in these systems correctly while some of these SROs are difficult to capture by most of the currently available analysis methods (e.g., Voronoi tessellation method). Such a full three-dimensional atomistic analysis method is generic and can be applied to describe the magnitude and nature of noncrystalline ordering in many disordered systems.
NASA Astrophysics Data System (ADS)
Kumar, Rohit; Puri, Rajeev K.
2018-03-01
Employing the quantum molecular dynamics (QMD) approach for nucleus-nucleus collisions, we test the predictive power of the energy-based clusterization algorithm, i.e., the simulating annealing clusterization algorithm (SACA), to describe the experimental data of charge distribution and various event-by-event correlations among fragments. The calculations are constrained into the Fermi-energy domain and/or mildly excited nuclear matter. Our detailed study spans over different system masses, and system-mass asymmetries of colliding partners show the importance of the energy-based clusterization algorithm for understanding multifragmentation. The present calculations are also compared with the other available calculations, which use one-body models, statistical models, and/or hybrid models.
NASA Astrophysics Data System (ADS)
Wang, X.; Hoag, A.; Huang, K.-H.; Treu, T.; Bradač, M.; Schmidt, K. B.; Brammer, G. B.; Vulcani, B.; Jones, T. A.; Ryan, R. E., Jr.; Amorín, R.; Castellano, M.; Fontana, A.; Merlin, E.; Trenti, M.
2015-09-01
We present a strong and weak lensing reconstruction of the massive cluster Abell 2744, the first cluster for which deep Hubble Frontier Fields (HFF) images and spectroscopy from the Grism Lens-Amplified Survey from Space (GLASS) are available. By performing a targeted search for emission lines in multiply imaged sources using the GLASS spectra, we obtain five high-confidence spectroscopic redshifts and two tentative ones. We confirm one strongly lensed system by detecting the same emission lines in all three multiple images. We also search for additional line emitters blindly and use the full GLASS spectroscopic catalog to test reliability of photometric redshifts for faint line emitters. We see a reasonable agreement between our photometric and spectroscopic redshift measurements, when including nebular emission in photometric redshift estimations. We introduce a stringent procedure to identify only secure multiple image sets based on colors, morphology, and spectroscopy. By combining 7 multiple image systems with secure spectroscopic redshifts (at 5 distinct redshift planes) with 18 multiple image systems with secure photometric redshifts, we reconstruct the gravitational potential of the cluster pixellated on an adaptive grid, using a total of 72 images. The resulting mass map is compared with a stellar mass map obtained from the deep Spitzer Frontier Fields data to study the relative distribution of stars and dark matter in the cluster. We find that the stellar to total mass ratio varies substantially across the cluster field, suggesting that stars do not trace exactly the total mass in this interacting system. The maps of convergence, shear, and magnification are made available in the standard HFF format.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, X.; Schmidt, K. B.; Jones, T. A.
2015-09-20
We present a strong and weak lensing reconstruction of the massive cluster Abell 2744, the first cluster for which deep Hubble Frontier Fields (HFF) images and spectroscopy from the Grism Lens-Amplified Survey from Space (GLASS) are available. By performing a targeted search for emission lines in multiply imaged sources using the GLASS spectra, we obtain five high-confidence spectroscopic redshifts and two tentative ones. We confirm one strongly lensed system by detecting the same emission lines in all three multiple images. We also search for additional line emitters blindly and use the full GLASS spectroscopic catalog to test reliability of photometricmore » redshifts for faint line emitters. We see a reasonable agreement between our photometric and spectroscopic redshift measurements, when including nebular emission in photometric redshift estimations. We introduce a stringent procedure to identify only secure multiple image sets based on colors, morphology, and spectroscopy. By combining 7 multiple image systems with secure spectroscopic redshifts (at 5 distinct redshift planes) with 18 multiple image systems with secure photometric redshifts, we reconstruct the gravitational potential of the cluster pixellated on an adaptive grid, using a total of 72 images. The resulting mass map is compared with a stellar mass map obtained from the deep Spitzer Frontier Fields data to study the relative distribution of stars and dark matter in the cluster. We find that the stellar to total mass ratio varies substantially across the cluster field, suggesting that stars do not trace exactly the total mass in this interacting system. The maps of convergence, shear, and magnification are made available in the standard HFF format.« less
NASA Astrophysics Data System (ADS)
Capuzzo-Dolcetta, Roberto
1993-10-01
Among the possible phenomena inducing evolution of the globular cluster system in an elliptical galaxy, dynamical friction due to field stars and tidal disruption caused by a central nucleus is of crucial importance. The aim of this paper is the study of the evolution of the globular cluster system in a triaxial galaxy in the presence of these phenomena. In particular, the possibility is examined that some galactic nuclei have been formed by frictionally decayed globular clusters moving in a triaxial potential. We find that the initial rapid growth of the nucleus, due mainly to massive clusters on box orbits falling in a short time scale into the galactic center, is later slowed by tidal disruption induced by the nucleus itself on less massive clusters in the way described by Ostriker, Binney, and Saha. The efficiency of dynamical friction is such to carry to the center of the galaxy enough globular cluster mass available to form a compact nucleus, but the actual modes and results of cluster-cluster encounters in the central potential well are complicated phenomena which remains to be investigated. The mass of the resulting nucleus is determined by the mutual feedback of the described processes, together with the initial spatial, velocity, and mass distributions of the globular cluster family. The effect on the system mass function is studied, showing the development of a low- and high-mass turnover even with an initially flat mass function. Moreover, in this paper is discussed the possibility that the globular cluster fall to the galactic center has been a cause of primordial violent galactic activity. An application of the model to M31 is presented.
Universal dynamical properties preclude standard clustering in a large class of biochemical data.
Gomez, Florian; Stoop, Ralph L; Stoop, Ruedi
2014-09-01
Clustering of chemical and biochemical data based on observed features is a central cognitive step in the analysis of chemical substances, in particular in combinatorial chemistry, or of complex biochemical reaction networks. Often, for reasons unknown to the researcher, this step produces disappointing results. Once the sources of the problem are known, improved clustering methods might revitalize the statistical approach of compound and reaction search and analysis. Here, we present a generic mechanism that may be at the origin of many clustering difficulties. The variety of dynamical behaviors that can be exhibited by complex biochemical reactions on variation of the system parameters are fundamental system fingerprints. In parameter space, shrimp-like or swallow-tail structures separate parameter sets that lead to stable periodic dynamical behavior from those leading to irregular behavior. We work out the genericity of this phenomenon and demonstrate novel examples for their occurrence in realistic models of biophysics. Although we elucidate the phenomenon by considering the emergence of periodicity in dependence on system parameters in a low-dimensional parameter space, the conclusions from our simple setting are shown to continue to be valid for features in a higher-dimensional feature space, as long as the feature-generating mechanism is not too extreme and the dimension of this space is not too high compared with the amount of available data. For online versions of super-paramagnetic clustering see http://stoop.ini.uzh.ch/research/clustering. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
X-ray morphological study of the ESZ sample
NASA Astrophysics Data System (ADS)
Lovisari, L.; Forman, W.; Jones, C.; Andrade-Santos, F.; Democles, J.; Pratt, G.; Ettori, S.; Arnaud, M.; Randall, S.; Kraft, R.
2017-10-01
An accurate knowledge of the scaling relations between X-ray observables and cluster mass is a crucial step for studies that aim to constrain cosmological parameters using galaxy clusters. The measure of the dynamical state of the systems offers important information to obtain precise scaling relations and understand their scatter. Unfortunately, characterize the dynamical state of a galaxy cluster requires to access a large set of information in different wavelength which are available only for a few individual systems. An alternative is to compute well defined morphological parameters making use of the relatively cheap X-ray images and profiles. Due to different projection effects none of the methods is good in all the cases and a combination of them is more effective to quantify the level of substructures. I will present the cluster morphologies that we derived for the ESZ sample. I will show their dependence on different cluster properties like total mass, redshift, and luminosity and how they differ from the ones obtained for X-ray selected clusters.
Burns, Randal; Roncal, William Gray; Kleissas, Dean; Lillaney, Kunal; Manavalan, Priya; Perlman, Eric; Berger, Daniel R; Bock, Davi D; Chung, Kwanghun; Grosenick, Logan; Kasthuri, Narayanan; Weiler, Nicholas C; Deisseroth, Karl; Kazhdan, Michael; Lichtman, Jeff; Reid, R Clay; Smith, Stephen J; Szalay, Alexander S; Vogelstein, Joshua T; Vogelstein, R Jacob
2013-01-01
We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes - neural connectivity maps of the brain-using the parallel execution of computer vision algorithms on high-performance compute clusters. These services and open-science data sets are publicly available at openconnecto.me. The system design inherits much from NoSQL scale-out and data-intensive computing architectures. We distribute data to cluster nodes by partitioning a spatial index. We direct I/O to different systems-reads to parallel disk arrays and writes to solid-state storage-to avoid I/O interference and maximize throughput. All programming interfaces are RESTful Web services, which are simple and stateless, improving scalability and usability. We include a performance evaluation of the production system, highlighting the effec-tiveness of spatial data organization.
N-body experiments and missing mass in clusters of galaxies
NASA Technical Reports Server (NTRS)
Smith, H.; Hintzen, P.; Sofia, S.; Oegerle, W.; Scott, J.; Holman, G.
1979-01-01
It is commonly assumed that the distributions of surface density and radial-velocity dispersion in clusters of galaxies are sensitive tracers of the underlying distribution of any unseen mass. N-body experiments have been used to test this assumption. Calculations with equal-mass systems indicate that the effects of the underlying mass distribution cannot be detected by observations of the surface-density or radial-velocity distributions, and the existence of an extended binding mass in all well-studied clusters would be consistent with available observations.
NASA Astrophysics Data System (ADS)
Stone, Maria Babakhanyan
Ultra-diffuse galaxies are a novel type of galaxies discovered first in the Coma cluster. These objects are characterized simultaneously by large sizes and by very low counts of constituent stars. Conflicting theories have been proposed to explain how these large diffuse galaxies could have survived in the harsh environment of clusters. To date, thousands of these new galaxies have been identified in cluster environments. However, further studies are required to understand their relationship to the known giant and dwarf classes of galaxies. The purpose of this study is to compare the trends of inner and outer populations of normal members of the Coma cluster and ultra-diffuse galaxies in color-magnitude space. The present work used several astronomical catalogs to identify the member galaxies based on the coordinates of their positions and to extract available colors and magnitudes. We obtained correlations to convert colors and magnitudes from different systems into the common Sloan Digital Sky Survey system to facilitate the comparative analysis. We showed the quantitative relations describing the color-magnitude trends of galaxies in the core and the outskirts of the cluster. We confirmed that the inner and outer populations of ultra-diffuse galaxies exhibit an offset similar to the normal red sequence galaxies. We presented an initial assessment of stellar population ages and metallicities which correspond to the obtained color offsets. We surveyed the available images of the cluster for outliers, merger candidates, and candidate ultra-diffuse galaxies. We conclude that ultra-diffuse galaxies are an important part of the Coma cluster evolutionary history and future work is needed especially in obtaining spectroscopic data of a larger number of these dim galaxies.
Cloudgene: A graphical execution platform for MapReduce programs on private and public clouds
2012-01-01
Background The MapReduce framework enables a scalable processing and analyzing of large datasets by distributing the computational load on connected computer nodes, referred to as a cluster. In Bioinformatics, MapReduce has already been adopted to various case scenarios such as mapping next generation sequencing data to a reference genome, finding SNPs from short read data or matching strings in genotype files. Nevertheless, tasks like installing and maintaining MapReduce on a cluster system, importing data into its distributed file system or executing MapReduce programs require advanced knowledge in computer science and could thus prevent scientists from usage of currently available and useful software solutions. Results Here we present Cloudgene, a freely available platform to improve the usability of MapReduce programs in Bioinformatics by providing a graphical user interface for the execution, the import and export of data and the reproducibility of workflows on in-house (private clouds) and rented clusters (public clouds). The aim of Cloudgene is to build a standardized graphical execution environment for currently available and future MapReduce programs, which can all be integrated by using its plug-in interface. Since Cloudgene can be executed on private clusters, sensitive datasets can be kept in house at all time and data transfer times are therefore minimized. Conclusions Our results show that MapReduce programs can be integrated into Cloudgene with little effort and without adding any computational overhead to existing programs. This platform gives developers the opportunity to focus on the actual implementation task and provides scientists a platform with the aim to hide the complexity of MapReduce. In addition to MapReduce programs, Cloudgene can also be used to launch predefined systems (e.g. Cloud BioLinux, RStudio) in public clouds. Currently, five different bioinformatic programs using MapReduce and two systems are integrated and have been successfully deployed. Cloudgene is freely available at http://cloudgene.uibk.ac.at. PMID:22888776
Astrophysical properties of star clusters in the Magellanic Clouds homogeneously estimated by ASteCA
NASA Astrophysics Data System (ADS)
Perren, G. I.; Piatti, A. E.; Vázquez, R. A.
2017-06-01
Aims: We seek to produce a homogeneous catalog of astrophysical parameters of 239 resolved star clusters, located in the Small and Large Magellanic Clouds, observed in the Washington photometric system. Methods: The cluster sample was processed with the recently introduced Automated Stellar Cluster Analysis (ASteCA) package, which ensures both an automatized and a fully reproducible treatment, together with a statistically based analysis of their fundamental parameters and associated uncertainties. The fundamental parameters determined for each cluster with this tool, via a color-magnitude diagram (CMD) analysis, are metallicity, age, reddening, distance modulus, and total mass. Results: We generated a homogeneous catalog of structural and fundamental parameters for the studied cluster sample and performed a detailed internal error analysis along with a thorough comparison with values taken from 26 published articles. We studied the distribution of cluster fundamental parameters in both Clouds and obtained their age-metallicity relationships. Conclusions: The ASteCA package can be applied to an unsupervised determination of fundamental cluster parameters, which is a task of increasing relevance as more data becomes available through upcoming surveys. A table with the estimated fundamental parameters for the 239 clusters analyzed is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/602/A89
How Is Fe-S Cluster Formation Regulated?
Mettert, Erin L; Kiley, Patricia J
2015-01-01
Iron-sulfur (Fe-S) clusters are fundamental to numerous biological processes in most organisms, but these protein cofactors can be prone to damage by various oxidants (e.g., O2, reactive oxygen species, and reactive nitrogen species) and toxic levels of certain metals (e.g., cobalt and copper). Furthermore, their synthesis can also be directly influenced by the level of available iron in the environment. Consequently, the cellular need for Fe-S cluster biogenesis varies with fluctuating growth conditions. To accommodate changes in Fe-S demand, microorganisms employ diverse regulatory strategies to tailor Fe-S cluster biogenesis according to their surroundings. Here, we review the mechanisms that regulate Fe-S cluster formation in bacteria, primarily focusing on control of the Isc and Suf Fe-S cluster biogenesis systems in the model bacterium Escherichia coli.
Burns, Randal; Roncal, William Gray; Kleissas, Dean; Lillaney, Kunal; Manavalan, Priya; Perlman, Eric; Berger, Daniel R.; Bock, Davi D.; Chung, Kwanghun; Grosenick, Logan; Kasthuri, Narayanan; Weiler, Nicholas C.; Deisseroth, Karl; Kazhdan, Michael; Lichtman, Jeff; Reid, R. Clay; Smith, Stephen J.; Szalay, Alexander S.; Vogelstein, Joshua T.; Vogelstein, R. Jacob
2013-01-01
We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes— neural connectivity maps of the brain—using the parallel execution of computer vision algorithms on high-performance compute clusters. These services and open-science data sets are publicly available at openconnecto.me. The system design inherits much from NoSQL scale-out and data-intensive computing architectures. We distribute data to cluster nodes by partitioning a spatial index. We direct I/O to different systems—reads to parallel disk arrays and writes to solid-state storage—to avoid I/O interference and maximize throughput. All programming interfaces are RESTful Web services, which are simple and stateless, improving scalability and usability. We include a performance evaluation of the production system, highlighting the effec-tiveness of spatial data organization. PMID:24401992
Huo, Yan; Dong, Wei; Qian, Jin; Jing, Tao
2017-01-01
In this paper, we address the low efficiency of cluster-based communication for the crossroad scenario in the Vehicular Cyber-Physical System (VCPS), which is due to the overload of the cluster head resulting from a large number of transmission bandwidth requirements. After formulating the issue as a coalition formation game, a coalition-based clustering strategy is proposed, which could converge into a Nash-stable partition to accomplish the clustering formation process. In the proposed strategy, the coalition utility is formulated by the relative velocity, relative position and the bandwidth availability ratio of vehicles among the cluster. Employing the coalition utility, the vehicles are denoted as the nodes that make the decision whether to switch to a new coalition or stay in the current coalition. Based on this, we can make full use of the bandwidth provided by cluster head under the requirement of clustering stability. Nevertheless, there exist selfish nodes during the clustering formation, so as to intend to benefit from networks. This behavior may degrade the communication quality and even destroy the cluster. Thus, we also present a reputation-based incentive and penalty mechanism to stop the selfish nodes from entering clusters. Numerical simulation results show that our strategy, CG-SECC, takes on a better performance for the tradeoff between the stability and efficiency of clustering communication. Besides, a case study demonstrates that the proposed incentive and penalty mechanism can play an important role in discovering and removing malicious nodes. PMID:28264469
NASA Astrophysics Data System (ADS)
Salinas, R.; Alabi, A.; Richtler, T.; Lane, R. R.
2015-05-01
As tracers of star formation, galaxy assembly, and mass distribution, globular clusters have provided important clues to our understanding of early-type galaxies. But their study has been mostly constrained to galaxy groups and clusters where early-type galaxies dominate, leaving the properties of the globular cluster systems (GCSs) of isolated ellipticals as a mostly uncharted territory. We present Gemini-South/GMOS g'i' observations of five isolated elliptical galaxies: NGC 3962, NGC 2865, IC 4889, NGC 2271, and NGC 4240. Photometry of their GCSs reveals clear color bimodality in three of them, but remains inconclusive for the other two. All the studied GCSs are rather poor with a mean specific frequency SN ~ 1.5, independently of the parent galaxy luminosity. Considering information from previous work as well, it is clear that bimodality and especially the presence of a significant, even dominant, population of blue clusters occurs at even the most isolated systems, which casts doubts on a possible accreted origin of metal-poor clusters, as suggested by some models. Additionally, we discuss the possible existence of ultra-compact dwarfs around the isolated elliptical NGC 3962. Based on observations obtained at the Gemini Observatory, which is operated by the Association of Universities for Research in Astronomy, Inc., under a cooperative agreement with the NSF on behalf of the Gemini partnership: the National Science Foundation (United States), the Science and Technology Facilities Council (United Kingdom), the National Research Council (Canada), CONICYT (Chile), the Australian Research Council (Australia), Ministério da Ciência, Tecnologia e Inovação (Brazil) and Ministerio de Ciencia, Tecnología e Innovación Productiva (Argentina).Globular cluster photometry is available in electronic form at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (ftp://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/577/A59Appendices are available in electronic form at http://www.aanda.org
Kinematics and dynamics of the MKW/AWM poor clusters
NASA Technical Reports Server (NTRS)
Beers, Timothy C.; Kriessler, Jeffrey R.; Bird, Christina M.; Huchra, John P.
1995-01-01
We report 472 new redshifts for 416 galaxies in the regions of the 23 poor clusters of galaxies originally identified by Morgan, Kayser, and White (MKW), and Albert, White, and Morgan (AWM). Eighteen of the poor clusters now have 10 or more available redshifts within 1.5/h Mpc of the central galaxy; 11 clusters have at least 20 available redshifts. Based on the 21 clusters for which we have sufficient velocity information, the median velocity scale is 336 km/s, a factor of 2 smaller than found for rich clusters. Several of the poor clusters exhibit complex velocity distributions due to the presence of nearby clumps of galaxies. We check on the velocity of the dominant galaxy in each poor cluster relative to the remaining cluster members. Significantly high relative velocities of the dominant galaxy are found in only 4 of 21 poor clusters, 3 of which we suspect are due to contamination of the parent velocity distribution. Several statistical tests indicate that the D/cD galaxies are at the kinematic centers of the parent poor cluster velocity distributions. Mass-to-light ratios for 13 of the 15 poor clusters for which we have the required data are in the range 50 less than or = M/L(sub B(0)) less than or = 200 solar mass/solar luminosity. The complex nature of the regions surrounding many of the poor clusters suggests that these groupings may represent an early epoch of cluster formation. For example, the poor clusters MKW7 and MKWS are shown to be gravitationally bound and likely to merge to form a richer cluster within the next several Gyrs. Eight of the nine other poor clusters for which simple two-body dynamical models can be carried out are consistent with being bound to other clumps in their vicinity. Additional complex systems with more than two gravitationally bound clumps are observed among the poor clusters.
The effect of host cluster gravitational tidal forces on the internal dynamics of spiral galaxies
NASA Astrophysics Data System (ADS)
Mayer, Alexander
2013-04-01
New empirical observation by Bidin, Carraro, Mendez & Smith finds ``a lack of dark matter in the Solar neighborhood" (2012 ApJ 751, 30). This, and the discovery of a vast polar structure of Milky Way satellites by Pawlowski, Pflamm-Altenburg & Kroupa (2012 MNRAS 423, 1109), conflict with the prevailing interpretation of the measured Galactic rotation curve. Simulating the dynamical effects of host cluster tidal forces on galaxy disks reveals radial migration in a spiral structure and an orbital velocity that accelerates with increasing galactocentric radial coordinate. A virtual ``toy model,'' which is based on an Earth-orbiting system of particles and is physically realizable in principle, is available at GravitySim.net. Given the perturbing gravitational effect of the host cluster on a spiral galaxy disk and that a similar effect does not exist for the Solar System, the two systems represent distinct classes of gravitational dynamical systems. The observed `flat' and accelerating rotation curves of spiral galaxies can be attributed to gravitational interaction with the host cluster; no `dark matter halo' is required to explain the observable.
NASA Astrophysics Data System (ADS)
Wagner-Kaiser, R.; Mackey, Dougal; Sarajedini, Ata; Cohen, Roger E.; Geisler, Doug; Yang, Soung-Chul; Grocholski, Aaron J.; Cummings, Jeffrey D.
2018-03-01
We leverage new high-quality data from Hubble Space Telescope program GO-14164 to explore the variation in horizontal branch morphology among globular clusters in the Large Magellanic Cloud (LMC). Our new observations lead to photometry with a precision commensurate with that available for the Galactic globular cluster population. Our analysis indicates that, once metallicity is accounted for, clusters in the LMC largely share similar horizontal branch morphologies regardless of their location within the system. Furthermore, the LMC clusters possess, on average, slightly redder morphologies than most of the inner halo Galactic population; we find, instead, that their characteristics tend to be more similar to those exhibited by clusters in the outer Galactic halo. Our results are consistent with previous studies, showing a correlation between horizontal branch morphology and age.
Huo, Yan; Dong, Wei; Qian, Jin; Jing, Tao
2017-02-27
In this paper, we address the low efficiency of cluster-based communication for the crossroad scenario in the Vehicular Cyber-Physical System (VCPS), which is due to the overload of the cluster head resulting from a large number of transmission bandwidth requirements. After formulating the issue as a coalition formation game, a coalition-based clustering strategy is proposed, which could converge into a Nash-stable partition to accomplish the clustering formation process. In the proposed strategy, the coalition utility is formulated by the relative velocity, relative position and the bandwidth availability ratio of vehicles among the cluster. Employing the coalition utility, the vehicles are denoted as the nodes that make the decision whether to switch to a new coalition or stay in the current coalition. Based on this, we can make full use of the bandwidth provided by cluster head under the requirement of clustering stability. Nevertheless, there exist selfish nodes duringtheclusteringformation,soastointendtobenefitfromnetworks. Thisbehaviormaydegrade the communication quality and even destroy the cluster. Thus, we also present a reputation-based incentive and penalty mechanism to stop the selfish nodes from entering clusters. Numerical simulation results show that our strategy, CG-SECC, takes on a better performance for the tradeoff between the stability and efficiency of clustering communication. Besides, a case study demonstrates that the proposed incentive and penalty mechanism can play an important role in discovering and removing malicious nodes.
Educational System Efficiency Improvement Using Knowledge Discovery in Databases
ERIC Educational Resources Information Center
Lukaš, Mirko; Leškovic, Darko
2007-01-01
This study describes one of possible way of usage ICT in education system. We basically treated educational system like Business Company and develop appropriate model for clustering of student population. Modern educational systems are forced to extract the most necessary and purposeful information from a large amount of available data. Clustering…
Brownstein, John S; Green, Traci C; Cassidy, Theresa A; Butler, Stephen F
2010-06-01
Understanding the spatial distribution of opioid abuse at the local level may facilitate public health interventions. Using patient-level data from addiction treatment facilities in New Mexico from ASI-MV Connect, we applied geographic information system (GIS) in combination with a spatial scan statistic to generate risk maps of prescription opioid abuse and identify clusters of product- and compound-specific abuse. Prescribed opioid volume data was used to determine whether identified clusters are beyond geographic differences in availability. Data on 24 452 patients residing in New Mexico were collected. Among those patients, 1779 (7.3%) reported abusing any prescription opioid (past 30 days). According to opioid type, 979 patients (4.0%) reported abuse of any hydrocodone, 1007 (4.1%) for any oxycodone, 108 (0.4%) for morphine, 507 (2.1%) for Vicodin or generic equivalent, 390 (1.6%) for OxyContin, and 63 (0.2%) for MS Contin or generic equivalent. Highest rates of abuse were found in the area surrounding Albuquerque with 8.6 patients indicating abuse per 100 interviewed patients. We found clustering of abuse around Albuquerque (P = 0.001; Relative Risk = 1.35, and a radius of 146 km). At the compound level, we found that drug availability was partly responsible for clustering of prescription opioid abuse. After accounting for drug availability, we identified a second foci of Vicodin abuse in the southern rural portion of the state near Las Cruces, NM and El Paso, Texas and bordering Mexico (RR = 2.1; P = 0.001). A better understanding of local risk distribution may have implications for response strategies to future introductions of prescription opioids.
Brownstein, John S.; Green, Traci C.; Cassidy, Theresa A.; Butler, Stephen F.
2010-01-01
Purpose Understanding the spatial distribution of opioid abuse at the local level may facilitate public health interventions. Methods Using patient-level data from addiction treatment facilities in New Mexico from ASI-MV® Connect, we applied geographic information system in combination with a spatial scan statistics to generate risk maps of prescription opioid abuse and identify clusters of product- and compound-specific abuse. Prescribed opioid volume data was used to determine whether identified clusters are beyond geographic differences in availability. Results Data on 24,452 patients residing in New Mexico was collected. Among those patients, 1779 (7.3%) reported abusing any prescription opioid (past 30 days). According to opioid type, 979 patients (4.0%) reported abuse of any hydrocodone, 1007 (4.1%) for any oxycodone, 108 (0.4%) for morphine, 507 (2.1%) for Vicodin® or generic equivalent, 390 (1.6%) for OxyContin®, and 63 (0.2%) for MS Contin® or generic equivalent. Highest rates of abuse were found in the area surrounding Albuquerque with 8.6 patients indicating abuse per 100 interviewed patients. We found clustering of abuse around Albuquerque (P=0.001; Relative Risk=1.35 and a radius of 146 km). At the compound level, we found that drug availability was partly responsible for clustering of prescription opioid abuse. After accounting for drug availability, we identified a second foci of Vicodin® abuse in the southern rural portion of the state near Las Cruces, NM and El Paso, Texas and bordering Mexico (RR=2.1; P=0.001). Conclusions A better understanding of local risk distribution may have implications for response strategies to future introductions of prescription opioids. PMID:20535759
NASA Technical Reports Server (NTRS)
Lightman, A. P.; Grindlay, J. E.
1982-01-01
Globular clusters are thought to be among the oldest objects in the Galaxy, and provide, in this connection, important clues for determining the age and process of formation of the Galaxy. The present investigation is concerned with puzzles relating to the X-ray emission of globular clusters, taking into account questions regarding the location of X-ray emitting clusters (XEGC) unusually near the galactic plane and/or galactic center. An adopted model is discussed for the nature, formation, and lifetime of X-ray sources in globular clusters. An analysis of the available data is conducted in connection with a search for correlations between binary formation time scales, central relaxation times, galactic locations, and X-ray emission. The positive correlation found between distance from galactic center and two-body binary formation time for globular clusters, explanations for this correlation, and the hypothesis that X-ray sources in globular clusters require binary star systems provide a possible explanation of the considered puzzles.
Silva, Gabriela Drummond Marques da; Bartholomay, Patrícia; Cruz, Oswaldo Gonçalves; Garcia, Leila Posenato
2017-10-01
This study aimed to evaluate quality, acceptability and timeliness of the data in the tuberculosis surveillance system in Brazilian micro-regions. An ecological cross-sectional study was carried out, after a qualitative stage for selecting indicators. All 558 Brazilian micro-regions were used as units of analysis. Data available in the National Notifiable Diseases Information System (SINAN), from 2012 to 2014, were used to calculate 14 indicators relating to four attributes: completeness, consistency, timeliness and acceptability. The study made use of cluster analysis to group micro-regions according to acceptability and timeliness. Three clusters were identified among the 473 micro-regions with optimal or regular completeness (70% to 100%) and with over five notifications. Cluster 1 (n = 109) presented mean timeliness of notification and treatment equal to 62.8% and 24.9%, respectively. Cluster 2 (n = 143) had a mean percentage of cases tested for HIV equal to 55.9%. Cluster 3 (n = 221) had the best performing tuberculosis indicators. Results suggest priority areas for improving surveillance of tuberculosis, predominantly in the central-north part of the country. They also point to the need to increase the timeliness of treatment and the percentage of cases tested for HIV.
Wright, Mark H.; Tung, Chih-Wei; Zhao, Keyan; Reynolds, Andy; McCouch, Susan R.; Bustamante, Carlos D.
2010-01-01
Motivation: The development of new high-throughput genotyping products requires a significant investment in testing and training samples to evaluate and optimize the product before it can be used reliably on new samples. One reason for this is current methods for automated calling of genotypes are based on clustering approaches which require a large number of samples to be analyzed simultaneously, or an extensive training dataset to seed clusters. In systems where inbred samples are of primary interest, current clustering approaches perform poorly due to the inability to clearly identify a heterozygote cluster. Results: As part of the development of two custom single nucleotide polymorphism genotyping products for Oryza sativa (domestic rice), we have developed a new genotype calling algorithm called ‘ALCHEMY’ based on statistical modeling of the raw intensity data rather than modelless clustering. A novel feature of the model is the ability to estimate and incorporate inbreeding information on a per sample basis allowing accurate genotyping of both inbred and heterozygous samples even when analyzed simultaneously. Since clustering is not used explicitly, ALCHEMY performs well on small sample sizes with accuracy exceeding 99% with as few as 18 samples. Availability: ALCHEMY is available for both commercial and academic use free of charge and distributed under the GNU General Public License at http://alchemy.sourceforge.net/ Contact: mhw6@cornell.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20926420
Guthke, Reinhard; Möller, Ulrich; Hoffmann, Martin; Thies, Frank; Töpfer, Susanne
2005-04-15
The immune response to bacterial infection represents a complex network of dynamic gene and protein interactions. We present an optimized reverse engineering strategy aimed at a reconstruction of this kind of interaction networks. The proposed approach is based on both microarray data and available biological knowledge. The main kinetics of the immune response were identified by fuzzy clustering of gene expression profiles (time series). The number of clusters was optimized using various evaluation criteria. For each cluster a representative gene with a high fuzzy-membership was chosen in accordance with available physiological knowledge. Then hypothetical network structures were identified by seeking systems of ordinary differential equations, whose simulated kinetics could fit the gene expression profiles of the cluster-representative genes. For the construction of hypothetical network structures singular value decomposition (SVD) based methods and a newly introduced heuristic Network Generation Method here were compared. It turned out that the proposed novel method could find sparser networks and gave better fits to the experimental data. Reinhard.Guthke@hki-jena.de.
Strong Lensing Analysis of the Galaxy Cluster MACS J1319.9+7003 and the Discovery of a Shell Galaxy
NASA Astrophysics Data System (ADS)
Zitrin, Adi
2017-01-01
We present a strong-lensing (SL) analysis of the galaxy cluster MACS J1319.9+7003 (z = 0.33, also known as Abell 1722), as part of our ongoing effort to analyze massive clusters with archival Hubble Space Telescope (HST) imaging. We spectroscopically measured with Keck/Multi-Object Spectrometer For Infra-Red Exploration (MOSFIRE) two galaxies multiply imaged by the cluster. Our analysis reveals a modest lens, with an effective Einstein radius of {θ }e(z=2)=12+/- 1\\prime\\prime , enclosing 2.1+/- 0.3× {10}13 M⊙. We briefly discuss the SL properties of the cluster, using two different modeling techniques (see the text for details), and make the mass models publicly available (ftp://wise-ftp.tau.ac.il/pub/adiz/MACS1319/). Independently, we identified a noteworthy, young shell galaxy (SG) system forming around two likely interacting cluster members, 20″ north of the brightest cluster galaxy. SGs are rare in galaxy clusters, and indeed, a simple estimate reveals that they are only expected in roughly one in several dozen, to several hundred, massive galaxy clusters (the estimate can easily change by an order of magnitude within a reasonable range of characteristic values relevant for the calculation). Taking advantage of our lens model best-fit, mass-to-light scaling relation for cluster members, we infer that the total mass of the SG system is ˜ 1.3× {10}11 {M}⊙ , with a host-to-companion mass ratio of about 10:1. Despite being rare in high density environments, the SG constitutes an example to how stars of cluster galaxies are efficiently redistributed to the intra-cluster medium. Dedicated numerical simulations for the observed shell configuration, perhaps aided by the mass model, might cast interesting light on the interaction history and properties of the two galaxies. An archival HST search in galaxy cluster images can reveal more such systems.
Plasma Physics Calculations on a Parallel Macintosh Cluster
NASA Astrophysics Data System (ADS)
Decyk, Viktor; Dauger, Dean; Kokelaar, Pieter
2000-03-01
We have constructed a parallel cluster consisting of 16 Apple Macintosh G3 computers running the MacOS, and achieved very good performance on numerically intensive, parallel plasma particle-in-cell simulations. A subset of the MPI message-passing library was implemented in Fortran77 and C. This library enabled us to port code, without modification, from other parallel processors to the Macintosh cluster. For large problems where message packets are large and relatively few in number, performance of 50-150 MFlops/node is possible, depending on the problem. This is fast enough that 3D calculations can be routinely done. Unlike Unix-based clusters, no special expertise in operating systems is required to build and run the cluster. Full details are available on our web site: http://exodus.physics.ucla.edu/appleseed/.
Plasma Physics Calculations on a Parallel Macintosh Cluster
NASA Astrophysics Data System (ADS)
Decyk, Viktor K.; Dauger, Dean E.; Kokelaar, Pieter R.
We have constructed a parallel cluster consisting of 16 Apple Macintosh G3 computers running the MacOS, and achieved very good performance on numerically intensive, parallel plasma particle-in-cell simulations. A subset of the MPI message-passing library was implemented in Fortran77 and C. This library enabled us to port code, without modification, from other parallel processors to the Macintosh cluster. For large problems where message packets are large and relatively few in number, performance of 50-150 Mflops/node is possible, depending on the problem. This is fast enough that 3D calculations can be routinely done. Unlike Unix-based clusters, no special expertise in operating systems is required to build and run the cluster. Full details are available on our web site: http://exodus.physics.ucla.edu/appleseed/.
Investigating the role of small vent volcanism during the development of Tharsis Province, Mars
NASA Astrophysics Data System (ADS)
Richardson, J. A.; Bleacher, J. E.; Connor, C.; Connor, L.; Glaze, L. S.
2014-12-01
Clusters of tens to hundreds of small volcanic vents have recently been recognized as a major component of Tharsis Province volcanism. These volcanic fields are formed from distributed-style, possibly monogenetic, volcanism and are composed of low sloped edifices with diameters of tens of kilometers and heights of tens to hundreds of meters. We report a new catalog of these small volcanic vents, now available through the USGS Astrogeology Science Center. This catalog was created with the use of gridded topographic data from the Mars Orbiter Laser Altimeter (MOLA) and images from the Thermal Emission Imaging System (THEMIS) and the High Resolution Stereo Camera (HRSC). We are now investigating isolated clusters of distributed volcanism in Tharsis with this dataset. We hypothesize that these clusters are formed from significant magmatic events that played a large role in the development of Tharsis. Currently, the catalog contains 1075 unique volcanic vents in the Tharsis Province. With the catalog, potentially isolated volcano clusters are identified with vent density estimation. Vent intensity for clusters is found to be 1 vent per 1000 sq km or less. Crater retention rates for one such cluster, Syria Planum, indicates that these distributed volcanic systems might continue as long as 700 Ma, or that monogenetic volcanic systems overprint older systems. Using a modified basal outlining algorithm with MOLA gridded data, shield volumes are found to be between 1-20 cubic km. Current results show distributed-style volcanism occuring in Tharsis orders of magnitude more dispersed than analogous volcano clusers on Earth, while individual edifices are found to be an order of magnitude larger than volcanoes in Earth clusters. Proof of concept results are reported for three identified clusters: Arsia Mons Caldera, Syria Planum, and Southern Pavonis Mons.
Dos Santos, Patricia C; Johnson, Deborah C; Ragle, Brook E; Unciuleac, Mihaela-Carmen; Dean, Dennis R
2007-04-01
The nitrogen-fixing organism Azotobacter vinelandii contains at least two systems that catalyze formation of [Fe-S] clusters. One of these systems is encoded by nif genes, whose products supply [Fe-S] clusters required for maturation of nitrogenase. The other system is encoded by isc genes, whose products are required for maturation of [Fe-S] proteins that participate in general metabolic processes. The two systems are similar in that they include an enzyme for the mobilization of sulfur (NifS or IscS) and an assembly scaffold (NifU or IscU) upon which [Fe-S] clusters are formed. Normal cellular levels of the Nif system, which supplies [Fe-S] clusters for the maturation of nitrogenase, cannot also supply [Fe-S] clusters for the maturation of other cellular [Fe-S] proteins. Conversely, when produced at the normal physiological levels, the Isc system cannot supply [Fe-S] clusters for the maturation of nitrogenase. In the present work we found that such target specificity for IscU can be overcome by elevated production of NifU. We also found that NifU, when expressed at normal levels, is able to partially replace the function of IscU if cells are cultured under low-oxygen-availability conditions. In contrast to the situation with IscU, we could not establish conditions in which the function of IscS could be replaced by NifS. We also found that elevated expression of the Isc components, as a result of deletion of the regulatory iscR gene, improved the capacity for nitrogen-fixing growth of strains deficient in either NifU or NifS.
Vote Stuffing Control in IPTV-based Recommender Systems
NASA Astrophysics Data System (ADS)
Bhatt, Rajen
Vote stuffing is a general problem in the functioning of the content rating-based recommender systems. Currently IPTV viewers browse various contents based on the program ratings. In this paper, we propose a fuzzy clustering-based approach to remove the effects of vote stuffing and consider only the genuine ratings for the programs over multiple genres. The approach requires only one authentic rating, which is generally available from recommendation system administrators or program broadcasters. The entire process is automated using fuzzy c-means clustering. Computational experiments performed over one real-world program rating database shows that the proposed approach is very efficient for controlling vote stuffing.
Jarboe, G R; Gates, R H; McDaniel, C D
1990-01-01
Healthcare providers of multiple option plans may be confronted with special market segmentation problems. This study demonstrates how cluster analysis may be used for discovering distinct patterns of preference for multiple option plans. The availability of metric, as opposed to categorical or ordinal, data provides the ability to use sophisticated analysis techniques which may be superior to frequency distributions and cross-tabulations in revealing preference patterns.
Modularization of biochemical networks based on classification of Petri net t-invariants.
Grafahrend-Belau, Eva; Schreiber, Falk; Heiner, Monika; Sackmann, Andrea; Junker, Björn H; Grunwald, Stefanie; Speer, Astrid; Winder, Katja; Koch, Ina
2008-02-08
Structural analysis of biochemical networks is a growing field in bioinformatics and systems biology. The availability of an increasing amount of biological data from molecular biological networks promises a deeper understanding but confronts researchers with the problem of combinatorial explosion. The amount of qualitative network data is growing much faster than the amount of quantitative data, such as enzyme kinetics. In many cases it is even impossible to measure quantitative data because of limitations of experimental methods, or for ethical reasons. Thus, a huge amount of qualitative data, such as interaction data, is available, but it was not sufficiently used for modeling purposes, until now. New approaches have been developed, but the complexity of data often limits the application of many of the methods. Biochemical Petri nets make it possible to explore static and dynamic qualitative system properties. One Petri net approach is model validation based on the computation of the system's invariant properties, focusing on t-invariants. T-invariants correspond to subnetworks, which describe the basic system behavior.With increasing system complexity, the basic behavior can only be expressed by a huge number of t-invariants. According to our validation criteria for biochemical Petri nets, the necessary verification of the biological meaning, by interpreting each subnetwork (t-invariant) manually, is not possible anymore. Thus, an automated, biologically meaningful classification would be helpful in analyzing t-invariants, and supporting the understanding of the basic behavior of the considered biological system. Here, we introduce a new approach to automatically classify t-invariants to cope with network complexity. We apply clustering techniques such as UPGMA, Complete Linkage, Single Linkage, and Neighbor Joining in combination with different distance measures to get biologically meaningful clusters (t-clusters), which can be interpreted as modules. To find the optimal number of t-clusters to consider for interpretation, the cluster validity measure, Silhouette Width, is applied. We considered two different case studies as examples: a small signal transduction pathway (pheromone response pathway in Saccharomyces cerevisiae) and a medium-sized gene regulatory network (gene regulation of Duchenne muscular dystrophy). We automatically classified the t-invariants into functionally distinct t-clusters, which could be interpreted biologically as functional modules in the network. We found differences in the suitability of the various distance measures as well as the clustering methods. In terms of a biologically meaningful classification of t-invariants, the best results are obtained using the Tanimoto distance measure. Considering clustering methods, the obtained results suggest that UPGMA and Complete Linkage are suitable for clustering t-invariants with respect to the biological interpretability. We propose a new approach for the biological classification of Petri net t-invariants based on cluster analysis. Due to the biologically meaningful data reduction and structuring of network processes, large sets of t-invariants can be evaluated, allowing for model validation of qualitative biochemical Petri nets. This approach can also be applied to elementary mode analysis.
Modularization of biochemical networks based on classification of Petri net t-invariants
Grafahrend-Belau, Eva; Schreiber, Falk; Heiner, Monika; Sackmann, Andrea; Junker, Björn H; Grunwald, Stefanie; Speer, Astrid; Winder, Katja; Koch, Ina
2008-01-01
Background Structural analysis of biochemical networks is a growing field in bioinformatics and systems biology. The availability of an increasing amount of biological data from molecular biological networks promises a deeper understanding but confronts researchers with the problem of combinatorial explosion. The amount of qualitative network data is growing much faster than the amount of quantitative data, such as enzyme kinetics. In many cases it is even impossible to measure quantitative data because of limitations of experimental methods, or for ethical reasons. Thus, a huge amount of qualitative data, such as interaction data, is available, but it was not sufficiently used for modeling purposes, until now. New approaches have been developed, but the complexity of data often limits the application of many of the methods. Biochemical Petri nets make it possible to explore static and dynamic qualitative system properties. One Petri net approach is model validation based on the computation of the system's invariant properties, focusing on t-invariants. T-invariants correspond to subnetworks, which describe the basic system behavior. With increasing system complexity, the basic behavior can only be expressed by a huge number of t-invariants. According to our validation criteria for biochemical Petri nets, the necessary verification of the biological meaning, by interpreting each subnetwork (t-invariant) manually, is not possible anymore. Thus, an automated, biologically meaningful classification would be helpful in analyzing t-invariants, and supporting the understanding of the basic behavior of the considered biological system. Methods Here, we introduce a new approach to automatically classify t-invariants to cope with network complexity. We apply clustering techniques such as UPGMA, Complete Linkage, Single Linkage, and Neighbor Joining in combination with different distance measures to get biologically meaningful clusters (t-clusters), which can be interpreted as modules. To find the optimal number of t-clusters to consider for interpretation, the cluster validity measure, Silhouette Width, is applied. Results We considered two different case studies as examples: a small signal transduction pathway (pheromone response pathway in Saccharomyces cerevisiae) and a medium-sized gene regulatory network (gene regulation of Duchenne muscular dystrophy). We automatically classified the t-invariants into functionally distinct t-clusters, which could be interpreted biologically as functional modules in the network. We found differences in the suitability of the various distance measures as well as the clustering methods. In terms of a biologically meaningful classification of t-invariants, the best results are obtained using the Tanimoto distance measure. Considering clustering methods, the obtained results suggest that UPGMA and Complete Linkage are suitable for clustering t-invariants with respect to the biological interpretability. Conclusion We propose a new approach for the biological classification of Petri net t-invariants based on cluster analysis. Due to the biologically meaningful data reduction and structuring of network processes, large sets of t-invariants can be evaluated, allowing for model validation of qualitative biochemical Petri nets. This approach can also be applied to elementary mode analysis. PMID:18257938
Deriving physical parameters of unresolved star clusters. V. M 31 PHAT star clusters
NASA Astrophysics Data System (ADS)
de Meulenaer, P.; Stonkutė, R.; Vansevičius, V.
2017-06-01
Context. This study is the fifth of a series that investigates the degeneracy and stochasticity problems present in the determination of physical parameters such as age, mass, extinction, and metallicity of partially resolved or unresolved star cluster populations in external galaxies when using HST broad-band photometry. Aims: In this work we aim to derive parameters of star clusters using models with fixed and free metallicity based on the HST WFC3+ACS photometric system. The method is applied to derive parameters of a subsample of 1363 star clusters in the Andromeda galaxy observed with the HST. Methods: Following Paper III, we derive the star cluster parameters using a large grid of stochastic models that are compared to the six observed integrated broad-band WFC3+ACS magnitudes of star clusters. Results: We show that the age, mass, and extinction of the M 31 star clusters, derived assuming fixed solar metallicity, are in agreement with previous studies. We also demonstrate the ability of the WFC3+ACS photometric system to derive metallicity of star clusters older than 1 Gyr. We show that the metallicity derived using broad-band photometry of 36 massive M 31 star clusters is in good agreement with the metallicity derived using spectroscopy. Table 1 is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/602/A112
ERIC Educational Resources Information Center
Esongo, Njie Martin
2017-01-01
The study takes an in-depth examination of the extent to which the availability of resources relates to the efficiency of the school system within the framework of the implementation of competency-based teaching approaches in Cameroon. The study employed a mix of probability sampling approaches, namely simple, cluster and stratified random…
[Construction of screening system for mutation of negative regulatory genes in Streptomyces].
Zhu, Yu; Feng, Chi; Tan, Huarong; Tian, Yuqing
2013-10-04
We aimed to create a novel report system for screening the mutation of the negative regulatory genes, especially for those repressing the expression of cryptic antibiotics clusters. We used marker-free gene disruption strategy, which combines with the "REDIRECT (Rapid Efficient Directed Recombination Time Saving)" technology and in vivo site-specific recombination by Streptomyces phage phiBT1 integrase, to construct a scbR2/inoA double mutant strain of S. coelicolor M145. This strain was used as the host of the report system. For the construction of the reporter plasmid, the ScbR2 repressed promoter of cpkO from CPK (cryptic polyketide) cluster was used to drive the expression of a promoterless conserved gene inoA of S. coelicolor. Then the reporter plasmid was introduced into the host strain described above to test the availability of inoA as a reporter gene in this system. The scbR2/inoA double mutant strain gave rise to a bald pheno type on MM medium in the absence of inositol, and produced yellow pigmented secondary metabolite by the disruption of scbR2 to release the repression of cpkO, a pathway specific activator gene situated in CPK cluster. After introducing the reporter plasmid into this test stain, the resulting strain recovered the phenotype as wild-type strain, indicating that the promoter of cpkO can drive the expression of inoA in scbR2 mutant and consequently restore the biosynthesis of inositol. Our results indicated that inoA can be used as a novel reporter gene for Streptomyces, especially for detecting the activation of the "silent" promoter. This report system might be available for screening the mutation of the negative regulatory genes for the cryptic secondary metabolic gene clusters.
NASA Astrophysics Data System (ADS)
Marcus, Kelvin
2014-06-01
The U.S Army Research Laboratory (ARL) has built a "Network Science Research Lab" to support research that aims to improve their ability to analyze, predict, design, and govern complex systems that interweave the social/cognitive, information, and communication network genres. Researchers at ARL and the Network Science Collaborative Technology Alliance (NS-CTA), a collaborative research alliance funded by ARL, conducted experimentation to determine if automated network monitoring tools and task-aware agents deployed within an emulated tactical wireless network could potentially increase the retrieval of relevant data from heterogeneous distributed information nodes. ARL and NS-CTA required the capability to perform this experimentation over clusters of heterogeneous nodes with emulated wireless tactical networks where each node could contain different operating systems, application sets, and physical hardware attributes. Researchers utilized the Dynamically Allocated Virtual Clustering Management System (DAVC) to address each of the infrastructure support requirements necessary in conducting their experimentation. The DAVC is an experimentation infrastructure that provides the means to dynamically create, deploy, and manage virtual clusters of heterogeneous nodes within a cloud computing environment based upon resource utilization such as CPU load, available RAM and hard disk space. The DAVC uses 802.1Q Virtual LANs (VLANs) to prevent experimentation crosstalk and to allow for complex private networks. Clusters created by the DAVC system can be utilized for software development, experimentation, and integration with existing hardware and software. The goal of this paper is to explore how ARL and the NS-CTA leveraged the DAVC to create, deploy and manage multiple experimentation clusters to support their experimentation goals.
Launching large computing applications on a disk-less cluster
NASA Astrophysics Data System (ADS)
Schwemmer, Rainer; Caicedo Carvajal, Juan Manuel; Neufeld, Niko
2011-12-01
The LHCb Event Filter Farm system is based on a cluster of the order of 1.500 disk-less Linux nodes. Each node runs one instance of the filtering application per core. The amount of cores in our current production environment is 8 per machine for the old cluster and 12 per machine on extension of the cluster. Each instance has to load about 1.000 shared libraries, weighting 200 MB from several directory locations from a central repository. The repository is currently hosted on a SAN and exported via NFS. The libraries are all available in the local file system cache on every node. Loading a library still causes a huge number of requests to the server though, because the loader will try to probe every available path. Measurements show there are between 100.000-200.000 calls per application instance start up. Multiplied by the numbers of cores in the farm, this translates into a veritable DDoS attack on the servers, which lasts several minutes. Since the application is being restarted frequently, a better solution had to be found.scp Rolling out the software to the nodes is out of the question, because they have no disks and the software in it's entirety is too large to put into a ram disk. To solve this problem we developed a FUSE based file systems which acts as a permanent, controllable cache that keeps the essential files that are necessary in stock.
Galaxy CloudMan: delivering cloud compute clusters.
Afgan, Enis; Baker, Dannon; Coraor, Nate; Chapman, Brad; Nekrutenko, Anton; Taylor, James
2010-12-21
Widespread adoption of high-throughput sequencing has greatly increased the scale and sophistication of computational infrastructure needed to perform genomic research. An alternative to building and maintaining local infrastructure is "cloud computing", which, in principle, offers on demand access to flexible computational infrastructure. However, cloud computing resources are not yet suitable for immediate "as is" use by experimental biologists. We present a cloud resource management system that makes it possible for individual researchers to compose and control an arbitrarily sized compute cluster on Amazon's EC2 cloud infrastructure without any informatics requirements. Within this system, an entire suite of biological tools packaged by the NERC Bio-Linux team (http://nebc.nerc.ac.uk/tools/bio-linux) is available for immediate consumption. The provided solution makes it possible, using only a web browser, to create a completely configured compute cluster ready to perform analysis in less than five minutes. Moreover, we provide an automated method for building custom deployments of cloud resources. This approach promotes reproducibility of results and, if desired, allows individuals and labs to add or customize an otherwise available cloud system to better meet their needs. The expected knowledge and associated effort with deploying a compute cluster in the Amazon EC2 cloud is not trivial. The solution presented in this paper eliminates these barriers, making it possible for researchers to deploy exactly the amount of computing power they need, combined with a wealth of existing analysis software, to handle the ongoing data deluge.
Galaxy CloudMan: delivering cloud compute clusters
2010-01-01
Background Widespread adoption of high-throughput sequencing has greatly increased the scale and sophistication of computational infrastructure needed to perform genomic research. An alternative to building and maintaining local infrastructure is “cloud computing”, which, in principle, offers on demand access to flexible computational infrastructure. However, cloud computing resources are not yet suitable for immediate “as is” use by experimental biologists. Results We present a cloud resource management system that makes it possible for individual researchers to compose and control an arbitrarily sized compute cluster on Amazon’s EC2 cloud infrastructure without any informatics requirements. Within this system, an entire suite of biological tools packaged by the NERC Bio-Linux team (http://nebc.nerc.ac.uk/tools/bio-linux) is available for immediate consumption. The provided solution makes it possible, using only a web browser, to create a completely configured compute cluster ready to perform analysis in less than five minutes. Moreover, we provide an automated method for building custom deployments of cloud resources. This approach promotes reproducibility of results and, if desired, allows individuals and labs to add or customize an otherwise available cloud system to better meet their needs. Conclusions The expected knowledge and associated effort with deploying a compute cluster in the Amazon EC2 cloud is not trivial. The solution presented in this paper eliminates these barriers, making it possible for researchers to deploy exactly the amount of computing power they need, combined with a wealth of existing analysis software, to handle the ongoing data deluge. PMID:21210983
NASA Astrophysics Data System (ADS)
Borgelt, Christian
In clustering we often face the situation that only a subset of the available attributes is relevant for forming clusters, even though this may not be known beforehand. In such cases it is desirable to have a clustering algorithm that automatically weights attributes or even selects a proper subset. In this paper I study such an approach for fuzzy clustering, which is based on the idea to transfer an alternative to the fuzzifier (Klawonn and Höppner, What is fuzzy about fuzzy clustering? Understanding and improving the concept of the fuzzifier, In: Proc. 5th Int. Symp. on Intelligent Data Analysis, 254-264, Springer, Berlin, 2003) to attribute weighting fuzzy clustering (Keller and Klawonn, Int J Uncertain Fuzziness Knowl Based Syst 8:735-746, 2000). In addition, by reformulating Gustafson-Kessel fuzzy clustering, a scheme for weighting and selecting principal axes can be obtained. While in Borgelt (Feature weighting and feature selection in fuzzy clustering, In: Proc. 17th IEEE Int. Conf. on Fuzzy Systems, IEEE Press, Piscataway, NJ, 2008) I already presented such an approach for a global selection of attributes and principal axes, this paper extends it to a cluster-specific selection, thus arriving at a fuzzy subspace clustering algorithm (Parsons, Haque, and Liu, 2004).
Evolutionary models of rotating dense stellar systems: challenges in software and hardware
NASA Astrophysics Data System (ADS)
Fiestas, Jose
2016-02-01
We present evolutionary models of rotating self-gravitating systems (e.g. globular clusters, galaxy cores). These models are characterized by the presence of initial axisymmetry due to rotation. Central black hole seeds are alternatively included in our models, and black hole growth due to consumption of stellar matter is simulated until the central potential dominates the kinematics in the core. Goal is to study the long-term evolution (~ Gyr) of relaxed dense stellar systems, which deviate from spherical symmetry, their morphology and final kinematics. With this purpose, we developed a 2D Fokker-Planck analytical code, which results we confirm by detailed N-Body techniques, applying a high performance code, developed for GPU machines. We compare our models to available observations of galactic rotating globular clusters, and conclude that initial rotation modifies significantly the shape and lifetime of these systems, and can not be neglected in studying the evolution of globular clusters, and the galaxy itself.
UWB Two-Cluster AOA Tracking Prototype System Design
NASA Technical Reports Server (NTRS)
Ngo, Phong H.; Arndt, D.; Phan, C.; Gross, J.; Jianjun; Rafford, Melinda
2006-01-01
This presentation discusses a design effort for a prototype ultra-wideband (UWB) tracking system that is currently under development at NASA Johnson Space Center (JSC). The system is being studied for use in tracking of lunar/Mars rovers during early exploration missions when satellite navigation systems are not available. The UWB technology is exploited to implement the tracking system due to its properties such as fine time resolution, low power spectral density and multipath immunity. A two cluster prototype design using commercially available UWB radios is employed to implement the Angle of Arrival (AOA) tracking methodology in this design effort. In order to increase the tracking range, low noise amplifiers (LNA) and high gain horns are used at the receiving sides. Field tests were conducted jointly with the Science and Crew Operation Utility Testbed (SCOUT) vehicle near the Meteor Crater in Arizona to test the tracking capability for a moving target in an operational environment. These tests demonstrate that the UWB tracking system can co-exist with other on-board radio frequency (RF) communication systems (such as Global Positioning System (GPS), video, voice and telemetry systems), and that a tracking resolution less than 1% of the range can be achieved.
Cluster folding analysis of 20Ne+16O elastic transfer
NASA Astrophysics Data System (ADS)
Hamada, Sh.; Keeley, N.; Kemper, K. W.; Rusek, K.
2018-05-01
The available experimental data for the 20Ne+16O system in the energy range where the effect of α -cluster transfer is well observed are reanalyzed using the cluster folding model. The cluster folding potential, which includes both real and imaginary terms, reproduces the data at forward angles and the inclusion of the 16O(20Ne,16O)20Ne elastic transfer process provides a satisfactory description of the backward angles. The spectroscopic factor for the 20Ne→16O+α overlap was extracted and compared with other values from the literature. The present results suggest that the (20Ne,16O ) reaction might be an alternative means of exploring the α -particle structure of nuclei.
CLUSTERnGO: a user-defined modelling platform for two-stage clustering of time-series data.
Fidaner, Işık Barış; Cankorur-Cetinkaya, Ayca; Dikicioglu, Duygu; Kirdar, Betul; Cemgil, Ali Taylan; Oliver, Stephen G
2016-02-01
Simple bioinformatic tools are frequently used to analyse time-series datasets regardless of their ability to deal with transient phenomena, limiting the meaningful information that may be extracted from them. This situation requires the development and exploitation of tailor-made, easy-to-use and flexible tools designed specifically for the analysis of time-series datasets. We present a novel statistical application called CLUSTERnGO, which uses a model-based clustering algorithm that fulfils this need. This algorithm involves two components of operation. Component 1 constructs a Bayesian non-parametric model (Infinite Mixture of Piecewise Linear Sequences) and Component 2, which applies a novel clustering methodology (Two-Stage Clustering). The software can also assign biological meaning to the identified clusters using an appropriate ontology. It applies multiple hypothesis testing to report the significance of these enrichments. The algorithm has a four-phase pipeline. The application can be executed using either command-line tools or a user-friendly Graphical User Interface. The latter has been developed to address the needs of both specialist and non-specialist users. We use three diverse test cases to demonstrate the flexibility of the proposed strategy. In all cases, CLUSTERnGO not only outperformed existing algorithms in assigning unique GO term enrichments to the identified clusters, but also revealed novel insights regarding the biological systems examined, which were not uncovered in the original publications. The C++ and QT source codes, the GUI applications for Windows, OS X and Linux operating systems and user manual are freely available for download under the GNU GPL v3 license at http://www.cmpe.boun.edu.tr/content/CnG. sgo24@cam.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Zhang, Shu; Taft, Cyrus W; Bentsman, Joseph; Hussey, Aaron; Petrus, Bryan
2012-09-01
Tuning a complex multi-loop PID based control system requires considerable experience. In today's power industry the number of available qualified tuners is dwindling and there is a great need for better tuning tools to maintain and improve the performance of complex multivariable processes. Multi-loop PID tuning is the procedure for the online tuning of a cluster of PID controllers operating in a closed loop with a multivariable process. This paper presents the first application of the simultaneous tuning technique to the multi-input-multi-output (MIMO) PID based nonlinear controller in the power plant control context, with the closed-loop system consisting of a MIMO nonlinear boiler/turbine model and a nonlinear cluster of six PID-type controllers. Although simplified, the dynamics and cross-coupling of the process and the PID cluster are similar to those used in a real power plant. The particular technique selected, iterative feedback tuning (IFT), utilizes the linearized version of the PID cluster for signal conditioning, but the data collection and tuning is carried out on the full nonlinear closed-loop system. Based on the figure of merit for the control system performance, the IFT is shown to deliver performance favorably comparable to that attained through the empirical tuning carried out by an experienced control engineer. Copyright © 2012 ISA. Published by Elsevier Ltd. All rights reserved.
Infrared Multiple Photon Dissociation Spectroscopy Of Metal Cluster-Adducts
NASA Astrophysics Data System (ADS)
Cox, D. M.; Kaldor, A.; Zakin, M. R.
1987-01-01
Recent development of the laser vaporization technique combined with mass-selective detection has made possible new studies of the fundamental chemical and physical properties of unsupported transition metal clusters as a function of the number of constituent atoms. A variety of experimental techniques have been developed in our laboratory to measure ionization threshold energies, magnetic moments, and gas phase reactivity of clusters. However, studies have so far been unable to determine the cluster structure or the chemical state of chemisorbed species on gas phase clusters. The application of infrared multiple photon dissociation IRMPD to obtain the IR absorption properties of metal cluster-adsorbate species in a molecular beam is described here. Specifically using a high power, pulsed CO2 laser as the infrared source, the IRMPD spectrum for methanol chemisorbed on small iron clusters is measured as a function of the number of both iron atoms and methanols in the complex for different methanol isotopes. Both the feasibility and potential utility of IRMPD for characterizing metal cluster-adsorbate interactions are demonstrated. The method is generally applicable to any cluster or cluster-adsorbate system dependent only upon the availability of appropriate high power infrared sources.
Mainshock-Aftershocks Clustering Detection in Volcanic Regions
NASA Astrophysics Data System (ADS)
Garza Giron, R.; Brodsky, E. E.; Prejean, S. G.
2017-12-01
Crustal earthquakes tend to break their general Poissonean process behavior by gathering into two main kinds of seismic bursts: swarms and mainshock-aftershocks sequences. The former is commonly related to volcanic or geothermal processes whereas the latter is a characteristic feature of tectonically driven seismicity. We explore the mainshock-aftershock clustering behavior of different active volcanic regions in Japan and its comparison to non-volcanic regions. We find that aftershock production in volcanoes shows mainshock-aftershocks clustering similar to what is observed in non-volcanic areas. The ratio of volanic areas that cluster in mainshock-aftershocks sequences vs the areas that do not is comparable to the ratio of non-volcanic regions that show clustering vs the ones that do not. Furthermore, the level of production of aftershocks for most volcanic areas where clustering is present seems to be of the same order of magnitude, or slightly higher, as the median of the non-volcanic regions. An interesting example of highly aftershock-productive volcanoes emerges from the 2000 Miyakejima dike intrusion. A big seismic cluster started to build up rapidly in the south-west flank of Miyakejima to later propagate to the north-west towards the Kozushima and Niijima volcanoes. In Miyakejima the seismicity showed a swarm-like signature with a constant earthquake rate, whereas Kozushima and Niijima both had expressions of highly productive mainshock-aftershocks sequences. These findings are surprising given the alternative mechanisms available in volcanic systems for releasing deviatoric strain. We speculate that aftershock behavior might hold a relationship with the rheological properties of the rocks of each system and with the capacity of a system to accumulate or release the internal pressures caused by magmatic or hydrothermal systems.
A Scalable Monitoring for the CMS Filter Farm Based on Elasticsearch
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andre, J.M.; et al.
2015-12-23
A flexible monitoring system has been designed for the CMS File-based Filter Farm making use of modern data mining and analytics components. All the metadata and monitoring information concerning data flow and execution of the HLT are generated locally in the form of small documents using the JSON encoding. These documents are indexed into a hierarchy of elasticsearch (es) clusters along with process and system log information. Elasticsearch is a search server based on Apache Lucene. It provides a distributed, multitenant-capable search and aggregation engine. Since es is schema-free, any new information can be added seamlessly and the unstructured informationmore » can be queried in non-predetermined ways. The leaf es clusters consist of the very same nodes that form the Filter Farm thus providing natural horizontal scaling. A separate central” es cluster is used to collect and index aggregated information. The fine-grained information, all the way to individual processes, remains available in the leaf clusters. The central es cluster provides quasi-real-time high-level monitoring information to any kind of client. Historical data can be retrieved to analyse past problems or correlate them with external information. We discuss the design and performance of this system in the context of the CMS DAQ commissioning for LHC Run 2.« less
Cooperativity in self-limiting equilibrium self-associating systems
NASA Astrophysics Data System (ADS)
Freed, Karl F.
2012-11-01
A wide variety of highly cooperative self-assembly processes in biological and synthetic systems involve the assembly of a large number (m) of units into clusters, with m narrowly peaked about a large size m0 ≫ 1 and with a second peak centered about the m = 1 unassembled monomers. While very specific models have been proposed for the assembly of, for example, viral capsids and core-shell micelles of ß-casein, no available theory describes a thermodynamically general mechanism for this double peaked, highly cooperative equilibrium assembly process. This study provides a general mechanism for these cooperative processes by developing a minimal Flory-Huggins type theory. Beginning from the simplest non-cooperative, free association model in which the equilibrium constant for addition of a monomer to a cluster is independent of cluster size, the new model merely allows more favorable growth for clusters of intermediate sizes. The theory is illustrated by computing the phase diagram for cases of self-assembly on cooling or heating and for the mass distribution of the two phases.
A Science Portal and Archive for Extragalactic Globular Cluster Systems Data
NASA Astrophysics Data System (ADS)
Young, Michael; Rhode, Katherine L.; Gopu, Arvind
2015-01-01
For several years we have been carrying out a wide-field imaging survey of the globular cluster populations of a sample of giant spiral, S0, and elliptical galaxies with distances of ~10-30 Mpc. We use mosaic CCD cameras on the WIYN 3.5-m and Kitt Peak 4-m telescopes to acquire deep BVR imaging of each galaxy and then analyze the data to derive global properties of the globular cluster system. In addition to measuring the total numbers, specific frequencies, spatial distributions, and color distributions for the globular cluster populations, we have produced deep, high-quality images and lists of tens to thousands of globular cluster candidates for the ~40 galaxies included in the survey.With the survey nearing completion, we have been exploring how to efficiently disseminate not only the overall results, but also all of the relevant data products, to the astronomical community. Here we present our solution: a scientific portal and archive for extragalactic globular cluster systems data. With a modern and intuitive web interface built on the same framework as the WIYN One Degree Imager Portal, Pipeline, and Archive (ODI-PPA), our system will provide public access to the survey results and the final stacked mosaic images of the target galaxies. In addition, the astrometric and photometric data for thousands of identified globular cluster candidates, as well as for all point sources detected in each field, will be indexed and searchable. Where available, spectroscopic follow-up data will be paired with the candidates. Advanced imaging tools will enable users to overlay the cluster candidates and other sources on the mosaic images within the web interface, while metadata charting tools will allow users to rapidly and seamlessly plot the survey results for each galaxy and the data for hundreds of thousands of individual sources. Finally, we will appeal to other researchers with similar data products and work toward making our portal a central repository for data related to well-studied giant galaxy globular cluster systems. This work is supported by NSF Faculty Early Career Development (CAREER) award AST-0847109.
NASA Astrophysics Data System (ADS)
Dias, B.; Barbuy, B.; Saviane, I.; Held, E. V.; Da Costa, G. S.; Ortolani, S.; Gullieuszik, M.; Vásquez, S.
2016-05-01
Context. Globular clusters trace the formation and evolution of the Milky Way and surrounding galaxies, and outline their chemical enrichment history. To accomplish these tasks it is important to have large samples of clusters with homogeneous data and analysis to derive kinematics, chemical abundances, ages and locations. Aims: We obtain homogeneous metallicities and α-element enhancement for 51 Galactic bulge, disc, and halo globular clusters that are among the most distant and/or highly reddened in the Galaxy's globular cluster system. We also provide membership selection based on stellar radial velocities and atmospheric parameters. The implications of our results are discussed. Methods: We observed R ~ 2000 spectra in the wavelength interval 456-586 nm for over 800 red giant stars in 51 Galactic globular clusters. We applied full spectrum fitting with the code ETOILE together with libraries of observed and synthetic spectra. We compared the mean abundances of all clusters with previous work and with field stars. We used the relation between mean metallicity and horizontal branch morphology defined by all clusters to select outliers for discussion. Results: [Fe/H], [Mg/Fe], and [α/Fe] were derived in a consistent way for almost one-third of all Galactic globular clusters. We find our metallicities are comparable to those derived from high-resolution data to within σ = 0.08 dex over the interval -2.5< [Fe/H] < 0.0. Furthermore, a comparison of previous metallicity scales with our values yields σ< 0.16 dex. We also find that the distribution of [Mg/Fe] and [α/Fe] with [Fe/H] for the 51 clusters follows the general trend exhibited by field stars. It is the first time that the following clusters have been included in a large sample of homogeneous stellar spectroscopic observations and metallicity derivation: BH 176, Djorg 2, Pal 10, NGC 6426, Lynga 7, and Terzan 8. In particular, only photometric metallicities were available previously for the first three clusters, and the available metallicity for NGC 6426 was based on integrated spectroscopy and photometry. Two other clusters, HP 1 and NGC 6558, are confirmed as candidates for the oldest globular clusters in the Milky Way. Conclusions: Stellar spectroscopy in the visible at R ~ 2000 for a large sample of globular clusters is a robust and efficient way to trace the chemical evolution of the host galaxy and to detect interesting objects for follow-up at higher resolution and with forthcoming giant telescopes. The technique used here can also be applied to globular cluster systems in nearby galaxies with current instruments and to distant galaxies with the advent of ELTs. Based on observations collected at the European Southern Observatory/Paranal, Chile, under programmes 68.B-0482(A), 69.D-0455(A), 71.D-0219(A), 077.D-0775(A), and 089.D-0493(B).Full Tables 1 and A.2 with the derived average parameters for the 758 red giant stars are only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/590/A9
NASA Astrophysics Data System (ADS)
Christou, Michalis; Christoudias, Theodoros; Morillo, Julián; Alvarez, Damian; Merx, Hendrik
2016-09-01
We examine an alternative approach to heterogeneous cluster-computing in the many-core era for Earth system models, using the European Centre for Medium-Range Weather Forecasts Hamburg (ECHAM)/Modular Earth Submodel System (MESSy) Atmospheric Chemistry (EMAC) model as a pilot application on the Dynamical Exascale Entry Platform (DEEP). A set of autonomous coprocessors interconnected together, called Booster, complements a conventional HPC Cluster and increases its computing performance, offering extra flexibility to expose multiple levels of parallelism and achieve better scalability. The EMAC model atmospheric chemistry code (Module Efficiently Calculating the Chemistry of the Atmosphere (MECCA)) was taskified with an offload mechanism implemented using OmpSs directives. The model was ported to the MareNostrum 3 supercomputer to allow testing with Intel Xeon Phi accelerators on a production-size machine. The changes proposed in this paper are expected to contribute to the eventual adoption of Cluster-Booster division and Many Integrated Core (MIC) accelerated architectures in presently available implementations of Earth system models, towards exploiting the potential of a fully Exascale-capable platform.
Fuzzy Kernel k-Medoids algorithm for anomaly detection problems
NASA Astrophysics Data System (ADS)
Rustam, Z.; Talita, A. S.
2017-07-01
Intrusion Detection System (IDS) is an essential part of security systems to strengthen the security of information systems. IDS can be used to detect the abuse by intruders who try to get into the network system in order to access and utilize the available data sources in the system. There are two approaches of IDS, Misuse Detection and Anomaly Detection (behavior-based intrusion detection). Fuzzy clustering-based methods have been widely used to solve Anomaly Detection problems. Other than using fuzzy membership concept to determine the object to a cluster, other approaches as in combining fuzzy and possibilistic membership or feature-weighted based methods are also used. We propose Fuzzy Kernel k-Medoids that combining fuzzy and possibilistic membership as a powerful method to solve anomaly detection problem since on numerical experiment it is able to classify IDS benchmark data into five different classes simultaneously. We classify IDS benchmark data KDDCup'99 data set into five different classes simultaneously with the best performance was achieved by using 30 % of training data with clustering accuracy reached 90.28 percent.
NASA Astrophysics Data System (ADS)
Valasek, Lukas; Glasa, Jan
2017-12-01
Current fire simulation systems are capable to utilize advantages of high-performance computer (HPC) platforms available and to model fires efficiently in parallel. In this paper, efficiency of a corridor fire simulation on a HPC computer cluster is discussed. The parallel MPI version of Fire Dynamics Simulator is used for testing efficiency of selected strategies of allocation of computational resources of the cluster using a greater number of computational cores. Simulation results indicate that if the number of cores used is not equal to a multiple of the total number of cluster node cores there are allocation strategies which provide more efficient calculations.
NASA Astrophysics Data System (ADS)
Lépinoux, J.; Sigli, C.
2018-01-01
In a recent paper, the authors showed how the clusters free energies are constrained by the coagulation probability, and explained various anomalies observed during the precipitation kinetics in concentrated alloys. This coagulation probability appeared to be a too complex function to be accurately predicted knowing only the cluster distribution in Cluster Dynamics (CD). Using atomistic Monte Carlo (MC) simulations, it is shown that during a transformation at constant temperature, after a short transient regime, the transformation occurs at quasi-equilibrium. It is proposed to use MC simulations until the system quasi-equilibrates then to switch to CD which is mean field but not limited by a box size like MC. In this paper, we explain how to take into account the information available before the quasi-equilibrium state to establish guidelines to safely predict the cluster free energies.
The FOSS GIS Workbench on the GFZ Load Sharing Facility compute cluster
NASA Astrophysics Data System (ADS)
Löwe, P.; Klump, J.; Thaler, J.
2012-04-01
Compute clusters can be used as GIS workbenches, their wealth of resources allow us to take on geocomputation tasks which exceed the limitations of smaller systems. To harness these capabilities requires a Geographic Information System (GIS), able to utilize the available cluster configuration/architecture and a sufficient degree of user friendliness to allow for wide application. In this paper we report on the first successful porting of GRASS GIS, the oldest and largest Free Open Source (FOSS) GIS project, onto a compute cluster using Platform Computing's Load Sharing Facility (LSF). In 2008, GRASS6.3 was installed on the GFZ compute cluster, which at that time comprised 32 nodes. The interaction with the GIS was limited to the command line interface, which required further development to encapsulate the GRASS GIS business layer to facilitate its use by users not familiar with GRASS GIS. During the summer of 2011, multiple versions of GRASS GIS (v 6.4, 6.5 and 7.0) were installed on the upgraded GFZ compute cluster, now consisting of 234 nodes with 480 CPUs providing 3084 cores. The GFZ compute cluster currently offers 19 different processing queues with varying hardware capabilities and priorities, allowing for fine-grained scheduling and load balancing. After successful testing of core GIS functionalities, including the graphical user interface, mechanisms were developed to deploy scripted geocomputation tasks onto dedicated processing queues. The mechanisms are based on earlier work by NETELER et al. (2008). A first application of the new GIS functionality was the generation of maps of simulated tsunamis in the Mediterranean Sea for the Tsunami Atlas of the FP-7 TRIDEC Project (www.tridec-online.eu). For this, up to 500 processing nodes were used in parallel. Further trials included the processing of geometrically complex problems, requiring significant amounts of processing time. The GIS cluster successfully completed all these tasks, with processing times lasting up to full 20 CPU days. The deployment of GRASS GIS on a compute cluster allows our users to tackle GIS tasks previously out of reach of single workstations. In addition, this GRASS GIS cluster implementation will be made available to other users at GFZ in the course of 2012. It will thus become a research utility in the sense of "Software as a Service" (SaaS) and can be seen as our first step towards building a GFZ corporate cloud service.
DIMM-SC: a Dirichlet mixture model for clustering droplet-based single cell transcriptomic data.
Sun, Zhe; Wang, Ting; Deng, Ke; Wang, Xiao-Feng; Lafyatis, Robert; Ding, Ying; Hu, Ming; Chen, Wei
2018-01-01
Single cell transcriptome sequencing (scRNA-Seq) has become a revolutionary tool to study cellular and molecular processes at single cell resolution. Among existing technologies, the recently developed droplet-based platform enables efficient parallel processing of thousands of single cells with direct counting of transcript copies using Unique Molecular Identifier (UMI). Despite the technology advances, statistical methods and computational tools are still lacking for analyzing droplet-based scRNA-Seq data. Particularly, model-based approaches for clustering large-scale single cell transcriptomic data are still under-explored. We developed DIMM-SC, a Dirichlet Mixture Model for clustering droplet-based Single Cell transcriptomic data. This approach explicitly models UMI count data from scRNA-Seq experiments and characterizes variations across different cell clusters via a Dirichlet mixture prior. We performed comprehensive simulations to evaluate DIMM-SC and compared it with existing clustering methods such as K-means, CellTree and Seurat. In addition, we analyzed public scRNA-Seq datasets with known cluster labels and in-house scRNA-Seq datasets from a study of systemic sclerosis with prior biological knowledge to benchmark and validate DIMM-SC. Both simulation studies and real data applications demonstrated that overall, DIMM-SC achieves substantially improved clustering accuracy and much lower clustering variability compared to other existing clustering methods. More importantly, as a model-based approach, DIMM-SC is able to quantify the clustering uncertainty for each single cell, facilitating rigorous statistical inference and biological interpretations, which are typically unavailable from existing clustering methods. DIMM-SC has been implemented in a user-friendly R package with a detailed tutorial available on www.pitt.edu/∼wec47/singlecell.html. wei.chen@chp.edu or hum@ccf.org. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
A uniform approach for programming distributed heterogeneous computing systems
Grasso, Ivan; Pellegrini, Simone; Cosenza, Biagio; Fahringer, Thomas
2014-01-01
Large-scale compute clusters of heterogeneous nodes equipped with multi-core CPUs and GPUs are getting increasingly popular in the scientific community. However, such systems require a combination of different programming paradigms making application development very challenging. In this article we introduce libWater, a library-based extension of the OpenCL programming model that simplifies the development of heterogeneous distributed applications. libWater consists of a simple interface, which is a transparent abstraction of the underlying distributed architecture, offering advanced features such as inter-context and inter-node device synchronization. It provides a runtime system which tracks dependency information enforced by event synchronization to dynamically build a DAG of commands, on which we automatically apply two optimizations: collective communication pattern detection and device-host-device copy removal. We assess libWater’s performance in three compute clusters available from the Vienna Scientific Cluster, the Barcelona Supercomputing Center and the University of Innsbruck, demonstrating improved performance and scaling with different test applications and configurations. PMID:25844015
A uniform approach for programming distributed heterogeneous computing systems.
Grasso, Ivan; Pellegrini, Simone; Cosenza, Biagio; Fahringer, Thomas
2014-12-01
Large-scale compute clusters of heterogeneous nodes equipped with multi-core CPUs and GPUs are getting increasingly popular in the scientific community. However, such systems require a combination of different programming paradigms making application development very challenging. In this article we introduce libWater, a library-based extension of the OpenCL programming model that simplifies the development of heterogeneous distributed applications. libWater consists of a simple interface, which is a transparent abstraction of the underlying distributed architecture, offering advanced features such as inter-context and inter-node device synchronization. It provides a runtime system which tracks dependency information enforced by event synchronization to dynamically build a DAG of commands, on which we automatically apply two optimizations: collective communication pattern detection and device-host-device copy removal. We assess libWater's performance in three compute clusters available from the Vienna Scientific Cluster, the Barcelona Supercomputing Center and the University of Innsbruck, demonstrating improved performance and scaling with different test applications and configurations.
NASA Astrophysics Data System (ADS)
Massey, Richard; Kitching, Thomas; Nagai, Daisuke
2011-05-01
The unique properties of dark matter are revealed during collisions between clusters of galaxies, such as the bullet cluster (1E 0657-56) and baby bullet (MACS J0025-12). These systems provide evidence for an additional, invisible mass in the separation between the distributions of their total mass, measured via gravitational lensing, and their ordinary 'baryonic' matter, measured via its X-ray emission. Unfortunately, the information available from these systems is limited by their rarity. Constraints on the properties of dark matter, such as its interaction cross-section, are therefore restricted by uncertainties in the individual systems' impact velocity, impact parameter and orientation with respect to the line of sight. Here we develop a complementary, statistical measurement in which every piece of substructure falling into every massive cluster is treated as a bullet. We define 'bulleticity' as the mean separation between dark matter and ordinary matter, and we measure the signal in hydrodynamical simulations. The phase space of substructure orbits also exhibits symmetries that provide an equivalent control test. Any detection of bulleticity in real data would indicate a difference in the interaction cross-sections of baryonic and dark matter that may rule out hypotheses of non-particulate dark matter that are otherwise able to model individual systems. A subsequent measurement of bulleticity could constrain the dark matter cross-section. Even with conservative estimates, the existing Hubble Space Telescope archive should yield an independent constraint tighter than that from the bullet cluster. This technique is then trivially extendable to and benefits enormously from larger, future surveys.
A Human Activity Recognition System Based on Dynamic Clustering of Skeleton Data.
Manzi, Alessandro; Dario, Paolo; Cavallo, Filippo
2017-05-11
Human activity recognition is an important area in computer vision, with its wide range of applications including ambient assisted living. In this paper, an activity recognition system based on skeleton data extracted from a depth camera is presented. The system makes use of machine learning techniques to classify the actions that are described with a set of a few basic postures. The training phase creates several models related to the number of clustered postures by means of a multiclass Support Vector Machine (SVM), trained with Sequential Minimal Optimization (SMO). The classification phase adopts the X-means algorithm to find the optimal number of clusters dynamically. The contribution of the paper is twofold. The first aim is to perform activity recognition employing features based on a small number of informative postures, extracted independently from each activity instance; secondly, it aims to assess the minimum number of frames needed for an adequate classification. The system is evaluated on two publicly available datasets, the Cornell Activity Dataset (CAD-60) and the Telecommunication Systems Team (TST) Fall detection dataset. The number of clusters needed to model each instance ranges from two to four elements. The proposed approach reaches excellent performances using only about 4 s of input data (~100 frames) and outperforms the state of the art when it uses approximately 500 frames on the CAD-60 dataset. The results are promising for the test in real context.
NASA Astrophysics Data System (ADS)
Erberich, Stephan G.; Hoppe, Martin; Jansen, Christian; Schmidt, Thomas; Thron, Armin; Oberschelp, Walter
2001-08-01
In the last few years more and more University Hospitals as well as private hospitals changed to digital information systems for patient record, diagnostic files and digital images. Not only that patient management becomes easier, it is also very remarkable how clinical research can profit from Picture Archiving and Communication Systems (PACS) and diagnostic databases, especially from image databases. Since images are available on the finger tip, difficulties arise when image data needs to be processed, e.g. segmented, classified or co-registered, which usually demands a lot computational power. Today's clinical environment does support PACS very well, but real image processing is still under-developed. The purpose of this paper is to introduce a parallel cluster of standard distributed systems and its software components and how such a system can be integrated into a hospital environment. To demonstrate the cluster technique we present our clinical experience with the crucial but cost-intensive motion correction of clinical routine and research functional MRI (fMRI) data, as it is processed in our Lab on a daily basis.
Method for Continuous Monitoring of Electrospray Ion Formation
NASA Astrophysics Data System (ADS)
Metzler, Guille; Crathern, Susan; Bachmann, Lorin; Fernández-Metzler, Carmen; King, Richard
2017-10-01
A method for continuously monitoring the performance of electrospray ionization without the addition of hardware or chemistry to the system is demonstrated. In the method, which we refer to as SprayDx, cluster ions with solvent vapor natively formed by electrospray are followed throughout the collection of liquid chromatography-selected reaction monitoring data. The cluster ion extracted ion chromatograms report on the consistency of the ion formation and detection system. The data collected by the SprayDx method resemble the data collected for postcolumn infusion of analyte. The response of the cluster ions monitored reports on changes in the physical parameters of the ion source such as voltage and gas flow. SprayDx is also observed to report on ion suppression in a fashion very similar to a postcolumn infusion of analyte. We anticipate the method finding utility as a continuous readout on the performance of electrospray and other atmospheric pressure ionization processes. [Figure not available: see fulltext.
Dynamic and Contextual Information in HMM Modeling for Handwritten Word Recognition.
Bianne-Bernard, Anne-Laure; Menasri, Farès; Al-Hajj Mohamad, Rami; Mokbel, Chafic; Kermorvant, Christopher; Likforman-Sulem, Laurence
2011-10-01
This study aims at building an efficient word recognition system resulting from the combination of three handwriting recognizers. The main component of this combined system is an HMM-based recognizer which considers dynamic and contextual information for a better modeling of writing units. For modeling the contextual units, a state-tying process based on decision tree clustering is introduced. Decision trees are built according to a set of expert-based questions on how characters are written. Questions are divided into global questions, yielding larger clusters, and precise questions, yielding smaller ones. Such clustering enables us to reduce the total number of models and Gaussians densities by 10. We then apply this modeling to the recognition of handwritten words. Experiments are conducted on three publicly available databases based on Latin or Arabic languages: Rimes, IAM, and OpenHart. The results obtained show that contextual information embedded with dynamic modeling significantly improves recognition.
O'Hagan, Steve; Kell, Douglas B
2018-01-01
Armed with the digital availability of two natural products libraries, amounting to some 195 885 molecular entities, we ask the question of how we can best sample from them to maximize their "representativeness" in smaller and more usable libraries of 96, 384, 1152, and 1920 molecules. The term "representativeness" is intended to include diversity, but for numerical reasons (and the likelihood of being able to perform a QSAR) it is necessary to focus on areas of chemical space that are more highly populated. Encoding chemical structures as fingerprints using the RDKit "patterned" algorithm, we first assess the granularity of the natural products space using a simple clustering algorithm, showing that there are major regions of "denseness" but also a great many very sparsely populated areas. We then apply a "hybrid" hierarchical K-means clustering algorithm to the data to produce more statistically robust clusters from which representative and appropriate numbers of samples may be chosen. There is necessarily again a trade-off between cluster size and cluster number, but within these constraints, libraries containing 384 or 1152 molecules can be found that come from clusters that represent some 18 and 30% of the whole chemical space, with cluster sizes of, respectively, 50 and 27 or above, just about sufficient to perform a QSAR. By using the online availability of molecules via the Molport system (www.molport.com), we are also able to construct (and, for the first time, provide the contents of) a small virtual library of available molecules that provided effective coverage of the chemical space described. Consistent with this, the average molecular similarities of the contents of the libraries developed is considerably smaller than is that of the original libraries. The suggested libraries may have use in molecular or phenotypic screening, including for determining possible transporter substrates. © 2017 The Authors. Biotechnology Journal Published by Wiley-VCH Verlag GmbH & Co. KGaA.
UWB Tracking System Design for Lunar/Mars Exploration
NASA Technical Reports Server (NTRS)
Ni, Jianjun; Arndt, Dickey; Ngo, Phong; Phan, Chau; Gross, Julia
2006-01-01
This paper describes a design effort for a prototype ultra-wideband (UWB) tracking system that is currently under development at NASA Johnson Space Center (JSC). The system is being studied for use in tracking of lunar/Mars rovers during early exploration missions when satellite navigation systems are not available. The UWB technology is exploited to implement the tracking system due to its properties such as high data rate, fine time resolution, low power spectral density, and multipath immunity. A two-cluster prototype design using commercially available UWB products is proposed to implement the Angle Of Arrival (AOA) tracking methodology in this research effort. An AOA technique using the Time Difference Of Arrival (TDOA) information is utilized for location estimation in the prototype system, not only to exploit the precise time resolution possible with UWB signals, but also to eliminate the need for synchronization between the transmitter and the receiver. After the UWB radio at each cluster is used to obtain the TDOA estimates from the UWB signal sent from the target, the TDOA data is converted to AOA data to find the angle of arrival, assuming this is a far field application. Since the distance between two clusters is known, the target position is computed by a simple triangulation. Simulations show that the average tracking error at a range of 610 meters is 2.7595 meters, less than 0.5% of the tracking range. Outdoor tests to track the SCOUT vehicle (The Science Crew Operations and Utility Testbed) near the Meteor Crater, Flagstaff, Arizona were performed on September 12-13, 2005. The tracking performance was obtained with less than 1% tracking error at ranges up to 2000 feet. No RF interference with on-board GPS, video, voice and telemetry systems was detected. Outdoor tests demonstrated the UWB tracking capability.
Truncation of the Binary Distribution Function in Globular Cluster Formation
NASA Astrophysics Data System (ADS)
Vesperini, E.; Chernoff, David F.
1996-02-01
We investigate a population of primordial binaries during the initial stage of evolution of a star cluster. For our calculations we assume that equal-mass stars form rapidly in a tidally truncated gas cloud, that ˜10% of the stars are in binaries, and that the resulting star cluster undergoes an epoch of violent relaxation. We study the collisional interaction of the binaries and single stars, in particular, the ionization of the binaries and the energy exchange between binaries and single stars. We find that for large N systems (N > 1000), even the most violent beginning leaves the binary distribution function largely intact. Hence, the binding energy originally tied up in the cloud's protostellar pairs is preserved during the relaxation process, and the binaries are available to interact at later times within the virialized cluster.
Mena, Carlos; Sepúlveda, Cesar; Fuentes, Eduardo; Ormazábal, Yony; Palomo, Iván
2018-05-07
Cardiovascular diseases (CVDs) are the primary cause of death and disability in de world, and the detection of populations at risk as well as localization of vulnerable areas is essential for adequate epidemiological management. Techniques developed for spatial analysis, among them geographical information systems and spatial statistics, such as cluster detection and spatial correlation, are useful for the study of the distribution of the CVDs. These techniques, enabling recognition of events at different geographical levels of study (e.g., rural, deprived neighbourhoods, etc.), make it possible to relate CVDs to factors present in the immediate environment. The systemic literature presented here shows that this group of diseases is clustered with regard to incidence, mortality and hospitalization as well as obesity, smoking, increased glycated haemoglobin levels, hypertension physical activity and age. In addition, acquired variables such as income, residency (rural or urban) and education, contribute to CVD clustering. Both local cluster detection and spatial regression techniques give statistical weight to the findings providing valuable information that can influence response mechanisms in the health services by indicating locations in need of intervention and assignment of available resources.
Description and typology of intensive Chios dairy sheep farms in Greece.
Gelasakis, A I; Valergakis, G E; Arsenos, G; Banos, G
2012-06-01
The aim was to assess the intensified dairy sheep farming systems of the Chios breed in Greece, establishing a typology that may properly describe and characterize them. The study included the total of the 66 farms of the Chios sheep breeders' cooperative Macedonia. Data were collected using a structured direct questionnaire for in-depth interviews, including questions properly selected to obtain a general description of farm characteristics and overall management practices. A multivariate statistical analysis was used on the data to obtain the most appropriate typology. Initially, principal component analysis was used to produce uncorrelated variables (principal components), which would be used for the consecutive cluster analysis. The number of clusters was decided using hierarchical cluster analysis, whereas, the farms were allocated in 4 clusters using k-means cluster analysis. The identified clusters were described and afterward compared using one-way ANOVA or a chi-squared test. The main differences were evident on land availability and use, facility and equipment availability and type, expansion rates, and application of preventive flock health programs. In general, cluster 1 included newly established, intensive, well-equipped, specialized farms and cluster 2 included well-established farms with balanced sheep and feed/crop production. In cluster 3 were assigned small flock farms focusing more on arable crops than on sheep farming with a tendency to evolve toward cluster 2, whereas cluster 4 included farms representing a rather conservative form of Chios sheep breeding with low/intermediate inputs and choosing not to focus on feed/crop production. In the studied set of farms, 4 different farmer attitudes were evident: 1) farming disrupts sheep breeding; feed should be purchased and economies of scale will decrease costs (mainly cluster 1), 2) only exercise/pasture land is necessary; at least part of the feed (pasture) must be home-grown to decrease costs (clusters 1 and 4), 3) providing pasture to sheep is essential; on-farm feed production decreases costs (mainly cluster 3), and 4) large-scale farming (feed production and cash crops) does not disrupt sheep breeding; all feed must be produced on-farm to decrease costs (mainly cluster 3). Conducting a profitability analysis among different clusters, exploring and discovering the most beneficial levels of intensified management and capital investment should now be considered. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
2n-emission from 205Pb* nucleus using clusterization approach at Ebeam˜14-20 MeV
NASA Astrophysics Data System (ADS)
Kaur, Amandeep; Sandhu, Kiran; Sharma, Manoj Kumar
2016-05-01
The dynamics involved in n-induced reaction with 204Pb target is analyzed and the decay of the composite system 205Pb* is governed within the collective clusterization approach of the Dynamical Cluster-decay Model (DCM). The experimental data for 2n-evaporation channel is available for neutron energy range of 14-20 MeV and is addressed by optimizing the only parameter of the model, the neck-length parameter (ΔR). The calculations are done by taking the quadrupole (β2) deformations of the decaying fragments and the calculated 2n-emission cross-sections find nice agreement with available data. An effort is made to study the role of level density parameter in the decay of hot-rotating nucleus, and the mass dependence in level density parameter is exercised for the first time in DCM based calculations. It is to be noted that the effect of deformation, temperature and angular momentum etc. is studied to extract better description of the dynamics involved.
Aerosol Plume Detection Algorithm Based on Image Segmentation of Scanning Atmospheric Lidar Data
Weekley, R. Andrew; Goodrich, R. Kent; Cornman, Larry B.
2016-04-06
An image-processing algorithm has been developed to identify aerosol plumes in scanning lidar backscatter data. The images in this case consist of lidar data in a polar coordinate system. Each full lidar scan is taken as a fixed image in time, and sequences of such scans are considered functions of time. The data are analyzed in both the original backscatter polar coordinate system and a lagged coordinate system. The lagged coordinate system is a scatterplot of two datasets, such as subregions taken from the same lidar scan (spatial delay), or two sequential scans in time (time delay). The lagged coordinatemore » system processing allows for finding and classifying clusters of data. The classification step is important in determining which clusters are valid aerosol plumes and which are from artifacts such as noise, hard targets, or background fields. These cluster classification techniques have skill since both local and global properties are used. Furthermore, more information is available since both the original data and the lag data are used. Performance statistics are presented for a limited set of data processed by the algorithm, where results from the algorithm were compared to subjective truth data identified by a human.« less
Cluster optical coding: from biochips to counterfeit security
NASA Astrophysics Data System (ADS)
Haglmueller, Jakob; Alguel, Yilmaz; Mayer, Christian; Matyushin, Viacheslav; Bauer, Georg; Pittner, Fritz; Leitner, Alfred; Aussenegg, Franz R.; Schalkhammer, Thomas G.
2004-07-01
Spatially tuned resonant nano-clusters allow high local field enhancement when exited by electromagnetic radiation. A number of phenomena had been described and subsequently applied to novel nano- and bionano-devices. Decisive for these types of devices and sensors is the precise nanometric assembly, coupling the local field surrounding a cluster to allow resonance with other elements interacting with this field. In particular, the distance cluster-mirror or cluster-fluorophore gives rise to a variety of enhancement phenomena. High throughput transducers using metal cluster resonance technology are based on surface-enhancement of metal cluster light absorption (SEA). The optical property for the analytical application of metal cluster films is the so-called anomalous absorption. At a well defined nanometric distance of a cluster to a mirror the reflected electromagnetic field has the same phase at the position of the absorbing cluster as the incident fields. This feedback mechanism strongly enhances the effective cluster absorption coefficient. The system is characterised by a narrow reflection minimum. Based on this SEA-phenomenon (licensed to and further developed and optimized by NovemberAG, Germany Erlangen) a number of commercial products have been constructed. Brandsealing(R) uses the patented SEA cluster technology to produce optical codings. Cluster SEA thin film systems show a characteristic color-flip effect and are extremely mechanically and thermally robust. This is the basis for its application as an unique security feature. The specific spectroscopic properties as e.g. narrow band multi-resonance of the cluster layers allow the authentication of the optical code which can be easily achieved with a mobile hand-held reader developed by november AG and Siemens AG. Thus, these features are machine-readable which makes them superior to comparable technologies. Cluster labels are available in two formats: as a label for tamper-proof product packaging, and as a direct label, where label and logo are permanently applied directly and unremovable to the product surface. Together with Infineon Technologies and HUECK FOLIEN, the SEA technology is currently developed as a direct label for e.g. SmartCards.
Data-Driven Packet Loss Estimation for Node Healthy Sensing in Decentralized Cluster.
Fan, Hangyu; Wang, Huandong; Li, Yong
2018-01-23
Decentralized clustering of modern information technology is widely adopted in various fields these years. One of the main reason is the features of high availability and the failure-tolerance which can prevent the entire system form broking down by a failure of a single point. Recently, toolkits such as Akka are used by the public commonly to easily build such kind of cluster. However, clusters of such kind that use Gossip as their membership managing protocol and use link failure detecting mechanism to detect link failures cannot deal with the scenario that a node stochastically drops packets and corrupts the member status of the cluster. In this paper, we formulate the problem to be evaluating the link quality and finding a max clique (NP-Complete) in the connectivity graph. We then proposed an algorithm that consists of two models driven by data from application layer to respectively solving these two problems. Through simulations with statistical data and a real-world product, we demonstrate that our algorithm has a good performance.
NASA Technical Reports Server (NTRS)
Yedavalli, R. K.
1992-01-01
The aspect of controller design for improving the ride quality of aircraft in terms of damping ratio and natural frequency specifications on the short period dynamics is addressed. The controller is designed to be robust with respect to uncertainties in the real parameters of the control design model such as uncertainties in the dimensional stability derivatives, imperfections in actuator/sensor locations and possibly variations in flight conditions, etc. The design is based on a new robust root clustering theory developed by the author by extending the nominal root clustering theory of Gutman and Jury to perturbed matrices. The proposed methodology allows to get an explicit relationship between the parameters of the root clustering region and the uncertainty radius of the parameter space. The current literature available for robust stability becomes a special case of this unified theory. The bounds derived on the parameter perturbation for robust root clustering are then used in selecting the robust controller.
NASA Astrophysics Data System (ADS)
Ruiz-Díaz, P.; Muñoz-Navia, M.; Dorantes-Dávila, J.
2018-03-01
Charge-doping together with 3 d-4 d alloying emerges as promising mechanisms for tailoring the magnetic properties of low-dimensional systems. Here, throughout ab initio calculations, we present a systematic overview regarding the impact of both electron(hole) charge-doping and chemical composition on the magnetocrystalline anisotropy (MA) of CoPt core-shell alloy clusters. By taking medium-sized Co n Pt m ( N = n + m = 85) octahedral-like alloy nanoparticles for some illustrative core-sizes as examples, we found enhanced MA energies and large induced spin(orbital) moments in Pt-rich clusters. Moreover, depending on the Pt-core-size, both in-plane and off-plane directions of magnetization are observed. In general, the MA of these binary compounds further stabilizes upon charge-doping. In addition, in the clusters with small MA, the doping promotes magnetization switching. Insights into the microscopical origins of the MA behavior are associated to changes in the electronic structure of the clusters. [Figure not available: see fulltext.
Dubbs, Nicole L; Bazzoli, Gloria J; Shortell, Stephen M; Kralovec, Peter D
2004-02-01
To (a) assess how the original cluster categories of hospital-led health networks and systems have changed over time; (b) identify any new patterns of cluster configurations; and (c) demonstrate how additional data can be used to refine and enhance the taxonomy measures. DATA SOURCES; 1994 and 1998 American Hospital Association (AHA) Annual Survey of Hospitals. As in the original taxonomy, separate cluster solutions are identified for health networks and health systems by applying three strategic/structural dimensions (differentiation, integration, and centralization) to three components of the health service/product continuum (hospital services, physician arrangements, and provider-based insurance activities). Factor, cluster, and discriminant analyses are used to analyze the 1998 data. Descriptive and comparative methods are used to analyze the updated 1998 taxonomy relative to the original 1994 version. The 1998 cluster categories are similar to the original taxonomy, however, they reveal some new organizational configurations. For the health networks, centralization of product/service lines is occurring more selectively than in the past. For the health systems, participation has grown in and dispersed across a more diverse set of decentralized organizational forms. For both networks and systems, the definition of centralization has changed over time. In its updated form, the taxonomy continues to provide policymakers and practitioners with a descriptive and contextual framework against which to assess organizational programs and policies. There is a need to continue to revisit the taxonomy from time to time because of the persistent evolution of the U.S. health care industry and the consequent shifting of organizational configurations in this arena. There is also value in continuing to move the taxonomy in the direction of refinement/expansion as new opportunities become available.
Washington photometry of 14 intermediate-age to old star clusters in the Small Magellanic Cloud
NASA Astrophysics Data System (ADS)
Piatti, Andrés E.; Clariá, Juan J.; Bica, Eduardo; Geisler, Doug; Ahumada, Andrea V.; Girardi, Léo
2011-10-01
We present CCD photometry in the Washington system C, T1 and T2 passbands down to T1˜ 23 in the fields of L3, L28, HW 66, L100, HW 79, IC 1708, L106, L108, L109, NGC 643, L112, HW 84, HW 85 and HW 86, 14 Small Magellanic Cloud (SMC) clusters, most of them poorly studied objects. We measured T1 magnitudes and C-T1 and T1-T2 colours for a total of 213 516 stars spread throughout cluster areas of 14.7 × 14.7 arcmin2 each. We carried out an in-depth analysis of the field star contamination of the colour-magnitude diagrams (CMDs) and statistically cleaned the cluster CMDs. Based on the best fits of isochrones computed by the Padova group to the (T1, C-T1) CMDs, as well as from the δ(T1) index and the standard giant branch procedure, we derived ages and metallicities for the cluster sample. With the exception of IC 1708, a relatively metal-poor Hyades-age cluster, the remaining 13 objects are between intermediate and old age (from 1.0 to 6.3 Gyr), their [Fe/H] values ranging from -1.4 to -0.7 dex. By combining these results with others available in the literature, we compiled a sample of 43 well-known SMC clusters older than 1 Gyr, with which we produced a revised age distribution. We found that the present clusters' age distribution reveals two primary excesses of clusters at t˜ 2 and 5 Gyr, which engraves the SMC with clear signs of enhanced formation episodes at both ages. In addition, we found that from the birth of the SMC cluster system until approximately the first 4 Gyr of its lifetime, the cluster formation resembles that of a constant formation rate scenario.
DOE Office of Scientific and Technical Information (OSTI.GOV)
U. Burghaus
2012-07-05
The prior project consisted of two main project lines. First, characterization of novel nanomaterials for hydrodesulfurization (HDS) applications. Second, studying more traditional model systems for HDS such as vapor-deposited silica-supported Mo and MoSx clusters. In the first subproject, we studied WS2 and MoS2 fullerene-like nanoparticles as well as WS2 nanotubes. Thiophene (C4H4S) was used as the probe molecule. Interestingly, metallic and sulfur-like adsorption sites could be identified on the silica-supported fullerene-particles system. Similar structures are seen for the traditional system (vapor-deposited clusters). Thus, this may be a kinetics fingerprint feature of modern HDS model systems. In addition, kinetics data allowedmore » characterization of the different adsorption sites for thiophene on and inside WS2 nanotube bundles. The latter is a unique feature of nanotubes that has not been reported before for any inorganic nanotube system; however, examples are known for carbon nanotubes, including prior work of the PI. Although HDS has been studied for decades, utilizing nanotubes as nanosized HDS reactors has never been tried before, as far as we know. This is of interest from a fundamental perspective. Unfortunately, the HDS activity of the nanocatalysts at ultra-high vacuum (UHV) conditions was close to the detection limit of our techniques. Therefore, we propose to run experiments at ambient pressure on related nanopowder samples as part of the renewal application utilizing a now-available GC (gas chromatograph) setup. In addition, Ni and Co doped nanocatalyts are proposed for study. These dopants will boost the catalytic activity. In the second subproject of the prior grant, we studied HDS-related chemistry on more traditional supported cluster catalysts. Mo clusters supported by physical vapor deposition (PVD) on silica have been characterized. Two reaction pathways are evident when adsorbing thiophene on Mo and MoSx clusters: molecular adsorption and dissociation. PVD Mo clusters turned out to be very reactive toward thiophene bond activation. Sulfur and carbon residuals form, which poison the catalyst and sulfide the Mo clusters. Sulfided silica-supported MoSx samples are not reactive toward thiophene bond activation. In addition to S and C deposits, H2, H2S, and small organic molecules were detected in the gas phase. Catalyst reactivation procedures, including O2 and atomic hydrogen treatments, have been tested. Cluster size effects have been seen: thiophene adsorbs molecularly with larger binding energies on smaller clusters. However, larger clusters have smaller activation energy for C4H4S bond activation than smaller clusters. The latter is consistent with early catalysis studies. Kinetics and dynamics parameters have been determined quantitatively. We spent a significant amount of time on upgrades of our equipment. A 2nd-hand refurbished X-ray photoelectron spectrometer (XPS) has been integrated into the existing molecular beam scattering system and is already operational (supported by the DoE supplemental grant available in October 2009). We also added a time of flight (TOF) system to the beam scattering apparatus and improved on the accessible impact energy range (new nozzle heater and gas mixing manifold) for the beam scattering experiments. In addition, a GC-based powder atmospheric flow reactor for studies on powder samples is now operational. Furthermore, a 2nd UHV kinetics system has been upgraded as well. In summary, mostly single crystal systems have so far been considered in basic science studies about HDS. Industrial catalysts, however, can be better approximated with the supported cluster systems that we studied in this project. Furthermore, an entirely new class of HDS systems, namely fullerene-like particles and inorganic nanotubes, has been included. Studying new materials and systems has the potential to impact science and technology. The systems investigated are closely related to energy and environmental-related surface science/catalysis. This prior project, conducted at NDSU by a small team, resulted in a total of 14 printed publications,1-5, 7-12, 14, 19, 20eight months before the end of the funding period. In addition, collaborators at national laboratories and abroad were part of the projects, as proposed. More specifically, projects about HDS on MoS2 and WS2 inorganic fullerene-like nanoparticles,1, 5 inorganic WS2 nanotubes,2 Mo and MoS2 vapor-deposited nanoclusters,3 modeling,19 reviews/book chapter,7, 11 and side projects8-10 have been conducted, as proposed, acknowledging solely (exception ref.7) funding from this grant. A list of publications and coworkers is given in sect. 6.« less
Lesho, Emil; Clifford, Robert; Onmus-Leone, Fatma; Appalla, Lakshmi; Snesrud, Erik; Kwak, Yoon; Ong, Ana; Maybank, Rosslyn; Waterman, Paige; Rohrbeck, Patricia; Julius, Michael; Roth, Amanda; Martinez, Joshua; Nielsen, Lindsey; Steele, Eric; McGann, Patrick; Hinkle, Mary
2016-01-01
Objective We sought to: 1) provide an overview of the genomic epidemiology of an extensive collection of carbapenemase-producing bacteria (CPB) collected in the U.S. Department of Defense health system; 2) increase awareness of the public availability of the sequences, isolates, and customized antimicrobial resistance database of that system; and 3) illustrate challenges and offer mitigations for implementing next generation sequencing (NGS) across large health systems. Design Prospective surveillance and system-wide implementation of NGS. Setting 288-hospital healthcare network. Methods All phenotypically carbapenem resistant bacteria underwent CarbaNP® testing and PCR, followed by NGS. Commercial (Newbler and Geneious), on-line (ResFinder), and open-source software (Btrim, FLASh, Bowtie2, an Samtools) were used for assembly, SNP detection and clustering. Laboratory capacity, throughput, and response time were assessed. Results From 2009 through 2015, 27,000 multidrug-resistant Gram-negative isolates were submitted. 225 contained carbapenemase-encoding genes (most commonly blaKPC, blaNDM, and blaOXA23). These were found in 15 species from 146 inpatients in 19 facilities. Genetically related CPB were found in more than one hospital. Other clusters or outbreaks were not clonal and involved genetically related plasmids, while some involved several unrelated plasmids. Relatedness depended on the clustering algorithm used. Transmission patterns of plasmids and other mobile genetic elements could not be determined without ultra-long read, single-molecule real-time sequencing. 80% of carbapenem-resistant phenotypes retained susceptibility to aminoglycosides, and 70% retained susceptibility to fluoroquinolones. However, among the CPB-confirmed genotypes, fewer than 25% retained susceptibility to aminoglycosides or fluoroquinolones. Conclusion Although NGS is increasingly acclaimed to revolutionize clinical practice, resource-constrained environments, large or geographically dispersed healthcare networks, and military or government-funded public health laboratories are likely to encounter constraints and challenges as they implement NGS across their health systems. These include lack of standardized definitions and quality control metrics, limitations of short-read sequencing, insufficient bandwidth, and the current limited availability of very expensive and scarcely available sequencing platforms. Possible solutions and mitigations are also proposed. PMID:27196272
Lesho, Emil; Clifford, Robert; Onmus-Leone, Fatma; Appalla, Lakshmi; Snesrud, Erik; Kwak, Yoon; Ong, Ana; Maybank, Rosslyn; Waterman, Paige; Rohrbeck, Patricia; Julius, Michael; Roth, Amanda; Martinez, Joshua; Nielsen, Lindsey; Steele, Eric; McGann, Patrick; Hinkle, Mary
2016-01-01
We sought to: 1) provide an overview of the genomic epidemiology of an extensive collection of carbapenemase-producing bacteria (CPB) collected in the U.S. Department of Defense health system; 2) increase awareness of the public availability of the sequences, isolates, and customized antimicrobial resistance database of that system; and 3) illustrate challenges and offer mitigations for implementing next generation sequencing (NGS) across large health systems. Prospective surveillance and system-wide implementation of NGS. 288-hospital healthcare network. All phenotypically carbapenem resistant bacteria underwent CarbaNP® testing and PCR, followed by NGS. Commercial (Newbler and Geneious), on-line (ResFinder), and open-source software (Btrim, FLASh, Bowtie2, an Samtools) were used for assembly, SNP detection and clustering. Laboratory capacity, throughput, and response time were assessed. From 2009 through 2015, 27,000 multidrug-resistant Gram-negative isolates were submitted. 225 contained carbapenemase-encoding genes (most commonly blaKPC, blaNDM, and blaOXA23). These were found in 15 species from 146 inpatients in 19 facilities. Genetically related CPB were found in more than one hospital. Other clusters or outbreaks were not clonal and involved genetically related plasmids, while some involved several unrelated plasmids. Relatedness depended on the clustering algorithm used. Transmission patterns of plasmids and other mobile genetic elements could not be determined without ultra-long read, single-molecule real-time sequencing. 80% of carbapenem-resistant phenotypes retained susceptibility to aminoglycosides, and 70% retained susceptibility to fluoroquinolones. However, among the CPB-confirmed genotypes, fewer than 25% retained susceptibility to aminoglycosides or fluoroquinolones. Although NGS is increasingly acclaimed to revolutionize clinical practice, resource-constrained environments, large or geographically dispersed healthcare networks, and military or government-funded public health laboratories are likely to encounter constraints and challenges as they implement NGS across their health systems. These include lack of standardized definitions and quality control metrics, limitations of short-read sequencing, insufficient bandwidth, and the current limited availability of very expensive and scarcely available sequencing platforms. Possible solutions and mitigations are also proposed.
Accelerating epistasis analysis in human genetics with consumer graphics hardware.
Sinnott-Armstrong, Nicholas A; Greene, Casey S; Cancare, Fabio; Moore, Jason H
2009-07-24
Human geneticists are now capable of measuring more than one million DNA sequence variations from across the human genome. The new challenge is to develop computationally feasible methods capable of analyzing these data for associations with common human disease, particularly in the context of epistasis. Epistasis describes the situation where multiple genes interact in a complex non-linear manner to determine an individual's disease risk and is thought to be ubiquitous for common diseases. Multifactor Dimensionality Reduction (MDR) is an algorithm capable of detecting epistasis. An exhaustive analysis with MDR is often computationally expensive, particularly for high order interactions. This challenge has previously been met with parallel computation and expensive hardware. The option we examine here exploits commodity hardware designed for computer graphics. In modern computers Graphics Processing Units (GPUs) have more memory bandwidth and computational capability than Central Processing Units (CPUs) and are well suited to this problem. Advances in the video game industry have led to an economy of scale creating a situation where these powerful components are readily available at very low cost. Here we implement and evaluate the performance of the MDR algorithm on GPUs. Of primary interest are the time required for an epistasis analysis and the price to performance ratio of available solutions. We found that using MDR on GPUs consistently increased performance per machine over both a feature rich Java software package and a C++ cluster implementation. The performance of a GPU workstation running a GPU implementation reduces computation time by a factor of 160 compared to an 8-core workstation running the Java implementation on CPUs. This GPU workstation performs similarly to 150 cores running an optimized C++ implementation on a Beowulf cluster. Furthermore this GPU system provides extremely cost effective performance while leaving the CPU available for other tasks. The GPU workstation containing three GPUs costs $2000 while obtaining similar performance on a Beowulf cluster requires 150 CPU cores which, including the added infrastructure and support cost of the cluster system, cost approximately $82,500. Graphics hardware based computing provides a cost effective means to perform genetic analysis of epistasis using MDR on large datasets without the infrastructure of a computing cluster.
Monitoring by Use of Clusters of Sensor-Data Vectors
NASA Technical Reports Server (NTRS)
Iverson, David L.
2007-01-01
The inductive monitoring system (IMS) is a system of computer hardware and software for automated monitoring of the performance, operational condition, physical integrity, and other aspects of the health of a complex engineering system (e.g., an industrial process line or a spacecraft). The input to the IMS consists of streams of digitized readings from sensors in the monitored system. The IMS determines the type and amount of any deviation of the monitored system from a nominal or normal ( healthy ) condition on the basis of a comparison between (1) vectors constructed from the incoming sensor data and (2) corresponding vectors in a database of nominal or normal behavior. The term inductive reflects the use of a process reminiscent of traditional mathematical induction to learn about normal operation and build the nominal-condition database. The IMS offers two major advantages over prior computational monitoring systems: The computational burden of the IMS is significantly smaller, and there is no need for abnormal-condition sensor data for training the IMS to recognize abnormal conditions. The figure schematically depicts the relationships among the computational processes effected by the IMS. Training sensor data are gathered during normal operation of the monitored system, detailed computational simulation of operation of the monitored system, or both. The training data are formed into vectors that are used to generate the database. The vectors in the database are clustered into regions that represent normal or nominal operation. Once the database has been generated, the IMS compares the vectors of incoming sensor data with vectors representative of the clusters. The monitored system is deemed to be operating normally or abnormally, depending on whether the vector of incoming sensor data is or is not, respectively, sufficiently close to one of the clusters. For this purpose, a distance between two vectors is calculated by a suitable metric (e.g., Euclidean distance) and "sufficiently close" signifies lying at a distance less than a specified threshold value. It must be emphasized that although the IMS is intended to detect off-nominal or abnormal performance or health, it is not necessarily capable of performing a thorough or detailed diagnosis. Limited diagnostic information may be available under some circumstances. For example, the distance of a vector of incoming sensor data from the nearest cluster could serve as an indication of the severity of a malfunction. The identity of the nearest cluster may be a clue as to the identity of the malfunctioning component or subsystem. It is possible to decrease the IMS computation time by use of a combination of cluster-indexing and -retrieval methods. For example, in one method, the distances between each cluster and two or more reference vectors can be used for the purpose of indexing and retrieval. The clusters are sorted into a list according to these distance values, typically in ascending order of distance. When a set of input data arrives and is to be tested, the data are first arranged as an ordered set (that is, a vector). The distances from the input vector to the reference points are computed. The search of clusters from the list can then be limited to those clusters lying within a certain distance range from the input vector; the computation time is reduced by not searching the clusters at a greater distance.
Babouee, B.; Frei, R.; Schultheiss, E.; Widmer, A. F.; Goldenberger, D.
2011-01-01
The emergence of methicillin-resistant Staphylococcus aureus (MRSA) has become an increasing problem worldwide in recent decades. Molecular typing methods have been developed to identify clonality of strains and monitor spread of MRSA. We compared a new commercially available DiversiLab (DL) repetitive element PCR system with spa typing, spa clonal cluster analysis, and pulsed-field gel electrophoresis (PFGE) in terms of discriminatory power and concordance. A collection of 106 well-defined MRSA strains from our hospital was analyzed, isolated between 1994 and 2006. In addition, we analyzed 6 USA300 strains collected in our institution. DL typing separated the 106 MRSA isolates in 10 distinct clusters and 8 singleton patterns. Clustering analysis into spa clonal complexes resulted in 3 clusters: spa-CC 067/548, spa-CC 008, and spa-CC 012. The discriminatory powers (Simpson's index of diversity) were 0.982, 0.950, 0.846, and 0.757 for PFGE, spa typing, DL typing, and spa clonal clustering, respectively. DL typing and spa clonal clustering showed the highest concordance, calculated by adjusted Rand's coefficients. The 6 USA300 isolates grouped homogeneously into distinct PFGE and DL clusters, and all belonged to spa type t008 and spa-CC 008. Among the three methods, DL proved to be rapid and easy to perform. DL typing qualifies for initial screening during outbreak investigation. However, compared to PFGE and spa typing, DL typing has limited discriminatory power and therefore should be complemented by more discriminative methods in isolates that share identical DL patterns. PMID:21307215
A multimembership catalogue for 1876 open clusters using UCAC4 data
NASA Astrophysics Data System (ADS)
Sampedro, L.; Dias, W. S.; Alfaro, E. J.; Monteiro, H.; Molino, A.
2017-10-01
The main objective of this work is to determine the cluster members of 1876 open clusters, using positions and proper motions of the astrometric fourth United States Naval Observatory (USNO) CCD Astrograph Catalog (UCAC4). For this purpose, we apply three different methods, all based on a Bayesian approach, but with different formulations: a purely parametric method, another completely non-parametric algorithm and a third, recently developed by Sampedro & Alfaro, using both formulations at different steps of the whole process. The first and second statistical moments of the members' phase-space subspace, obtained after applying the three methods, are compared for every cluster. Although, on average, the three methods yield similar results, there are also specific differences between them, as well as for some particular clusters. The comparison with other published catalogues shows good agreement. We have also estimated, for the first time, the mean proper motion for a sample of 18 clusters. The results are organized in a single catalogue formed by two main files, one with the most relevant information for each cluster, partially including that in UCAC4, and the other showing the individual membership probabilities for each star in the cluster area. The final catalogue, with an interface design that enables an easy interaction with the user, is available in electronic format at the Stellar Systems Group (SSG-IAA) web site (http://ssg.iaa.es/en/content/sampedro-cluster-catalog).
NASA Astrophysics Data System (ADS)
Nastasi, A.; Fassbender, R.; Böhringer, H.; Šuhada, R.; Rosati, P.; Pierini, D.; Verdugo, M.; Santos, J. S.; Schwope, A. D.; de Hoon, A.; Kohnert, J.; Lamer, G.; Mühlegger, M.; Quintana, H.
2011-08-01
We report the discovery of a galaxy cluster at z = 1.490 originally selected as an extended X-ray source in the XMM-Newton Distant Cluster Project. Further observations carried out with the VLT-FORS2 spectrograph allowed the spectroscopic confirmation of seven secure cluster members, providing a median system redshift of z = 1.490 ± 0.009. The color-magnitude diagram of XMMU J0338.8+0021 reveals the presence of a well-populated red sequence with z - H ≈ 3, albeit with an apparent significant scatter in color. Since we do not detect indications of any strong star formation activity in these objects, the color spread could represent the different stellar ages of the member galaxies. In addition, we found the brightest cluster galaxy in a very active dynamical state, with an interacting, merging companion located at a physical projected distance of d ≈ 20 kpc. From the X-ray luminosity, we estimate a cluster mass of M200 ~ 1.2 × 1014 M⊙. The data appear to be consistent with a scenario in which XMMU J0338.8+0021 is a young system, possibly caught in a moment of active ongoing mass assembly. Based on observations under programme ID 084.A-0844 collected at the European Organisation for Astronomical Research in the Southern Hemisphere, Chile, and observations collected at the Centro Astronómico Hispano Alemán at Calar Alto, operated jointly by the Max-Planck Institut für Astronomie and the Instituto de Astrofísica de Andalucía (CSIC).Tables 1, 2 and Figs. 3-6 are available in electronic form at http://www.aanda.org
NASA Astrophysics Data System (ADS)
Bonacic-Koutecky, Vlasta; Burda, Jaroslav; Mitric, Roland; Ge, Maofa; Zampella, Giuseppe; Fantucci, Piercarlo
2002-08-01
Bimetallic silver-gold clusters offer an excellent opportunity to study changes in metallic versus "ionic" properties involving charge transfer as a function of the size and the composition, particularly when compared to pure silver and gold clusters. We have determined structures, ionization potentials, and vertical detachment energies for neutral and charged bimetallic AgmAun 3[less-than-or-equal](m+n)[less-than-or-equal]5 clusters. Calculated VDE values compare well with available experimental data. In the stable structures of these clusters Au atoms assume positions which favor the charge transfer from Ag atoms. Heteronuclear bonding is usually preferred to homonuclear bonding in clusters with equal numbers of hetero atoms. In fact, stable structures of neutral Ag2Au2, Ag3Au3, and Ag4Au4 clusters are characterized by the maximum number of hetero bonds and peripheral positions of Au atoms. Bimetallic tetramer as well as hexamer are planar and have common structural properties with corresponding one-component systems, while Ag4Au4 and Ag8 have 3D forms in contrast to Au8 which assumes planar structure. At the density functional level of theory we have shown that this is due to participation of d electrons in bonding of pure Aun clusters while s electrons dominate bonding in pure Agm as well as in bimetallic clusters. In fact, Aun clusters remain planar for larger sizes than Agm and AgnAun clusters. Segregation between two components in bimetallic systems is not favorable, as shown in the example of Ag5Au5 cluster. We have found that the structures of bimetallic clusters with 20 atoms Ag10Au10 and Ag12Au8 are characterized by negatively charged Au subunits embedded in Ag environment. In the latter case, the shape of Au8 is related to a pentagonal bipyramid capped by one atom and contains three exposed negatively charged Au atoms. They might be suitable for activating reactions relevant to catalysis. According to our findings the charge transfer in bimetallic clusters is responsible for formation of negatively charged gold subunits which are expected to be reactive, a situation similar to that of gold clusters supported on metal oxides.
Peña, Alejandro; Del Carratore, Francesco; Cummings, Matthew; Takano, Eriko; Breitling, Rainer
2017-12-18
The rapid increase of publicly available microbial genome sequences has highlighted the presence of hundreds of thousands of biosynthetic gene clusters (BGCs) encoding valuable secondary metabolites. The experimental characterization of new BGCs is extremely laborious and struggles to keep pace with the in silico identification of potential BGCs. Therefore, the prioritisation of promising candidates among computationally predicted BGCs represents a pressing need. Here, we propose an output ordering and prioritisation system (OOPS) which helps sorting identified BGCs by a wide variety of custom-weighted biological and biochemical criteria in a flexible and user-friendly interface. OOPS facilitates a judicious prioritisation of BGCs using G+C content, coding sequence length, gene number, cluster self-similarity and codon bias parameters, as well as enabling the user to rank BGCs based upon BGC type, novelty, and taxonomic distribution. Effective prioritisation of BGCs will help to reduce experimental attrition rates and improve the breadth of bioactive metabolites characterized.
Yu, Chun-Yang; Yang, Zhong-Zhi
2011-03-31
Hydrogen peroxide (HP) clusters (H(2)O(2))(n) (n = 1-6) and liquid-state HP have been systemically investigated by the newly constructed ABEEM/MM fluctuating charge model. Because of the explicit description of charge distribution and special treatment of the hydrogen-bond interaction region, the ABEEM/MM potential model gives reasonable properties of HP clusters, including geometries, interaction energies, and dipole moments, when comparing with the present ab initio results. Meanwhile, the average dipole moment, static dielectric constant, heats of vaporization, radial distribution function, and diffusion constant for the dynamic properties of liquid HP at 273 K and 1 atm are fairly consistent with the available experimental data. To the best of our knowledge, this is the first theoretical investigation of condensed HP. The properties of HP monomer are studied in detail involving the structure, torsion potentials, molecular orbital analysis, charge distribution, dipole moment, and vibrational frequency.
Effect of video server topology on contingency capacity requirements
NASA Astrophysics Data System (ADS)
Kienzle, Martin G.; Dan, Asit; Sitaram, Dinkar; Tetzlaff, William H.
1996-03-01
Video servers need to assign a fixed set of resources to each video stream in order to guarantee on-time delivery of the video data. If a server has insufficient resources to guarantee the delivery, it must reject the stream request rather than slowing down all existing streams. Large scale video servers are being built as clusters of smaller components, so as to be economical, scalable, and highly available. This paper uses a blocking model developed for telephone systems to evaluate video server cluster topologies. The goal is to achieve high utilization of the components and low per-stream cost combined with low blocking probability and high user satisfaction. The analysis shows substantial economies of scale achieved by larger server images. Simple distributed server architectures can result in partitioning of resources with low achievable resource utilization. By comparing achievable resource utilization of partitioned and monolithic servers, we quantify the cost of partitioning. Next, we present an architecture for a distributed server system that avoids resource partitioning and results in highly efficient server clusters. Finally, we show how, in these server clusters, further optimizations can be achieved through caching and batching of video streams.
The Hubble Space Telescope Medium Deep Survey Cluster Sample: Methodology and Data
NASA Astrophysics Data System (ADS)
Ostrander, E. J.; Nichol, R. C.; Ratnatunga, K. U.; Griffiths, R. E.
1998-12-01
We present a new, objectively selected, sample of galaxy overdensities detected in the Hubble Space Telescope Medium Deep Survey (MDS). These clusters/groups were found using an automated procedure that involved searching for statistically significant galaxy overdensities. The contrast of the clusters against the field galaxy population is increased when morphological data are used to search around bulge-dominated galaxies. In total, we present 92 overdensities above a probability threshold of 99.5%. We show, via extensive Monte Carlo simulations, that at least 60% of these overdensities are likely to be real clusters and groups and not random line-of-sight superpositions of galaxies. For each overdensity in the MDS cluster sample, we provide a richness and the average of the bulge-to-total ratio of galaxies within each system. This MDS cluster sample potentially contains some of the most distant clusters/groups ever detected, with about 25% of the overdensities having estimated redshifts z > ~0.9. We have made this sample publicly available to facilitate spectroscopic confirmation of these clusters and help more detailed studies of cluster and galaxy evolution. We also report the serendipitous discovery of a new cluster close on the sky to the rich optical cluster Cl l0016+16 at z = 0.546. This new overdensity, HST 001831+16208, may be coincident with both an X-ray source and a radio source. HST 001831+16208 is the third cluster/group discovered near to Cl 0016+16 and appears to strengthen the claims of Connolly et al. of superclustering at high redshift.
Li, Ruijuan; Erpelding, John E
2016-10-01
The diploid cotton species Gossypium arboreum possesses many favorable agronomic traits such as drought tolerance and disease resistance, which can be utilized in the development of improved upland cotton cultivars. The USDA National Plant Germplasm System maintains more than 1600 G. arboreum accessions. Little information is available on the genetic diversity of the collection thereby limiting the utilization of this cotton species. The genetic diversity and population structure of the G. arboreum germplasm collection were assessed by genotyping-by-sequencing of 375 accessions. Using genome-wide single nucleotide polymorphism sequence data, two major clusters were inferred with 302 accessions in Cluster 1, 64 accessions in Cluster 2, and nine accessions unassigned due to their nearly equal membership to each cluster. These two clusters were further evaluated independently resulting in the identification of two sub-clusters for the 302 Cluster 1 accessions and three sub-clusters for the 64 Cluster 2 accessions. Low to moderate genetic diversity between clusters and sub-clusters were observed indicating a narrow genetic base. Cluster 2 accessions were more genetically diverse and the majority of the accessions in this cluster were landraces. In contrast, Cluster 1 is composed of varieties or breeding lines more recently added to the collection. The majority of the accessions had kinship values ranging from 0.6 to 0.8. Eight pairs of accessions were identified as potential redundancies due to their high kinship relatedness. The genetic diversity and genotype data from this study are essential to enhance germplasm utilization to identify genetically diverse accessions for the detection of quantitative trait loci associated with important traits that would benefit upland cotton improvement.
Open source clustering software.
de Hoon, M J L; Imoto, S; Nolan, J; Miyano, S
2004-06-12
We have implemented k-means clustering, hierarchical clustering and self-organizing maps in a single multipurpose open-source library of C routines, callable from other C and C++ programs. Using this library, we have created an improved version of Michael Eisen's well-known Cluster program for Windows, Mac OS X and Linux/Unix. In addition, we generated a Python and a Perl interface to the C Clustering Library, thereby combining the flexibility of a scripting language with the speed of C. The C Clustering Library and the corresponding Python C extension module Pycluster were released under the Python License, while the Perl module Algorithm::Cluster was released under the Artistic License. The GUI code Cluster 3.0 for Windows, Macintosh and Linux/Unix, as well as the corresponding command-line program, were released under the same license as the original Cluster code. The complete source code is available at http://bonsai.ims.u-tokyo.ac.jp/mdehoon/software/cluster. Alternatively, Algorithm::Cluster can be downloaded from CPAN, while Pycluster is also available as part of the Biopython distribution.
Systematic Identification of Genes Required for Expression of Androgen Receptor Splice Variants
2016-08-01
engineering tool has been developed from bacterial Clustered Regularly Interspaced Short Palindromic Repeats ( CRISPR )/ CRISPR ‐Associated System (Cas...regulation of AR splice variant through CRISPR /Cas screening system. 15. SUBJECT TERMS CRISPR /Cas, Androgen receptor, castration resistance, biomarker 16...control (non-targeting) gRNAs available from Addgene (http://www.addgene.org/ CRISPR /libraries/). Generation of AR3 reporter: We used molecular cloning
Analytical network process based optimum cluster head selection in wireless sensor network.
Farman, Haleem; Javed, Huma; Jan, Bilal; Ahmad, Jamil; Ali, Shaukat; Khalil, Falak Naz; Khan, Murad
2017-01-01
Wireless Sensor Networks (WSNs) are becoming ubiquitous in everyday life due to their applications in weather forecasting, surveillance, implantable sensors for health monitoring and other plethora of applications. WSN is equipped with hundreds and thousands of small sensor nodes. As the size of a sensor node decreases, critical issues such as limited energy, computation time and limited memory become even more highlighted. In such a case, network lifetime mainly depends on efficient use of available resources. Organizing nearby nodes into clusters make it convenient to efficiently manage each cluster as well as the overall network. In this paper, we extend our previous work of grid-based hybrid network deployment approach, in which merge and split technique has been proposed to construct network topology. Constructing topology through our proposed technique, in this paper we have used analytical network process (ANP) model for cluster head selection in WSN. Five distinct parameters: distance from nodes (DistNode), residual energy level (REL), distance from centroid (DistCent), number of times the node has been selected as cluster head (TCH) and merged node (MN) are considered for CH selection. The problem of CH selection based on these parameters is tackled as a multi criteria decision system, for which ANP method is used for optimum cluster head selection. Main contribution of this work is to check the applicability of ANP model for cluster head selection in WSN. In addition, sensitivity analysis is carried out to check the stability of alternatives (available candidate nodes) and their ranking for different scenarios. The simulation results show that the proposed method outperforms existing energy efficient clustering protocols in terms of optimum CH selection and minimizing CH reselection process that results in extending overall network lifetime. This paper analyzes that ANP method used for CH selection with better understanding of the dependencies of different components involved in the evaluation process.
Analytical network process based optimum cluster head selection in wireless sensor network
Javed, Huma; Jan, Bilal; Ahmad, Jamil; Ali, Shaukat; Khalil, Falak Naz; Khan, Murad
2017-01-01
Wireless Sensor Networks (WSNs) are becoming ubiquitous in everyday life due to their applications in weather forecasting, surveillance, implantable sensors for health monitoring and other plethora of applications. WSN is equipped with hundreds and thousands of small sensor nodes. As the size of a sensor node decreases, critical issues such as limited energy, computation time and limited memory become even more highlighted. In such a case, network lifetime mainly depends on efficient use of available resources. Organizing nearby nodes into clusters make it convenient to efficiently manage each cluster as well as the overall network. In this paper, we extend our previous work of grid-based hybrid network deployment approach, in which merge and split technique has been proposed to construct network topology. Constructing topology through our proposed technique, in this paper we have used analytical network process (ANP) model for cluster head selection in WSN. Five distinct parameters: distance from nodes (DistNode), residual energy level (REL), distance from centroid (DistCent), number of times the node has been selected as cluster head (TCH) and merged node (MN) are considered for CH selection. The problem of CH selection based on these parameters is tackled as a multi criteria decision system, for which ANP method is used for optimum cluster head selection. Main contribution of this work is to check the applicability of ANP model for cluster head selection in WSN. In addition, sensitivity analysis is carried out to check the stability of alternatives (available candidate nodes) and their ranking for different scenarios. The simulation results show that the proposed method outperforms existing energy efficient clustering protocols in terms of optimum CH selection and minimizing CH reselection process that results in extending overall network lifetime. This paper analyzes that ANP method used for CH selection with better understanding of the dependencies of different components involved in the evaluation process. PMID:28719616
Beating the tyranny of scale with a private cloud configured for Big Data
NASA Astrophysics Data System (ADS)
Lawrence, Bryan; Bennett, Victoria; Churchill, Jonathan; Juckes, Martin; Kershaw, Philip; Pepler, Sam; Pritchard, Matt; Stephens, Ag
2015-04-01
The Joint Analysis System, JASMIN, consists of a five significant hardware components: a batch computing cluster, a hypervisor cluster, bulk disk storage, high performance disk storage, and access to a tape robot. Each of the computing clusters consists of a heterogeneous set of servers, supporting a range of possible data analysis tasks - and a unique network environment makes it relatively trivial to migrate servers between the two clusters. The high performance disk storage will include the world's largest (publicly visible) deployment of the Panasas parallel disk system. Initially deployed in April 2012, JASMIN has already undergone two major upgrades, culminating in a system which by April 2015, will have in excess of 16 PB of disk and 4000 cores. Layered on the basic hardware are a range of services, ranging from managed services, such as the curated archives of the Centre for Environmental Data Archival or the data analysis environment for the National Centres for Atmospheric Science and Earth Observation, to a generic Infrastructure as a Service (IaaS) offering for the UK environmental science community. Here we present examples of some of the big data workloads being supported in this environment - ranging from data management tasks, such as checksumming 3 PB of data held in over one hundred million files, to science tasks, such as re-processing satellite observations with new algorithms, or calculating new diagnostics on petascale climate simulation outputs. We will demonstrate how the provision of a cloud environment closely coupled to a batch computing environment, all sharing the same high performance disk system allows massively parallel processing without the necessity to shuffle data excessively - even as it supports many different virtual communities, each with guaranteed performance. We will discuss the advantages of having a heterogeneous range of servers with available memory from tens of GB at the low end to (currently) two TB at the high end. There are some limitations of the JASMIN environment, the high performance disk environment is not fully available in the IaaS environment, and a planned ability to burst compute heavy jobs into the public cloud is not yet fully available. There are load balancing and performance issues that need to be understood. We will conclude with projections for future usage, and our plans to meet those requirements.
2n-emission from {sup 205}Pb* nucleus using clusterization approach at E{sub beam}∼14-20 MeV
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaur, Amandeep, E-mail: adeepkaur89@gmail.com; Sandhu, Kiran; Sharma, Manoj Kumar, E-mail: msharma@thapar.edu
2016-05-06
The dynamics involved in n-induced reaction with {sup 204}Pb target is analyzed and the decay of the composite system {sup 205}Pb* is governed within the collective clusterization approach of the Dynamical Cluster-decay Model (DCM). The experimental data for 2n-evaporation channel is available for neutron energy range of 14-20 MeV and is addressed by optimizing the only parameter of the model, the neck-length parameter (ΔR). The calculations are done by taking the quadrupole (β{sub 2}) deformations of the decaying fragments and the calculated 2n-emission cross-sections find nice agreement with available data. An effort is made to study the role of levelmore » density parameter in the decay of hot-rotating nucleus, and the mass dependence in level density parameter is exercised for the first time in DCM based calculations. It is to be noted that the effect of deformation, temperature and angular momentum etc. is studied to extract better description of the dynamics involved.« less
Structure and substructure analysis of DAFT/FADA galaxy clusters in the [0.4-0.9] redshift range
NASA Astrophysics Data System (ADS)
Guennou, L.; Adami, C.; Durret, F.; Lima Neto, G. B.; Ulmer, M. P.; Clowe, D.; LeBrun, V.; Martinet, N.; Allam, S.; Annis, J.; Basa, S.; Benoist, C.; Biviano, A.; Cappi, A.; Cypriano, E. S.; Gavazzi, R.; Halliday, C.; Ilbert, O.; Jullo, E.; Just, D.; Limousin, M.; Márquez, I.; Mazure, A.; Murphy, K. J.; Plana, H.; Rostagni, F.; Russeil, D.; Schirmer, M.; Slezak, E.; Tucker, D.; Zaritsky, D.; Ziegler, B.
2014-01-01
Context. The DAFT/FADA survey is based on the study of ~90 rich (masses found in the literature >2 × 1014 M⊙) and moderately distant clusters (redshifts 0.4 < z < 0.9), all with HST imaging data available. This survey has two main objectives: to constrain dark energy (DE) using weak lensing tomography on galaxy clusters and to build a database (deep multi-band imaging allowing photometric redshift estimates, spectroscopic data, X-ray data) of rich distant clusters to study their properties. Aims: We analyse the structures of all the clusters in the DAFT/FADA survey for which XMM-Newton and/or a sufficient number of galaxy redshifts in the cluster range are available, with the aim of detecting substructures and evidence for merging events. These properties are discussed in the framework of standard cold dark matter (ΛCDM) cosmology. Methods: In X-rays, we analysed the XMM-Newton data available, fit a β-model, and subtracted it to identify residuals. We used Chandra data, when available, to identify point sources. In the optical, we applied a Serna & Gerbal (SG) analysis to clusters with at least 15 spectroscopic galaxy redshifts available in the cluster range. We discuss the substructure detection efficiencies of both methods. Results: XMM-Newton data were available for 32 clusters, for which we derive the X-ray luminosity and a global X-ray temperature for 25 of them. For 23 clusters we were able to fit the X-ray emissivity with a β-model and subtract it to detect substructures in the X-ray gas. A dynamical analysis based on the SG method was applied to the clusters having at least 15 spectroscopic galaxy redshifts in the cluster range: 18 X-ray clusters and 11 clusters with no X-ray data. The choice of a minimum number of 15 redshifts implies that only major substructures will be detected. Ten substructures were detected both in X-rays and by the SG method. Most of the substructures detected both in X-rays and with the SG method are probably at their first cluster pericentre approach and are relatively recent infalls. We also find hints of a decreasing X-ray gas density profile core radius with redshift. Conclusions: The percentage of mass included in substructures was found to be roughly constant with redshift values of 5-15%, in agreement both with the general CDM framework and with the results of numerical simulations. Galaxies in substructures show the same general behaviour as regular cluster galaxies; however, in substructures, there is a deficiency of both late type and old stellar population galaxies. Late type galaxies with recent bursts of star formation seem to be missing in the substructures close to the bottom of the host cluster potential well. However, our sample would need to be increased to allow a more robust analysis. Tables 1, 2, 4 and Appendices A-C are available in electronic form at http://www.aanda.org
The Evolution of Globular Cluster Systems In Early-Type Galaxies
NASA Astrophysics Data System (ADS)
Grillmair, Carl
1999-07-01
We will measure structural parameters {core radii and concentrations} of globular clusters in three early-type galaxies using deep, four-point dithered observations. We have chosen globular cluster systems which have young, medium-age and old cluster populations, as indicated by cluster colors and luminosities. Our primary goal is to test the hypothesis that globular cluster luminosity functions evolve towards a ``universal'' form. Previous observations have shown that young cluster systems have exponential luminosity functions rather than the characteristic log-normal luminosity function of old cluster systems. We will test to see whether such young system exhibits a wider range of structural parameters than an old systems, and whether and at what rate plausible disruption mechanisms will cause the luminosity function to evolve towards a log-normal form. A simple observational comparison of structural parameters between different age cluster populations and between diff er ent sub-populations within the same galaxy will also provide clues concerning both the formation and destruction mechanisms of star clusters, the distinction between open and globular clusters, and the advisability of using globular cluster luminosity functions as distance indicators.
NASA Technical Reports Server (NTRS)
Sanz, J.; Pischel, K.; Hubler, D.
1992-01-01
An application for parallel computation on a combined cluster of powerful workstations and supercomputers was developed. A Parallel Virtual Machine (PVM) is used as message passage language on a macro-tasking parallelization of the Aerodynamic Inverse Design and Analysis for a Full Engine computer code. The heterogeneous nature of the cluster is perfectly handled by the controlling host machine. Communication is established via Ethernet with the TCP/IP protocol over an open network. A reasonable overhead is imposed for internode communication, rendering an efficient utilization of the engaged processors. Perhaps one of the most interesting features of the system is its versatile nature, that permits the usage of the computational resources available that are experiencing less use at a given point in time.
Automatic Semantic Orientation of Adjectives for Indonesian Language Using PMI-IR and Clustering
NASA Astrophysics Data System (ADS)
Riyanti, Dewi; Arif Bijaksana, M.; Adiwijaya
2018-03-01
We present our work in the area of sentiment analysis for Indonesian language. We focus on bulding automatic semantic orientation using available resources in Indonesian. In this research we used Indonesian corpus that contains 9 million words from kompas.txt and tempo.txt that manually tagged and annotated with of part-of-speech tagset. And then we construct a dataset by taking all the adjectives from the corpus, removing the adjective with no orientation. The set contained 923 adjective words. This systems will include several steps such as text pre-processing and clustering. The text pre-processing aims to increase the accuracy. And finally clustering method will classify each word to related sentiment which is positive or negative. With improvements to the text preprocessing, can be achieved 72% of accuracy.
Star Formation in Merging Clusters of Galaxies
NASA Astrophysics Data System (ADS)
Mansheim, Alison Seiler
This thesis straddles two areas of cosmology, each of which are active, rich and plagued by controversy in their own right: merging clusters and the environmental dependence of galaxy evolution. While the greater context of this thesis is major cluster mergers, our individual subjects are galaxies, and we apply techniques traditionally used to study the differential evolution of galaxies with environment. The body of this thesis is drawn from two papers: Mansheim et al. 2016a and Mansheim et al. 2016b, one on each system. Both projects benefited from exquisite data sets assembled as part of the Merging Cluster Collaboration (MC2), and Observations of Redshift Evolution in Large Scale Environments (ORELSE) survey, allowing us to scrutinize the evolutionary states of galaxy populations in multiple lights. Multi-band optical and near-infrared imaging was available for both systems, allowing us to calculate photometric redshifts for completeness corrections, colors (red vs. blue) and stellar masses to view the ensemble properties of the populations in and around each merger. High-resolution spectroscopy was also available for both systems, allowing us to confirm cluster members by measuring spectroscopic redshifts, which are unparalleled in accuracy, and gauge star formation rates and histories by measuring the strengths of certain spectral features. We had the luxury of HST imaging for Musket Ball, allowing us to use galaxy morphology as an additional diagnostic. For Cl J0910, 24 mum imaging allowed us to defeat a most pernicious source of uncertainty. Details on the acquisition and reduction of multi-wavelength data for each system are found within each respective chapter. It is important to note that the research presented in Chapter 3 is based on a letter which had significant space restrictions, so much of the observational details are outsourced to papers written by ORELSE collaboration members. Below is a free-standing summary of each project, drawn from the abstracts of each paper. The Chapter 1 contains an introduction to the topic and motivation to fill a vacuum in knowledge using our hypothesis. Chapter 4, following the meat of the thesis in Chapters 2 and 3, gives closure and looks to the future. In Chapter 2, we investigate star formation in DLSCL J0916.2+2953, a dissociative merger of two clusters at z = 0.53 that has progressed 1.1 +1.3-0.4 Gyr since first pass-through. We attempt to reveal the effects a collision may have had on the evolution of the cluster galaxies by tracing their star formation history. We probe current and recent activity to identify a possible star formation event at the time of the merger using EW(Hdelta), EW(OII) and Dn(4000) measured from the composite spectra of 64 cluster and 153 coeval field galaxies. We supplement Keck DEIMOS spectra with DLS and HST imaging to determine the color, stellar mass, and morphology of each galaxy and conduct a comprehensive study of the populations in this complex structure. Spectral results indicate the average cluster and cluster red sequence galaxies experienced no enhanced star formation relative to the surrounding field during the merger, ruling out a predominantly merger-quenched population. We find that the average blue galaxy in the North cluster is currently active and in the South cluster is currently post-starburst having undergone a recent star formation event. While the North activity could be latent or long-term merger effects, a young blue stellar population and irregular geometry suggest the cluster was still forming prior the collision. While the South activity coincides with the time of the merger, the blue early-type population could be a result of secular cluster processes. The evidence suggests that the dearth or surfeit of activity is indiscernible from normal cluster galaxy evolution. In Chapter 3, we examine the effects of an impending cluster merger on galaxies in the large scale structure (LSS) RX Cl J0910 at z =1.105. Using multi-wavelength data, including 102 spectral members drawn from the ORELSE survey and precise photometric redshifts, we calculate extinction-corrected star formation rates and map the specific star formation rate density of the LSS galaxies. These analyses along with an investigation of the color-magnitude properties of LSS galaxies indicate lower levels of star formation activity in the region between the merging clusters relative to the outskirts of the system. We suggest gravitational tidal forces due to the potential of merging halos may be the physical mechanisms responsible for the observed suppression of star formation in galaxies caught between the merging clusters. (Abstract shortened by ProQuest.).
System and method for merging clusters of wireless nodes in a wireless network
Budampati, Ramakrishna S [Maple Grove, MN; Gonia, Patrick S [Maplewood, MN; Kolavennu, Soumitri N [Blaine, MN; Mahasenan, Arun V [Kerala, IN
2012-05-29
A system includes a first cluster having multiple first wireless nodes. One first node is configured to act as a first cluster master, and other first nodes are configured to receive time synchronization information provided by the first cluster master. The system also includes a second cluster having one or more second wireless nodes. One second node is configured to act as a second cluster master, and any other second nodes configured to receive time synchronization information provided by the second cluster master. The system further includes a manager configured to merge the clusters into a combined cluster. One of the nodes is configured to act as a single cluster master for the combined cluster, and the other nodes are configured to receive time synchronization information provided by the single cluster master.
NASA Astrophysics Data System (ADS)
Dotson, Jessie; Barentsen, Geert; Cody, Ann Marie
2018-01-01
The K2 survey has expanded the Kepler legacy by using the repurposed spacecraft to observe over 20 star clusters. The sample includes open and globular clusters at all ages, including very young (1-10 Myr, e.g. Taurus, Upper Sco, NGC 6530), moderately young (0.1-1 Gyr, e.g. M35, M44, Pleiades, Hyades), middle-aged (e.g. M67, Ruprecht 147, NGC 2158), and old globular clusters (e.g. M9, M19, Terzan 5). K2 observations of stellar clusters are exploring the rotation period-mass relationship to significantly lower masses than was previously possible, shedding light on the angular momentum budget and its dependence on mass and circumstellar disk properties, and illuminating the role of multiplicity in stellar angular momentum. Exoplanets discovered by K2 in stellar clusters provides planetary systems ripe for modeling given the extensive information available about their ages and environment. I will review the star clusters sampled by K2 across 16 fields so far, highlighting several characteristics, caveats, and unexplored uses of the public data set along the way. With fuel expected to run out in 2018, I will discuss the closing Campaigns, highlight the final target selection opportunities, and explain the data archive and TESS-compatible software tools the K2 mission intends to leave behind for posterity.
Deep, wide-field, multi-band imaging of z approximately equal to 0.4 clusters and their environs
NASA Technical Reports Server (NTRS)
Silva, David R.; Pierce, Michael J.
1993-01-01
The existence of an excess population of blue galaxies in the cores of distant, rich clusters of galaxies, commonly referred to as the 'Butcher-Oemler' effect is now well established. Spectroscopy of clusters at z = 0.2-0.4 has confirmed that the luminous blue populations comprise as much as 20 percent of these clusters. This fraction is much higher that the 2 percent blue fraction found for nearby rich clusters, such as Coma, indicating that rapid galaxy evolution has occurred on a relatively short time scale. Spectroscopy has also shown that the 'blue' galaxies can basically be divided into three classes: 'starburst' galaxies with large (O II) equivalent widths, 'post-starburst' E+A galaxies (i.e. galaxies with strong Balmer lines shortward of 4000A but elliptical-like colors, and normal spiral/irregulars. Unfortunately, it is difficult to obtain enough spectra of individual galaxies in these intermediate redshift clusters to say anything statistically meaningful. Thus, limited information is available about the relative numbers of these three classes of 'blue' galaxies and the associated E/SO population in these intermediate redshift clusters. More statistically meaningful results can be derived from deep imaging of these clusters. However, the best published data to date (e.g. MacLaren et al. 1988; Dressler & Gunn 1992) are limited to the cluster cores and do not sample the galaxy luminosity functions very deeply at the bluest wavelengths. Furthermore, only limited spectro-energy distribution data is available below 4000A in the observed cluster rest frame providing limited sensitivity to 'recent' star formation activity. To improve this situation, we are currently obtaining deep, wide-field UBRI images of all known rich clusters at z approx. equals 0.4. Our main objective is to obtain the necessary color information to distinguish between the E+SO, 'E+A', and spiral/irregular galaxy populations throughout the cluster/supercluster complex. At this redshift, UBRI correspond to rest-frame 2500A/UVR bandpasses. The rest-frame UVR system provides a powerful 'blue' galaxy discriminate given the expected color distribution. Moreover, since 'hot' stars peak near 2500A, that bandpass is a powerful probe of recent star formation activity in all classes of galaxies. In particular, it is sensitive to ellipticals with 'UV excess' populations (MacLaren et al. 1988).
On the symmetries of the 12C nucleus
NASA Astrophysics Data System (ADS)
Cseh, J.; Trencsényi, R.
The consequences of some symmetries of the three-alpha system are discussed. In particular, the recent description of the low-energy spectrum of the 12C nucleus in terms of the algebraic cluster model (ACM) is compared to that of the multichannel dynamical symmetry (MUSY), which is the intersection of the shell and cluster models. The previous one applies interactions of a D3h geometric symmetry [D. J. Marin-Lambarri et al., Phys. Rev. Lett. 113 (2014) 012502], while the latter one has a U(3) dynamical symmetry. The available data is in line with both descriptions.
Binary and Millisecond Pulsars.
Lorimer, Duncan R
2005-01-01
We review the main properties, demographics and applications of binary and millisecond radio pulsars. Our knowledge of these exciting objects has greatly increased in recent years, mainly due to successful surveys which have brought the known pulsar population to over 1700. There are now 80 binary and millisecond pulsars associated with the disk of our Galaxy, and a further 103 pulsars in 24 of the Galactic globular clusters. Recent highlights have been the discovery of the first ever double pulsar system and a recent flurry of discoveries in globular clusters, in particular Terzan 5. Supplementary material is available for this article at 10.12942/lrr-2005-7.
Kohonen, Pekka; Benfenati, Emilio; Bower, David; Ceder, Rebecca; Crump, Michael; Cross, Kevin; Grafström, Roland C; Healy, Lyn; Helma, Christoph; Jeliazkova, Nina; Jeliazkov, Vedrin; Maggioni, Silvia; Miller, Scott; Myatt, Glenn; Rautenberg, Michael; Stacey, Glyn; Willighagen, Egon; Wiseman, Jeff; Hardy, Barry
2013-01-01
The aim of the SEURAT-1 (Safety Evaluation Ultimately Replacing Animal Testing-1) research cluster, comprised of seven EU FP7 Health projects co-financed by Cosmetics Europe, is to generate a proof-of-concept to show how the latest technologies, systems toxicology and toxicogenomics can be combined to deliver a test replacement for repeated dose systemic toxicity testing on animals. The SEURAT-1 strategy is to adopt a mode-of-action framework to describe repeated dose toxicity, combining in vitro and in silico methods to derive predictions of in vivo toxicity responses. ToxBank is the cross-cluster infrastructure project whose activities include the development of a data warehouse to provide a web-accessible shared repository of research data and protocols, a physical compounds repository, reference or "gold compounds" for use across the cluster (available via wiki.toxbank.net), and a reference resource for biomaterials. Core technologies used in the data warehouse include the ISA-Tab universal data exchange format, REpresentational State Transfer (REST) web services, the W3C Resource Description Framework (RDF) and the OpenTox standards. We describe the design of the data warehouse based on cluster requirements, the implementation based on open standards, and finally the underlying concepts and initial results of a data analysis utilizing public data related to the gold compounds. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Gorzalczany, Marian B; Rudzinski, Filip
2017-06-07
This paper presents a generalization of self-organizing maps with 1-D neighborhoods (neuron chains) that can be effectively applied to complex cluster analysis problems. The essence of the generalization consists in introducing mechanisms that allow the neuron chain--during learning--to disconnect into subchains, to reconnect some of the subchains again, and to dynamically regulate the overall number of neurons in the system. These features enable the network--working in a fully unsupervised way (i.e., using unlabeled data without a predefined number of clusters)--to automatically generate collections of multiprototypes that are able to represent a broad range of clusters in data sets. First, the operation of the proposed approach is illustrated on some synthetic data sets. Then, this technique is tested using several real-life, complex, and multidimensional benchmark data sets available from the University of California at Irvine (UCI) Machine Learning repository and the Knowledge Extraction based on Evolutionary Learning data set repository. A sensitivity analysis of our approach to changes in control parameters and a comparative analysis with an alternative approach are also performed.
Data-Driven Packet Loss Estimation for Node Healthy Sensing in Decentralized Cluster
Fan, Hangyu; Wang, Huandong; Li, Yong
2018-01-01
Decentralized clustering of modern information technology is widely adopted in various fields these years. One of the main reason is the features of high availability and the failure-tolerance which can prevent the entire system form broking down by a failure of a single point. Recently, toolkits such as Akka are used by the public commonly to easily build such kind of cluster. However, clusters of such kind that use Gossip as their membership managing protocol and use link failure detecting mechanism to detect link failures cannot deal with the scenario that a node stochastically drops packets and corrupts the member status of the cluster. In this paper, we formulate the problem to be evaluating the link quality and finding a max clique (NP-Complete) in the connectivity graph. We then proposed an algorithm that consists of two models driven by data from application layer to respectively solving these two problems. Through simulations with statistical data and a real-world product, we demonstrate that our algorithm has a good performance. PMID:29360792
Brown, David K; Penkler, David L; Musyoka, Thommas M; Bishop, Özlem Tastan
2015-01-01
Complex computational pipelines are becoming a staple of modern scientific research. Often these pipelines are resource intensive and require days of computing time. In such cases, it makes sense to run them over high performance computing (HPC) clusters where they can take advantage of the aggregated resources of many powerful computers. In addition to this, researchers often want to integrate their workflows into their own web servers. In these cases, software is needed to manage the submission of jobs from the web interface to the cluster and then return the results once the job has finished executing. We have developed the Job Management System (JMS), a workflow management system and web interface for high performance computing (HPC). JMS provides users with a user-friendly web interface for creating complex workflows with multiple stages. It integrates this workflow functionality with the resource manager, a tool that is used to control and manage batch jobs on HPC clusters. As such, JMS combines workflow management functionality with cluster administration functionality. In addition, JMS provides developer tools including a code editor and the ability to version tools and scripts. JMS can be used by researchers from any field to build and run complex computational pipelines and provides functionality to include these pipelines in external interfaces. JMS is currently being used to house a number of bioinformatics pipelines at the Research Unit in Bioinformatics (RUBi) at Rhodes University. JMS is an open-source project and is freely available at https://github.com/RUBi-ZA/JMS.
Brown, David K.; Penkler, David L.; Musyoka, Thommas M.; Bishop, Özlem Tastan
2015-01-01
Complex computational pipelines are becoming a staple of modern scientific research. Often these pipelines are resource intensive and require days of computing time. In such cases, it makes sense to run them over high performance computing (HPC) clusters where they can take advantage of the aggregated resources of many powerful computers. In addition to this, researchers often want to integrate their workflows into their own web servers. In these cases, software is needed to manage the submission of jobs from the web interface to the cluster and then return the results once the job has finished executing. We have developed the Job Management System (JMS), a workflow management system and web interface for high performance computing (HPC). JMS provides users with a user-friendly web interface for creating complex workflows with multiple stages. It integrates this workflow functionality with the resource manager, a tool that is used to control and manage batch jobs on HPC clusters. As such, JMS combines workflow management functionality with cluster administration functionality. In addition, JMS provides developer tools including a code editor and the ability to version tools and scripts. JMS can be used by researchers from any field to build and run complex computational pipelines and provides functionality to include these pipelines in external interfaces. JMS is currently being used to house a number of bioinformatics pipelines at the Research Unit in Bioinformatics (RUBi) at Rhodes University. JMS is an open-source project and is freely available at https://github.com/RUBi-ZA/JMS. PMID:26280450
Spatial clustering of pixels of a multispectral image
Conger, James Lynn
2014-08-19
A method and system for clustering the pixels of a multispectral image is provided. A clustering system computes a maximum spectral similarity score for each pixel that indicates the similarity between that pixel and the most similar neighboring. To determine the maximum similarity score for a pixel, the clustering system generates a similarity score between that pixel and each of its neighboring pixels and then selects the similarity score that represents the highest similarity as the maximum similarity score. The clustering system may apply a filtering criterion based on the maximum similarity score so that pixels with similarity scores below a minimum threshold are not clustered. The clustering system changes the current pixel values of the pixels in a cluster based on an averaging of the original pixel values of the pixels in the cluster.
Abdolzadeh, Ahmad; Wang, Xing; Veneklaas, Erik J.; Lambers, Hans
2010-01-01
Background and Aims In some lupin species, phosphate deficiency induces cluster-root formation, which enhances P uptake by increasing root surface area and, more importantly, the release of root exudates which enhances P availability. Methods Three species of Lupinus, L. albus, L. atlanticus and L. micranthus, with inherently different relative growth rates were cultivated under hydroponics in a greenhouse at four phosphate concentrations (1, 10, 50 and 150 µm) to compare the role of internal P in regulating cluster-root formation. Key Results The highest growth rate was observed in L. atlanticus, followed by L. albus and L. micranthus. At 1 µm P, cluster-root formation was markedly induced in all three species. The highest P uptake and accumulation was observed in L. micranthus, followed by L. atlanticus and then L. albus. Inhibition of cluster-root formation was severe at 10 µm P in L. atlanticus, but occurred stepwise with increasing P concentration in the root medium in L. albus. Conclusions In L. atlanticus and L. albus cluster-root formation was suppressed by P treatments above 10 µm, indicating a P-inducible regulating system for cluster-root formation, as expected. By contrast, production of cluster roots in L. micranthus, in spite of a high internal P concentration, indicated a lower sensitivity to P status, which allowed P-toxicity symptoms to develop. PMID:20037142
Cohesiveness and hydrodynamic properties of young drinking water biofilms.
Abe, Yumiko; Skali-Lami, Salaheddine; Block, Jean-Claude; Francius, Grégory
2012-03-15
Drinking water biofilms are complex microbial systems mainly composed of clusters of different size and age. Atomic force microscopy (AFM) measurements were performed on 4, 8 and 12 weeks old biofilms in order to quantify the mechanical detachment shear stress of the clusters, to estimate the biofilm entanglement rate ξ. This AFM approach showed that the removal of the clusters occurred generally for mechanical shear stress of about 100 kPa only for clusters volumes greater than 200 μm3. This value appears 1000 times higher than hydrodynamic shear stress technically available meaning that the cleaning of pipe surfaces by water flushing remains always incomplete. To predict hydrodynamic detachment of biofilm clusters, a theoretical model has been developed regarding the averaging of elastic and viscous stresses in the cluster and by including the entanglement rate ξ. The results highlighted a slight increase of the detachment shear stress with age and also the dependence between the posting of clusters and their volume. Indeed, the experimental values of ξ allow predicting biofilm hydrodynamic detachment with same order of magnitude than was what reported in the literature. The apparent discrepancy between the mechanical and the hydrodynamic detachment is mainly due to the fact that AFM mechanical experiments are related to the clusters local properties whereas hydrodynamic measurements reflected the global properties of the whole biofilm. Copyright © 2011 Elsevier Ltd. All rights reserved.
Ergatis: a web interface and scalable software system for bioinformatics workflows
Orvis, Joshua; Crabtree, Jonathan; Galens, Kevin; Gussman, Aaron; Inman, Jason M.; Lee, Eduardo; Nampally, Sreenath; Riley, David; Sundaram, Jaideep P.; Felix, Victor; Whitty, Brett; Mahurkar, Anup; Wortman, Jennifer; White, Owen; Angiuoli, Samuel V.
2010-01-01
Motivation: The growth of sequence data has been accompanied by an increasing need to analyze data on distributed computer clusters. The use of these systems for routine analysis requires scalable and robust software for data management of large datasets. Software is also needed to simplify data management and make large-scale bioinformatics analysis accessible and reproducible to a wide class of target users. Results: We have developed a workflow management system named Ergatis that enables users to build, execute and monitor pipelines for computational analysis of genomics data. Ergatis contains preconfigured components and template pipelines for a number of common bioinformatics tasks such as prokaryotic genome annotation and genome comparisons. Outputs from many of these components can be loaded into a Chado relational database. Ergatis was designed to be accessible to a broad class of users and provides a user friendly, web-based interface. Ergatis supports high-throughput batch processing on distributed compute clusters and has been used for data management in a number of genome annotation and comparative genomics projects. Availability: Ergatis is an open-source project and is freely available at http://ergatis.sourceforge.net Contact: jorvis@users.sourceforge.net PMID:20413634
uvbyβ photometry of early type open cluster and field stars
NASA Astrophysics Data System (ADS)
Handler, G.
2011-04-01
Context. The β Cephei stars and slowly pulsating B (SPB) stars are massive main sequence variables. The strength of their pulsational driving strongly depends on the opacity of iron-group elements. As many of those stars naturally occur in young open clusters, whose metallicities can be determined in several fundamental ways, it is logical to study the incidence of pulsation in several young open clusters. Aims: To provide the foundation for such an investigation, Strömgren-Crawford uvbyβ photometry of open cluster target stars was carried out to determine effective temperatures, luminosities, and therefore cluster memberships. Methods: In the course of three observing runs, uvbyβ photometry for 168 target stars was acquired and transformed into the standard system by measurements of 117 standard stars. The list of target stars also included some known cluster and field β Cephei stars, as well as β Cephei and SPB candidates that are targets of the asteroseismic part of the Kepler satellite mission. Results: The uvbyβ photometric results are presented. The data are shown to be on the standard system, and the properties of the target stars are discussed: 140 of these are indeed OB stars, a total of 101 targets lie within the β Cephei and/or SPB star instability strips, and each investigated cluster contains such potential pulsators. Conclusions: These measurements will be taken advantage of in a number of subsequent publications. Based on measurements obtained at McDonald Observatory of the University of Texas at Austin.Tables 3-6 are only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/528/A148
Clustering execution in a processing system to increase power savings
Bose, Pradip; Buyuktosunoglu, Alper; Jacobson, Hans M.; Vega, Augusto J.
2018-03-20
Embodiments relate to clustering execution in a processing system. An aspect includes accessing a control flow graph that defines a data dependency and an execution sequence of a plurality of tasks of an application that executes on a plurality of system components. The execution sequence of the tasks in the control flow graph is modified as a clustered control flow graph that clusters active and idle phases of a system component while maintaining the data dependency. The clustered control flow graph is sent to an operating system, where the operating system utilizes the clustered control flow graph for scheduling the tasks.
Ding, Jiarui; Shah, Sohrab; Condon, Anne
2016-01-01
Motivation: Many biological data processing problems can be formalized as clustering problems to partition data points into sensible and biologically interpretable groups. Results: This article introduces densityCut, a novel density-based clustering algorithm, which is both time- and space-efficient and proceeds as follows: densityCut first roughly estimates the densities of data points from a K-nearest neighbour graph and then refines the densities via a random walk. A cluster consists of points falling into the basin of attraction of an estimated mode of the underlining density function. A post-processing step merges clusters and generates a hierarchical cluster tree. The number of clusters is selected from the most stable clustering in the hierarchical cluster tree. Experimental results on ten synthetic benchmark datasets and two microarray gene expression datasets demonstrate that densityCut performs better than state-of-the-art algorithms for clustering biological datasets. For applications, we focus on the recent cancer mutation clustering and single cell data analyses, namely to cluster variant allele frequencies of somatic mutations to reveal clonal architectures of individual tumours, to cluster single-cell gene expression data to uncover cell population compositions, and to cluster single-cell mass cytometry data to detect communities of cells of the same functional states or types. densityCut performs better than competing algorithms and is scalable to large datasets. Availability and Implementation: Data and the densityCut R package is available from https://bitbucket.org/jerry00/densitycut_dev. Contact: condon@cs.ubc.ca or sshah@bccrc.ca or jiaruid@cs.ubc.ca Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153661
The gamma-ray pulsar population of globular clusters: implications for the GeV excess
NASA Astrophysics Data System (ADS)
Hooper, Dan; Linden, Tim
2016-08-01
It has been suggested that the GeV excess, observed from the region surrounding the Galactic Center, might originate from a population of millisecond pulsars that formed in globular clusters. With this in mind, we employ the publicly available Fermi data to study the gamma-ray emission from 157 globular clusters, identifying a statistically significant signal from 25 of these sources (ten of which are not found in existing gamma-ray catalogs). We combine these observations with the predicted pulsar formation rate based on the stellar encounter rate of each globular cluster to constrain the gamma-ray luminosity function of millisecond pulsars in the Milky Way's globular cluster system. We find that this pulsar population exhibits a luminosity function that is quite similar to those millisecond pulsars observed in the field of the Milky Way (i.e. the thick disk). After pulsars are expelled from a globular cluster, however, they continue to lose rotational kinetic energy and become less luminous, causing their luminosity function to depart from the steady-state distribution. Using this luminosity function and a model for the globular cluster disruption rate, we show that millisecond pulsars born in globular clusters can account for only a few percent or less of the observed GeV excess. Among other challenges, scenarios in which the entire GeV excess is generated from such pulsars are in conflict with the observed mass of the Milky Way's Central Stellar Cluster.
The gamma-ray pulsar population of globular clusters: implications for the GeV excess
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hooper, Dan; Linden, Tim, E-mail: dhooper@fnal.gov, E-mail: linden.70@osu.edu
It has been suggested that the GeV excess, observed from the region surrounding the Galactic Center, might originate from a population of millisecond pulsars that formed in globular clusters. With this in mind, we employ the publicly available Fermi data to study the gamma-ray emission from 157 globular clusters, identifying a statistically significant signal from 25 of these sources (ten of which are not found in existing gamma-ray catalogs). We combine these observations with the predicted pulsar formation rate based on the stellar encounter rate of each globular cluster to constrain the gamma-ray luminosity function of millisecond pulsars in themore » Milky Way's globular cluster system. We find that this pulsar population exhibits a luminosity function that is quite similar to those millisecond pulsars observed in the field of the Milky Way (i.e. the thick disk). After pulsars are expelled from a globular cluster, however, they continue to lose rotational kinetic energy and become less luminous, causing their luminosity function to depart from the steady-state distribution. Using this luminosity function and a model for the globular cluster disruption rate, we show that millisecond pulsars born in globular clusters can account for only a few percent or less of the observed GeV excess. Among other challenges, scenarios in which the entire GeV excess is generated from such pulsars are in conflict with the observed mass of the Milky Way's Central Stellar Cluster.« less
The gamma-ray pulsar population of globular clusters: Implications for the GeV excess
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hooper, Dan; Linden, Tim
In this study, it has been suggested that the GeV excess, observed from the region surrounding the Galactic Center, might originate from a population of millisecond pulsars that formed in globular clusters. With this in mind, we employ the publicly available Fermi data to study the gamma-ray emission from 157 globular clusters, identifying a statistically significant signal from 25 of these sources (ten of which are not found in existing gamma-ray catalogs). We combine these observations with the predicted pulsar formation rate based on the stellar encounter rate of each globular cluster to constrain the gamma-ray luminosity function of millisecondmore » pulsars in the Milky Way's globular cluster system. We find that this pulsar population exhibits a luminosity function that is quite similar to those millisecond pulsars observed in the field of the Milky Way (i.e. the thick disk). After pulsars are expelled from a globular cluster, however, they continue to lose rotational kinetic energy and become less luminous, causing their luminosity function to depart from the steady-state distribution. Using this luminosity function and a model for the globular cluster disruption rate, we show that millisecond pulsars born in globular clusters can account for only a few percent or less of the observed GeV excess. Among other challenges, scenarios in which the entire GeV excess is generated from such pulsars are in conflict with the observed mass of the Milky Way's Central Stellar Cluster.« less
The gamma-ray pulsar population of globular clusters: Implications for the GeV excess
Hooper, Dan; Linden, Tim
2016-08-09
In this study, it has been suggested that the GeV excess, observed from the region surrounding the Galactic Center, might originate from a population of millisecond pulsars that formed in globular clusters. With this in mind, we employ the publicly available Fermi data to study the gamma-ray emission from 157 globular clusters, identifying a statistically significant signal from 25 of these sources (ten of which are not found in existing gamma-ray catalogs). We combine these observations with the predicted pulsar formation rate based on the stellar encounter rate of each globular cluster to constrain the gamma-ray luminosity function of millisecondmore » pulsars in the Milky Way's globular cluster system. We find that this pulsar population exhibits a luminosity function that is quite similar to those millisecond pulsars observed in the field of the Milky Way (i.e. the thick disk). After pulsars are expelled from a globular cluster, however, they continue to lose rotational kinetic energy and become less luminous, causing their luminosity function to depart from the steady-state distribution. Using this luminosity function and a model for the globular cluster disruption rate, we show that millisecond pulsars born in globular clusters can account for only a few percent or less of the observed GeV excess. Among other challenges, scenarios in which the entire GeV excess is generated from such pulsars are in conflict with the observed mass of the Milky Way's Central Stellar Cluster.« less
pong: fast analysis and visualization of latent clusters in population genetic data.
Behr, Aaron A; Liu, Katherine Z; Liu-Fang, Gracie; Nakka, Priyanka; Ramachandran, Sohini
2016-09-15
A series of methods in population genetics use multilocus genotype data to assign individuals membership in latent clusters. These methods belong to a broad class of mixed-membership models, such as latent Dirichlet allocation used to analyze text corpora. Inference from mixed-membership models can produce different output matrices when repeatedly applied to the same inputs, and the number of latent clusters is a parameter that is often varied in the analysis pipeline. For these reasons, quantifying, visualizing, and annotating the output from mixed-membership models are bottlenecks for investigators across multiple disciplines from ecology to text data mining. We introduce pong, a network-graphical approach for analyzing and visualizing membership in latent clusters with a native interactive D3.js visualization. pong leverages efficient algorithms for solving the Assignment Problem to dramatically reduce runtime while increasing accuracy compared with other methods that process output from mixed-membership models. We apply pong to 225 705 unlinked genome-wide single-nucleotide variants from 2426 unrelated individuals in the 1000 Genomes Project, and identify previously overlooked aspects of global human population structure. We show that pong outpaces current solutions by more than an order of magnitude in runtime while providing a customizable and interactive visualization of population structure that is more accurate than those produced by current tools. pong is freely available and can be installed using the Python package management system pip. pong's source code is available at https://github.com/abehr/pong aaron_behr@alumni.brown.edu or sramachandran@brown.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
Cluster and Sporadic Cases of Herbaspirillum Species Infections in Patients With Cancer
Chemaly, Roy F.; Dantes, Raymund; Shah, Dimpy P.; Shah, Pankil K.; Pascoe, Neil; Ariza-Heredia, Ella; Perego, Cheryl; Nguyen, Duc B.; Nguyen, Kim; Modarai, Farhad; Moulton-Meissner, Heather; Noble-Wang, Judith; Tarrand, Jeffrey J.; LiPuma, John J.; Guh, Alice Y.; MacCannell, Tara; Raad, Issam; Mulanovich, Victor
2015-01-01
Background. Herbaspirillum species are gram-negative Betaproteobacteria that inhabit the rhizosphere. We investigated a potential cluster of hospital-based Herbaspirillum species infections. Methods. Cases were defined as Herbaspirillum species isolated from a patient in our comprehensive cancer center between 1 January 2006 and 15 October 2013. Case finding was performed by reviewing isolates initially identified as Burkholderia cepacia susceptible to all antibiotics tested, and 16S ribosomal DNA sequencing of available isolates to confirm their identity. Pulsed-field gel electrophoresis (PFGE) was performed to test genetic relatedness. Facility observations, infection prevention assessments, and environmental sampling were performed to investigate potential sources of Herbaspirillum species. Results. Eight cases of Herbaspirillum species were identified. Isolates from the first 5 clustered cases were initially misidentified as B. cepacia, and available isolates from 4 of these cases were indistinguishable. The 3 subsequent cases were identified by prospective surveillance and had different PFGE patterns. All but 1 case-patient had bloodstream infections, and 6 presented with sepsis. Underlying diagnoses included solid tumors (3), leukemia (3), lymphoma (1), and aplastic anemia (1). Herbaspirillum species infections were hospital-onset in 5 patients and community-onset in 3. All symptomatic patients were treated with intravenous antibiotics, and their infections resolved. No environmental source or common mechanism of acquisition was identified. Conclusions. This is the first report of a hospital-based cluster of Herbaspirillum species infections. Herbaspirillum species are capable of causing bacteremia and sepsis in immunocompromised patients. Herbaspirillum species can be misidentified as Burkholderia cepacia by commercially available microbial identification systems. PMID:25216687
Hardware accelerator design for change detection in smart camera
NASA Astrophysics Data System (ADS)
Singh, Sanjay; Dunga, Srinivasa Murali; Saini, Ravi; Mandal, A. S.; Shekhar, Chandra; Chaudhury, Santanu; Vohra, Anil
2011-10-01
Smart Cameras are important components in Human Computer Interaction. In any remote surveillance scenario, smart cameras have to take intelligent decisions to select frames of significant changes to minimize communication and processing overhead. Among many of the algorithms for change detection, one based on clustering based scheme was proposed for smart camera systems. However, such an algorithm could achieve low frame rate far from real-time requirements on a general purpose processors (like PowerPC) available on FPGAs. This paper proposes the hardware accelerator capable of detecting real time changes in a scene, which uses clustering based change detection scheme. The system is designed and simulated using VHDL and implemented on Xilinx XUP Virtex-IIPro FPGA board. Resulted frame rate is 30 frames per second for QVGA resolution in gray scale.
Cluster-Glass Phase in Pyrochlore X Y Antiferromagnets with Quenched Disorder
NASA Astrophysics Data System (ADS)
Andrade, Eric C.; Hoyos, José A.; Rachel, Stephan; Vojta, Matthias
2018-03-01
We study the impact of quenched disorder (random exchange couplings or site dilution) on easy-plane pyrochlore antiferromagnets. In the clean system, order by disorder selects a magnetically ordered state from a classically degenerate manifold. In the presence of randomness, however, different orders can be chosen locally depending on details of the disorder configuration. Using a combination of analytical considerations and classical Monte Carlo simulations, we argue that any long-range-ordered magnetic state is destroyed beyond a critical level of randomness where the system breaks into magnetic domains due to random exchange anisotropies, becoming, therefore, a glass of spin clusters, in accordance with the available experimental data. These random anisotropies originate from off-diagonal exchange couplings in the microscopic Hamiltonian, establishing their relevance to other magnets with strong spin-orbit coupling.
McKenna, J.E.
2003-01-01
The biosphere is filled with complex living patterns and important questions about biodiversity and community and ecosystem ecology are concerned with structure and function of multispecies systems that are responsible for those patterns. Cluster analysis identifies discrete groups within multivariate data and is an effective method of coping with these complexities, but often suffers from subjective identification of groups. The bootstrap testing method greatly improves objective significance determination for cluster analysis. The BOOTCLUS program makes cluster analysis that reliably identifies real patterns within a data set more accessible and easier to use than previously available programs. A variety of analysis options and rapid re-analysis provide a means to quickly evaluate several aspects of a data set. Interpretation is influenced by sampling design and a priori designation of samples into replicate groups, and ultimately relies on the researcher's knowledge of the organisms and their environment. However, the BOOTCLUS program provides reliable, objectively determined groupings of multivariate data.
Southern Clusters for Standardizing CCD Photometry
NASA Astrophysics Data System (ADS)
Moon, T. T.
2017-06-01
Standardizing photometric measurements typically involves undertaking all-sky photometry. This can be laborious and time-consuming and, for CCD photometry, particularly challenging. Transforming photometry to a standard system is, however, a crucial step when routinely measuring variable stars, as it allows photoelectric measurements from different observers to be combined. For observers in the northern hemisphere, standardized UBVRI values of stars in open clusters such as M67 and NGC 7790 have been established, greatly facilitating quick and accurate transformation of CCD measurements. Recently the AAVSO added the cluster NGC 3532 for southern hemisphere observers to similarly standardize their photometry. The availability of NGC 3532 standards was announced on the AAVSO Variable Star Observing, Photometry forum on 27 October 2016. Published photometry, along with some new measurements by the author, provide a means of checking these NGC 3532 standards which were determined through the AAVSO's Bright Star Monitor (BSM) program (see: https://www.aavso.org/aavsonet-epoch-photometry-database). New measurements of selected stars in the open clusters M25 and NGC 6067 are also included.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-04-07
... economically dynamic regional innovation cluster focused on energy efficient buildings technologies and systems... DEPARTMENT OF ENERGY Energy Efficient Building Systems Regional Innovation Cluster Initiative... February 8, 2010, titled the Energy Efficient Building Systems Regional Innovation Cluster Initiative. A...
Yin, Shi; Bernstein, Elliot R
2017-10-05
Iron sulfur cluster anions (FeS) m - (m = 2-8) are studied by photoelectron spectroscopy (PES) at 3.492 eV (355 nm) and 4.661 eV (266 nm) photon energies, and by density functional theory (DFT) calculations. The most probable structures and ground state spin multiplicities for (FeS) m - (m = 2-8) clusters are tentatively assigned through a comparison of their theoretical and experiment first vertical detachment energy (VDE) values. Many spin states lie within 0.5 eV of the ground spin state for the larger (FeS) m - (m ≥ 4) clusters. Theoretical VDEs of these low lying spin states are in good agreement with the experimental VDE values. Therefore, multiple spin states of each of these iron sulfur cluster anions probably coexist under the current experimental conditions. Such available multiple spin states must be considered when evaluating the properties and behavior of these iron sulfur clusters in real chemical and biological systems. The experimental first VDEs of (FeS) m - (m = 1-8) clusters are observed to change with the cluster size (number m). The first VDE trends noted can be related to the different properties of the highest singly occupied molecular orbitals (NBO, HSOMOs) of each cluster anion. The changing nature of the NBO/HSOMO of these (FeS) m - (m = 1-8) clusters from a p orbital on S, to a d orbital on Fe, and to an Fe-Fe bonding orbital is probably responsible for the observed increasing trend for their first VDEs with respect to m.
Clustering execution in a processing system to increase power savings
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bose, Pradip; Buyuktosunoglu, Alper; Jacobson, Hans M.
Embodiments relate to clustering execution in a processing system. An aspect includes accessing a control flow graph that defines a data dependency and an execution sequence of a plurality of tasks of an application that executes on a plurality of system components. The execution sequence of the tasks in the control flow graph is modified as a clustered control flow graph that clusters active and idle phases of a system component while maintaining the data dependency. The clustered control flow graph is sent to an operating system, where the operating system utilizes the clustered control flow graph for scheduling themore » tasks.« less
Inputs to Vocational-Technical Education from Occupational Research.
ERIC Educational Resources Information Center
Christal, Raymond E.
The Air Force has developed a data bank which classifies job descriptions into occupational clusters. A retrieval system, also recently developed computes and publishes a consolidated description of the work being performed by any group which can be defined in terms of the background information. Other retrieval programs are available which enable…
Steganalysis feature improvement using expectation maximization
NASA Astrophysics Data System (ADS)
Rodriguez, Benjamin M.; Peterson, Gilbert L.; Agaian, Sos S.
2007-04-01
Images and data files provide an excellent opportunity for concealing illegal or clandestine material. Currently, there are over 250 different tools which embed data into an image without causing noticeable changes to the image. From a forensics perspective, when a system is confiscated or an image of a system is generated the investigator needs a tool that can scan and accurately identify files suspected of containing malicious information. The identification process is termed the steganalysis problem which focuses on both blind identification, in which only normal images are available for training, and multi-class identification, in which both the clean and stego images at several embedding rates are available for training. In this paper an investigation of a clustering and classification technique (Expectation Maximization with mixture models) is used to determine if a digital image contains hidden information. The steganalysis problem is for both anomaly detection and multi-class detection. The various clusters represent clean images and stego images with between 1% and 10% embedding percentage. Based on the results it is concluded that the EM classification technique is highly suitable for both blind detection and the multi-class problem.
Reanalysis of 24 Nearby Open Clusters using Gaia data
NASA Astrophysics Data System (ADS)
Yen, Steffi X.; Reffert, Sabine; Röser, Siegfried; Schilbach, Elena; Kharchenko, Nina V.; Piskunov, Anatoly E.
2018-04-01
We have developed a fully automated cluster characterization pipeline, which simultaneously determines cluster membership and fits the fundamental cluster parameters: distance, reddening, and age. We present results for 24 established clusters and compare them to literature values. Given the large amount of stellar data for clusters available from Gaia DR2 in 2018, this pipeline will be beneficial to analyzing the parameters of open clusters in our Galaxy.
A comparison of queueing, cluster and distributed computing systems
NASA Technical Reports Server (NTRS)
Kaplan, Joseph A.; Nelson, Michael L.
1993-01-01
Using workstation clusters for distributed computing has become popular with the proliferation of inexpensive, powerful workstations. Workstation clusters offer both a cost effective alternative to batch processing and an easy entry into parallel computing. However, a number of workstations on a network does not constitute a cluster. Cluster management software is necessary to harness the collective computing power. A variety of cluster management and queuing systems are compared: Distributed Queueing Systems (DQS), Condor, Load Leveler, Load Balancer, Load Sharing Facility (LSF - formerly Utopia), Distributed Job Manager (DJM), Computing in Distributed Networked Environments (CODINE), and NQS/Exec. The systems differ in their design philosophy and implementation. Based on published reports on the different systems and conversations with the system's developers and vendors, a comparison of the systems are made on the integral issues of clustered computing.
CORM: An R Package Implementing the Clustering of Regression Models Method for Gene Clustering
Shi, Jiejun; Qin, Li-Xuan
2014-01-01
We report a new R package implementing the clustering of regression models (CORM) method for clustering genes using gene expression data and provide data examples illustrating each clustering function in the package. The CORM package is freely available at CRAN from http://cran.r-project.org. PMID:25452684
fluff: exploratory analysis and visualization of high-throughput sequencing data
Georgiou, Georgios
2016-01-01
Summary. In this article we describe fluff, a software package that allows for simple exploration, clustering and visualization of high-throughput sequencing data mapped to a reference genome. The package contains three command-line tools to generate publication-quality figures in an uncomplicated manner using sensible defaults. Genome-wide data can be aggregated, clustered and visualized in a heatmap, according to different clustering methods. This includes a predefined setting to identify dynamic clusters between different conditions or developmental stages. Alternatively, clustered data can be visualized in a bandplot. Finally, fluff includes a tool to generate genomic profiles. As command-line tools, the fluff programs can easily be integrated into standard analysis pipelines. The installation is straightforward and documentation is available at http://fluff.readthedocs.org. Availability. fluff is implemented in Python and runs on Linux. The source code is freely available for download at https://github.com/simonvh/fluff. PMID:27547532
Kamali, Tahereh; Stashuk, Daniel
2016-10-01
Robust and accurate segmentation of brain white matter (WM) fiber bundles assists in diagnosing and assessing progression or remission of neuropsychiatric diseases such as schizophrenia, autism and depression. Supervised segmentation methods are infeasible in most applications since generating gold standards is too costly. Hence, there is a growing interest in designing unsupervised methods. However, most conventional unsupervised methods require the number of clusters be known in advance which is not possible in most applications. The purpose of this study is to design an unsupervised segmentation algorithm for brain white matter fiber bundles which can automatically segment fiber bundles using intrinsic diffusion tensor imaging data information without considering any prior information or assumption about data distributions. Here, a new density based clustering algorithm called neighborhood distance entropy consistency (NDEC), is proposed which discovers natural clusters within data by simultaneously utilizing both local and global density information. The performance of NDEC is compared with other state of the art clustering algorithms including chameleon, spectral clustering, DBSCAN and k-means using Johns Hopkins University publicly available diffusion tensor imaging data. The performance of NDEC and other employed clustering algorithms were evaluated using dice ratio as an external evaluation criteria and density based clustering validation (DBCV) index as an internal evaluation metric. Across all employed clustering algorithms, NDEC obtained the highest average dice ratio (0.94) and DBCV value (0.71). NDEC can find clusters with arbitrary shapes and densities and consequently can be used for WM fiber bundle segmentation where there is no distinct boundary between various bundles. NDEC may also be used as an effective tool in other pattern recognition and medical diagnostic systems in which discovering natural clusters within data is a necessity. Copyright © 2016 Elsevier B.V. All rights reserved.
Close Encounters of the Stellar Kind
NASA Astrophysics Data System (ADS)
2003-07-01
NASA's Chandra X-ray Observatory has confirmed that close encounters between stars form X-ray emitting, double-star systems in dense globular star clusters. These X-ray binaries have a different birth process than their cousins outside globular clusters, and should have a profound influence on the cluster's evolution. A team of scientists led by David Pooley of the Massachusetts Institute of Technology in Cambridge took advantage of Chandra's unique ability to precisely locate and resolve individual sources to determine the number of X-ray sources in 12 globular clusters in our Galaxy. Most of the sources are binary systems containing a collapsed star such as a neutron star or a white dwarf star that is pulling matter off a normal, Sun-like companion star. "We found that the number of X-ray binaries is closely correlated with the rate of encounters between stars in the clusters," said Pooley. "Our conclusion is that the binaries are formed as a consequence of these encounters. It is a case of nurture not nature." A similar study led by Craig Heinke of the Harvard-Smithsonian Center for Astrophysics in Cambridge, Mass. confirmed this conclusion, and showed that roughly 10 percent of these X-ray binary systems contain neutron stars. Most of these neutron stars are usually quiet, spending less than 10% of their time actively feeding from their companion. NGC 7099 NGC 7099 A globular cluster is a spherical collection of hundreds of thousands or even millions of stars buzzing around each other in a gravitationally-bound stellar beehive that is about a hundred light years in diameter. The stars in a globular cluster are often only about a tenth of a light year apart. For comparison, the nearest star to the Sun, Proxima Centauri, is 4.2 light years away. With so many stars moving so close together, interactions between stars occur frequently in globular clusters. The stars, while rarely colliding, do get close enough to form binary star systems or cause binary stars to exchange partners in intricate dances. The data suggest that X-ray binary systems are formed in dense clusters known as globular clusters about once a day somewhere in the universe. Observations by NASA's Uhuru X-ray satellite in the 1970's showed that globular clusters seemed to contain a disproportionately large number of X-ray binary sources compared to the Galaxy as a whole. Normally only one in a billion stars is a member of an X-ray binary system containing a neutron star, whereas in globular clusters, the fraction is more like one in a million. The present research confirms earlier suggestions that the chance of forming an X-ray binary system is dramatically increased by the congestion in a globular cluster. Under these conditions two processes, known as three-star exchange collisions, and tidal captures, can lead to a thousandfold increase in the number of X-ray sources in globular clusters. 47 Tucanae 47 Tucanae In an exchange collision, a lone neutron star encounters a pair of ordinary stars. The intense gravity of the neutron star can induce the most massive ordinary star to "change partners," and pair up with the neutron star while ejecting the lighter star. A neutron star could also make a grazing collision with a single normal star, and the intense gravity of the neutron star could distort the gravity of the normal star in the process. The energy lost in the distortion, could prevent the normal star from escaping from the neutron star, leading to what is called tidal capture. "In addition to solving a long-standing mystery, Chandra data offer an opportunity for a deeper understanding of globular cluster evolution," said Heinke. "For example, the energy released in the formation of close binary systems could keep the central parts of the cluster from collapsing to form a massive black hole." NASA's Marshall Space Flight Center, Huntsville, Ala., manages the Chandra program for the Office of Space Science, NASA Headquarters, Washington. Northrop Grumman of Redondo Beach, Calif., formerly TRW, Inc., was the prime development contractor for the observatory. The Smithsonian Astrophysical Observatory controls science and flight operations from the Chandra X-ray Center in Cambridge, Mass. The image and additional information are available at: http://chandra.harvard.edu and http://chandra.nasa.gov
Density-functional theory applied to d- and f-electron systems
NASA Astrophysics Data System (ADS)
Wu, Xueyuan
Density functional theory (DFT) has been applied to study the electronic and geometric structures of prototype d- and f-electron systems. For the d-electron system, all electron DFT with gradient corrections to the exchange and correlation functionals has been used to investigate the properties of small neutral and cationic vanadium clusters. Results are in good agreement with available experimental and other theoretical data. For the f-electron system, a hybrid DFT, namely, B3LYP (Becke's 3-parameter hybrid functional using the correlation functional of Lee, Yang and Parr) with relativistic effective core potentials and cluster models has been applied to investigate the nature of chemical bonding of both the bulk and the surfaces of plutonium monoxide and dioxide. Using periodic models, the electronic and geometric structures of PuO2 and its (110) surface, as well as water adsorption on this surface have also been investigated using DFT in both local density approximation (LDA) and generalized gradient approximation (GGA) formalisms.
Cluster dynamics and cluster size distributions in systems of self-propelled particles
NASA Astrophysics Data System (ADS)
Peruani, F.; Schimansky-Geier, L.; Bär, M.
2010-12-01
Systems of self-propelled particles (SPP) interacting by a velocity alignment mechanism in the presence of noise exhibit rich clustering dynamics. Often, clusters are responsible for the distribution of (local) information in these systems. Here, we investigate the properties of individual clusters in SPP systems, in particular the asymmetric spreading behavior of clusters with respect to their direction of motion. In addition, we formulate a Smoluchowski-type kinetic model to describe the evolution of the cluster size distribution (CSD). This model predicts the emergence of steady-state CSDs in SPP systems. We test our theoretical predictions in simulations of SPP with nematic interactions and find that our simple kinetic model reproduces qualitatively the transition to aggregation observed in simulations.
MOCCA-SURVEY Database I: Galactic Globular Clusters Harbouring a Black Hole Subsystem
NASA Astrophysics Data System (ADS)
Askar, Abbas; Sedda, Manuel Arca; Giersz, Mirek
2018-05-01
There have been increasing theoretical speculations and observational indications that certain globular clusters (GCs) could contain a sizeable population of stellar mass black holes (BHs). In this paper, we shortlist at least 29 Galactic GCs that could be hosting a subsystem of BHs (BHS). In a companion paper, we analysed results from a wide array of GC models (simulated with the MOCCA code for cluster simulations) that retained few tens to several hundreds of BHs at 12 Gyr and showed that the properties of the BHS in those GCs correlate with the GC's observable properties. Building on those results, we use available observational properties of 140 Galactic GCs to identify 29 GCs that could potentially be harbouring up to a few hundreds of BHs. Utilizing observational properties and theoretical scaling relations, we estimate the density, size and mass of the BHS in these GCs. We also calculate the total number of BHs and the fraction of BHs contained in a binary system for our shortlisted Galactic GCs. Additionally, we mention other Galactic GCs that could also contain significant number of single BHs or BHs in binary systems.
Li, Jun; Tai, Cui; Deng, Zixin; Zhong, Weihong; He, Yongqun; Ou, Hong-Yu
2017-01-10
VRprofile is a Web server that facilitates rapid investigation of virulence and antibiotic resistance genes, as well as extends these trait transfer-related genetic contexts, in newly sequenced pathogenic bacterial genomes. The used backend database MobilomeDB was firstly built on sets of known gene cluster loci of bacterial type III/IV/VI/VII secretion systems and mobile genetic elements, including integrative and conjugative elements, prophages, class I integrons, IS elements and pathogenicity/antibiotic resistance islands. VRprofile is thus able to co-localize the homologs of these conserved gene clusters using HMMer or BLASTp searches. With the integration of the homologous gene cluster search module with a sequence composition module, VRprofile has exhibited better performance for island-like region predictions than the other widely used methods. In addition, VRprofile also provides an integrated Web interface for aligning and visualizing identified gene clusters with MobilomeDB-archived gene clusters, or a variety set of bacterial genomes. VRprofile might contribute to meet the increasing demands of re-annotations of bacterial variable regions, and aid in the real-time definitions of disease-relevant gene clusters in pathogenic bacteria of interest. VRprofile is freely available at http://bioinfo-mml.sjtu.edu.cn/VRprofile. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Signatures of α clustering in ultrarelativistic collisions with light nuclei
NASA Astrophysics Data System (ADS)
Rybczyński, Maciej; Piotrowska, Milena; Broniowski, Wojciech
2018-03-01
We explore possible observable signatures of α clustering of light nuclei in ultrarelativistic nuclear collisions involving Be,97, 12C, and 16O. The clustering leads to specific spatial correlations of the nucleon distributions in the ground state, which are manifest in the earliest stage of the ultrahigh energy reaction. The formed initial state of the fireball is sensitive to these correlations, and the effect influences, after the collective evolution of the system, the hadron production in the final stage. Specifically, we study effects on the harmonic flow in collisions of light clustered nuclei with a heavy target (208Pb), showing that measures of the elliptic flow are sensitive to clusterization in Be,97, whereas triangular flow is sensitive to clusterization in 12C and 16O. Specific predictions are made for model collisions at energies available at the CERN Super Proton Synchrotron. In another exploratory development we also examine proton-beryllium collisions, where the 3 /2- ground state of Be,97 nuclei is polarized by an external magnetic field. Clusterization leads to multiplicity distributions of participant nucleons which depend on the orientation of the polarization with respect to the collision axis, as well as on the magnetic number of the state. The obtained effects on multiplicities reach a factor of a few for collisions with a large number of participant nucleons.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dowell, Jessica L.; Rhode, Katherine L.; Bridges, Terry J.
2014-06-01
We have obtained radial velocity measurements for 51 new globular clusters around the Sombrero galaxy. These measurements were obtained using spectroscopic observations from the AAOmega spectrograph on the Anglo-Australian Telescope and the Hydra spectrograph at WIYN. Combining our own past measurements and velocity measurements obtained from the literature, we have constructed a large database of radial velocities that contains a total of 360 confirmed globular clusters. Previous studies' analyses of the kinematics and mass profile of the Sombrero globular cluster system have been constrained to the inner ∼9' (∼24 kpc or ∼5R{sub e} ), but our new measurements have increasedmore » the radial coverage of the data, allowing us to determine the kinematic properties of M104 out to ∼15' (∼41 kpc or ∼9R{sub e} ). We use our set of radial velocities to study the GC system kinematics and to determine the mass profile and V-band mass-to-light profile of the galaxy. We find that M/L{sub V} increases from 4.5 at the center to a value of 20.9 at 41 kpc (∼9R{sub e} or 15'), which implies that the dark matter halo extends to the edge of our available data set. We compare our mass profile at 20 kpc (∼4R{sub e} or ∼7.'4) to the mass computed from X-ray data and find good agreement. We also use our data to look for rotation in the globular cluster system as a whole, as well as in the red and blue subpopulations. We find no evidence for significant rotation in any of these samples.« less
NASA Astrophysics Data System (ADS)
Chen, Xiuhong; Huang, Xianglei; Jiao, Chaoyi; Flanner, Mark G.; Raeker, Todd; Palen, Brock
2017-01-01
The suites of numerical models used for simulating climate of our planet are usually run on dedicated high-performance computing (HPC) resources. This study investigates an alternative to the usual approach, i.e. carrying out climate model simulations on commercially available cloud computing environment. We test the performance and reliability of running the CESM (Community Earth System Model), a flagship climate model in the United States developed by the National Center for Atmospheric Research (NCAR), on Amazon Web Service (AWS) EC2, the cloud computing environment by Amazon.com, Inc. StarCluster is used to create virtual computing cluster on the AWS EC2 for the CESM simulations. The wall-clock time for one year of CESM simulation on the AWS EC2 virtual cluster is comparable to the time spent for the same simulation on a local dedicated high-performance computing cluster with InfiniBand connections. The CESM simulation can be efficiently scaled with the number of CPU cores on the AWS EC2 virtual cluster environment up to 64 cores. For the standard configuration of the CESM at a spatial resolution of 1.9° latitude by 2.5° longitude, increasing the number of cores from 16 to 64 reduces the wall-clock running time by more than 50% and the scaling is nearly linear. Beyond 64 cores, the communication latency starts to outweigh the benefit of distributed computing and the parallel speedup becomes nearly unchanged.
Patel, Dhilon S; Park, Soohyung; Wu, Emilia L; Yeom, Min Sun; Widmalm, Göran; Klauda, Jeffery B; Im, Wonpil
2016-11-01
Gangliosides are a class of glycosphingolipids (GSLs) with amphiphilic character that are found at the outer leaflet of the cell membranes, where their ability to organize into special domains makes them vital cell membrane components. However, a molecular understanding of GSL-rich membranes in terms of their clustered organization, stability, and dynamics is still elusive. To gain molecular insight into the organization and dynamics of GSL-rich membranes, we performed all-atom molecular-dynamics simulations of bicomponent ganglioside GM1 in 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC) phospholipid bilayers with varying concentrations of GM1 (10%, 20%, and 30%). Overall, the simulations show very good agreement with available experimental data, including x-ray electron density profiles along the membrane normal, NMR carbohydrate proton-proton distances, and x-ray crystal structures. This validates the quality of our model systems for investigating GM1 clustering through an ordered-lipid-cluster analysis. The increase in GM1 concentration induces tighter lipid packing, driven mainly by inter-GM1 carbohydrate-carbohydrate interactions, leading to a greater preference for the positive curvature of GM1-containing membranes and larger cluster sizes of ordered-lipid clusters (with a composite of GM1 and POPC). These clusters tend to segregate and form a large percolated cluster at a 30% GM1 concentration at 293 K. At a higher temperature of 330 K, however, the segregation is not maintained. Copyright © 2016 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Theofilatos, Konstantinos; Pavlopoulou, Niki; Papasavvas, Christoforos; Likothanassis, Spiros; Dimitrakopoulos, Christos; Georgopoulos, Efstratios; Moschopoulos, Charalampos; Mavroudi, Seferina
2015-03-01
Proteins are considered to be the most important individual components of biological systems and they combine to form physical protein complexes which are responsible for certain molecular functions. Despite the large availability of protein-protein interaction (PPI) information, not much information is available about protein complexes. Experimental methods are limited in terms of time, efficiency, cost and performance constraints. Existing computational methods have provided encouraging preliminary results, but they phase certain disadvantages as they require parameter tuning, some of them cannot handle weighted PPI data and others do not allow a protein to participate in more than one protein complex. In the present paper, we propose a new fully unsupervised methodology for predicting protein complexes from weighted PPI graphs. The proposed methodology is called evolutionary enhanced Markov clustering (EE-MC) and it is a hybrid combination of an adaptive evolutionary algorithm and a state-of-the-art clustering algorithm named enhanced Markov clustering. EE-MC was compared with state-of-the-art methodologies when applied to datasets from the human and the yeast Saccharomyces cerevisiae organisms. Using public available datasets, EE-MC outperformed existing methodologies (in some datasets the separation metric was increased by 10-20%). Moreover, when applied to new human datasets its performance was encouraging in the prediction of protein complexes which consist of proteins with high functional similarity. In specific, 5737 protein complexes were predicted and 72.58% of them are enriched for at least one gene ontology (GO) function term. EE-MC is by design able to overcome intrinsic limitations of existing methodologies such as their inability to handle weighted PPI networks, their constraint to assign every protein in exactly one cluster and the difficulties they face concerning the parameter tuning. This fact was experimentally validated and moreover, new potentially true human protein complexes were suggested as candidates for further validation using experimental techniques. Copyright © 2015 Elsevier B.V. All rights reserved.
Spatial ecology of refuge selection by an herbivore under risk of predation
Wilson, Tammy L.; Rayburn, Andrew P.; Edwards, Thomas C.
2012-01-01
Prey species use structures such as burrows to minimize predation risk. The spatial arrangement of these resources can have important implications for individual and population fitness. For example, there is evidence that clustered resources can benefit individuals by reducing predation risk and increasing foraging opportunity concurrently, which leads to higher population density. However, the scale of clustering that is important in these processes has been ignored during theoretical and empirical development of resource models. Ecological understanding of refuge exploitation by prey can be improved by spatial analysis of refuge use and availability that incorporates the effect of scale. We measured the spatial distribution of pygmy rabbit (Brachylagus idahoensis) refugia (burrows) through censuses in four 6-ha sites. Point pattern analyses were used to evaluate burrow selection by comparing the spatial distribution of used and available burrows. The presence of food resources and additional overstory cover resources was further examined using logistic regression. Burrows were spatially clustered at scales up to approximately 25 m, and then regularly spaced at distances beyond ~40 m. Pygmy rabbit exploitation of burrows did not match availability. Burrows used by pygmy rabbits were likely to be located in areas with high overall burrow density (resource clusters) and high overstory cover, which together minimized predation risk. However, in some cases we observed an interaction between either overstory cover (safety) or understory cover (forage) and burrow density. The interactions show that pygmy rabbits will use burrows in areas with low relative burrow density (high relative predation risk) if understory food resources are high. This points to a potential trade-off whereby rabbits must sacrifice some safety afforded by additional nearby burrows to obtain ample forage resources. Observed patterns of clustered burrows and non-random burrow use improve understanding of the importance of spatial distribution of refugia for burrowing herbivores. The analyses used allowed for the estimation of the spatial scale where subtle trade-offs between predation avoidance and foraging opportunity are likely to occur in a natural system.
Hough transform for clustered microcalcifications detection in full-field digital mammograms
NASA Astrophysics Data System (ADS)
Fanizzi, A.; Basile, T. M. A.; Losurdo, L.; Amoroso, N.; Bellotti, R.; Bottigli, U.; Dentamaro, R.; Didonna, V.; Fausto, A.; Massafra, R.; Moschetta, M.; Tamborra, P.; Tangaro, S.; La Forgia, D.
2017-09-01
Many screening programs use mammography as principal diagnostic tool for detecting breast cancer at a very early stage. Despite the efficacy of the mammograms in highlighting breast diseases, the detection of some lesions is still doubtless for radiologists. In particular, the extremely minute and elongated salt-like particles of microcalcifications are sometimes no larger than 0.1 mm and represent approximately half of all cancer detected by means of mammograms. Hence the need for automatic tools able to support radiologists in their work. Here, we propose a computer assisted diagnostic tool to support radiologists in identifying microcalcifications in full (native) digital mammographic images. The proposed CAD system consists of a pre-processing step, that improves contrast and reduces noise by applying Sobel edge detection algorithm and Gaussian filter, followed by a microcalcification detection step performed by exploiting the circular Hough transform. The procedure performance was tested on 200 images coming from the Breast Cancer Digital Repository (BCDR), a publicly available database. The automatically detected clusters of microcalcifications were evaluated by skilled radiologists which asses the validity of the correctly identified regions of interest as well as the system error in case of missed clustered microcalcifications. The system performance was evaluated in terms of Sensitivity and False Positives per images (FPi) rate resulting comparable to the state-of-art approaches. The proposed model was able to accurately predict the microcalcification clusters obtaining performances (sensibility = 91.78% and FPi rate = 3.99) which favorably compare to other state-of-the-art approaches.
Building CHAOS: An Operating System for Livermore Linux Clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Garlick, J E; Dunlap, C M
2003-02-21
The Livermore Computing (LC) Linux Integration and Development Project (the Linux Project) produces and supports the Clustered High Availability Operating System (CHAOS), a cluster operating environment based on Red Hat Linux. Each CHAOS release begins with a set of requirements and ends with a formally tested, packaged, and documented release suitable for use on LC's production Linux clusters. One characteristic of CHAOS is that component software packages come from different sources under varying degrees of project control. Some are developed by the Linux Project, some are developed by other LC projects, some are external open source projects, and some aremore » commercial software packages. A challenge to the Linux Project is to adhere to release schedules and testing disciplines in a diverse, highly decentralized development environment. Communication channels are maintained for externally developed packages in order to obtain support, influence development decisions, and coordinate/understand release schedules. The Linux Project embraces open source by releasing locally developed packages under open source license, by collaborating with open source projects where mutually beneficial, and by preferring open source over proprietary software. Project members generally use open source development tools. The Linux Project requires system administrators and developers to work together to resolve problems that arise in production. This tight coupling of production and development is a key strategy for making a product that directly addresses LC's production requirements. It is another challenge to balance support and development activities in such a way that one does not overwhelm the other.« less
Evers, Stefan; Rapoport, Alan
2017-04-01
Background Oxygen is recommended for the treatment of acute cluster headache attacks. However, it is not available worldwide. Methods The International Headache Society performed a survey among its national member societies on the availability and the restrictions for oxygen in the treatment of cluster headache. Results Oxygen is reimbursed in 50% of all countries responding ( n = 22). There are additional restrictions in the reimbursement of the facial mask and with respect to age. Conclusion Oxygen for the treatment of cluster headache attack is not reimbursed worldwide. Headache societies should pressure national/public health authorities to reimburse oxygen for cluster headache in all countries.
High Performance Computing Based Parallel HIearchical Modal Association Clustering (HPAR HMAC)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Patlolla, Dilip R; Surendran Nair, Sujithkumar; Graves, Daniel A.
For many applications, clustering is a crucial step in order to gain insight into the makeup of a dataset. The best approach to a given problem often depends on a variety of factors, such as the size of the dataset, time restrictions, and soft clustering requirements. The HMAC algorithm seeks to combine the strengths of 2 particular clustering approaches: model-based and linkage-based clustering. One particular weakness of HMAC is its computational complexity. HMAC is not practical for mega-scale data clustering. For high-definition imagery, a user would have to wait months or years for a result; for a 16-megapixel image, themore » estimated runtime skyrockets to over a decade! To improve the execution time of HMAC, it is reasonable to consider an multi-core implementation that utilizes available system resources. An existing imple-mentation (Ray and Cheng 2014) divides the dataset into N partitions - one for each thread prior to executing the HMAC algorithm. This implementation benefits from 2 types of optimization: parallelization and divide-and-conquer. By running each partition in parallel, the program is able to accelerate computation by utilizing more system resources. Although the parallel implementation provides considerable improvement over the serial HMAC, it still suffers from poor computational complexity, O(N2). Once the maximum number of cores on a system is exhausted, the program exhibits slower behavior. We now consider a modification to HMAC that involves a recursive partitioning scheme. Our modification aims to exploit divide-and-conquer benefits seen by the parallel HMAC implementation. At each level in the recursion tree, partitions are divided into 2 sub-partitions until a threshold size is reached. When the partition can no longer be divided without falling below threshold size, the base HMAC algorithm is applied. This results in a significant speedup over the parallel HMAC.« less
Thermodynamics of confined gallium clusters.
Chandrachud, Prachi
2015-11-11
We report the results of ab initio molecular dynamics simulations of Ga13 and Ga17 clusters confined inside carbon nanotubes with different diameters. The cluster-tube interaction is simulated by the Lennard-Jones (LJ) potential. We discuss the geometries, the nature of the bonding and the thermodynamics under confinement. The geometries as well as the isomer spectra of both the clusters are significantly affected. The degree of confinement decides the dimensionality of the clusters. We observe that a number of low-energy isomers appear under moderate confinement while some isomers seen in the free space disappear. Our finite-temperature simulations bring out interesting aspects, namely that the heat capacity curve is flat, even though the ground state is symmetric. Such a flat nature indicates that the phase change is continuous. This effect is due to the restricted phase space available to the system. These observations are supported by the mean square displacement of individual atoms, which are significantly smaller than in free space. The nature of the bonding is found to be approximately jellium-like. Finally we note the relevance of the work to the problem of single file diffusion for the case of the highest confinement.
Key Management Scheme Based on Route Planning of Mobile Sink in Wireless Sensor Networks.
Zhang, Ying; Liang, Jixing; Zheng, Bingxin; Jiang, Shengming; Chen, Wei
2016-01-29
In many wireless sensor network application scenarios the key management scheme with a Mobile Sink (MS) should be fully investigated. This paper proposes a key management scheme based on dynamic clustering and optimal-routing choice of MS. The concept of Traveling Salesman Problem with Neighbor areas (TSPN) in dynamic clustering for data exchange is proposed, and the selection probability is used in MS route planning. The proposed scheme extends static key management to dynamic key management by considering the dynamic clustering and mobility of MSs, which can effectively balance the total energy consumption during the activities. Considering the different resources available to the member nodes and sink node, the session key between cluster head and MS is established by modified an ECC encryption with Diffie-Hellman key exchange (ECDH) algorithm and the session key between member node and cluster head is built with a binary symmetric polynomial. By analyzing the security of data storage, data transfer and the mechanism of dynamic key management, the proposed scheme has more advantages to help improve the resilience of the key management system of the network on the premise of satisfying higher connectivity and storage efficiency.
Searching for galaxy clusters in the Kilo-Degree Survey
NASA Astrophysics Data System (ADS)
Radovich, M.; Puddu, E.; Bellagamba, F.; Roncarelli, M.; Moscardini, L.; Bardelli, S.; Grado, A.; Getman, F.; Maturi, M.; Huang, Z.; Napolitano, N.; McFarland, J.; Valentijn, E.; Bilicki, M.
2017-02-01
Aims: In this paper, we present the tools used to search for galaxy clusters in the Kilo Degree Survey (KiDS), and our first results. Methods: The cluster detection is based on an implementation of the optimal filtering technique that enables us to identify clusters as over-densities in the distribution of galaxies using their positions on the sky, magnitudes, and photometric redshifts. The contamination and completeness of the cluster catalog are derived using mock catalogs based on the data themselves. The optimal signal to noise threshold for the cluster detection is obtained by randomizing the galaxy positions and selecting the value that produces a contamination of less than 20%. Starting from a subset of clusters detected with high significance at low redshifts, we shift them to higher redshifts to estimate the completeness as a function of redshift: the average completeness is 85%. An estimate of the mass of the clusters is derived using the richness as a proxy. Results: We obtained 1858 candidate clusters with redshift 0
Binary and Millisecond Pulsars.
Lorimer, Duncan R
2008-01-01
We review the main properties, demographics and applications of binary and millisecond radio pulsars. Our knowledge of these exciting objects has greatly increased in recent years, mainly due to successful surveys which have brought the known pulsar population to over 1800. There are now 83 binary and millisecond pulsars associated with the disk of our Galaxy, and a further 140 pulsars in 26 of the Galactic globular clusters. Recent highlights include the discovery of the young relativistic binary system PSR J1906+0746, a rejuvination in globular cluster pulsar research including growing numbers of pulsars with masses in excess of 1.5 M ⊙ , a precise measurement of relativistic spin precession in the double pulsar system and a Galactic millisecond pulsar in an eccentric ( e = 0.44) orbit around an unevolved companion. Supplementary material is available for this article at 10.12942/lrr-2008-8.
Substructures in DAFT/FADA survey clusters based on XMM and optical data
NASA Astrophysics Data System (ADS)
Durret, F.; DAFT/FADA Team
2014-07-01
The DAFT/FADA survey was initiated to perform weak lensing tomography on a sample of 90 massive clusters in the redshift range [0.4,0.9] with HST imaging available. The complementary deep multiband imaging constitutes a high quality imaging data base for these clusters. In X-rays, we have analysed the XMM-Newton and/or Chandra data available for 32 clusters, and for 23 clusters we fit the X-ray emissivity with a beta-model and subtract it to search for substructures in the X-ray gas. This study was coupled with a dynamical analysis for the 18 clusters with at least 15 spectroscopic galaxy redshifts in the cluster range, based on a Serna & Gerbal (SG) analysis. We detected ten substructures in eight clusters by both methods (X-rays and SG). The percentage of mass included in substructures is found to be roughly constant with redshift, with values of 5-15%. Most of the substructures detected both in X-rays and with the SG method are found to be relatively recent infalls, probably at their first cluster pericenter approach.
Mean-cluster approach indicates cell sorting time scales are determined by collective dynamics
NASA Astrophysics Data System (ADS)
Beatrici, Carine P.; de Almeida, Rita M. C.; Brunnet, Leonardo G.
2017-03-01
Cell migration is essential to cell segregation, playing a central role in tissue formation, wound healing, and tumor evolution. Considering random mixtures of two cell types, it is still not clear which cell characteristics define clustering time scales. The mass of diffusing clusters merging with one another is expected to grow as td /d +2 when the diffusion constant scales with the inverse of the cluster mass. Cell segregation experiments deviate from that behavior. Explanations for that could arise from specific microscopic mechanisms or from collective effects, typical of active matter. Here we consider a power law connecting diffusion constant and cluster mass to propose an analytic approach to model cell segregation where we explicitly take into account finite-size corrections. The results are compared with active matter model simulations and experiments available in the literature. To investigate the role played by different mechanisms we considered different hypotheses describing cell-cell interaction: differential adhesion hypothesis and different velocities hypothesis. We find that the simulations yield normal diffusion for long time intervals. Analytic and simulation results show that (i) cluster evolution clearly tends to a scaling regime, disrupted only at finite-size limits; (ii) cluster diffusion is greatly enhanced by cell collective behavior, such that for high enough tendency to follow the neighbors, cluster diffusion may become independent of cluster size; (iii) the scaling exponent for cluster growth depends only on the mass-diffusion relation, not on the detailed local segregation mechanism. These results apply for active matter systems in general and, in particular, the mechanisms found underlying the increase in cell sorting speed certainly have deep implications in biological evolution as a selection mechanism.
STAR CLUSTER FORMATION AND DESTRUCTION IN THE MERGING GALAXY NGC 3256
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mulia, A. J.; Chandar, R.; Whitmore, B. C.
2016-07-20
We use the Advanced Camera for Surveys on the Hubble Space Telescope to study the rich population of young massive star clusters in the main body of NGC 3256, a merging pair of galaxies with a high star formation rate (SFR) and SFR per unit area (Σ{sub SFR}). These clusters have luminosity and mass functions that follow power laws, dN / dL ∝ L{sup α} with α = 2.23 ± 0.07, and dN / dM ∝ M{sup β} with β = 1.86 ± 0.34 for τ < 10 Myr clusters, similar to those found in more quiescent galaxies. The agemore » distribution can be described by dN / dτ ∝ τ{sup γ}, with γ ≈ 0.67 ± 0.08 for clusters younger than about a few hundred million years, with no obvious dependence on cluster mass. This is consistent with a picture where ∼80% of the clusters are disrupted each decade in time. We investigate the claim that galaxies with high Σ{sub SFR} form clusters more efficiently than quiescent systems by determining the fraction of stars in bound clusters (Γ) and the CMF/SFR statistic (CMF is the cluster mass function) for NGC 3256 and comparing the results with those for other galaxies. We find that the CMF/SFR statistic for NGC 3256 agrees well with that found for galaxies with Σ{sub SFR} and SFRs that are lower by 1–3 orders of magnitude, but that estimates for Γ are only robust when the same sets of assumptions are applied. Currently, Γ values available in the literature have used different sets of assumptions, making it more difficult to compare the results between galaxies.« less
Star Cluster Formation and Destruction in the Merging Galaxy NGC 3256
NASA Astrophysics Data System (ADS)
Mulia, A. J.; Chandar, R.; Whitmore, B. C.
2016-07-01
We use the Advanced Camera for Surveys on the Hubble Space Telescope to study the rich population of young massive star clusters in the main body of NGC 3256, a merging pair of galaxies with a high star formation rate (SFR) and SFR per unit area (ΣSFR). These clusters have luminosity and mass functions that follow power laws, dN/dL ∝ L α with α = -2.23 ± 0.07, and dN/dM ∝ M β with β = -1.86 ± 0.34 for τ < 10 Myr clusters, similar to those found in more quiescent galaxies. The age distribution can be described by dN/dτ ∝ τ γ , with γ ≈ -0.67 ± 0.08 for clusters younger than about a few hundred million years, with no obvious dependence on cluster mass. This is consistent with a picture where ˜80% of the clusters are disrupted each decade in time. We investigate the claim that galaxies with high ΣSFR form clusters more efficiently than quiescent systems by determining the fraction of stars in bound clusters (Γ) and the CMF/SFR statistic (CMF is the cluster mass function) for NGC 3256 and comparing the results with those for other galaxies. We find that the CMF/SFR statistic for NGC 3256 agrees well with that found for galaxies with ΣSFR and SFRs that are lower by 1-3 orders of magnitude, but that estimates for Γ are only robust when the same sets of assumptions are applied. Currently, Γ values available in the literature have used different sets of assumptions, making it more difficult to compare the results between galaxies.
Esserman, Denise; Allore, Heather G.; Travison, Thomas G.
2016-01-01
Cluster-randomized clinical trials (CRT) are trials in which the unit of randomization is not a participant but a group (e.g. healthcare systems or community centers). They are suitable when the intervention applies naturally to the cluster (e.g. healthcare policy); when lack of independence among participants may occur (e.g. nursing home hygiene); or when it is most ethical to apply an intervention to all within a group (e.g. school-level immunization). Because participants in the same cluster receive the same intervention, CRT may approximate clinical practice, and may produce generalizable findings. However, when not properly designed or interpreted, CRT may induce biased results. CRT designs have features that add complexity to statistical estimation and inference. Chief among these is the cluster-level correlation in response measurements induced by the randomization. A critical consideration is the experimental unit of inference; often it is desirable to consider intervention effects at the level of the individual rather than the cluster. Finally, given that the number of clusters available may be limited, simple forms of randomization may not achieve balance between intervention and control arms at either the cluster- or participant-level. In non-clustered clinical trials, balance of key factors may be easier to achieve because the sample can be homogenous by exclusion of participants with multiple chronic conditions (MCC). CRTs, which are often pragmatic, may eschew such restrictions. Failure to account for imbalance may induce bias and reducing validity. This article focuses on the complexities of randomization in the design of CRTs, such as the inclusion of patients with MCC, and imbalances in covariate factors across clusters. PMID:27478520
NASA Astrophysics Data System (ADS)
Zepf, Stephen E.; Stern, Daniel; Maccarone, Thomas J.; Kundu, Arunav; Kamionkowski, Marc; Rhode, Katherine L.; Salzer, John J.; Ciardullo, Robin; Gronwall, Caryl
2008-08-01
We present Keck LRIS spectroscopy of the black hole-hosting globular cluster RZ 2109 in the Virgo elliptical galaxy NGC 4472. We find that this object has extraordinarily broad [O III] λ5007 and [O III] λ4959 emission lines, with velocity widths of approximately 2000 km s-1. This result has significant implications for the nature of this accreting black hole system and the mass of the globular cluster black hole. We show that the broad [O III] λ5007 emission must arise from material driven at high velocity from the black hole system. This is because the volume available near the black hole is too small by many orders of magnitude to have enough [O III]-emitting atoms to account for the observed L([O III] λ5007) at high velocities, even if this volume is filled with oxygen at the critical density for [O III] λ5007. The Balmer emission is also weak, indicating the observed [O III] is not due to shocks. We therefore conclude that the [O III] λλ4959, 5007 is produced by photoionization of material driven across the cluster. The only known way to drive significant material at high velocity is for a system accreting mass near or above its Eddington limit, which indicates a stellar-mass black hole. Since it is dynamically implausible to form an accreting stellar-mass black hole system in a globular cluster with an intermediate-mass black hole (IMBH), it appears this massive globular cluster does not have an IMBH. We discuss further tests of this conclusion, and its implications for the MBH - Mstellar and MBH - σ relations. Based on observations made at the W. M. Keck Observatory, which is operated as a scientific partnership among the California Institute of Technology, the University of California, and the National Aeronautics and Space Administration. The Observatory was made possible by the generous financial support of the W. M. Keck Foundation.
Impacts of storm chronology on the morphological changes of the Formby beach and dune system, UK
NASA Astrophysics Data System (ADS)
Dissanayake, P.; Brown, J.; Karunarathna, H.
2015-07-01
Impacts of storm chronology within a storm cluster on beach/dune erosion are investigated by applying the state-of-the-art numerical model XBeach to the Sefton coast, northwest England. Six temporal storm clusters of different storm chronologies were formulated using three storms observed during the 2013/2014 winter. The storm power values of these three events nearly halve from the first to second event and from the second to third event. Cross-shore profile evolution was simulated in response to the tide, surge and wave forcing during these storms. The model was first calibrated against the available post-storm survey profiles. Cumulative impacts of beach/dune erosion during each storm cluster were simulated by using the post-storm profile of an event as the pre-storm profile for each subsequent event. For the largest event the water levels caused noticeable retreat of the dune toe due to the high water elevation. For the other events the greatest evolution occurs over the bar formations (erosion) and within the corresponding troughs (deposition) of the upper-beach profile. The sequence of events impacting the size of this ridge-runnel feature is important as it consequently changes the resilience of the system to the most extreme event that causes dune retreat. The highest erosion during each single storm event was always observed when that storm initialised the storm cluster. The most severe storm always resulted in the most erosion during each cluster, no matter when it occurred within the chronology, although the erosion volume due to this storm was reduced when it was not the primary event. The greatest cumulative cluster erosion occurred with increasing storm severity; however, the variability in cumulative cluster impact over a beach/dune cross section due to storm chronology is minimal. Initial storm impact can act to enhance or reduce the system resilience to subsequent impact, but overall the cumulative impact is controlled by the magnitude and number of the storms. This model application provides inter-survey information about morphological response to repeated storm impact. This will inform local managers of the potential beach response and dune vulnerability to variable storm configurations.
Impacts of storm chronology on the morphological changes of the Formby beach and dune system, UK
NASA Astrophysics Data System (ADS)
Dissanayake, P.; Brown, J.; Karunarathna, H.
2015-04-01
Impacts of storm chronology within a storm cluster on beach/dune erosion are investigated by applying the state-of-the-art numerical model XBeach to the Sefton coast, northwest England. Six temporal storm clusters of different storm chronologies were formulated using three storms observed during the 2013/14 winter. The storm power values of these three events nearly halve from the first to second event and from the second to third event. Cross-shore profile evolution was simulated in response to the tide, surge and wave forcing during these storms. The model was first calibrated against the available post-storm survey profiles. Cumulative impacts of beach/dune erosion during each storm cluster were simulated by using the post-storm profile of an event as the pre-storm profile for each subsequent event. For the largest event the water levels caused noticeable retreat of the dune toe due to the high water elevation. For the other events the greatest evolution occurs over the bar formations (erosion) and within the corresponding troughs (deposition) of the upper beach profile. The sequence of events impacting the size of this ridge-runnel feature is important as it consequently changes the resilience of the system to the most extreme event that causes dune retreat. The highest erosion during each single storm event was always observed when that storm initialised the storm cluster. The most severe storm always resulted in the most erosion during each cluster, no matter when it occurred within the chronology, although the erosion volume due to this storm was reduced when it was not the primary event. The greatest cumulative cluster erosion occurred with increasing storm severity; however, the variability in cumulative cluster impact over a beach/dune cross-section due to storm chronology is minimal. Initial storm impact can act to enhance or reduce the system resilience to subsequent impact, but overall the cumulative impact is controlled by the magnitude and number of the storms. This model application provides inter-survey information about morphological response to repeated storm impact. This will inform local managers of the potential beach response and dune vulnerability to variable storm configurations.
Galaxy evolution in the densest environments: HST imaging
NASA Astrophysics Data System (ADS)
Jorgensen, Inger
2013-10-01
We propose to process in a consistent fashion all available HST/ACS and WFC3 imaging of seven rich clusters of galaxies at z=1.2-1.6. The clusters are part of our larger project aimed at constraining models for galaxy evolution in dense environments from observations of stellar populations in rich z=1.2-2 galaxy clusters. The main objective is to establish the star formation {SF} history and structural evolution over this epoch during which large changes in SF rates and galaxy structure are expected to take place in cluster galaxies.The observational data required to meet our main objective are deep HST imaging and high S/N spectroscopy of individual cluster members. The HST imaging already exists for the seven rich clusters at z=1.2-1.6 included in this archive proposal. However, the data have not been consistently processed to derive colors, magnitudes, sizes and morphological parameters for all potential cluster members bright enough to be suitable for spectroscopic observations with 8-m class telescopes. We propose to carry out this processing and make all derived parameters publicly available. We will use the parameters derived from the HST imaging to {1} study the structural evolution of the galaxies, {2} select clusters and galaxies for spectroscopic observations, and {3} use the photometry and spectroscopy together for a unified analysis aimed at the SF history and structural changes. The analysis will also utilize data from the Gemini/HST Cluster Galaxy Project, which covers rich clusters at z=0.2-1.0 and for which we have similar HST imaging and high S/N spectroscopy available.
NASA Astrophysics Data System (ADS)
Perez, Adrianna; Moreno, Jorge; Naiman, Jill; Ramirez-Ruiz, Enrico; Hopkins, Philip F.
2017-01-01
In this work, we analyze the environments surrounding star clusters of simulated merging galaxies. Our framework employs Feedback In Realistic Environments (FIRE) model (Hopkins et al., 2014). The FIRE project is a high resolution cosmological simulation that resolves star forming regions and incorporates stellar feedback in a physically realistic way. The project focuses on analyzing the properties of the star clusters formed in merging galaxies. The locations of these star clusters are identified with astrodendro.py, a publicly available dendrogram algorithm. Once star cluster properties are extracted, they will be used to create a sub-grid (smaller than the resolution scale of FIRE) of gas confinement in these clusters. Then, we can examine how the star clusters interact with these available gas reservoirs (either by accreting this mass or blowing it out via feedback), which will determine many properties of the cluster (star formation history, compact object accretion, etc). These simulations will further our understanding of star formation within stellar clusters during galaxy evolution. In the future, we aim to enhance sub-grid prescriptions for feedback specific to processes within star clusters; such as, interaction with stellar winds and gas accretion onto black holes and neutron stars.
Poole, William; Leinonen, Kalle; Shmulevich, Ilya
2017-01-01
Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C. PMID:28170390
Poole, William; Leinonen, Kalle; Shmulevich, Ilya; Knijnenburg, Theo A; Bernard, Brady
2017-02-01
Cancer researchers have long recognized that somatic mutations are not uniformly distributed within genes. However, most approaches for identifying cancer mutations focus on either the entire-gene or single amino-acid level. We have bridged these two methodologies with a multiscale mutation clustering algorithm that identifies variable length mutation clusters in cancer genes. We ran our algorithm on 539 genes using the combined mutation data in 23 cancer types from The Cancer Genome Atlas (TCGA) and identified 1295 mutation clusters. The resulting mutation clusters cover a wide range of scales and often overlap with many kinds of protein features including structured domains, phosphorylation sites, and known single nucleotide variants. We statistically associated these multiscale clusters with gene expression and drug response data to illuminate the functional and clinical consequences of mutations in our clusters. Interestingly, we find multiple clusters within individual genes that have differential functional associations: these include PTEN, FUBP1, and CDH1. This methodology has potential implications in identifying protein regions for drug targets, understanding the biological underpinnings of cancer, and personalizing cancer treatments. Toward this end, we have made the mutation clusters and the clustering algorithm available to the public. Clusters and pathway associations can be interactively browsed at m2c.systemsbiology.net. The multiscale mutation clustering algorithm is available at https://github.com/IlyaLab/M2C.
Dynamically allocated virtual clustering management system
NASA Astrophysics Data System (ADS)
Marcus, Kelvin; Cannata, Jess
2013-05-01
The U.S Army Research Laboratory (ARL) has built a "Wireless Emulation Lab" to support research in wireless mobile networks. In our current experimentation environment, our researchers need the capability to run clusters of heterogeneous nodes to model emulated wireless tactical networks where each node could contain a different operating system, application set, and physical hardware. To complicate matters, most experiments require the researcher to have root privileges. Our previous solution of using a single shared cluster of statically deployed virtual machines did not sufficiently separate each user's experiment due to undesirable network crosstalk, thus only one experiment could be run at a time. In addition, the cluster did not make efficient use of our servers and physical networks. To address these concerns, we created the Dynamically Allocated Virtual Clustering management system (DAVC). This system leverages existing open-source software to create private clusters of nodes that are either virtual or physical machines. These clusters can be utilized for software development, experimentation, and integration with existing hardware and software. The system uses the Grid Engine job scheduler to efficiently allocate virtual machines to idle systems and networks. The system deploys stateless nodes via network booting. The system uses 802.1Q Virtual LANs (VLANs) to prevent experimentation crosstalk and to allow for complex, private networks eliminating the need to map each virtual machine to a specific switch port. The system monitors the health of the clusters and the underlying physical servers and it maintains cluster usage statistics for historical trends. Users can start private clusters of heterogeneous nodes with root privileges for the duration of the experiment. Users also control when to shutdown their clusters.
Oak Ridge Institutional Cluster Autotune Test Drive Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jibonananda, Sanyal; New, Joshua Ryan
2014-02-01
The Oak Ridge Institutional Cluster (OIC) provides general purpose computational resources for the ORNL staff to run computation heavy jobs that are larger than desktop applications but do not quite require the scale and power of the Oak Ridge Leadership Computing Facility (OLCF). This report details the efforts made and conclusions derived in performing a short test drive of the cluster resources on Phase 5 of the OIC. EnergyPlus was used in the analysis as a candidate user program and the overall software environment was evaluated against anticipated challenges experienced with resources such as the shared memory-Nautilus (JICS) and Titanmore » (OLCF). The OIC performed within reason and was found to be acceptable in the context of running EnergyPlus simulations. The number of cores per node and the availability of scratch space per node allow non-traditional desktop focused applications to leverage parallel ensemble execution. Although only individual runs of EnergyPlus were executed, the software environment on the OIC appeared suitable to run ensemble simulations with some modifications to the Autotune workflow. From a standpoint of general usability, the system supports common Linux libraries, compilers, standard job scheduling software (Torque/Moab), and the OpenMPI library (the only MPI library) for MPI communications. The file system is a Panasas file system which literature indicates to be an efficient file system.« less
New science from the phase space of old stellar systems
NASA Astrophysics Data System (ADS)
Varri, Anna Lisa; Breen, Philip G.; Heggie, Douglas C.; Tiongco, Maria; Vesperini, Enrico
2017-06-01
Our traditional interpretative picture of the internal dynamics of globular clusters has been recently revolutionized by a series of discoveries about their chemical, structural, and kinematic properties. The empirical evidence that their velocity space is much more complex than usually expected encourages us to use them as refreshingly novel phase space laboratories for some long-forgotten aspects of collisional gravitational dynamics. Such a realization, coupled with the discovery that the stars in clusters were not all born at once in a single population, makes them new, challenging chemodynamical puzzles.Thanks to the proper motions of thousands of stars that will be available from the Gaia mission, we are about to enter a new ''golden age'' for the study of the dynamics of this class of stellar systems, as the full phase space of several Galactic globular clusters will be soon unlocked for the first time. In this context, I will present the highlights of a more realistic dynamical paradigm for these intriguing stellar systems, with emphasis on the role of angular momentum, velocity anisotropy and external tidal field. Such a fundamental understanding of the emerging phase space complexity of globulars will allow us to address many open questions about their rich dynamical evolution, their elusive stellar populations and putative black holes, and their role within the history of our Galaxy.
NASA Astrophysics Data System (ADS)
Galleti, S.; Bellazzini, M.; Buzzoni, A.; Federici, L.; Fusi Pecci, F.
2009-12-01
Aims. We present a new homogeneous set of metallicity estimates based on Lick indices for the old globular clusters of the M 31 galaxy. The final aim is to add homogeneous spectroscopic metallicities to as many entries as possible of the Revised Bologna Catalog of M 31 clusters, by reporting Lick index measurements from any source (literature, new observations, etc.) on the same scale. Methods: New empirical relations of [Fe/H] as a function of [MgFe] and Mg2 indices are based on the well-studied galactic globular clusters, complemented with theoretical model predictions for -0.2≤ [Fe/H]≤ +0.5. Lick indices for M 31 clusters from various literature sources (225 clusters) and from new observations by our team (71 clusters) have been transformed into the Trager et al. system, yielding new metallicity estimates for 245 globular clusters of M 31. Results: Our values are in good agreement with recent estimates based on detailed spectral fitting and with those obtained from color magnitude diagrams of clusters imaged with the Hubble Space Telescope. The typical uncertainty on individual estimates is ≃±0.25 dex, as resulted from the comparison with metallicities derived from color magnitude diagrams of individual clusters. Conclusions: The metallicity distribution of M 31 globular cluster is briefly discussed and compared with that of the Milky Way. Simple parametric statistical tests suggest that the distribution is probably not unimodal. The strong correlation between metallicity and kinematics found in previous studies is confirmed. The most metal-rich GCs tend to be packed into the center of the system and to cluster tightly around the galactic rotation curve defined by the HI disk, while the velocity dispersion about the curve increases with decreasing metallicity. However, also the clusters with [Fe/H]<-1.0 display a clear rotation pattern, at odds with their Milky Way counterparts. Based on observations made at La Palma, at the Spanish Observatorio del Roque de los Muchachos of the IAC, with the William Herschel Telescope of the Isaac Newton Group and with the Italian Telescopio Nazionale Galileo (TNG) operated by the Fundación Galileo Galilei of INAF. Also based on observations made with the G.B. Cassini Telescope at Loiano (Italy), operated by the Osservatorio Astronomico di Bologna (INAF). Appendices are only available in electronic form at http://www.aanda.org
BioMake: a GNU make-compatible utility for declarative workflow management.
Holmes, Ian H; Mungall, Christopher J
2017-11-01
The Unix 'make' program is widely used in bioinformatics pipelines, but suffers from problems that limit its application to large analysis datasets. These include reliance on file modification times to determine whether a target is stale, lack of support for parallel execution on clusters, and restricted flexibility to extend the underlying logic program. We present BioMake, a make-like utility that is compatible with most features of GNU Make and adds support for popular cluster-based job-queue engines, MD5 signatures as an alternative to timestamps, and logic programming extensions in Prolog. BioMake is available for MacOSX and Linux systems from https://github.com/evoldoers/biomake under the BSD3 license. The only dependency is SWI-Prolog (version 7), available from http://www.swi-prolog.org/. ihholmes + biomake@gmail.com or cmungall + biomake@gmail.com. Feature table comparing BioMake to similar tools. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
New insights into designing metallacarborane based room temperature hydrogen storage media.
Bora, Pankaj Lochan; Singh, Abhishek K
2013-10-28
Metallacarboranes are promising towards realizing room temperature hydrogen storage media because of the presence of both transition metal and carbon atoms. In metallacarborane clusters, the transition metal adsorbs hydrogen molecules and carbon can link these clusters to form metal organic framework, which can serve as a complete storage medium. Using first principles density functional calculations, we chalk out the underlying principles of designing an efficient metallacarborane based hydrogen storage media. The storage capacity of hydrogen depends upon the number of available transition metal d-orbitals, number of carbons, and dopant atoms in the cluster. These factors control the amount of charge transfer from metal to the cluster, thereby affecting the number of adsorbed hydrogen molecules. This correlation between the charge transfer and storage capacity is general in nature, and can be applied to designing efficient hydrogen storage systems. Following this strategy, a search for the best metallacarborane was carried out in which Sc based monocarborane was found to be the most promising H2 sorbent material with a 9 wt.% of reversible storage at ambient pressure and temperature.
Lin, Mouhong; Huang, Haoliang; Liu, Zuotao; Liu, Yingju; Ge, Junbin; Fang, Yueping
2013-12-10
Magnetic nanoparticle clusters (MNCs) are a class of secondary structural materials that comprise chemically defined nanoparticles assembled into clusters of defined size. Herein, MNCs are fabricated through a one-pot solvothermal reaction featuring self-limiting assembly of building blocks and the controlled reorganization process. Such growth-dissolution-regrowth fabrication mechanism overcomes some limitations of conventional solvothermal fabrication methods with regard to restricted available feature size and structural complexity, which can be extended to other oxides (as long as one can be chelated by EDTA-2Na). Based on this method, the nanoparticle size of MNCs is tuned between 6.8 and 31.2 nm at a fixed cluster diameter of 120 nm, wherein the critical size for superparamagnetic-ferromagnetic transition is estimated from 13.5 to 15.7 nm. Control over the nature and secondary structure of MNCs gives an excellent model system to understand the nanoparticle size-dependent magnetic properties of MNCs. MNCs have potential applications in many different areas, while this work evaluates their cytotoxicity and Pb(2+) adsorption capacity as initial application study.
New insights into designing metallacarborane based room temperature hydrogen storage media
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bora, Pankaj Lochan; Singh, Abhishek K.
Metallacarboranes are promising towards realizing room temperature hydrogen storage media because of the presence of both transition metal and carbon atoms. In metallacarborane clusters, the transition metal adsorbs hydrogen molecules and carbon can link these clusters to form metal organic framework, which can serve as a complete storage medium. Using first principles density functional calculations, we chalk out the underlying principles of designing an efficient metallacarborane based hydrogen storage media. The storage capacity of hydrogen depends upon the number of available transition metal d-orbitals, number of carbons, and dopant atoms in the cluster. These factors control the amount of chargemore » transfer from metal to the cluster, thereby affecting the number of adsorbed hydrogen molecules. This correlation between the charge transfer and storage capacity is general in nature, and can be applied to designing efficient hydrogen storage systems. Following this strategy, a search for the best metallacarborane was carried out in which Sc based monocarborane was found to be the most promising H{sub 2} sorbent material with a 9 wt.% of reversible storage at ambient pressure and temperature.« less
Matthew S. Bumgardner; Gary W. Graham; P. Charles Goebel; Robert L. Romig
2011-01-01
The Amish-based furniture manufacturing cluster in and around Holmes County, OH, is home to some 400 shops and has become an important regional driver of demand for hardwood products. The cluster has expanded even as the broader domestic furniture industry has declined. Clustering dynamics are seen as important to the success, but little information has been available...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-02-19
... DEPARTMENT OF ENERGY Energy Efficient Building Systems Regional Innovation Cluster Initiative... Energy Efficient Building Systems Regional Innovation Cluster Initiative. A single proposal submitted by... systems design. The DOE funded Energy Efficient Building Systems Design Hub (the ``Hub'') will serve as a...
Lens models and magnification maps of the six Hubble Frontier Fields clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Johnson, Traci L.; Sharon, Keren; Bayliss, Matthew B.
2014-12-10
We present strong-lensing models as well as mass and magnification maps for the cores of the six Hubble Space Telescope (HST) Frontier Fields galaxy clusters. Our parametric lens models are constrained by the locations and redshifts of multiple image systems of lensed background galaxies. We use a combination of photometric redshifts and spectroscopic redshifts of the lensed background sources obtained by us (for A2744 and AS1063), collected from the literature, or kindly provided by the lensing community. Using our results, we (1) compare the derived mass distribution of each cluster to its light distribution, (2) quantify the cumulative magnification powermore » of the HST Frontier Fields clusters, (3) describe how our models can be used to estimate the magnification and image multiplicity of lensed background sources at all redshifts and at any position within the cluster cores, and (4) discuss systematic effects and caveats resulting from our modeling methods. We specifically investigate the effect of the use of spectroscopic and photometric redshift constraints on the uncertainties of the resulting models. We find that the photometric redshift estimates of lensed galaxies are generally in excellent agreement with spectroscopic redshifts, where available. However, the flexibility associated with relaxed redshift priors may cause the complexity of large-scale structure that is needed to account for the lensing signal to be underestimated. Our findings thus underline the importance of spectroscopic arc redshifts, or tight photometric redshift constraints, for high precision lens models. All products from our best-fit lens models (magnification, convergence, shear, deflection field) and model simulations for estimating errors are made available via the Mikulski Archive for Space Telescopes.« less
Boriollo, Marcelo Fabiano Gomes; Dias, Ricardo Antunes; Fiorini, João Evangelista; Oliveira, Nelma de Mello Silva; Spolidório, Denise Madalena Palomari; de Souza, Henrique Marques Barbosa; Figueira, Antonio Vargas de Oliveira; Pizzirani-Kleiner, Aline Aparecida
2010-09-01
Various molecular systems are available for epidemiological, genetic, evolutionary, taxonomic and systematic studies of innumerable fungal infections, especially those caused by the opportunistic pathogen C. albicans. A total of 75 independent oral isolates were selected in order to compare Multilocus Enzyme Electrophoresis (MLEE), Electrophoretic Karyotyping (EK) and Microsatellite Markers (Simple Sequence Repeats - SSRs), in their abilities to differentiate and group C. albicans isolates (discriminatory power), and also, to evaluate the concordance and similarity of the groups of strains determined by cluster analysis for each fingerprinting method. Isoenzyme typing was performed using eleven enzyme systems: Adh, Sdh, M1p, Mdh, Idh, Gdh, G6pdh, Asd, Cat, Po, and Lap (data previously published). The EK method consisted of chromosomal DNA separation by pulsed-field gel electrophoresis using a CHEF system. The microsatellite markers were investigated by PCR using three polymorphic loci: EF3, CDC3, and HIS3. Dendrograms were generated by the SAHN method and UPGMA algorithm based on similarity matrices (S(SM)). The discriminatory power of the three methods was over 95%, however a paired analysis among them showed a parity of 19.7-22.4% in the identification of strains. Weak correlation was also observed among the genetic similarity matrices (S(SM)(MLEE)xS(SM)(EK)xS(SM)(SSRs)). Clustering analyses showed a mean of 9+/-12.4 isolates per cluster (3.8+/-8 isolates/taxon) for MLEE, 6.2+/-4.9 isolates per cluster (4+/-4.5 isolates/taxon) for SSRs, and 4.1+/-2.3 isolates per cluster (2.6+/-2.3 isolates/taxon) for EK. A total of 45 (13%), 39 (11.2%), 5 (1.4%) and 3 (0.9%) clusters pairs from 347 showed similarity (S(J)) of 0.1-10%, 10.1-20%, 20.1-30% and 30.1-40%, respectively. Clinical and molecular epidemiological correlation involving the opportunistic pathogen C. albicans may be attributed dependently of each method of genotyping (i.e., MLEE, EK, and SSRs) supplemented with similarity and grouping analysis. Therefore, the use of genotyping systems that give results which offer minimum disparity, or the combination of the results of these systems, can provide greater security and consistency in the determination of strains and their genetic relationships. (c) 2010 Elsevier B.V. All rights reserved.
The enigma of the open cluster M29 (NGC 6913) solved
DOE Office of Scientific and Technical Information (OSTI.GOV)
Straižys, V.; Milašius, K.; Černis, K.
2014-11-01
Determining the distance to the open cluster M29 (NGC 6913) has proven difficult, with distances determined by various authors differing by a factor of two or more. To solve this problem, we have initiated a new photometric investigation of the cluster in the Vilnius seven-color photometric system, supplementing it with available data in the BV and JHK {sub s} photometric systems and spectra of the nine brightest stars of spectral classes O and B. Photometric spectral classes and luminosities of 260 stars in a 15' × 15' area down to V = 19 mag are used to investigate the interstellarmore » extinction run with distance and to estimate the distance of the Great Cygnus Rift, ∼ 800 pc. The interstellar reddening law in the optical and near-infrared regions is found to be close to normal, with the ratio of extinction to color excess R{sub BV} = 2.87. The extinction A{sub V} of cluster members is between 2.5 and 3.8 mag, with a mean value of 2.97 mag, or E {sub B–V} = 1.03. The average distance of eight stars of spectral types O9-B2 is 1.54 ± 0.15 kpc. Two stars from the seven brightest stars are field stars: HDE 229238 is a background B0.5 supergiant and HD 194378 is a foreground F star. In the intrinsic color-magnitude diagram, seven fainter stars of spectral classes B3-B8 are identified as possible members of the cluster. The 15 selected members of the cluster of spectral classes O9-B8 plotted on the log L/L {sub ☉} versus log T {sub eff} diagram, together with the isochrones from the Padova database, give the age of the cluster as 5 ± 1 Myr.« less
NASA Astrophysics Data System (ADS)
Wagner, Judson
Today's technology driven global economy has put pressure on the American education system to produce more students who are prepared for careers in Science, Technology, Engineering, and Math (STEM). Adding to this pressure is the demand for a more diverse workforce that can stimulate the development of new ideas and innovation. This in turn requires more female and under represented minority groups to pursue future careers in STEM. Though STEM careers include many of the highest paid professionals, school systems are dealing with exceptionally high numbers of students, especially female and under represented minorities, who begin but do not persist to STEM degree completion. Using the Expectancy-Value Theory (EVT) framework that attributes student motivation to a combination of intrinsic, utility, and attainment values, this study analyzed readily available survey data to gauge students' career related values. These values were indirectly investigated through a longitudinal approach, spanning five years, on the predictive nature of 8 th grade survey-derived recommendations for students to pursue a future in a particular career cluster. Using logistic regression analysis, it was determined that this 8 th grade data, particularly in STEM, provides significantly high probabilities of a 12th grader's average grade, SAT-Math score, the math and science elective courses they take, and most importantly, interest in the same career cluster.
Phase space theory of evaporation in neon clusters: the role of quantum effects.
Calvo, F; Parneix, P
2009-12-31
Unimolecular evaporation of neon clusters containing between 14 and 148 atoms is theoretically investigated in the framework of phase space theory. Quantum effects are incorporated in the vibrational densities of states, which include both zero-point and anharmonic contributions, and in the possible tunneling through the centrifugal barrier. The evaporation rates, kinetic energy released, and product angular momentum are calculated as a function of excess energy or temperature in the parent cluster and compared to the classical results. Quantum fluctuations are found to generally increase both the kinetic energy released and the angular momentum of the product, but the effects on the rate constants depend nontrivially on the excess energy. These results are interpreted as due to the very few vibrational states available in the product cluster when described quantum mechanically. Because delocalization also leads to much narrower thermal energy distributions, the variations of evaporation observables as a function of canonical temperature appear much less marked than in the microcanonical ensemble. While quantum effects tend to smooth the caloric curve in the product cluster, the melting phase change clearly keeps a signature on these observables. The microcanonical temperature extracted from fitting the kinetic energy released distribution using an improved Arrhenius form further suggests a backbending in the quantum Ne(13) cluster that is absent in the classical system. Finally, in contrast to delocalization effects, quantum tunneling through the centrifugal barrier does not play any appreciable role on the evaporation kinetics of these rather heavy clusters.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Manz, Boryana N.; Jackson, Bryan L.; Petit, Rebecca S.
2011-05-31
T cells react to extremely small numbers of activating agonist peptides. Spatial organization of T-cell receptors (TCR) and their peptide-major histocompatibility complex (pMHC) ligands into microclusters is correlated with T-cell activation. In this study, we have designed an experimental strategy that enables control over the number of agonist peptides per TCR cluster, without altering the total number engaged by the cell. Supported membranes, partitioned with grids of barriers to lateral mobility, provide an effective way of limiting the total number of pMHC ligands that may be assembled within a single TCR cluster. Observations directly reveal that restriction of pMHC contentmore » within individual TCR clusters can decrease T-cell sensitivity for triggering initial calcium flux at fixed total pMHC density. Further analysis suggests that triggering thresholds are determined by the number of activating ligands available to individual TCR clusters, not by the total number encountered by the cell. Results from a series of experiments in which the overall agonist density and the maximum number of agonist per TCR cluster are independently varied in primary T cells indicate that the most probable minimal triggering unit for calcium signaling is at least four pMHC in a single cluster for this system. In conclusion, this threshold is unchanged by inclusion of coagonist pMHC, but costimulation of CD28 by CD80 can modulate the threshold lower.« less
Understanding the evolution of luminescent gold quantum clusters in protein templates.
Chaudhari, Kamalesh; Xavier, Paulrajpillai Lourdu; Pradeep, Thalappil
2011-11-22
We show that the time-dependent biomineralization of Au(3+) by native lactoferrin (NLf) and bovine serum albumin (BSA) resulting in near-infrared (NIR) luminescent gold quantum clusters (QCs) occurs through a protein-bound Au(1+) intermediate and subsequent emergence of free protein. The evolution was probed by diverse tools, principally, using matrix-assisted laser desorption ionization mass spectrometry (MALDI MS), X-ray photoelectron spectroscopy (XPS), and photoluminescence spectroscopy (PL). The importance of alkaline pH in the formation of clusters was probed. At neutral pH, a Au(1+)-protein complex was formed (starting from Au(3+)) with the binding of 13-14 gold atoms per protein. When the pH was increased above 12, these bound gold ions were further reduced to Au(0) and nucleation and growth of cluster commenced, which was corroborated by the beginning of emission; at this point, the number of gold atoms per protein was ~25, suggesting the formation of Au(25). During the cluster evolution, at certain time intervals, for specific molar ratios of gold and protein, occurrence of free protein was noticed in the mass spectra, suggesting a mixture of products and gold ion redistribution. By providing gold ions at specific time of the reaction, monodispersed clusters with enhanced luminescence could be obtained, and the available quantity of free protein could be utilized efficiently. Monodispersed clusters would be useful in obtaining single crystals of protein-protected noble metal quantum clusters where homogeneity of the system is of primary concern. © 2011 American Chemical Society
DOE Office of Scientific and Technical Information (OSTI.GOV)
Heidelberg, S T; Fitzgerald, K J; Richmond, G H
2006-01-24
There has been substantial development of the Lustre parallel filesystem prior to the configuration described below for this milestone. The initial Lustre filesystems that were deployed were directly connected to the cluster interconnect, i.e. Quadrics Elan3. That is, the clients (OSSes) and Meta-data Servers (MDS) were all directly connected to the cluster's internal high speed interconnect. This configuration serves a single cluster very well, but does not provide sharing of the filesystem among clusters. LLNL funded the development of high-efficiency ''portals router'' code by CFS (the company that develops Lustre) to enable us to move the Lustre servers to amore » GigE-connected network configuration, thus making it possible to connect to the servers from several clusters. With portals routing available, here is what changes: (1) another storage-only cluster is deployed to front the Lustre storage devices (these become the Lustre OSSes and MDS), (2) this ''Lustre cluster'' is attached via GigE connections to a large GigE switch/router cloud, (3) a small number of compute-cluster nodes are designated as ''gateway'' or ''portal router'' nodes, and (4) the portals router nodes are GigE-connected to the switch/router cloud. The Lustre configuration is then changed to reflect the new network paths. A typical example of this is a compute cluster and a related visualization cluster: the compute cluster produces the data (writes it to the Lustre filesystem), and the visualization cluster consumes some of the data (reads it from the Lustre filesystem). This process can be expanded by aggregating several collections of Lustre backend storage resources into one or more ''centralized'' Lustre filesystems, and then arranging to have several ''client'' clusters mount these centralized filesystems. The ''client clusters'' can be any combination of compute, visualization, archiving, or other types of cluster. This milestone demonstrates the operation and performance of a scaled-down version of such a large, centralized, shared Lustre filesystem concept.« less
Distant Galaxy Clusters Hosting Extreme Central Galaxies
NASA Astrophysics Data System (ADS)
McDonald, Michael
2014-09-01
The recently-discovered Phoenix cluster harbors the most star-forming central cluster galaxy of any cluster in the known Universe, by nearly a factor of 10. This extreme system appears to be fulfilling early cooling flow predictions, although the lack of similar systems makes any interpretation difficult. In an attempt to find other "Phoenix-like" clusters, we have cross-correlated archival all-sky surveys (in which Phoenix was detected) and isolated 4 similarly-extreme systems which are also coincident in position and redshift with an overdensity of red galaxies. We propose here to obtain Chandra observations of these extreme, Phoenix-like systems, in order to confirm them as relaxed, rapidly-cooling galaxy clusters.
Sun, Yueqi; Luo, Xi; Li, Huabin
2014-01-01
Background Although allergen specific immunotherapy (SIT) represents the only immune- modifying and curative option available for patients with allergic rhinitis (AR), the optimal schedule for specific subcutaneous immunotherapy (SCIT) is still unknown. The objective of this study is to systematically assess the efficacy and safety of cluster SCIT for patients with AR. Methods By searching PubMed, EMBASE and the Cochrane clinical trials database from 1980 through May 10th, 2013, we collected and analyzed the randomized controlled trials (RCTs) of cluster SCIT to assess its efficacy and safety. Results Eight trials involving 567 participants were included in this systematic review. Our meta-analysis showed that cluster SCIT have similar effect in reduction of both rhinitis symptoms and the requirement for anti-allergic medication compared with conventional SCIT, but when comparing cluster SCIT with placebo, no statistic significance were found in reduction of symptom scores or medication scores. Some caution is required in this interpretation as there was significant heterogeneity between studies. Data relating to Rhinoconjunctivitis Quality of Life Questionnaire (RQLQ) in 3 included studies were analyzed, which consistently point to the efficacy of cluster SCIT in improving quality of life compared to placebo. To assess the safety of cluster SCIT, meta-analysis showed that no differences existed in the incidence of either local adverse reaction or systemic adverse reaction between the cluster group and control group. Conclusion Based on the current limited evidence, we still could not conclude affirmatively that cluster SCIT was a safe and efficacious option for the treatment of AR patients. Further large-scale, well-designed RCTs on this topic are still needed. PMID:24489740
NASA Technical Reports Server (NTRS)
Bonamente, Massimiliano; Joy, Marshall K.; Carlstrom, John E.; LaRoque, Samuel J.
2004-01-01
X-ray and Sunyaev-Zeldovich Effect data ca,n be combined to determine the distance to galaxy clusters. High-resolution X-ray data are now available from the Chandra Observatory, which provides both spatial and spectral information, and interferometric radio measurements of the Sunyam-Zeldovich Effect are available from the BIMA and 0VR.O arrays. We introduce a Monte Carlo Markov chain procedure for the joint analysis of X-ray and Sunyaev-Zeldovich Effect data. The advantages of this method are the high computational efficiency and the ability to measure the full probability distribution of all parameters of interest, such as the spatial and spectral properties of the cluster gas and the cluster distance. We apply this technique to the Chandra X-ray data and the OVRO radio data for the galaxy cluster Abell 611. Comparisons with traditional likelihood-ratio methods reveal the robustness of the method. This method will be used in a follow-up paper to determine the distance of a large sample of galaxy clusters for which high-resolution Chandra X-ray and BIMA/OVRO radio data are available.
ERIC Educational Resources Information Center
Georgia Univ., Athens. Div. of Vocational Education.
This booklet lists tasks and functions the student in the transportation cluster should be able to do upon entering an employment situation or a postsecondary school. (Listings are also available for the areas of allied health occupations/practical nursing and cosmetology.) Tasks are coded to correspond to those on the Student Performance Record,…
Scoping Report: AI-Driven Wargame Replicator
2010-12-01
Evans, 2003 Without training involving external input Responsive to verbal instructions Clark & Karmiloff-Smith, 1993 Associative Rule-based Sloman...Applied to Clustering. Online available on January 21, 2010, at http://laboratorios.fi.uba.ar/lsi/rgm/ comunicaciones /c-AGsclustering-ORLANDO96.pdf...systems for associative recall and recognition. Psychological Review, 91:281-294. [214] Polk, T.A. and Newell, A. (1995). Deduction as verbal reasoning
MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems.
González-Domínguez, Jorge; Liu, Yongchao; Touriño, Juan; Schmidt, Bertil
2016-12-15
MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-scale input datasets. In this work we present MSAProbs-MPI, a distributed-memory parallel version of the multithreaded MSAProbs tool that is able to reduce runtimes by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on a cluster with 32 nodes (each containing two Intel Haswell processors) shows reductions in execution time of over one order of magnitude for typical input datasets. Furthermore, MSAProbs-MPI using eight nodes is faster than the GPU-accelerated QuickProbs running on a Tesla K20. Another strong point is that MSAProbs-MPI can deal with large datasets for which MSAProbs and QuickProbs might fail due to time and memory constraints, respectively. Source code in C ++ and MPI running on Linux systems as well as a reference manual are available at http://msaprobs.sourceforge.net CONTACT: jgonzalezd@udc.esSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
XRootD popularity on hadoop clusters
NASA Astrophysics Data System (ADS)
Meoni, Marco; Boccali, Tommaso; Magini, Nicolò; Menichetti, Luca; Giordano, Domenico;
2017-10-01
Performance data and metadata of the computing operations at the CMS experiment are collected through a distributed monitoring infrastructure, currently relying on a traditional Oracle database system. This paper shows how to harness Big Data architectures in order to improve the throughput and the efficiency of such monitoring. A large set of operational data - user activities, job submissions, resources, file transfers, site efficiencies, software releases, network traffic, machine logs - is being injected into a readily available Hadoop cluster, via several data streamers. The collected metadata is further organized running fast arbitrary queries; this offers the ability to test several Map&Reduce-based frameworks and measure the system speed-up when compared to the original database infrastructure. By leveraging a quality Hadoop data store and enabling an analytics framework on top, it is possible to design a mining platform to predict dataset popularity and discover patterns and correlations.
Reconstruction of cluster masses using particle based lensing
NASA Astrophysics Data System (ADS)
Deb, Sanghamitra
Clusters of galaxies are among the richest astrophysical data systems, but to truly understand these systems, we need a detailed study of the relationship between observables and the underlying cluster dark matter distribution. Gravitational lensing is the most direct probe of dark matter, but many mass reconstruction techniques assume that cluster light traces mass, or combine different lensing signals in an ad hoc way. In this talk, we will describe "Particle Based Lensing" (PBL), a new method for cluster mass reconstruction, that avoids many of the pitfalls of previous techniques. PBL optimally combines lensing information of varying signal-to-noise, and makes no assumptions about the relationship between mass and light. We will describe mass reconstructions in three very different, but very illuminating cluster systems: the "Bullet Cluster" (lE 0657-56), A901/902 and A1689. The "Bullet Cluster" is a system of merging clusters made famous by the first unambiguous lensing detection of dark matter. A901/902 is a multi-cluster system with four peaks, and provides an ideal laboratory for studying cluster interaction. We are particularly interested in measuring and correlating the dark matter clump ellipticities. A1689 is one of the richest clusters known, and has significant substructure at the core. It is also my first exercise in optimally combining weak and strong gravitational lensing in a cluster reconstruction. We find that the dark matter distribution is significantly clumpier than indicated by X-ray maps of the gas. We conclude by discussing various potential applications of PBL to existing and future data.
Pentsak, E. O.; Kashin, A. S.; Polynski, M. V.; Kvashnina, K. O.; Glatzel, P.
2015-01-01
Gaining insight into Pd/C catalytic systems aimed at locating reactive centers on carbon surfaces, revealing their properties and estimating the number of reactive centers presents a challenging problem. In the present study state-of-the-art experimental techniques involving ultra high resolution SEM/STEM microscopy (1 Å resolution), high brilliance X-ray absorption spectroscopy and theoretical calculations on truly nanoscale systems were utilized to reveal the role of carbon centers in the formation and nature of Pd/C catalytic materials. Generation of Pd clusters in solution from the easily available Pd2dba3 precursor and the unique reactivity of the Pd clusters opened an excellent opportunity to develop an efficient procedure for the imaging of a carbon surface. Defect sites and reactivity centers of a carbon surface were mapped in three-dimensional space with high resolution and excellent contrast using a user-friendly nanoscale imaging procedure. The proposed imaging approach takes advantage of the specific interactions of reactive carbon centers with Pd clusters, which allows spatial information about chemical reactivity across the Pd/C system to be obtained using a microscopy technique. Mapping the reactivity centers with Pd markers provided unique information about the reactivity of the graphene layers and showed that >2000 reactive centers can be located per 1 μm2 of the surface area of the carbon material. A computational study at a PBE-D3-GPW level differentiated the relative affinity of the Pd2 species to the reactive centers of graphene. These findings emphasized the spatial complexity of the carbon material at the nanoscale and indicated the importance of the surface defect nature, which exhibited substantial gradients and variations across the surface area. The findings show the crucial role of the structure of the carbon support, which governs the formation of Pd/C systems and their catalytic activity. PMID:29511504
Optimal Partitioning of a Data Set Based on the "p"-Median Model
ERIC Educational Resources Information Center
Brusco, Michael J.; Kohn, Hans-Friedrich
2008-01-01
Although the "K"-means algorithm for minimizing the within-cluster sums of squared deviations from cluster centroids is perhaps the most common method for applied cluster analyses, a variety of other criteria are available. The "p"-median model is an especially well-studied clustering problem that requires the selection of "p" objects to serve as…
NASA Astrophysics Data System (ADS)
Neilson, Hunter L.
The Reactivity and Structure of Size Selected VxOy Clusters on a TiO2 (110) Surface of Variable Oxidation State by Hunter L Neilson The selective oxidative dehydrogenation of methanol by vanadium oxide/TiO2 model systems has received a great deal of interest in the surface science community. Previous studies using temperature programmed desorption and reaction (TPD/R) to probe the oxidation of methanol to formaldehyde by vanadia/TiO2 model catalysts have shown that the activity of these systems vary considerably based on the way in which the model system is prepared with formaldehyde desorption temperatures observed anywhere from room temperature to 660 K. The principle reason for this variation is that the preparation of sub-monolayer films of vanadia on TiO2 produces clusters with a multitude of VxOy structures and a mixture of vanadium oxidation states. As a result the stoichiometry of the active vanadium oxide catalyst as well as the oxidation state of vanadium in the active catalyst remain unknown. To better understand this system, our group has probed the reactivity and structure of size-selected Vx, VOy and VxOy clusters on a reduced TiO2 (110) support in ultra-high vacuum (UHV) via TPD/R and scanning tunneling microscopy (STM). Ex situ preparation of these clusters in the gas phase prior to deposition has allowed us to systematically vary the stoichiometry of the vanadia clusters; a layer of control not available via the usual routes to vanadium oxide. The most active catalysts are shown to have (VO3)n stoichiometry in agreement with the theoretical models of the Metiu group. We have shown that both the activity and selectivity of V2O6 and V3O9 cluster catalysts depend sensitively on the oxidation state of the TiO2 (110) support. For example, V2O6 on a reduced surface is selective for the oxidation of methanol to formaldehyde while the selectivity shifts to favor methyl formate as the surface becomes increasingly oxidized. STM studies show that the structure of size-selected V2O6 clusters, upon adsorption to the surface, varies considerably with the oxidation state of the support, in good agreement with our reactivity studies. V 3O9 was shown to catalyze the oxidation of methanol to both formaldehyde and methyl formate on a reduced surface while STM suggests that, unlike V2O6, these clusters are prone to decomposition upon adsorption to the surface. Furthermore, TPD/R of size selected V 2O5 and V2O7 on TiO2 suggests that altering the stoichiometry of the (VO3)n clusters by a single oxygen atom significantly inhibits the activity of these catalysts.
Comparative Microbial Modules Resource: Generation and Visualization of Multi-species Biclusters
Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard
2011-01-01
The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures – results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation. PMID:22144874
Comparative microbial modules resource: generation and visualization of multi-species biclusters.
Kacmarczyk, Thadeous; Waltman, Peter; Bate, Ashley; Eichenberger, Patrick; Bonneau, Richard
2011-12-01
The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures - results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation. © 2011 Kacmarczyk et al.
Extracting the Textual and Temporal Structure of Supercomputing Logs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jain, S; Singh, I; Chandra, A
2009-05-26
Supercomputers are prone to frequent faults that adversely affect their performance, reliability and functionality. System logs collected on these systems are a valuable resource of information about their operational status and health. However, their massive size, complexity, and lack of standard format makes it difficult to automatically extract information that can be used to improve system management. In this work we propose a novel method to succinctly represent the contents of supercomputing logs, by using textual clustering to automatically find the syntactic structures of log messages. This information is used to automatically classify messages into semantic groups via an onlinemore » clustering algorithm. Further, we describe a methodology for using the temporal proximity between groups of log messages to identify correlated events in the system. We apply our proposed methods to two large, publicly available supercomputing logs and show that our technique features nearly perfect accuracy for online log-classification and extracts meaningful structural and temporal message patterns that can be used to improve the accuracy of other log analysis techniques.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thanjavur, Karun; Willis, Jon; Crampton, David, E-mail: karun@uvic.c
2009-11-20
We have developed a new method, K2, optimized for the detection of galaxy clusters in multicolor images. Based on the Red Sequence approach, K2 detects clusters using simultaneous enhancements in both colors and position. The detection significance is robustly determined through extensive Monte Carlo simulations and through comparison with available cluster catalogs based on two different optical methods, and also on X-ray data. K2 also provides quantitative estimates of the candidate clusters' richness and photometric redshifts. Initially, K2 was applied to the two color (gri) 161 deg{sup 2} images of the Canada-France-Hawaii Telescope Legacy Survey Wide (CFHTLS-W) data. Our simulationsmore » show that the false detection rate for these data, at our selected threshold, is only approx1%, and that the cluster catalogs are approx80% complete up to a redshift of z = 0.6 for Fornax-like and richer clusters and to z approx 0.3 for poorer clusters. Based on the g-, r-, and i-band photometric catalogs of the Terapix T05 release, 35 clusters/deg{sup 2} are detected, with 1-2 Fornax-like or richer clusters every 2 deg{sup 2}. Catalogs containing data for 6144 galaxy clusters have been prepared, of which 239 are rich clusters. These clusters, especially the latter, are being searched for gravitational lenses-one of our chief motivations for cluster detection in CFHTLS. The K2 method can be easily extended to use additional color information and thus improve overall cluster detection to higher redshifts. The complete set of K2 cluster catalogs, along with the supplementary catalogs for the member galaxies, are available on request from the authors.« less
NASA Technical Reports Server (NTRS)
Iverson, David L. (Inventor)
2008-01-01
The present invention relates to an Inductive Monitoring System (IMS), its software implementations, hardware embodiments and applications. Training data is received, typically nominal system data acquired from sensors in normally operating systems or from detailed system simulations. The training data is formed into vectors that are used to generate a knowledge database having clusters of nominal operating regions therein. IMS monitors a system's performance or health by comparing cluster parameters in the knowledge database with incoming sensor data from a monitored-system formed into vectors. Nominal performance is concluded when a monitored-system vector is determined to lie within a nominal operating region cluster or lies sufficiently close to a such a cluster as determined by a threshold value and a distance metric. Some embodiments of IMS include cluster indexing and retrieval methods that increase the execution speed of IMS.
Couvin, David; Bernheim, Aude; Toffano-Nioche, Claire; Touchon, Marie; Michalik, Juraj; Néron, Bertrand; C Rocha, Eduardo P; Vergnaud, Gilles; Gautheret, Daniel; Pourcel, Christine
2018-05-22
CRISPR (clustered regularly interspaced short palindromic repeats) arrays and their associated (Cas) proteins confer bacteria and archaea adaptive immunity against exogenous mobile genetic elements, such as phages or plasmids. CRISPRCasFinder allows the identification of both CRISPR arrays and Cas proteins. The program includes: (i) an improved CRISPR array detection tool facilitating expert validation based on a rating system, (ii) prediction of CRISPR orientation and (iii) a Cas protein detection and typing tool updated to match the latest classification scheme of these systems. CRISPRCasFinder can either be used online or as a standalone tool compatible with Linux operating system. All third-party software packages employed by the program are freely available. CRISPRCasFinder is available at https://crisprcas.i2bc.paris-saclay.fr.
Cascading failure in scale-free networks with tunable clustering
NASA Astrophysics Data System (ADS)
Zhang, Xue-Jun; Gu, Bo; Guan, Xiang-Min; Zhu, Yan-Bo; Lv, Ren-Li
2016-02-01
Cascading failure is ubiquitous in many networked infrastructure systems, such as power grids, Internet and air transportation systems. In this paper, we extend the cascading failure model to a scale-free network with tunable clustering and focus on the effect of clustering coefficient on system robustness. It is found that the network robustness undergoes a nonmonotonic transition with the increment of clustering coefficient: both highly and lowly clustered networks are fragile under the intentional attack, and the network with moderate clustering coefficient can better resist the spread of cascading. We then provide an extensive explanation for this constructive phenomenon via the microscopic point of view and quantitative analysis. Our work can be useful to the design and optimization of infrastructure systems.
C60 fullerene localization and membrane interactions in RAW 264.7 immortalized mouse macrophages
NASA Astrophysics Data System (ADS)
Russ, K. A.; Elvati, P.; Parsonage, T. L.; Dews, A.; Jarvis, J. A.; Ray, M.; Schneider, B.; Smith, P. J. S.; Williamson, P. T. F.; Violi, A.; Philbert, M. A.
2016-02-01
There continues to be a significant increase in the number and complexity of hydrophobic nanomaterials that are engineered for a variety of commercial purposes making human exposure a significant health concern. This study uses a combination of biophysical, biochemical and computational methods to probe potential mechanisms for uptake of C60 nanoparticles into various compartments of living immune cells. Cultures of RAW 264.7 immortalized murine macrophage were used as a canonical model of immune-competent cells that are likely to provide the first line of defense following inhalation. Modes of entry studied were endocytosis/pinocytosis and passive permeation of cellular membranes. The evidence suggests marginal uptake of C60 clusters is achieved through endocytosis/pinocytosis, and that passive diffusion into membranes provides a significant source of biologically-available nanomaterial. Computational modeling of both a single molecule and a small cluster of fullerenes predicts that low concentrations of fullerenes enter the membrane individually and produce limited perturbation; however, at higher concentrations the clusters in the membrane causes deformation of the membrane. These findings are bolstered by nuclear magnetic resonance (NMR) of model membranes that reveal deformation of the cell membrane upon exposure to high concentrations of fullerenes. The atomistic and NMR models fail to explain escape of the particle out of biological membranes, but are limited to idealized systems that do not completely recapitulate the complexity of cell membranes. The surprising contribution of passive modes of cellular entry provides new avenues for toxicological research that go beyond the pharmacological inhibition of bulk transport systems such as pinocytosis.There continues to be a significant increase in the number and complexity of hydrophobic nanomaterials that are engineered for a variety of commercial purposes making human exposure a significant health concern. This study uses a combination of biophysical, biochemical and computational methods to probe potential mechanisms for uptake of C60 nanoparticles into various compartments of living immune cells. Cultures of RAW 264.7 immortalized murine macrophage were used as a canonical model of immune-competent cells that are likely to provide the first line of defense following inhalation. Modes of entry studied were endocytosis/pinocytosis and passive permeation of cellular membranes. The evidence suggests marginal uptake of C60 clusters is achieved through endocytosis/pinocytosis, and that passive diffusion into membranes provides a significant source of biologically-available nanomaterial. Computational modeling of both a single molecule and a small cluster of fullerenes predicts that low concentrations of fullerenes enter the membrane individually and produce limited perturbation; however, at higher concentrations the clusters in the membrane causes deformation of the membrane. These findings are bolstered by nuclear magnetic resonance (NMR) of model membranes that reveal deformation of the cell membrane upon exposure to high concentrations of fullerenes. The atomistic and NMR models fail to explain escape of the particle out of biological membranes, but are limited to idealized systems that do not completely recapitulate the complexity of cell membranes. The surprising contribution of passive modes of cellular entry provides new avenues for toxicological research that go beyond the pharmacological inhibition of bulk transport systems such as pinocytosis. Electronic supplementary information (ESI) available. See DOI: 10.1039/c5nr07003a
Planck 2013 results. XXXII. The updated Planck catalogue of Sunyaev-Zeldovich sources
NASA Astrophysics Data System (ADS)
Planck Collaboration; Ade, P. A. R.; Aghanim, N.; Armitage-Caplan, C.; Arnaud, M.; Ashdown, M.; Atrio-Barandela, F.; Aumont, J.; Aussel, H.; Baccigalupi, C.; Banday, A. J.; Barreiro, R. B.; Barrena, R.; Bartelmann, M.; Bartlett, J. G.; Battaner, E.; Benabed, K.; Benoît, A.; Benoit-Lévy, A.; Bernard, J.-P.; Bersanelli, M.; Bielewicz, P.; Bikmaev, I.; Bobin, J.; Bock, J. J.; Böhringer, H.; Bonaldi, A.; Bond, J. R.; Borrill, J.; Bouchet, F. R.; Bridges, M.; Bucher, M.; Burenin, R.; Burigana, C.; Butler, R. C.; Cardoso, J.-F.; Carvalho, P.; Catalano, A.; Challinor, A.; Chamballu, A.; Chary, R.-R.; Chen, X.; Chiang, H. C.; Chiang, L.-Y.; Chon, G.; Christensen, P. R.; Churazov, E.; Church, S.; Clements, D. L.; Colombi, S.; Colombo, L. P. L.; Comis, B.; Couchot, F.; Coulais, A.; Crill, B. P.; Curto, A.; Cuttaia, F.; Da Silva, A.; Dahle, H.; Danese, L.; Davies, R. D.; Davis, R. J.; de Bernardis, P.; de Rosa, A.; de Zotti, G.; Delabrouille, J.; Delouis, J.-M.; Démoclès, J.; Désert, F.-X.; Dickinson, C.; Diego, J. M.; Dolag, K.; Dole, H.; Donzelli, S.; Doré, O.; Douspis, M.; Dupac, X.; Efstathiou, G.; Enßlin, T. A.; Eriksen, H. K.; Feroz, F.; Ferragamo, A.; Finelli, F.; Flores-Cacho, I.; Forni, O.; Frailis, M.; Franceschi, E.; Fromenteau, S.; Galeotta, S.; Ganga, K.; Génova-Santos, R. T.; Giard, M.; Giardino, G.; Gilfanov, M.; Giraud-Héraud, Y.; González-Nuevo, J.; Górski, K. M.; Grainge, K. J. B.; Gratton, S.; Gregorio, A.; Groeneboom, N., E.; Gruppuso, A.; Hansen, F. K.; Hanson, D.; Harrison, D.; Hempel, A.; Henrot-Versillé, S.; Hernández-Monteagudo, C.; Herranz, D.; Hildebrandt, S. R.; Hivon, E.; Hobson, M.; Holmes, W. A.; Hornstrup, A.; Hovest, W.; Huffenberger, K. M.; Hurier, G.; Hurley-Walker, N.; Jaffe, A. H.; Jaffe, T. R.; Jones, W. C.; Juvela, M.; Keihänen, E.; Keskitalo, R.; Khamitov, I.; Kisner, T. S.; Kneissl, R.; Knoche, J.; Knox, L.; Kunz, M.; Kurki-Suonio, H.; Lagache, G.; Lähteenmäki, A.; Lamarre, J.-M.; Lasenby, A.; Laureijs, R. J.; Lawrence, C. R.; Leahy, J. P.; Leonardi, R.; León-Tavares, J.; Lesgourgues, J.; Li, C.; Liddle, A.; Liguori, M.; Lilje, P. B.; Linden-Vørnle, M.; López-Caniego, M.; Lubin, P. M.; Macías-Pérez, J. F.; MacTavish, C. J.; Maffei, B.; Maino, D.; Mandolesi, N.; Maris, M.; Marshall, D. J.; Martin, P. G.; Martínez-González, E.; Masi, S.; Massardi, M.; Matarrese, S.; Matthai, F.; Mazzotta, P.; Mei, S.; Meinhold, P. R.; Melchiorri, A.; Melin, J.-B.; Mendes, L.; Mennella, A.; Migliaccio, M.; Mikkelsen, K.; Mitra, S.; Miville-Deschênes, M.-A.; Moneti, A.; Montier, L.; Morgante, G.; Mortlock, D.; Munshi, D.; Murphy, J. A.; Naselsky, P.; Nastasi, A.; Nati, F.; Natoli, P.; Nesvadba, N. P. H.; Netterfield, C. B.; Nørgaard-Nielsen, H. U.; Noviello, F.; Novikov, D.; Novikov, I.; O'Dwyer, I. J.; Olamaie, M.; Osborne, S.; Oxborrow, C. A.; Paci, F.; Pagano, L.; Pajot, F.; Paoletti, D.; Pasian, F.; Patanchon, G.; Pearson, T. J.; Perdereau, O.; Perotto, L.; Perrott, Y. C.; Perrotta, F.; Piacentini, F.; Piat, M.; Pierpaoli, E.; Pietrobon, D.; Plaszczynski, S.; Pointecouteau, E.; Polenta, G.; Ponthieu, N.; Popa, L.; Poutanen, T.; Pratt, G. W.; Prézeau, G.; Prunet, S.; Puget, J.-L.; Rachen, J. P.; Reach, W. T.; Rebolo, R.; Reinecke, M.; Remazeilles, M.; Renault, C.; Ricciardi, S.; Riller, T.; Ristorcelli, I.; Rocha, G.; Rosset, C.; Roudier, G.; Rowan-Robinson, M.; Rubiño-Martín, J. A.; Rumsey, C.; Rusholme, B.; Sandri, M.; Santos, D.; Saunders, R. D. E.; Savini, G.; Schammel, M. P.; Scott, D.; Seiffert, M. D.; Shellard, E. P. S.; Shimwell, T. W.; Spencer, L. D.; Starck, J.-L.; Stolyarov, V.; Stompor, R.; Streblyanska, A.; Sudiwala, R.; Sunyaev, R.; Sureau, F.; Sutton, D.; Suur-Uski, A.-S.; Sygnet, J.-F.; Tauber, J. A.; Tavagnacco, D.; Terenzi, L.; Toffolatti, L.; Tomasi, M.; Tramonte, D.; Tristram, M.; Tucci, M.; Tuovinen, J.; Türler, M.; Umana, G.; Valenziano, L.; Valiviita, J.; Van Tent, B.; Vibert, L.; Vielva, P.; Villa, F.; Vittorio, N.; Wade, L. A.; Wandelt, B. D.; White, M.; White, S. D. M.; Yvon, D.; Zacchei, A.; Zonca, A.
2015-09-01
We update the all-sky Planck catalogue of 1227 clusters and cluster candidates (PSZ1) published in March 2013, derived from detections of the Sunyaev-Zeldovich (SZ) effect using the first 15.5 months of Planck satellite observations. As an addendum, we deliver an updated version of the PSZ1 catalogue, reporting the further confirmation of 86 Planck-discovered clusters. In total, the PSZ1 now contains 947 confirmed clusters, of which 214 were confirmed as newly discovered clusters through follow-up observations undertaken by the Planck Collaboration. The updated PSZ1 contains redshifts for 913 systems, of which 736 (~ 80.6%) are spectroscopic, and associated mass estimates derived from the Yz mass proxy. We also provide a new SZ quality flag for the remaining 280 candidates. This flag was derived from a novel artificial neural-network classification of the SZ signal. Based on this assessment, the purity of the updated PSZ1 catalogue is estimated to be 94%. In this release, we provide the full updated catalogue and an additional readme file with further information on the Planck SZ detections. The catalogue is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (ftp://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/581/A14
The signatures of the parental cluster on field planetary systems
NASA Astrophysics Data System (ADS)
Cai, Maxwell Xu; Portegies Zwart, Simon; van Elteren, Arjen
2018-03-01
Due to the high stellar densities in young clusters, planetary systems formed in these environments are likely to have experienced perturbations from encounters with other stars. We carry out direct N-body simulations of multiplanet systems in star clusters to study the combined effects of stellar encounters and internal planetary dynamics. These planetary systems eventually become part of the Galactic field population as the parental cluster dissolves, which is where most presently known exoplanets are observed. We show that perturbations induced by stellar encounters lead to distinct signatures in the field planetary systems, most prominently, the excited orbital inclinations and eccentricities. Planetary systems that form within the cluster's half-mass radius are more prone to such perturbations. The orbital elements are most strongly excited in the outermost orbit, but the effect propagates to the entire planetary system through secular evolution. Planet ejections may occur long after a stellar encounter. The surviving planets in these reduced systems tend to have, on average, higher inclinations and larger eccentricities compared to systems that were perturbed less strongly. As soon as the parental star cluster dissolves, external perturbations stop affecting the escaped planetary systems, and further evolution proceeds on a relaxation time-scale. The outer regions of these ejected planetary systems tend to relax so slowly that their state carries the memory of their last strong encounter in the star cluster. Regardless of the stellar density, we observe a robust anticorrelation between multiplicity and mean inclination/eccentricity. We speculate that the `Kepler dichotomy' observed in field planetary systems is a natural consequence of their early evolution in the parental cluster.
Merging history of three bimodal clusters
NASA Astrophysics Data System (ADS)
Maurogordato, S.; Sauvageot, J. L.; Bourdin, H.; Cappi, A.; Benoist, C.; Ferrari, C.; Mars, G.; Houairi, K.
2011-01-01
We present a combined X-ray and optical analysis of three bimodal galaxy clusters selected as merging candidates at z ~ 0.1. These targets are part of MUSIC (MUlti-Wavelength Sample of Interacting Clusters), which is a general project designed to study the physics of merging clusters by means of multi-wavelength observations. Observations include spectro-imaging with XMM-Newton EPIC camera, multi-object spectroscopy (260 new redshifts), and wide-field imaging at the ESO 3.6 m and 2.2 m telescopes. We build a global picture of these clusters using X-ray luminosity and temperature maps together with galaxy density and velocity distributions. Idealized numerical simulations were used to constrain the merging scenario for each system. We show that A2933 is very likely an equal-mass advanced pre-merger ~200 Myr before the core collapse, while A2440 and A2384 are post-merger systems (~450 Myr and ~1.5 Gyr after core collapse, respectively). In the case of A2384, we detect a spectacular filament of galaxies and gas spreading over more than 1 h-1 Mpc, which we infer to have been stripped during the previous collision. The analysis of the MUSIC sample allows us to outline some general properties of merging clusters: a strong luminosity segregation of galaxies in recent post-mergers; the existence of preferential axes - corresponding to the merging directions - along which the BCGs and structures on various scales are aligned; the concomitance, in most major merger cases, of secondary merging or accretion events, with groups infalling onto the main cluster, and in some cases the evidence of previous merging episodes in one of the main components. These results are in good agreement with the hierarchical scenario of structure formation, in which clusters are expected to form by successive merging events, and matter is accreted along large-scale filaments. Based on data obtained with the European Southern Observatory, Chile (programs 072.A-0595, 075.A-0264, and 079.A-0425).Tables 5-7 are only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/525/A79
TethysCluster: A comprehensive approach for harnessing cloud resources for hydrologic modeling
NASA Astrophysics Data System (ADS)
Nelson, J.; Jones, N.; Ames, D. P.
2015-12-01
Advances in water resources modeling are improving the information that can be supplied to support decisions affecting the safety and sustainability of society. However, as water resources models become more sophisticated and data-intensive they require more computational power to run. Purchasing and maintaining the computing facilities needed to support certain modeling tasks has been cost-prohibitive for many organizations. With the advent of the cloud, the computing resources needed to address this challenge are now available and cost-effective, yet there still remains a significant technical barrier to leverage these resources. This barrier inhibits many decision makers and even trained engineers from taking advantage of the best science and tools available. Here we present the Python tools TethysCluster and CondorPy, that have been developed to lower the barrier to model computation in the cloud by providing (1) programmatic access to dynamically scalable computing resources, (2) a batch scheduling system to queue and dispatch the jobs to the computing resources, (3) data management for job inputs and outputs, and (4) the ability to dynamically create, submit, and monitor computing jobs. These Python tools leverage the open source, computing-resource management, and job management software, HTCondor, to offer a flexible and scalable distributed-computing environment. While TethysCluster and CondorPy can be used independently to provision computing resources and perform large modeling tasks, they have also been integrated into Tethys Platform, a development platform for water resources web apps, to enable computing support for modeling workflows and decision-support systems deployed as web apps.
Ichikawa, Kazuki; Morishita, Shinichi
2014-01-01
K-means clustering has been widely used to gain insight into biological systems from large-scale life science data. To quantify the similarities among biological data sets, Pearson correlation distance and standardized Euclidean distance are used most frequently; however, optimization methods have been largely unexplored. These two distance measurements are equivalent in the sense that they yield the same k-means clustering result for identical sets of k initial centroids. Thus, an efficient algorithm used for one is applicable to the other. Several optimization methods are available for the Euclidean distance and can be used for processing the standardized Euclidean distance; however, they are not customized for this context. We instead approached the problem by studying the properties of the Pearson correlation distance, and we invented a simple but powerful heuristic method for markedly pruning unnecessary computation while retaining the final solution. Tests using real biological data sets with 50-60K vectors of dimensions 10-2001 (~400 MB in size) demonstrated marked reduction in computation time for k = 10-500 in comparison with other state-of-the-art pruning methods such as Elkan's and Hamerly's algorithms. The BoostKCP software is available at http://mlab.cb.k.u-tokyo.ac.jp/~ichikawa/boostKCP/.
Noninvasive neuromodulation in migraine and cluster headache.
Starling, Amaal
2018-06-01
The purpose of this narrative review is to provide an overview of the currently available noninvasive neuromodulation devices for the treatment of migraine and cluster headache. Over the last decade, several noninvasive devices have undergone development and clinical trials to evaluate efficacy and safety. Based on this body of work, single-pulse transcranial magnetic stimulation, transcutaneous supraorbital neurostimulation, and noninvasive vagal nerve stimulation devices have been cleared by the United States Food and Drug Administration and are available for clinical use for the treatment of primary headache disorders. Overall, these novel noninvasive devices appear to be safe, well tolerated, and have demonstrated promising results in clinical trials in both migraine and cluster headache. This narrative review will provide a summary and update of the proposed mechanisms of action, evidence, safety, and future directions of various currently available modalities of noninvasive neuromodulation for the treatment of migraine and cluster headache.
NASA Astrophysics Data System (ADS)
Savage, J. C.; Simpson, R. W.
2013-09-01
The deformation across the Sierra Nevada Block, the Walker Lane Belt, and the Central Nevada Seismic Belt (CNSB) between 38.5°N and 40.5°N has been analyzed by clustering GPS velocities to identify coherent blocks. Cluster analysis determines the number of clusters required and assigns the GPS stations to the proper clusters. The clusters are shown on a fault map by symbols located at the positions of the GPS stations, each symbol representing the cluster to which the velocity of that GPS station belongs. Fault systems that separate the clusters are readily identified on such a map. Four significant clusters are identified. Those clusters are strips separated by (from west to east) the Mohawk Valley-Genoa fault system, the Pyramid Lake-Wassuk fault system, and the Central Nevada Seismic Belt. The strain rates within the westernmost three clusters approximate simple right-lateral shear (~13 nstrain/a) across vertical planes roughly parallel to the cluster boundaries. Clustering does not recognize the longitudinal segmentation of the Walker Lane Belt into domains dominated by either northwesterly trending, right-lateral faults or northeasterly trending, left-lateral faults.
Savage, James C.; Simpson, Robert W.
2013-01-01
The deformation across the Sierra Nevada Block, the Walker Lane Belt, and the Central Nevada Seismic Belt (CNSB) between 38.5°N and 40.5°N has been analyzed by clustering GPS velocities to identify coherent blocks. Cluster analysis determines the number of clusters required and assigns the GPS stations to the proper clusters. The clusters are shown on a fault map by symbols located at the positions of the GPS stations, each symbol representing the cluster to which the velocity of that GPS station belongs. Fault systems that separate the clusters are readily identified on such a map. Four significant clusters are identified. Those clusters are strips separated by (from west to east) the Mohawk Valley-Genoa fault system, the Pyramid Lake-Wassuk fault system, and the Central Nevada Seismic Belt. The strain rates within the westernmost three clusters approximate simple right-lateral shear (~13 nstrain/a) across vertical planes roughly parallel to the cluster boundaries. Clustering does not recognize the longitudinal segmentation of the Walker Lane Belt into domains dominated by either northwesterly trending, right-lateral faults or northeasterly trending, left-lateral faults.
Dolidze-35: Results for a Possible Open Cluster
NASA Astrophysics Data System (ADS)
Gulledge, Deborah J.; Borges, Richard A.; Juelfs, Elizabeth; Allyn Smith, J.; Olive, Mary E.; McDonald, Christopher P.; Williams, Sarah M.; Cohen, Eden M.; Gawel, Jason D.; McCole, Bambi A.; Robertson, Jacob M.; Wilson, Tyler; Young, William J.; Buckner, Spencer L.; Allen, Nic R.; Head, H. Hope
2016-01-01
Dolidze-35 is an under-observed northern hemisphere open cluster. It is noted in WEBDA as "No data available for this cluster". As such, we chose this cluster as an undergraduate class project to investigate its existence. We present SDSS-ugriz magnitudes for the possible cluster and cross these with existing JHK data obtained from 2MASS. Selection of possible members is aided by the proper motion study of Krone-Martins (2010).
Zodiacal Exoplanets in Time: Searching for Young Stars in K2
NASA Astrophysics Data System (ADS)
Morris, Nathan Ryan; Mann, Andrew; Rizzuto, Aaron
2018-01-01
Observations of planetary systems around young stars provide insight into the early stages of planetary system formation. Nearby young open clusters such as the Hyades, Pleiades, and Praesepe provide important benchmarks for the properties of stellar systems in general. These clusters are all known to be less than 1 Gyr old, making them ideal targets for a survey of young planetary systems. Few transiting planets have been detected around clusters stars, however, so this alone is too small of a sample. K2, the revived Kepler mission, has provided a vast number of light curves for young stars in clusters and elsewhere in the K2 field. This provides us with the opportunity to extend the sample of young systems to field stars while calibrating with cluster stars. We compute rotational periods from starspot patterns for ~36,000 K2 targets and use gyrochronological relationships derived from cluster stars to determine their ages. From there, we have begun searching for planets around young stars outside the clusters with the ultimate goal of shedding light on how planets and planetary systems evolve in their early, most formative years.
The Newly-Discovered Outer Halo Globular Cluster System of M31
NASA Astrophysics Data System (ADS)
Mackey, D.; Huxor, A.; Ferguson, A.
2012-08-01
In this contribution we describe the discovery of a large number of globular clusters in the outer halo of M31 from the Pan-Andromeda Archaeological Survey (PAndAS). New globular clusters have also been found in the outskirts of M33, and NGC 147 and 185. Many of the remote M31 clusters are observed to preferentially project onto tidal debris streams in the stellar halo, suggesting that much of the outer M31 globular cluster system has been assembled via the accretion of satellite galaxies. We briefly discuss the global properties of the M31 halo globular cluster system.
Effects of Non-Clustering of Refugees.
ERIC Educational Resources Information Center
Conick, John E.
1983-01-01
Describes the approach to resettlement for recently arrived refugees implemented within the state of South Carolina. Suggests that non-clustering of refugees leads to quick acculturation if there is wide community support, but that certain services are more readily available when refugees are clustered. (GC)
Greenhouse tomato limited cluster production systems: crop management practices affect yield
NASA Technical Reports Server (NTRS)
Logendra, L. S.; Gianfagna, T. J.; Specca, D. R.; Janes, H. W.
2001-01-01
Limited-cluster production systems may be a useful strategy to increase crop production and profitability for the greenhouse tomato (Lycopersicon esculentum Mill). In this study, using an ebb-and-flood hydroponics system, we modified plant architecture and spacing and determined the effects on fruit yield and harvest index at two light levels. Single-cluster plants pruned to allow two leaves above the cluster had 25% higher fruit yields than did plants pruned directly above the cluster; this was due to an increase in fruit weight, not fruit number. Both fruit yield and harvest index were greater for all single-cluster plants at the higher light level because of increases in both fruit weight and fruit number. Fruit yield for two-cluster plants was 30% to 40% higher than for single-cluster plants, and there was little difference in the dates or length of the harvest period. Fruit yield for three-cluster plants was not significantly different from that of two-cluster plants; moreover, the harvest period was delayed by 5 days. Plant density (5.5, 7.4, 9.2 plants/m2) affected fruit yield/plant, but not fruit yield/unit area. Given the higher costs for materials and labor associated with higher plant densities, a two-cluster crop at 5.5 plants/m2 with two leaves above the cluster was the best of the production system strategies tested.
Clustering of GPS velocities in the Mojave Block, southeastern California
Savage, James C.; Simpson, Robert W.
2013-01-01
We find subdivisions within the Mojave Block using cluster analysis to identify groupings in the velocities observed at GPS stations there. The clusters are represented on a fault map by symbols located at the positions of the GPS stations, each symbol representing the cluster to which the velocity of that GPS station belongs. Fault systems that separate the clusters are readily identified on such a map. The most significant representation as judged by the gap test involves 4 clusters within the Mojave Block. The fault systems bounding the clusters from east to west are 1) the faults defining the eastern boundary of the Northeast Mojave Domain extended southward to connect to the Hector Mine rupture, 2) the Calico-Paradise fault system, 3) the Landers-Blackwater fault system, and 4) the Helendale-Lockhart fault system. This division of the Mojave Block is very similar to that proposed by Meade and Hager. However, no cluster boundary coincides with the Garlock Fault, the northern boundary of the Mojave Block. Rather, the clusters appear to continue without interruption from the Mojave Block north into the southern Walker Lane Belt, similar to the continuity across the Garlock Fault of the shear zone along the Blackwater-Little Lake fault system observed by Peltzer et al. Mapped traces of individual faults in the Mojave Block terminate within the block and do not continue across the Garlock Fault [Dokka and Travis, ].
Plant adaptations to severely phosphorus-impoverished soils.
Lambers, Hans; Martinoia, Enrico; Renton, Michael
2015-06-01
Mycorrhizas play a pivotal role in phosphorus (P) acquisition of plant roots, by enhancing the soil volume that can be explored. Non-mycorrhizal plant species typically occur either in relatively fertile soil or on soil with a very low P availability, where there is insufficient P in the soil solution for mycorrhizal hyphae to be effective. Soils with a very low P availability are either old and severely weathered or relatively young with high concentrations of oxides and hydroxides of aluminium and iron that sorb P. In such soils, cluster roots and other specialised roots that release P-mobilising carboxylates are more effective than mycorrhizas. Cluster roots are ephemeral structures that release carboxylates in an exudative burst. The carboxylates mobilise sparingly-available sources of soil P. The relative investment of biomass in cluster roots and the amount of carboxylates that are released during the exudative burst differ between species on severely weathered soils with a low total P concentration and species on young soils with high total P concentrations but low P availability. Taking a modelling approach, we explore how the optimal cluster-root strategy depends on soil characteristics, thus offering insights for plant breeders interested in developing crop plants with optimal cluster-root strategies. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Pathak, Arup Kumar
2014-06-01
Analytical expressions have been derived for the vertical detachment energy (VDE) for hydrated sulphate (SO2 -4) and oxalate (C2O2 -4) dianions that can be used to calculate the same over a wide range of cluster sizes including the bulk from the knowledge of VDE for a finite number of stable clusters. The calculated bulk detachment energies are found to be very good in agreement (within 5%) with the available experimental results for both the systems. It is observed that two or more water molecules will be essential for the stability of sulphate and oxalate dianions against spontaneous electron loss and this is consistent with the experiment. We have, for the first time, provided a scheme to calculate the radius of the solvent berg for sulphate and oxalate dianions. The calculated conductivity values for the sulphate and oxalate dianions using Stokes-Einstein relation and the radius of solvent berg are found to be very good in agreement (within 4%) with the available experimental results.
MOCASSIN-prot: a multi-objective clustering approach for protein similarity networks.
Keel, Brittney N; Deng, Bo; Moriyama, Etsuko N
2018-04-15
Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary history of proteins is hence best modeled through networks that incorporate information both from the sequence divergence and the domain content. Here, a game-theoretic approach proposed for protein network construction is adapted into the framework of multi-objective optimization, and extended to incorporate clustering refinement procedure. The new method, MOCASSIN-prot, was applied to cluster multi-domain proteins from ten genomes. The performance of MOCASSIN-prot was compared against two protein clustering methods, Markov clustering (TRIBE-MCL) and spectral clustering (SCPS). We showed that compared to these two methods, MOCASSIN-prot, which uses both domain composition and quantitative sequence similarity information, generates fewer false positives. It achieves more functionally coherent protein clusters and better differentiates protein families. MOCASSIN-prot, implemented in Perl and Matlab, is freely available at http://bioinfolab.unl.edu/emlab/MOCASSINprot. emoriyama2@unl.edu. Supplementary data are available at Bioinformatics online.
The Mass Function of Abell Clusters
NASA Astrophysics Data System (ADS)
Chen, J.; Huchra, J. P.; McNamara, B. R.; Mader, J.
1998-12-01
The velocity dispersion and mass functions for rich clusters of galaxies provide important constraints on models of the formation of Large-Scale Structure (e.g., Frenk et al. 1990). However, prior estimates of the velocity dispersion or mass function for galaxy clusters have been based on either very small samples of clusters (Bahcall and Cen 1993; Zabludoff et al. 1994) or large but incomplete samples (e.g., the Girardi et al. (1998) determination from a sample of clusters with more than 30 measured galaxy redshifts). In contrast, we approach the problem by constructing a volume-limited sample of Abell clusters. We collected individual galaxy redshifts for our sample from two major galaxy velocity databases, the NASA Extragalactic Database, NED, maintained at IPAC, and ZCAT, maintained at SAO. We assembled a database with velocity information for possible cluster members and then selected cluster members based on both spatial and velocity data. Cluster velocity dispersions and masses were calculated following the procedures of Danese, De Zotti, and di Tullio (1980) and Heisler, Tremaine, and Bahcall (1985), respectively. The final velocity dispersion and mass functions were analyzed in order to constrain cosmological parameters by comparison to the results of N-body simulations. Our data for the cluster sample as a whole and for the individual clusters (spatial maps and velocity histograms) in our sample is available on-line at http://cfa-www.harvard.edu/ huchra/clusters. This website will be updated as more data becomes available in the master redshift compilations, and will be expanded to include more clusters and large groups of galaxies.
Presentation on systems cluster research
NASA Technical Reports Server (NTRS)
Morgenthaler, George W.
1989-01-01
This viewgraph presentation presents an overview of systems cluster research performed by the Center for Space Construction. The goals of the research are to develop concepts, insights, and models for space construction and to develop systems engineering/analysis curricula for training future aerospace engineers. The following topics are covered: CSC systems analysis/systems engineering (SIMCON) model, CSC systems cluster schedule, system life-cycle, model optimization techniques, publications, cooperative efforts, and sponsored research.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Toloba, Elisa; Guhathakurta, Puragra; Li, Biao
2016-05-01
We analyze the kinematics of six Virgo cluster dwarf early-type galaxies (dEs) from their globular cluster (GC) systems. We present new Keck/DEIMOS spectroscopy for three of them and re-analyze the data found in the literature for the remaining three. We use two independent methods to estimate the rotation amplitude ( V {sub rot}) and velocity dispersion ( σ {sub GC}) of the GC systems and evaluate their statistical significance by simulating non-rotating GC systems with the same number of GC satellites and velocity uncertainties. Our measured kinematics agree with the published values for the three galaxies from the literature and,more » in all cases, some rotation is measured. However, our simulations show that the null hypothesis of being non-rotating GC systems cannot be ruled out. In the case of VCC 1861, the measured V {sub rot} and the simulations indicate that it is not rotating. In the case of VCC 1528, the null hypothesis can be marginally ruled out, and thus it might be rotating although further confirmation is needed. In our analysis, we find that, in general, the measured V {sub rot} tends to be overestimated and the measured σ {sub GC} tends to be underestimated by amounts that depend on the intrinsic V {sub rot}/ σ {sub GC}, the number of observed GCs ( N {sub GC}), and the velocity uncertainties. The bias is negligible when N {sub GC} ≳ 20. In those cases where a large N {sub GC} is not available, it is imperative to obtain data with small velocity uncertainties. For instance, errors of ≤2 km s{sup −1} lead to V {sub rot} < 10 km s{sup −1} for a system that is intrinsically not rotating.« less
Integral field spectroscopy with GEMINI: Extragalactic star cluster in NGC1275
NASA Astrophysics Data System (ADS)
Trancho, Gelys; Miller, Bryan; García-Lorenzo, Begoña; Sánchez, Sebastián F.
2006-01-01
Studies of globular cluster systems play a critical role in our understanding of galaxy formation. Imaging with the Hubble Space Telescope has revealed that young star clusters are formed copiously in galaxy mergers, strengthening theories in which giant elliptical galaxies are formed by the merger of spirals [e.g. Whitmore, B.C., Schweizer, F., Leitherer, C., Borne, K., Robert, C., 1993. Astronomical Journal. 106, 1354; Miller, B.W., Whitmore, B.C., Schweizer, F., Fall, S.M., 1997. Astronomical Journal. 114, 2381; Zepf, S.E., Ashman, K.M., English, J., Freeman, K.C., Sharples, R.M., 1999. Astronomical Journal. 118, 752; Ashman, K.M., Zepf, S.E., 1992. Astrophysical Journal. 384, 50]. However, the formation and evolution of globular cluster systems is still not well understood. Ages and metallicities of the clusters are uncertain either because of degeneracy in the broad-band colors or due to variable reddening. Also, the luminosity function of the young clusters, which depends critically on the metallicities and ages of the clusters, appears to be single power-laws while the luminosity function of old clusters has a well-defined break. Either there is significant dynamical evolution of the cluster systems or metallicity affects the mass function of forming clusters. Spectroscopy of these clusters are needed to improve the metallicity and age measurements and to study the kinematics of young cluster systems. Therefore, we have obtained GMOS IFU data of 4 clusters in NGC1275. We will present preliminary results like metallicities, ages, and velocities of the star clusters from IFU spectroscopy.
Chiara, Matteo; Horner, David S; Spada, Alberto
2013-01-01
De novo transcriptome characterization from Next Generation Sequencing data has become an important approach in the study of non-model plants. Despite notable advances in the assembly of short reads, the clustering of transcripts into unigene-like (locus-specific) clusters remains a somewhat neglected subject. Indeed, closely related paralogous transcripts are often merged into single clusters by current approaches. Here, a novel heuristic method for locus-specific clustering is compared to that implemented in the de novo assembler Oases, using the same initial transcript collections, derived from Arabidopsis thaliana and the developmental model Streptocarpus rexii. We show that the proposed approach improves cluster specificity in the A. thaliana dataset for which the reference genome is available. Furthermore, for the S. rexii data our filtered transcript collection matches a larger number of distinct annotated loci in reference genomes than the Oases set, while containing a reduced overall number of loci. A detailed discussion of advantages and limitations of our approach in processing de novo transcriptome reconstructions is presented. The proposed method should be widely applicable to other organisms, irrespective of the transcript assembly method employed. The S. rexii transcriptome is available as a sophisticated and augmented publicly available online database.
Bible, Joe; Beck, James D.; Datta, Somnath
2016-01-01
Summary Ignorance of the mechanisms responsible for the availability of information presents an unusual problem for analysts. It is often the case that the availability of information is dependent on the outcome. In the analysis of cluster data we say that a condition for informative cluster size (ICS) exists when the inference drawn from analysis of hypothetical balanced data varies from that of inference drawn on observed data. Much work has been done in order to address the analysis of clustered data with informative cluster size; examples include Inverse Probability Weighting (IPW), Cluster Weighted Generalized Estimating Equations (CWGEE), and Doubly Weighted Generalized Estimating Equations (DWGEE). When cluster size changes with time, i.e., the data set possess temporally varying cluster sizes (TVCS), these methods may produce biased inference for the underlying marginal distribution of interest. We propose a new marginalization that may be appropriate for addressing clustered longitudinal data with TVCS. The principal motivation for our present work is to analyze the periodontal data collected by Beck et al. (1997, Journal of Periodontal Research 6, 497–505). Longitudinal periodontal data often exhibits both ICS and TVCS as the number of teeth possessed by participants at the onset of study is not constant and teeth as well as individuals may be displaced throughout the study. PMID:26682911
DOE Office of Scientific and Technical Information (OSTI.GOV)
Juarez, Aaron J.; Stassun, Keivan G.; Cargile, Phillip A.
2014-11-10
The lithium depletion boundary (LDB) is a robust method for accurately determining the ages of young clusters, but most pre-main-sequence models used to derive LDB ages do not include the effects of magnetic activity on stellar properties. In light of this, we present results from our spectroscopic study of the very-low-mass members of the southern open cluster Blanco 1 using the Gemini-North Telescope, program IDs: GN-2009B-Q-53 and GN-2010B-Q-96. We obtained Gemini Multi-Object Spectrograph spectra at intermediate resolution for cluster candidate members with I ≈ 13-20 mag. From our sample of 43 spectra, we find 14 probable cluster members by consideringmore » proximity to the cluster sequence in an I/I – K {sub s} color-magnitude diagram, agreement with the cluster's systemic radial velocity, and magnetic activity as a youth indicator. We systematically analyze the Hα and Li features and update the LDB age of Blanco 1 to be 126{sub −14}{sup +13} Myr. Our new LDB age for Blanco 1 shows remarkable coevality with the benchmark Pleiades open cluster. Using available empirical activity corrections, we investigate the effects of magnetic activity on the LDB age of Blanco 1. Accounting for activity, we infer a corrected LDB age of 114{sub −10}{sup +9} Myr. This work demonstrates the importance of accounting for magnetic activity on LDB inferred stellar ages, suggesting the need to reinvestigate previous LDB age determinations.« less
Holden, Richard J; Kulanthaivel, Anand; Purkayastha, Saptarshi; Goggins, Kathryn M; Kripalani, Sunil
2017-12-01
Personas are a canonical user-centered design method increasingly used in health informatics research. Personas-empirically-derived user archetypes-can be used by eHealth designers to gain a robust understanding of their target end users such as patients. To develop biopsychosocial personas of older patients with heart failure using quantitative analysis of survey data. Data were collected using standardized surveys and medical record abstraction from 32 older adults with heart failure recently hospitalized for acute heart failure exacerbation. Hierarchical cluster analysis was performed on a final dataset of n=30. Nonparametric analyses were used to identify differences between clusters on 30 clustering variables and seven outcome variables. Six clusters were produced, ranging in size from two to eight patients per cluster. Clusters differed significantly on these biopsychosocial domains and subdomains: demographics (age, sex); medical status (comorbid diabetes); functional status (exhaustion, household work ability, hygiene care ability, physical ability); psychological status (depression, health literacy, numeracy); technology (Internet availability); healthcare system (visit by home healthcare, trust in providers); social context (informal caregiver support, cohabitation, marital status); and economic context (employment status). Tabular and narrative persona descriptions provide an easy reference guide for informatics designers. Personas development using approaches such as clustering of structured survey data is an important tool for health informatics professionals. We describe insights from our study of patients with heart failure, then recommend a generic ten-step personas development process. Methods strengths and limitations of the study and of personas development generally are discussed. Copyright © 2017 Elsevier B.V. All rights reserved.
Scientific Cluster Deployment and Recovery - Using puppet to simplify cluster management
NASA Astrophysics Data System (ADS)
Hendrix, Val; Benjamin, Doug; Yao, Yushu
2012-12-01
Deployment, maintenance and recovery of a scientific cluster, which has complex, specialized services, can be a time consuming task requiring the assistance of Linux system administrators, network engineers as well as domain experts. Universities and small institutions that have a part-time FTE with limited time for and knowledge of the administration of such clusters can be strained by such maintenance tasks. This current work is the result of an effort to maintain a data analysis cluster (DAC) with minimal effort by a local system administrator. The realized benefit is the scientist, who is the local system administrator, is able to focus on the data analysis instead of the intricacies of managing a cluster. Our work provides a cluster deployment and recovery process (CDRP) based on the puppet configuration engine allowing a part-time FTE to easily deploy and recover entire clusters with minimal effort. Puppet is a configuration management system (CMS) used widely in computing centers for the automatic management of resources. Domain experts use Puppet's declarative language to define reusable modules for service configuration and deployment. Our CDRP has three actors: domain experts, a cluster designer and a cluster manager. The domain experts first write the puppet modules for the cluster services. A cluster designer would then define a cluster. This includes the creation of cluster roles, mapping the services to those roles and determining the relationships between the services. Finally, a cluster manager would acquire the resources (machines, networking), enter the cluster input parameters (hostnames, IP addresses) and automatically generate deployment scripts used by puppet to configure it to act as a designated role. In the event of a machine failure, the originally generated deployment scripts along with puppet can be used to easily reconfigure a new machine. The cluster definition produced in our CDRP is an integral part of automating cluster deployment in a cloud environment. Our future cloud efforts will further build on this work.
Autonomic Cluster Management System (ACMS): A Demonstration of Autonomic Principles at Work
NASA Technical Reports Server (NTRS)
Baldassari, James D.; Kopec, Christopher L.; Leshay, Eric S.; Truszkowski, Walt; Finkel, David
2005-01-01
Cluster computing, whereby a large number of simple processors or nodes are combined together to apparently function as a single powerful computer, has emerged as a research area in its own right. The approach offers a relatively inexpensive means of achieving significant computational capabilities for high-performance computing applications, while simultaneously affording the ability to. increase that capability simply by adding more (inexpensive) processors. However, the task of manually managing and con.guring a cluster quickly becomes impossible as the cluster grows in size. Autonomic computing is a relatively new approach to managing complex systems that can potentially solve many of the problems inherent in cluster management. We describe the development of a prototype Automatic Cluster Management System (ACMS) that exploits autonomic properties in automating cluster management.
NASA Astrophysics Data System (ADS)
Martinez Aviles, G.; Johnston-Hollitt, M.; Ferrari, C.; Venturi, T.; Democles, J.; Dallacasa, D.; Cassano, R.; Brunetti, G.; Giacintucci, S.; Pratt, G. W.; Arnaud, M.; Aghanim, N.; Brown, S.; Douspis, M.; Hurier, J.; Intema, H. T.; Langer, M.; Macario, G.; Pointecouteau, E.
2018-04-01
Aim. A fraction of galaxy clusters host diffuse radio sources whose origins are investigated through multi-wavelength studies of cluster samples. We investigate the presence of diffuse radio emission in a sample of seven galaxy clusters in the largely unexplored intermediate redshift range (0.3 < z < 0.44). Methods: In search of diffuse emission, deep radio imaging of the clusters are presented from wide band (1.1-3.1 GHz), full resolution ( 5 arcsec) observations with the Australia Telescope Compact Array (ATCA). The visibilities were also imaged at lower resolution after point source modelling and subtraction and after a taper was applied to achieve better sensitivity to low surface brightness diffuse radio emission. In case of non-detection of diffuse sources, we set upper limits for the radio power of injected diffuse radio sources in the field of our observations. Furthermore, we discuss the dynamical state of the observed clusters based on an X-ray morphological analysis with XMM-Newton. Results: We detect a giant radio halo in PSZ2 G284.97-23.69 (z = 0.39) and a possible diffuse source in the nearly relaxed cluster PSZ2 G262.73-40.92 (z = 0.421). Our sample contains three highly disturbed massive clusters without clear traces of diffuse emission at the observed frequencies. We were able to inject modelled radio haloes with low values of total flux density to set upper detection limits; however, with our high-frequency observations we cannot exclude the presence of RH in these systems because of the sensitivity of our observations in combination with the high z of the observed clusters. The reduced images are only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (http://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/611/A94
Document Clustering Approach for Meta Search Engine
NASA Astrophysics Data System (ADS)
Kumar, Naresh, Dr.
2017-08-01
The size of WWW is growing exponentially with ever change in technology. This results in huge amount of information with long list of URLs. Manually it is not possible to visit each page individually. So, if the page ranking algorithms are used properly then user search space can be restricted up to some pages of searched results. But available literatures show that no single search system can provide qualitative results from all the domains. This paper provides solution to this problem by introducing a new meta search engine that determine the relevancy of query corresponding to web page and cluster the results accordingly. The proposed approach reduces the user efforts, improves the quality of results and performance of the meta search engine.
Development of Metal Cluster-Based Energetic Materials at NSWC-IHD
2011-01-01
reactivity of NixAly + clusters with nitromethane was investigated using a gas-phase molecular beam system. Results indicate that nitromethane is highly...clusters make up the subunit of a molecular metal-based energetic material. The reactivity of NixAly+ clusters with nitromethane was investigated using...a gas-phase molecular beam system. Results indicate that nitromethane is highly reactive toward the NixAly+ clusters and suggests it would not make
Kubas, Adam; Noak, Johannes
2017-01-01
Absorption and multiwavelength resonance Raman spectroscopy are widely used to investigate the electronic structure of transition metal centers in coordination compounds and extended solid systems. In combination with computational methodologies that have predictive accuracy, they define powerful protocols to study the spectroscopic response of catalytic materials. In this work, we study the absorption and resonance Raman spectra of the M1 MoVOx catalyst. The spectra were calculated by time-dependent density functional theory (TD-DFT) in conjunction with the independent mode displaced harmonic oscillator model (IMDHO), which allows for detailed bandshape predictions. For this purpose cluster models with up to 9 Mo and V metallic centers are considered to represent the bulk structure of MoVOx. Capping hydrogens were used to achieve valence saturation at the edges of the cluster models. The construction of model structures was based on a thorough bonding analysis which involved conventional DFT and local coupled cluster (DLPNO-CCSD(T)) methods. Furthermore the relationship of cluster topology to the computed spectral features is discussed in detail. It is shown that due to the local nature of the involved electronic transitions, band assignment protocols developed for molecular systems can be applied to describe the calculated spectral features of the cluster models as well. The present study serves as a reference for future applications of combined experimental and computational protocols in the field of solid-state heterogeneous catalysis. PMID:28989667
NASA Astrophysics Data System (ADS)
Stutz, Amelia M.
2018-02-01
We characterize the stellar and gas volume density, potential, and gravitational field profiles in the central ∼0.5 pc of the Orion Nebula Cluster (ONC), the nearest embedded star cluster (or rather, protocluster) hosting massive star formation available for detailed observational scrutiny. We find that the stellar volume density is well characterized by a Plummer profile ρstars(r) = 5755 M⊙ pc- 3 (1 + (r/a)2)- 5/2, where a = 0.36 pc. The gas density follows a cylindrical power law ρgas(R) = 25.9 M⊙ pc- 3 (R/pc)- 1.775. The stellar density profile dominates over the gas density profile inside r ∼ 1 pc. The gravitational field is gas-dominated at all radii, but the contribution to the total field by the stars is nearly equal to that of the gas at r ∼ a. This fact alone demonstrates that the protocluster cannot be considered a gas-free system or a virialized system dominated by its own gravity. The stellar protocluster core is dynamically young, with an age of ∼2-3 Myr, a 1D velocity dispersion of σobs = 2.6 km s-1, and a crossing time of ∼0.55 Myr. This time-scale is almost identical to the gas filament oscillation time-scale estimated recently by Stutz & Gould. This provides strong evidence that the protocluster structure is regulated by the gas filament. The protocluster structure may be set by tidal forces due to the oscillating filamentary gas potential. Such forces could naturally suppress low density stellar structures on scales ≳ a. The analysis presented here leads to a new suggestion that clusters form by an analogue of the 'slingshot mechanism' previously proposed for stars.
Katz, S.; Noth, J.; Shafaat, H. S.; Happe, T.; Hildebrandt, P.
2016-01-01
[FeFe] hydrogenases are biocatalytic model systems for the exploitation and investigation of catalytic hydrogen evolution. Here, we used vibrational spectroscopic techniques to characterize, in detail, redox transformations of the [FeFe] and [4Fe4S] sub-sites of the catalytic centre (H-cluster) in a monomeric [FeFe] hydrogenase. Through the application of low-temperature resonance Raman spectroscopy, we discovered a novel metastable intermediate that is characterized by an oxidized [FeIFeII] centre and a reduced [4Fe4S]1+ cluster. Based on this unusual configuration, this species is assigned to the first, deprotonated H-cluster intermediate of the [FeFe] hydrogenase catalytic cycle. Providing insights into the sequence of initial reaction steps, the identification of this species represents a key finding towards the mechanistic understanding of biological hydrogen evolution. PMID:28451119
Observation of correlated electronic decay in expanding clusters triggered by near-infrared fields
Schütte, B.; Arbeiter, M.; Fennel, T.; Jabbari, G.; Kuleff, A.I.; Vrakking, M.J.J.; Rouzée, A.
2015-01-01
When an excited atom is embedded into an environment, novel relaxation pathways can emerge that are absent for isolated atoms. A well-known example is interatomic Coulombic decay, where an excited atom relaxes by transferring its excess energy to another atom in the environment, leading to its ionization. Such processes have been observed in clusters ionized by extreme-ultraviolet and X-ray lasers. Here, we report on a correlated electronic decay process that occurs following nanoplasma formation and Rydberg atom generation in the ionization of clusters by intense, non-resonant infrared laser fields. Relaxation of the Rydberg states and transfer of the available electronic energy to adjacent electrons in Rydberg states or quasifree electrons in the expanding nanoplasma leaves a distinct signature in the electron kinetic energy spectrum. These so far unobserved electron-correlation-driven energy transfer processes may play a significant role in the response of any nano-scale system to intense laser light. PMID:26469997
MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.
Gonzalez-Dominguez, Jorge; Martin, Maria J
2017-10-10
In this work we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. Source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.
Lipper, Colin H; Paddock, Mark L; Onuchic, José N; Mittler, Ron; Nechushtai, Rachel; Jennings, Patricia A
2015-01-01
Iron-sulfur cluster biogenesis is executed by distinct protein assembly systems. Mammals have two systems, the mitochondrial Fe-S cluster assembly system (ISC) and the cytosolic assembly system (CIA), that are connected by an unknown mechanism. The human members of the NEET family of 2Fe-2S proteins, nutrient-deprivation autophagy factor-1 (NAF-1) and mitoNEET (mNT), are located at the interface between the mitochondria and the cytosol. These proteins have been implicated in cancer cell proliferation, and they can transfer their 2Fe-2S clusters to a standard apo-acceptor protein. Here we report the first physiological 2Fe-2S cluster acceptor for both NEET proteins as human Anamorsin (also known as cytokine induced apoptosis inhibitor-1; CIAPIN-1). Anamorsin is an electron transfer protein containing two iron-sulfur cluster-binding sites that is required for cytosolic Fe-S cluster assembly. We show, using UV-Vis spectroscopy, that both NAF-1 and mNT can transfer their 2Fe-2S clusters to apo-Anamorsin with second order rate constants similar to those of other known human 2Fe-2S transfer proteins. A direct protein-protein interaction of the NEET proteins with apo-Anamorsin was detected using biolayer interferometry. Furthermore, electrospray mass spectrometry of holo-Anamorsin prepared by cluster transfer shows that it receives both of its 2Fe-2S clusters from the NEETs. We propose that mNT and NAF-1 can provide parallel routes connecting the mitochondrial ISC system and the CIA. 2Fe-2S clusters assembled in the mitochondria are received by NEET proteins and when needed transferred to Anamorsin, activating the CIA.
A web portal for hydrodynamical, cosmological simulations
NASA Astrophysics Data System (ADS)
Ragagnin, A.; Dolag, K.; Biffi, V.; Cadolle Bel, M.; Hammer, N. J.; Krukau, A.; Petkova, M.; Steinborn, D.
2017-07-01
This article describes a data centre hosting a web portal for accessing and sharing the output of large, cosmological, hydro-dynamical simulations with a broad scientific community. It also allows users to receive related scientific data products by directly processing the raw simulation data on a remote computing cluster. The data centre has a multi-layer structure: a web portal, a job control layer, a computing cluster and a HPC storage system. The outer layer enables users to choose an object from the simulations. Objects can be selected by visually inspecting 2D maps of the simulation data, by performing highly compounded and elaborated queries or graphically by plotting arbitrary combinations of properties. The user can run analysis tools on a chosen object. These services allow users to run analysis tools on the raw simulation data. The job control layer is responsible for handling and performing the analysis jobs, which are executed on a computing cluster. The innermost layer is formed by a HPC storage system which hosts the large, raw simulation data. The following services are available for the users: (I) CLUSTERINSPECT visualizes properties of member galaxies of a selected galaxy cluster; (II) SIMCUT returns the raw data of a sub-volume around a selected object from a simulation, containing all the original, hydro-dynamical quantities; (III) SMAC creates idealized 2D maps of various, physical quantities and observables of a selected object; (IV) PHOX generates virtual X-ray observations with specifications of various current and upcoming instruments.
Biogenesis of [Fe-S] cluster in Firmicutes: an unexploited field of investigation.
Riboldi, Gustavo Pelicioli; de Mattos, Eduardo Preusser; Frazzon, Jeverson
2013-09-01
Iron-sulfur clusters (ISC) ([Fe-S]) are evolutionarily ancient and ubiquitous inorganic prosthetic groups present in almost all living organisms, whose biosynthetic assembly is dependent on complex protein machineries. [Fe-S] clusters are involved in biologically important processes, ranging from electron transfer catalysis to transcriptional regulatory roles. Three different systems involved in [Fe-S] cluster assembly have already been characterized in Proteobacteria, namely, the nitrogen fixation system, the ISC system and the sulfur assimilation system. Although they are well described in various microorganisms, these machineries are poorly characterized in members of the Firmicutes phylum, to which several groups of pathogenic bacteria belong. Recently, several research groups have made efforts to elucidate the biogenesis of [Fe-S] clusters at the molecular level in Firmicutes, and many important characteristics have been described. Considering the pivotal role of [Fe-S] clusters in a number of biological processes, the review presented here focuses on the description of the biosynthetic machineries for [Fe-S] cluster biogenesis in prokaryotes, followed by a discussion on recent results observed for Firmicutes [Fe-S] cluster assembly.
Artim-Esen, Bahar; Çene, Erhan; Şahinkaya, Yasemin; Ertan, Semra; Pehlivan, Özlem; Kamali, Sevil; Gül, Ahmet; Öcal, Lale; Aral, Orhan; Inanç, Murat
2014-07-01
Associations between autoantibodies and clinical features have been described in systemic lupus erythematosus (SLE). Herein, we aimed to define autoantibody clusters and their clinical correlations in a large cohort of patients with SLE. We analyzed 852 patients with SLE who attended our clinic. Seven autoantibodies were selected for cluster analysis: anti-DNA, anti-Sm, anti-RNP, anticardiolipin (aCL) immunoglobulin (Ig)G or IgM, lupus anticoagulant (LAC), anti-Ro, and anti-La. Two-step clustering and Kaplan-Meier survival analyses were used. Five clusters were identified. A cluster consisted of patients with only anti-dsDNA antibodies, a cluster of anti-Sm and anti-RNP, a cluster of aCL IgG/M and LAC, and a cluster of anti-Ro and anti-La antibodies. Analysis revealed 1 more cluster that consisted of patients who did not belong to any of the clusters formed by antibodies chosen for cluster analysis. Sm/RNP cluster had significantly higher incidence of pulmonary hypertension and Raynaud phenomenon. DsDNA cluster had the highest incidence of renal involvement. In the aCL/LAC cluster, there were significantly more patients with neuropsychiatric involvement, antiphospholipid syndrome, autoimmune hemolytic anemia, and thrombocytopenia. According to the Systemic Lupus International Collaborating Clinics damage index, the highest frequency of damage was in the aCL/LAC cluster. Comparison of 10 and 20 years survival showed reduced survival in the aCL/LAC cluster. This study supports the existence of autoantibody clusters with distinct clinical features in SLE and shows that forming clinical subsets according to autoantibody clusters may be useful in predicting the outcome of the disease. Autoantibody clusters in SLE may exhibit differences according to the clinical setting or population.
Clustering of GPS velocities in the Mojave Block, southeastern California
NASA Astrophysics Data System (ADS)
Savage, J. C.; Simpson, R. W.
2013-04-01
find subdivisions within the Mojave Block using cluster analysis to identify groupings in the velocities observed at GPS stations there. The clusters are represented on a fault map by symbols located at the positions of the GPS stations, each symbol representing the cluster to which the velocity of that GPS station belongs. Fault systems that separate the clusters are readily identified on such a map. The most significant representation as judged by the gap test involves 4 clusters within the Mojave Block. The fault systems bounding the clusters from east to west are 1) the faults defining the eastern boundary of the Northeast Mojave Domain extended southward to connect to the Hector Mine rupture, 2) the Calico-Paradise fault system, 3) the Landers-Blackwater fault system, and 4) the Helendale-Lockhart fault system. This division of the Mojave Block is very similar to that proposed by Meade and Hager []. However, no cluster boundary coincides with the Garlock Fault, the northern boundary of the Mojave Block. Rather, the clusters appear to continue without interruption from the Mojave Block north into the southern Walker Lane Belt, similar to the continuity across the Garlock Fault of the shear zone along the Blackwater-Little Lake fault system observed by Peltzer et al. []. Mapped traces of individual faults in the Mojave Block terminate within the block and do not continue across the Garlock Fault [Dokka and Travis, ].
The bacterial species definition in the genomic era
Konstantinidis, Konstantinos T; Ramette, Alban; Tiedje, James M
2006-01-01
The bacterial species definition, despite its eminent practical significance for identification, diagnosis, quarantine and diversity surveys, remains a very difficult issue to advance. Genomics now offers novel insights into intra-species diversity and the potential for emergence of a more soundly based system. Although we share the excitement, we argue that it is premature for a universal change to the definition because current knowledge is based on too few phylogenetic groups and too few samples of natural populations. Our analysis of five important bacterial groups suggests, however, that more stringent standards for species may be justifiable when a solid understanding of gene content and ecological distinctiveness becomes available. Our analysis also reveals what is actually encompassed in a species according to the current standards, in terms of whole-genome sequence and gene-content diversity, and shows that this does not correspond to coherent clusters for the environmental Burkholderia and Shewanella genera examined. In contrast, the obligatory pathogens, which have a very restricted ecological niche, do exhibit clusters. Therefore, the idea of biologically meaningful clusters of diversity that applies to most eukaryotes may not be universally applicable in the microbial world, or if such clusters exist, they may be found at different levels of distinction. PMID:17062412
OCCAM: a flexible, multi-purpose and extendable HPC cluster
NASA Astrophysics Data System (ADS)
Aldinucci, M.; Bagnasco, S.; Lusso, S.; Pasteris, P.; Rabellino, S.; Vallero, S.
2017-10-01
The Open Computing Cluster for Advanced data Manipulation (OCCAM) is a multipurpose flexible HPC cluster designed and operated by a collaboration between the University of Torino and the Sezione di Torino of the Istituto Nazionale di Fisica Nucleare. It is aimed at providing a flexible, reconfigurable and extendable infrastructure to cater to a wide range of different scientific computing use cases, including ones from solid-state chemistry, high-energy physics, computer science, big data analytics, computational biology, genomics and many others. Furthermore, it will serve as a platform for R&D activities on computational technologies themselves, with topics ranging from GPU acceleration to Cloud Computing technologies. A heterogeneous and reconfigurable system like this poses a number of challenges related to the frequency at which heterogeneous hardware resources might change their availability and shareability status, which in turn affect methods and means to allocate, manage, optimize, bill, monitor VMs, containers, virtual farms, jobs, interactive bare-metal sessions, etc. This work describes some of the use cases that prompted the design and construction of the HPC cluster, its architecture and resource provisioning model, along with a first characterization of its performance by some synthetic benchmark tools and a few realistic use-case tests.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parisi, M. C.; Clariá, J. J.; Marcionni, N.
2015-05-15
We obtained spectra of red giants in 15 Small Magellanic Cloud (SMC) clusters in the region of the Ca ii lines with FORS2 on the Very Large Telescope. We determined the mean metallicity and radial velocity with mean errors of 0.05 dex and 2.6 km s{sup −1}, respectively, from a mean of 6.5 members per cluster. One cluster (B113) was too young for a reliable metallicity determination and was excluded from the sample. We combined the sample studied here with 15 clusters previously studied by us using the same technique, and with 7 clusters whose metallicities determined by other authorsmore » are on a scale similar to ours. This compilation of 36 clusters is the largest SMC cluster sample currently available with accurate and homogeneously determined metallicities. We found a high probability that the metallicity distribution is bimodal, with potential peaks at −1.1 and −0.8 dex. Our data show no strong evidence of a metallicity gradient in the SMC clusters, somewhat at odds with recent evidence from Ca ii triplet spectra of a large sample of field stars. This may be revealing possible differences in the chemical history of clusters and field stars. Our clusters show a significant dispersion of metallicities, whatever age is considered, which could be reflecting the lack of a unique age–metallicity relation in this galaxy. None of the chemical evolution models currently available in the literature satisfactorily represents the global chemical enrichment processes of SMC clusters.« less
Cosmic Pressure Fronts Mapped by Chandra
NASA Astrophysics Data System (ADS)
2000-03-01
A colossal cosmic "weather system" produced by the collision of two giant clusters of galaxies has been imaged by NASA's Chandra X-ray Observatory. For the first time, the pressure fronts in the system can be traced in detail, and they show a bright, but relatively cool 50 million degree Celsius central region embedded in large elongated cloud of 70 million degree Celsius gas, all of which is roiling in a faint "atmosphere"of 100 million degree Celsius gas. "We can compare this to an intergalactic cold front," said Maxim Markevitch of the Harvard-Smithsonian Center for Astrophysics, Cambridge, Mass. and leader of the international team involved in the analysis of the observations. "A major difference is that in this case, cold means 70 million degree Celsius." The gas clouds are in the core of a galaxy cluster known as Abell 2142. The cluster is six million light years across and contains hundreds of galaxies and enough gas to make a thousand more. It is one of the most massive objects in the universe. Galaxy clusters grow to vast sizes as smaller clusters are pulled inward under the influence of gravity. They collide and merge over the course of billions of years, releasing tremendous amounts of energy that heats the cluster gas to 100 million degrees Celsius. The Chandra data provides the first detailed look at the late stages of this merger process. Previously, scientists had used the German-US Roentgensatellite to produce a broad brush picture of the cluster. The elongated shape of the bright cloud suggested that two clouds were in the process of coalescing into one, but the details remained unclear. Chandra is able to measure variations of temperature, density, and pressure with unprecedented resolution. "Now we can begin to understand the physics of these mergers, which are among the most energetic events in the universe," said Markevitch. "The pressure and density maps of the cluster show a sharp boundary that can only exist in the moving environment of a merger." With this information scientists can make a comparison with computer simulations of cosmic mergers. This comparison, which is in the early stages, shows that this merger has progressed to an advanced stage. Strong shock waves predicted by the theory for the initial collision of clusters are not observed. It appears likely that these sub-clusters have collided two or three times in a billion years or more, and have nearly completed their merger. The observations were made on August 20, 1999 using the Advanced CCD Imaging Spectrometer (ACIS). The team involved scientists from Harvard-Smithsonian; the Massachusetts Institute of Technology, Cambridge; NASA's Marshall Space Flight Center, Huntsville, Ala.; the University of Hawaii, Honolulu; the University of Birmingham, U.K.; the University of Wollongong, Australia; the Space Research Organization Netherlands; the University of Rome, Italy; and the Russian Academy of Sciences. The results will be published in an upcoming issue of the Astrophysical Journal. The ACIS instrument was built for NASA by the Massachusetts Institute of Technology, Cambridge, and Pennsylvania State University, University Park. NASA's Marshall Space Flight Center in Huntsville, Ala., manages the Chandra program. TRW, Inc., Redondo Beach, Calif., is the prime contractor for the spacecraft. The Smithsonian's Chandra X-ray Center controls science and flight operations from Cambridge, Mass. For images connected to this release, and to follow Chandra's progress, visit the Chandra site at: http://chandra.harvard.edu/photo/2000/a2142/index.html AND http://chandra.nasa.gov High resolution digital versions of the X-ray image (JPG, 300 dpi TIFF) are available at the Internet sites listed above. This image will be available on NASA Video File which airs at noon, 3:00 p.m., 6:00 p.m., 9:00 p.m. and midnight Eastern Time. NASA Television is available on GE-2, transponder 9C at 85 degrees West longitude, with vertical polarization. Frequency is on 3880.0 megahertz, with audio on 6.8 megahertz.
STAR CLUSTERS IN M33: UPDATED UBVRI PHOTOMETRY, AGES, METALLICITIES, AND MASSES
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fan, Zhou; De Grijs, Richard, E-mail: zfan@bao.ac.cn, E-mail: grijs@pku.edu.cn
2014-04-01
The photometric characterization of M33 star clusters is far from complete. In this paper, we present homogeneous UBVRI photometry of 708 star clusters and cluster candidates in M33 based on archival images from the Local Group Galaxies Survey, which covers 0.8 deg{sup 2} along the galaxy's major axis. Our photometry includes 387, 563, 616, 580, and 478 objects in the UBVRI bands, respectively, of which 276, 405, 430, 457, and 363 do not have previously published UBVRI photometry. Our photometry is consistent with previous measurements (where available) in all filters. We adopted Sloan Digital Sky Survey ugriz photometry for complementarymore » purposes, as well as Two Micron All Sky Survey near-infrared JHK photometry where available. We fitted the spectral-energy distributions of 671 star clusters and candidates to derive their ages, metallicities, and masses based on the updated PARSEC simple stellar populations synthesis models. The results of our χ{sup 2} minimization routines show that only 205 of the 671 clusters (31%) are older than 2 Gyr, which represents a much smaller fraction of the cluster population than that in M31 (56%), suggesting that M33 is dominated by young star clusters (<1 Gyr). We investigate the mass distributions of the star clusters—both open and globular clusters—in M33, M31, the Milky Way, and the Large Magellanic Cloud. Their mean values are log (M {sub cl}/M {sub ☉}) = 4.25, 5.43, 2.72, and 4.18, respectively. The fraction of open to globular clusters is highest in the Milky Way and lowest in M31. Our comparisons of the cluster ages, masses, and metallicities show that our results are basically in agreement with previous studies (where objects in common are available); differences can be traced back to differences in the models adopted, the fitting methods used, and stochastic sampling effects.« less
Eradication of Transboundary Animal Diseases: Can the Rinderpest Success Story be Repeated?
Thomson, G R; Penrith, M-L
2017-04-01
A matrix system was developed to aid in the evaluation of the technical amenability to eradication, through mass vaccination, of transboundary animal diseases (TADs). The system involved evaluation of three basic criteria - disease management efficiency, surveillance and epidemiological factors - each in turn comprised of a number of elements (17 in all). On that basis, 25 TADs that have occurred or do occur in southern Africa and for which vaccines are available, in addition to rinderpest (incorporated as a yardstick because it has been eradicated worldwide), were ranked. Cluster analysis was also applied using the same criteria to the 26 diseases, creating division into three groups. One cluster contained only diseases transmitted by arthropods (e.g. African horse sickness and Rift Valley fever) and considered difficult to eradicate because technologies for managing parasitic arthropods on a large scale are unavailable, while a second cluster contained diseases that have been widely considered to be eradicable [rinderpest, canine rabies, the Eurasian serotypes of foot and mouth disease virus (O, A, C & Asia 1) and peste des petits ruminants] as well classical swine fever, Newcastle disease and lumpy skin disease. The third cluster contained all the other TADs evaluated with the implication that these constitute TADs that would be more difficult to eradicate. However, it is acknowledged that the scores assigned in the course of this study may be biased. The point is that the system proposed offers an objective method for assessment of the technical eradicability of TADs; the rankings and groupings derived during this study are less important than the provision of a systematic approach for further development and evaluation. © 2015 Blackwell Verlag GmbH.
Dymond, Caren C; Field, Robert D; Roswintiarti, Orbita; Guswanto
2005-04-01
Vegetation fires have become an increasing problem in tropical environments as a consequence of socioeconomic pressures and subsequent land-use change. In response, fire management systems are being developed. This study set out to determine the relationships between two aspects of the fire problems in western Indonesia and Malaysia, and two components of the Canadian Forest Fire Weather Index System. The study resulted in a new method for calibrating components of fire danger rating systems based on satellite fire detection (hotspot) data. Once the climate was accounted for, a problematic number of fires were related to high levels of the Fine Fuel Moisture Code. The relationship between climate, Fine Fuel Moisture Code, and hotspot occurrence was used to calibrate Fire Occurrence Potential classes where low accounted for 3% of the fires from 1994 to 2000, moderate accounted for 25%, high 26%, and extreme 38%. Further problems arise when there are large clusters of fires burning that may consume valuable land or produce local smoke pollution. Once the climate was taken into account, the hotspot load (number and size of clusters of hotspots) was related to the Fire Weather Index. The relationship between climate, Fire Weather Index, and hotspot load was used to calibrate Fire Load Potential classes. Low Fire Load Potential conditions (75% of an average year) corresponded with 24% of the hotspot clusters, which had an average size of 30% of the largest cluster. In contrast, extreme Fire Load Potential conditions (1% of an average year) corresponded with 30% of the hotspot clusters, which had an average size of 58% of the maximum. Both Fire Occurrence Potential and Fire Load Potential calibrations were successfully validated with data from 2001. This study showed that when ground measurements are not available, fire statistics derived from satellite fire detection archives can be reliably used for calibration. More importantly, as a result of this work, Malaysia and Indonesia have two new sources of information to initiate fire prevention and suppression activities.
Wee, Bryan A; Woolfit, Megan; Beatson, Scott A; Petty, Nicola K
2013-01-01
Legionella encodes multiple classes of Type IV Secretion Systems (T4SSs), including the Dot/Icm protein secretion system that is essential for intracellular multiplication in amoebal and human hosts. Other T4SSs not essential for virulence are thought to facilitate the acquisition of niche-specific adaptation genes including the numerous effector genes that are a hallmark of this genus. Previously, we identified two novel gene clusters in the draft genome of Legionella pneumophila strain 130b that encode homologues of a subtype of T4SS, the genomic island-associated T4SS (GI-T4SS), usually associated with integrative and conjugative elements (ICE). In this study, we performed genomic analyses of 14 homologous GI-T4SS clusters found in eight publicly available Legionella genomes and show that this cluster is unusually well conserved in a region of high plasticity. Phylogenetic analyses show that Legionella GI-T4SSs are substantially divergent from other members of this subtype of T4SS and represent a novel clade of GI-T4SSs only found in this genus. The GI-T4SS was found to be under purifying selection, suggesting it is functional and may play an important role in the evolution and adaptation of Legionella. Like other GI-T4SSs, the Legionella clusters are also associated with ICEs, but lack the typical integration and replication modules of related ICEs. The absence of complete replication and DNA pre-processing modules, together with the presence of Legionella-specific regulatory elements, suggest the Legionella GI-T4SS-associated ICE is unique and may employ novel mechanisms of regulation, maintenance and excision. The Legionella GI-T4SS cluster was found to be associated with several cargo genes, including numerous antibiotic resistance and virulence factors, which may confer a fitness benefit to the organism. The in-silico characterisation of this new T4SS furthers our understanding of the diversity of secretion systems involved in the frequent horizontal gene transfers that allow Legionella to adapt to and exploit diverse environmental niches.
Wee, Bryan A.; Woolfit, Megan; Beatson, Scott A.; Petty, Nicola K.
2013-01-01
Legionella encodes multiple classes of Type IV Secretion Systems (T4SSs), including the Dot/Icm protein secretion system that is essential for intracellular multiplication in amoebal and human hosts. Other T4SSs not essential for virulence are thought to facilitate the acquisition of niche-specific adaptation genes including the numerous effector genes that are a hallmark of this genus. Previously, we identified two novel gene clusters in the draft genome of Legionella pneumophila strain 130b that encode homologues of a subtype of T4SS, the genomic island-associated T4SS (GI-T4SS), usually associated with integrative and conjugative elements (ICE). In this study, we performed genomic analyses of 14 homologous GI-T4SS clusters found in eight publicly available Legionella genomes and show that this cluster is unusually well conserved in a region of high plasticity. Phylogenetic analyses show that Legionella GI-T4SSs are substantially divergent from other members of this subtype of T4SS and represent a novel clade of GI-T4SSs only found in this genus. The GI-T4SS was found to be under purifying selection, suggesting it is functional and may play an important role in the evolution and adaptation of Legionella. Like other GI-T4SSs, the Legionella clusters are also associated with ICEs, but lack the typical integration and replication modules of related ICEs. The absence of complete replication and DNA pre-processing modules, together with the presence of Legionella-specific regulatory elements, suggest the Legionella GI-T4SS-associated ICE is unique and may employ novel mechanisms of regulation, maintenance and excision. The Legionella GI-T4SS cluster was found to be associated with several cargo genes, including numerous antibiotic resistance and virulence factors, which may confer a fitness benefit to the organism. The in-silico characterisation of this new T4SS furthers our understanding of the diversity of secretion systems involved in the frequent horizontal gene transfers that allow Legionella to adapt to and exploit diverse environmental niches. PMID:24358157
clusterProfiler: an R package for comparing biological themes among gene clusters.
Yu, Guangchuang; Wang, Li-Gen; Han, Yanyan; He, Qing-Yu
2012-05-01
Increasing quantitative data generated from transcriptomics and proteomics require integrative strategies for analysis. Here, we present an R package, clusterProfiler that automates the process of biological-term classification and the enrichment analysis of gene clusters. The analysis module and visualization module were combined into a reusable workflow. Currently, clusterProfiler supports three species, including humans, mice, and yeast. Methods provided in this package can be easily extended to other species and ontologies. The clusterProfiler package is released under Artistic-2.0 License within Bioconductor project. The source code and vignette are freely available at http://bioconductor.org/packages/release/bioc/html/clusterProfiler.html.
Double Cluster Heads Model for Secure and Accurate Data Fusion in Wireless Sensor Networks
Fu, Jun-Song; Liu, Yun
2015-01-01
Secure and accurate data fusion is an important issue in wireless sensor networks (WSNs) and has been extensively researched in the literature. In this paper, by combining clustering techniques, reputation and trust systems, and data fusion algorithms, we propose a novel cluster-based data fusion model called Double Cluster Heads Model (DCHM) for secure and accurate data fusion in WSNs. Different from traditional clustering models in WSNs, two cluster heads are selected after clustering for each cluster based on the reputation and trust system and they perform data fusion independently of each other. Then, the results are sent to the base station where the dissimilarity coefficient is computed. If the dissimilarity coefficient of the two data fusion results exceeds the threshold preset by the users, the cluster heads will be added to blacklist, and the cluster heads must be reelected by the sensor nodes in a cluster. Meanwhile, feedback is sent from the base station to the reputation and trust system, which can help us to identify and delete the compromised sensor nodes in time. Through a series of extensive simulations, we found that the DCHM performed very well in data fusion security and accuracy. PMID:25608211
Scaling of cluster growth for coagulating active particles
NASA Astrophysics Data System (ADS)
Cremer, Peet; Löwen, Hartmut
2014-02-01
Cluster growth in a coagulating system of active particles (such as microswimmers in a solvent) is studied by theory and simulation. In contrast to passive systems, the net velocity of a cluster can have various scalings dependent on the propulsion mechanism and alignment of individual particles. Additionally, the persistence length of the cluster trajectory typically increases with size. As a consequence, a growing cluster collects neighboring particles in a very efficient way and thus amplifies its growth further. This results in unusual large growth exponents for the scaling of the cluster size with time and, for certain conditions, even leads to "explosive" cluster growth where the cluster becomes macroscopic in a finite amount of time.
Xu, Lina; O'Hare, Gregory M P; Collier, Rem
2017-07-05
Wireless Sensor Networks (WSNs) are typically composed of thousands of sensors powered by limited energy resources. Clustering techniques were introduced to prolong network longevity offering the promise of green computing. However, most existing work fails to consider the network coverage when evaluating the lifetime of a network. We believe that balancing the energy consumption in per unit area rather than on each single sensor can provide better-balanced power usage throughout the network. Our former work-Balanced Energy-Efficiency (BEE) and its Multihop version BEEM can not only extend the network longevity, but also maintain the network coverage. Following WSNs, Internet of Things (IoT) technology has been proposed with higher degree of diversities in terms of communication abilities and user scenarios, supporting a large range of real world applications. The IoT devices are embedded with multiple communication interfaces, normally referred as Multiple-In and Multiple-Out (MIMO) in 5G networks. The applications running on those devices can generate various types of data. Every interface has its own characteristics, which may be preferred and beneficial in some specific user scenarios. With MIMO becoming more available on the IoT devices, an advanced clustering solution for highly dynamic IoT systems is missing and also pressingly demanded in order to cater for differing user applications. In this paper, we present a smart clustering algorithm (Smart-BEEM) based on our former work BEE(M) to accomplish energy efficient and Quality of user Experience (QoE) supported communication in cluster based IoT networks. It is a user behaviour and context aware approach, aiming to facilitate IoT devices to choose beneficial communication interfaces and cluster headers for data transmission. Experimental results have proved that Smart-BEEM can further improve the performance of BEE and BEEM for coverage sensitive longevity.
O’Hare, Gregory M. P.; Collier, Rem
2017-01-01
Wireless Sensor Networks (WSNs) are typically composed of thousands of sensors powered by limited energy resources. Clustering techniques were introduced to prolong network longevity offering the promise of green computing. However, most existing work fails to consider the network coverage when evaluating the lifetime of a network. We believe that balancing the energy consumption in per unit area rather than on each single sensor can provide better-balanced power usage throughout the network. Our former work—Balanced Energy-Efficiency (BEE) and its Multihop version BEEM can not only extend the network longevity, but also maintain the network coverage. Following WSNs, Internet of Things (IoT) technology has been proposed with higher degree of diversities in terms of communication abilities and user scenarios, supporting a large range of real world applications. The IoT devices are embedded with multiple communication interfaces, normally referred as Multiple-In and Multiple-Out (MIMO) in 5G networks. The applications running on those devices can generate various types of data. Every interface has its own characteristics, which may be preferred and beneficial in some specific user scenarios. With MIMO becoming more available on the IoT devices, an advanced clustering solution for highly dynamic IoT systems is missing and also pressingly demanded in order to cater for differing user applications. In this paper, we present a smart clustering algorithm (Smart-BEEM) based on our former work BEE(M) to accomplish energy efficient and Quality of user Experience (QoE) supported communication in cluster based IoT networks. It is a user behaviour and context aware approach, aiming to facilitate IoT devices to choose beneficial communication interfaces and cluster headers for data transmission. Experimental results have proved that Smart-BEEM can further improve the performance of BEE and BEEM for coverage sensitive longevity. PMID:28678164
Xing, Jian; Burkom, Howard; Moniz, Linda; Edgerton, James; Leuze, Michael; Tokars, Jerome
2009-01-01
Background The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of the daily disease monitoring task. Our study focused on finding useful anomalies at manageable alert rates according to available BioSense data history. Methods The study dataset included more than 3 years of daily counts of military outpatient clinic visits for respiratory and rash syndrome groupings. We applied four spatial estimation methods in implementations of space-time scan statistics cross-checked in Matlab and C. We compared the utility of these methods according to the resultant background cluster rate (a false alarm surrogate) and sensitivity to injected cluster signals. The comparison runs used a spatial resolution based on the facility zip code in the patient record and a finer resolution based on the residence zip code. Results Simple estimation methods that account for day-of-week (DOW) data patterns yielded a clear advantage both in background cluster rate and in signal sensitivity. A 28-day baseline gave the most robust results for this estimation; the preferred baseline is long enough to remove daily fluctuations but short enough to reflect recent disease trends and data representation. Background cluster rates were lower for the rash syndrome counts than for the respiratory counts, likely because of seasonality and the large scale of the respiratory counts. Conclusion The spatial estimation method should be chosen according to characteristics of the selected data streams. In this dataset with strong day-of-week effects, the overall best detection performance was achieved using subregion averages over a 28-day baseline stratified by weekday or weekend/holiday behavior. Changing the estimation method for particular scenarios involving different spatial resolution or other syndromes can yield further improvement. PMID:19615075
Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; Ratner, Anna; Palaniappan, Krishna; Szeto, Ernest; Huang, Jinghua; Reddy, T B K; Cimermančič, Peter; Fischbach, Michael A; Ivanova, Natalia N; Markowitz, Victor M; Kyrpides, Nikos C; Pati, Amrita
2015-07-14
In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of "big" genomic data for discovering small molecules. IMG-ABC relies on IMG's comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve as the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC's focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in Alphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG's extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to expand, with the goal of becoming an essential component of any bioinformatic exploration of the secondary metabolism world. Copyright © 2015 Hadjithomas et al.
Hsu, Arthur L; Tang, Sen-Lin; Halgamuge, Saman K
2003-11-01
Current Self-Organizing Maps (SOMs) approaches to gene expression pattern clustering require the user to predefine the number of clusters likely to be expected. Hierarchical clustering methods used in this area do not provide unique partitioning of data. We describe an unsupervised dynamic hierarchical self-organizing approach, which suggests an appropriate number of clusters, to perform class discovery and marker gene identification in microarray data. In the process of class discovery, the proposed algorithm identifies corresponding sets of predictor genes that best distinguish one class from other classes. The approach integrates merits of hierarchical clustering with robustness against noise known from self-organizing approaches. The proposed algorithm applied to DNA microarray data sets of two types of cancers has demonstrated its ability to produce the most suitable number of clusters. Further, the corresponding marker genes identified through the unsupervised algorithm also have a strong biological relationship to the specific cancer class. The algorithm tested on leukemia microarray data, which contains three leukemia types, was able to determine three major and one minor cluster. Prediction models built for the four clusters indicate that the prediction strength for the smaller cluster is generally low, therefore labelled as uncertain cluster. Further analysis shows that the uncertain cluster can be subdivided further, and the subdivisions are related to two of the original clusters. Another test performed using colon cancer microarray data has automatically derived two clusters, which is consistent with the number of classes in data (cancerous and normal). JAVA software of dynamic SOM tree algorithm is available upon request for academic use. A comparison of rectangular and hexagonal topologies for GSOM is available from http://www.mame.mu.oz.au/mechatronics/journalinfo/Hsu2003supp.pdf
Lattice animals in diffusion limited binary colloidal system
NASA Astrophysics Data System (ADS)
Shireen, Zakiya; Babu, Sujin B.
2017-08-01
In a soft matter system, controlling the structure of the amorphous materials has been a key challenge. In this work, we have modeled irreversible diffusion limited cluster aggregation of binary colloids, which serves as a model for chemical gels. Irreversible aggregation of binary colloidal particles leads to the formation of a percolating cluster of one species or both species which are also called bigels. Before the formation of the percolating cluster, the system forms a self-similar structure defined by a fractal dimension. For a one component system when the volume fraction is very small, the clusters are far apart from each other and the system has a fractal dimension of 1.8. Contrary to this, we will show that for the binary system, we observe the presence of lattice animals which has a fractal dimension of 2 irrespective of the volume fraction. When the clusters start inter-penetrating, we observe a fractal dimension of 2.5, which is the same as in the case of the one component system. We were also able to predict the formation of bigels using a simple inequality relation. We have also shown that the growth of clusters follows the kinetic equations introduced by Smoluchowski for diffusion limited cluster aggregation. We will also show that the chemical distance of a cluster in the flocculation regime will follow the same scaling law as predicted for the lattice animals. Further, we will also show that irreversible binary aggregation comes under the universality class of the percolation theory.
The second virial coefficient of system ((nitrogen-water))
NASA Astrophysics Data System (ADS)
Podmurnaya, O. A.
2004-01-01
The virial coefficient data of various components of atmosphere are interesting because permit to evaluate a deviation from ideal gas model. These data may be useful while investigating the clusters generation and determination their contribution in absorption. The second cross virial coefficient Baw for system ((nitrogen water)) has been calculated form +9°C to +50°C using the last experimental data about water vapor mole fraction. The reliability of this coefficient has been tested by analysing of errors sources and by comparing the results with other available experimental data.
Hydration of a Large Anionic Charge Distribution - Naphthalene-Water Cluster Anions
NASA Astrophysics Data System (ADS)
Weber, J. Mathias; Adams, Christopher L.
2010-06-01
We report the infrared spectra of anionic clusters of naphthalene with up to three water molecules. Comparison of the experimental infrared spectra with theoretically predicted spectra from quantum chemistry calculations allow conclusions regarding the structures of the clusters under study. The first water molecule forms two hydrogen bonds with the π electron system of the naphthalene moiety. Subsequent water ligands interact with both the naphthalene and the other water ligands to form hydrogen bonded networks, similar to other hydrated anion clusters. Naphthalene-water anion clusters illustrate how water interacts with negative charge delocalized over a large π electron system. The clusters are interesting model systems that are discussed in the context of wetting of graphene surfaces and polyaromatic hydrocarbons.
Dynamical evolution of globular-cluster systems in clusters of galaxies
DOE Office of Scientific and Technical Information (OSTI.GOV)
Muzzio, J.C.
1987-04-01
The dynamical processes that affect globular-cluster systems in clusters of galaxies are analyzed. Two-body and impulsive approximations are utilized to study dynamical friction, drag force, tidal stripping, tidal radii, globular-cluster swapping, tidal accretion, and galactic cannibalism. The evolution of galaxies and the collision of galaxies are simulated numerically; the steps involved in the simulation are described. The simulated data are compared with observations. Consideration is given to the number of galaxies, halo extension, location of the galaxies, distribution of the missing mass, nonequilibrium initial conditions, mass dependence, massive central galaxies, globular-cluster distribution, and lost globular clusters. 116 references.
Austin, Peter C
2010-04-22
Multilevel logistic regression models are increasingly being used to analyze clustered data in medical, public health, epidemiological, and educational research. Procedures for estimating the parameters of such models are available in many statistical software packages. There is currently little evidence on the minimum number of clusters necessary to reliably fit multilevel regression models. We conducted a Monte Carlo study to compare the performance of different statistical software procedures for estimating multilevel logistic regression models when the number of clusters was low. We examined procedures available in BUGS, HLM, R, SAS, and Stata. We found that there were qualitative differences in the performance of different software procedures for estimating multilevel logistic models when the number of clusters was low. Among the likelihood-based procedures, estimation methods based on adaptive Gauss-Hermite approximations to the likelihood (glmer in R and xtlogit in Stata) or adaptive Gaussian quadrature (Proc NLMIXED in SAS) tended to have superior performance for estimating variance components when the number of clusters was small, compared to software procedures based on penalized quasi-likelihood. However, only Bayesian estimation with BUGS allowed for accurate estimation of variance components when there were fewer than 10 clusters. For all statistical software procedures, estimation of variance components tended to be poor when there were only five subjects per cluster, regardless of the number of clusters.
Artificial gravity Mars spaceship
NASA Technical Reports Server (NTRS)
Clark, Benton C.
1989-01-01
Experience gained in the study of artificial gravity for a manned trip to Mars is reviewed, and a snowflake-configured interplanetary vehicle cluster of habitat modules, descent vehicles, and propulsion systems is presented. An evolutionary design is described which permits sequential upgrading from five to nine crew members, an increase of landers from one to as many a three per mission, and an orderly, phased incorporation of advanced technologies as they become available.
Chen, Yi-Bu; Chattopadhyay, Ansuman; Bergen, Phillip; Gadd, Cynthia; Tannery, Nancy
2007-01-01
To bridge the gap between the rising information needs of biological and medical researchers and the rapidly growing number of online bioinformatics resources, we have created the Online Bioinformatics Resources Collection (OBRC) at the Health Sciences Library System (HSLS) at the University of Pittsburgh. The OBRC, containing 1542 major online bioinformatics databases and software tools, was constructed using the HSLS content management system built on the Zope Web application server. To enhance the output of search results, we further implemented the Vivísimo Clustering Engine, which automatically organizes the search results into categories created dynamically based on the textual information of the retrieved records. As the largest online collection of its kind and the only one with advanced search results clustering, OBRC is aimed at becoming a one-stop guided information gateway to the major bioinformatics databases and software tools on the Web. OBRC is available at the University of Pittsburgh's HSLS Web site (http://www.hsls.pitt.edu/guides/genetics/obrc).
Analysis of ground-motion simulation big data
NASA Astrophysics Data System (ADS)
Maeda, T.; Fujiwara, H.
2016-12-01
We developed a parallel distributed processing system which applies a big data analysis to the large-scale ground motion simulation data. The system uses ground-motion index values and earthquake scenario parameters as input. We used peak ground velocity value and velocity response spectra as the ground-motion index. The ground-motion index values are calculated from our simulation data. We used simulated long-period ground motion waveforms at about 80,000 meshes calculated by a three dimensional finite difference method based on 369 earthquake scenarios of a great earthquake in the Nankai Trough. These scenarios were constructed by considering the uncertainty of source model parameters such as source area, rupture starting point, asperity location, rupture velocity, fmax and slip function. We used these parameters as the earthquake scenario parameter. The system firstly carries out the clustering of the earthquake scenario in each mesh by the k-means method. The number of clusters is determined in advance using a hierarchical clustering by the Ward's method. The scenario clustering results are converted to the 1-D feature vector. The dimension of the feature vector is the number of scenario combination. If two scenarios belong to the same cluster the component of the feature vector is 1, and otherwise the component is 0. The feature vector shows a `response' of mesh to the assumed earthquake scenario group. Next, the system performs the clustering of the mesh by k-means method using the feature vector of each mesh previously obtained. Here the number of clusters is arbitrarily given. The clustering of scenarios and meshes are performed by parallel distributed processing with Hadoop and Spark, respectively. In this study, we divided the meshes into 20 clusters. The meshes in each cluster are geometrically concentrated. Thus this system can extract regions, in which the meshes have similar `response', as clusters. For each cluster, it is possible to determine particular scenario parameters which characterize the cluster. In other word, by utilizing this system, we can obtain critical scenario parameters of the ground-motion simulation for each evaluation point objectively. This research was supported by CREST, JST.
VizieR Online Data Catalog: XCS-DR1 Cluster Catalogue (Mehrtens+, 2012)
NASA Astrophysics Data System (ADS)
Mehrtens, N.; Romer, A. K.; Hilton, M.; Lloyd-Davies, E. J.; Miller, C. J.; Stanford, S. A.; Hosmer, M.; Hoyle, B.; Collins, C. A.; Liddle, A. R.; Viana, P. T. P.; Nichol, R. C.; Stott, J. P.; Dubois, E. N.; Kay, S. T.; Sahlen, M.; Young, O.; Short, C. J.; Christodoulou, L.; Watson, W. A.; Davidson, M.; Harrison, C. D.; Baruah, L.; Smith, M.; Burke, C.; Mayers, J. A.; Deadman, P.-J.; Rooney, P. J.; Edmondson, E. M.; West, M.; Campbell, H. C.; Edge, A. C.; Mann, R. G.; Sabirli, K.; Wake, D.; Benoist, C.; da Costa, L.; Maia, M. A. G.; Ogando, R.
2013-04-01
The XMM Cluster Survey (XCS) is a serendipitous search for galaxy clusters using all publicly available data in the XMM-Newton Science Archive. Its main aims are to measure cosmological parameters and trace the evolution of X-ray scaling relations. In this paper we present the first data release from the XMM Cluster Survey (XCS-DR1). This consists of 503 optically confirmed, serendipitously detected, X-ray clusters. Of these clusters, 256 are new to the literature and 357 are new X-ray discoveries. We present 463 clusters with a redshift estimate (0.06
Onda, Kyle; Crocker, Jonny; Kayser, Georgia Lyn; Bartram, Jamie
2013-01-01
The fields of global health and international development commonly cluster countries by geography and income to target resources and describe progress. For any given sector of interest, a range of relevant indicators can serve as a more appropriate basis for classification. We create a new typology of country clusters specific to the water and sanitation (WatSan) sector based on similarities across multiple WatSan-related indicators. After a literature review and consultation with experts in the WatSan sector, nine indicators were selected. Indicator selection was based on relevance to and suggested influence on national water and sanitation service delivery, and to maximize data availability across as many countries as possible. A hierarchical clustering method and a gap statistic analysis were used to group countries into a natural number of relevant clusters. Two stages of clustering resulted in five clusters, representing 156 countries or 6.75 billion people. The five clusters were not well explained by income or geography, and were unique from existing country clusters used in international development. Analysis of these five clusters revealed that they were more compact and well separated than United Nations and World Bank country clusters. This analysis and resulting country typology suggest that previous geography- or income-based country groupings can be improved upon for applications in the WatSan sector by utilizing globally available WatSan-related indicators. Potential applications include guiding and discussing research, informing policy, improving resource targeting, describing sector progress, and identifying critical knowledge gaps in the WatSan sector. PMID:24054545
Nepusz, Tamás; Sasidharan, Rajkumar; Paccanaro, Alberto
2010-03-09
An important problem in genomics is the automatic inference of groups of homologous proteins from pairwise sequence similarities. Several approaches have been proposed for this task which are "local" in the sense that they assign a protein to a cluster based only on the distances between that protein and the other proteins in the set. It was shown recently that global methods such as spectral clustering have better performance on a wide variety of datasets. However, currently available implementations of spectral clustering methods mostly consist of a few loosely coupled Matlab scripts that assume a fair amount of familiarity with Matlab programming and hence they are inaccessible for large parts of the research community. SCPS (Spectral Clustering of Protein Sequences) is an efficient and user-friendly implementation of a spectral method for inferring protein families. The method uses only pairwise sequence similarities, and is therefore practical when only sequence information is available. SCPS was tested on difficult sets of proteins whose relationships were extracted from the SCOP database, and its results were extensively compared with those obtained using other popular protein clustering algorithms such as TribeMCL, hierarchical clustering and connected component analysis. We show that SCPS is able to identify many of the family/superfamily relationships correctly and that the quality of the obtained clusters as indicated by their F-scores is consistently better than all the other methods we compared it with. We also demonstrate the scalability of SCPS by clustering the entire SCOP database (14,183 sequences) and the complete genome of the yeast Saccharomyces cerevisiae (6,690 sequences). Besides the spectral method, SCPS also implements connected component analysis and hierarchical clustering, it integrates TribeMCL, it provides different cluster quality tools, it can extract human-readable protein descriptions using GI numbers from NCBI, it interfaces with external tools such as BLAST and Cytoscape, and it can produce publication-quality graphical representations of the clusters obtained, thus constituting a comprehensive and effective tool for practical research in computational biology. Source code and precompiled executables for Windows, Linux and Mac OS X are freely available at http://www.paccanarolab.org/software/scps.
Scalable cluster administration - Chiba City I approach and lessons learned.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Navarro, J. P.; Evard, R.; Nurmi, D.
2002-07-01
Systems administrators of large clusters often need to perform the same administrative activity hundreds or thousands of times. Often such activities are time-consuming, especially the tasks of installing and maintaining software. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control, cluster administrators can automate all administrative tasks. Scalable cluster administration addresses the following challenge: What systems design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linuxmore » cluster; and we analyze the scalability, flexibility, and reliability benefits and limitations from that approach.« less
The properties of the disk system of globular clusters
NASA Technical Reports Server (NTRS)
Armandroff, Taft E.
1989-01-01
A large refined data sample is used to study the properties and origin of the disk system of globular clusters. A scale height for the disk cluster system of 800-1500 pc is found which is consistent with scale-height determinations for samples of field stars identified with the Galactic thick disk. A rotational velocity of 193 + or - 29 km/s and a line-of-sight velocity dispersion of 59 + or - 14 km/s have been found for the metal-rich clusters.
Image quality guided approach for adaptive modelling of biometric intra-class variations
NASA Astrophysics Data System (ADS)
Abboud, Ali J.; Jassim, Sabah A.
2010-04-01
The high intra-class variability of acquired biometric data can be attributed to several factors such as quality of acquisition sensor (e.g. thermal), environmental (e.g. lighting), behavioural (e.g. change face pose). Such large fuzziness of biometric data can cause a big difference between an acquired and stored biometric data that will eventually lead to reduced performance. Many systems store multiple templates in order to account for such variations in the biometric data during enrolment stage. The number and typicality of these templates are the most important factors that affect system performance than other factors. In this paper, a novel offline approach is proposed for systematic modelling of intra-class variability and typicality in biometric data by regularly selecting new templates from a set of available biometric images. Our proposed technique is a two stage algorithm whereby in the first stage image samples are clustered in terms of their image quality profile vectors, rather than their biometric feature vectors, and in the second stage a per cluster template is selected from a small number of samples in each clusters to create an ultimate template sets. These experiments have been conducted on five face image databases and their results will demonstrate the effectiveness of proposed quality guided approach.
A single population of red globular clusters around the massive compact galaxy NGC 1277
NASA Astrophysics Data System (ADS)
Beasley, Michael A.; Trujillo, Ignacio; Leaman, Ryan; Montes, Mireia
2018-03-01
Massive galaxies are thought to form in two phases: an initial collapse of gas and giant burst of central star formation, followed by the later accretion of material that builds up their stellar and dark-matter haloes. The systems of globular clusters within such galaxies are believed to form in a similar manner. The initial central burst forms metal-rich (spectrally red) clusters, whereas more metal-poor (spectrally blue) clusters are brought in by the later accretion of less-massive satellites. This formation process is thought to result in the multimodal optical colour distributions that are seen in the globular cluster systems of massive galaxies. Here we report optical observations of the massive relic-galaxy candidate NGC 1277—a nearby, un-evolved example of a high-redshift ‘red nugget’ galaxy. We find that the optical colour distribution of the cluster system of NGC 1277 is unimodal and entirely red. This finding is in strong contrast to other galaxies of similar and larger stellar mass, the cluster systems of which always exhibit (and are generally dominated by) blue clusters. We argue that the colour distribution of the cluster system of NGC 1277 indicates that the galaxy has undergone little (if any) mass accretion after its initial collapse, and use simulations of possible merger histories to show that the stellar mass due to accretion is probably at most ten per cent of the total stellar mass of the galaxy. These results confirm that NGC 1277 is a genuine relic galaxy and demonstrate that blue clusters constitute an accreted population in present-day massive galaxies.
A single population of red globular clusters around the massive compact galaxy NGC 1277.
Beasley, Michael A; Trujillo, Ignacio; Leaman, Ryan; Montes, Mireia
2018-03-22
Massive galaxies are thought to form in two phases: an initial collapse of gas and giant burst of central star formation, followed by the later accretion of material that builds up their stellar and dark-matter haloes. The systems of globular clusters within such galaxies are believed to form in a similar manner. The initial central burst forms metal-rich (spectrally red) clusters, whereas more metal-poor (spectrally blue) clusters are brought in by the later accretion of less-massive satellites. This formation process is thought to result in the multimodal optical colour distributions that are seen in the globular cluster systems of massive galaxies. Here we report optical observations of the massive relic-galaxy candidate NGC 1277-a nearby, un-evolved example of a high-redshift 'red nugget' galaxy. We find that the optical colour distribution of the cluster system of NGC 1277 is unimodal and entirely red. This finding is in strong contrast to other galaxies of similar and larger stellar mass, the cluster systems of which always exhibit (and are generally dominated by) blue clusters. We argue that the colour distribution of the cluster system of NGC 1277 indicates that the galaxy has undergone little (if any) mass accretion after its initial collapse, and use simulations of possible merger histories to show that the stellar mass due to accretion is probably at most ten per cent of the total stellar mass of the galaxy. These results confirm that NGC 1277 is a genuine relic galaxy and demonstrate that blue clusters constitute an accreted population in present-day massive galaxies.
Model validation of simple-graph representations of metabolism
Holme, Petter
2009-01-01
The large-scale properties of chemical reaction systems, such as metabolism, can be studied with graph-based methods. To do this, one needs to reduce the information, lists of chemical reactions, available in databases. Even for the simplest type of graph representation, this reduction can be done in several ways. We investigate different simple network representations by testing how well they encode information about one biologically important network structure—network modularity (the propensity for edges to be clustered into dense groups that are sparsely connected between each other). To achieve this goal, we design a model of reaction systems where network modularity can be controlled and measure how well the reduction to simple graphs captures the modular structure of the model reaction system. We find that the network types that best capture the modular structure of the reaction system are substrate–product networks (where substrates are linked to products of a reaction) and substance networks (with edges between all substances participating in a reaction). Furthermore, we argue that the proposed model for reaction systems with tunable clustering is a general framework for studies of how reaction systems are affected by modularity. To this end, we investigate statistical properties of the model and find, among other things, that it recreates correlations between degree and mass of the molecules. PMID:19158012
X-Ray Morphological Analysis of the Planck ESZ Clusters
NASA Astrophysics Data System (ADS)
Lovisari, Lorenzo; Forman, William R.; Jones, Christine; Ettori, Stefano; Andrade-Santos, Felipe; Arnaud, Monique; Démoclès, Jessica; Pratt, Gabriel W.; Randall, Scott; Kraft, Ralph
2017-09-01
X-ray observations show that galaxy clusters have a very large range of morphologies. The most disturbed systems, which are good to study how clusters form and grow and to test physical models, may potentially complicate cosmological studies because the cluster mass determination becomes more challenging. Thus, we need to understand the cluster properties of our samples to reduce possible biases. This is complicated by the fact that different experiments may detect different cluster populations. For example, Sunyaev-Zeldovich (SZ) selected cluster samples have been found to include a greater fraction of disturbed systems than X-ray selected samples. In this paper we determine eight morphological parameters for the Planck Early Sunyaev-Zeldovich (ESZ) objects observed with XMM-Newton. We found that two parameters, concentration and centroid shift, are the best to distinguish between relaxed and disturbed systems. For each parameter we provide the values that allow selecting the most relaxed or most disturbed objects from a sample. We found that there is no mass dependence on the cluster dynamical state. By comparing our results with what was obtained with REXCESS clusters, we also confirm that the ESZ clusters indeed tend to be more disturbed, as found by previous studies.
X-Ray Morphological Analysis of the Planck ESZ Clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lovisari, Lorenzo; Forman, William R.; Jones, Christine
2017-09-01
X-ray observations show that galaxy clusters have a very large range of morphologies. The most disturbed systems, which are good to study how clusters form and grow and to test physical models, may potentially complicate cosmological studies because the cluster mass determination becomes more challenging. Thus, we need to understand the cluster properties of our samples to reduce possible biases. This is complicated by the fact that different experiments may detect different cluster populations. For example, Sunyaev–Zeldovich (SZ) selected cluster samples have been found to include a greater fraction of disturbed systems than X-ray selected samples. In this paper wemore » determine eight morphological parameters for the Planck Early Sunyaev–Zeldovich (ESZ) objects observed with XMM-Newton . We found that two parameters, concentration and centroid shift, are the best to distinguish between relaxed and disturbed systems. For each parameter we provide the values that allow selecting the most relaxed or most disturbed objects from a sample. We found that there is no mass dependence on the cluster dynamical state. By comparing our results with what was obtained with REXCESS clusters, we also confirm that the ESZ clusters indeed tend to be more disturbed, as found by previous studies.« less
Mo, Yun; Zhang, Zhongzhao; Meng, Weixiao; Ma, Lin; Wang, Yao
2014-01-01
Indoor positioning systems based on the fingerprint method are widely used due to the large number of existing devices with a wide range of coverage. However, extensive positioning regions with a massive fingerprint database may cause high computational complexity and error margins, therefore clustering methods are widely applied as a solution. However, traditional clustering methods in positioning systems can only measure the similarity of the Received Signal Strength without being concerned with the continuity of physical coordinates. Besides, outage of access points could result in asymmetric matching problems which severely affect the fine positioning procedure. To solve these issues, in this paper we propose a positioning system based on the Spatial Division Clustering (SDC) method for clustering the fingerprint dataset subject to physical distance constraints. With the Genetic Algorithm and Support Vector Machine techniques, SDC can achieve higher coarse positioning accuracy than traditional clustering algorithms. In terms of fine localization, based on the Kernel Principal Component Analysis method, the proposed positioning system outperforms its counterparts based on other feature extraction methods in low dimensionality. Apart from balancing online matching computational burden, the new positioning system exhibits advantageous performance on radio map clustering, and also shows better robustness and adaptability in the asymmetric matching problem aspect. PMID:24451470
Alternatives to Multilevel Modeling for the Analysis of Clustered Data
ERIC Educational Resources Information Center
Huang, Francis L.
2016-01-01
Multilevel modeling has grown in use over the years as a way to deal with the nonindependent nature of observations found in clustered data. However, other alternatives to multilevel modeling are available that can account for observations nested within clusters, including the use of Taylor series linearization for variance estimation, the design…
Dispersed or Clustered Housing for Adults with Intellectual Disability: A Systematic Review
ERIC Educational Resources Information Center
Mansell, Jim; Beadle-Brown, Julie
2009-01-01
Background: The purpose of this review was to evaluate the available research on the quality and costs of dispersed community-based housing when compared with clustered housing. Methods: Searches against specified criteria yielded 19 papers based on 10 studies presenting data comparing dispersed housing with some kind of clustered housing (village…
Improving Performance for Gifted Students in a Cluster Grouping Model
ERIC Educational Resources Information Center
Brulles, Dina; Saunders, Rachel; Cohn, Sanford J.
2010-01-01
Although experts in gifted education widely promote cluster grouping gifted students, little empirical evidence is available to attest to its effectiveness. This study is an example of comparative action research in the form of a quantitative case study that focused on the mandated cluster grouping practices for gifted students in an urban…
Optimal colour quality of LED clusters based on memory colours.
Smet, Kevin; Ryckaert, Wouter R; Pointer, Michael R; Deconinck, Geert; Hanselaer, Peter
2011-03-28
The spectral power distributions of tri- and tetrachromatic clusters of Light-Emitting-Diodes, composed of simulated and commercially available LEDs, were optimized with a genetic algorithm to maximize the luminous efficacy of radiation and the colour quality as assessed by the memory colour quality metric developed by the authors. The trade-off of the colour quality as assessed by the memory colour metric and the luminous efficacy of radiation was investigated by calculating the Pareto optimal front using the NSGA-II genetic algorithm. Optimal peak wavelengths and spectral widths of the LEDs were derived, and over half of them were found to be close to Thornton's prime colours. The Pareto optimal fronts of real LED clusters were always found to be smaller than those of the simulated clusters. The effect of binning on designing a real LED cluster was investigated and was found to be quite large. Finally, a real LED cluster of commercially available AlGaInP, InGaN and phosphor white LEDs was optimized to obtain a higher score on memory colour quality scale than its corresponding CIE reference illuminant.
Open star clusters and Galactic structure
NASA Astrophysics Data System (ADS)
Joshi, Yogesh C.
2018-04-01
In order to understand the Galactic structure, we perform a statistical analysis of the distribution of various cluster parameters based on an almost complete sample of Galactic open clusters yet available. The geometrical and physical characteristics of a large number of open clusters given in the MWSC catalogue are used to study the spatial distribution of clusters in the Galaxy and determine the scale height, solar offset, local mass density and distribution of reddening material in the solar neighbourhood. We also explored the mass-radius and mass-age relations in the Galactic open star clusters. We find that the estimated parameters of the Galactic disk are largely influenced by the choice of cluster sample.
Mothe-Diniz Asteroid Dynamical Families V1.0
NASA Astrophysics Data System (ADS)
Mothe-Diniz, T.; Roig, F.; Carvano, J. M.
2006-03-01
This dataset contains an updated compilation of asteroid families and clusters, resulting from the application of the Hierarchical Clustering Method (HCM) on a set of around 120,000 asteroids with available proper elements. Whenever available, the classification in the Bus taxonomy is provided for family members, based on spectra from the SMASS, SMASS2 and S3OS2 spectroscopic surveys.
Formation of black hole x-ray binaries in globular clusters
NASA Astrophysics Data System (ADS)
Kremer, Kyle; Chatterjee, Sourav; Rodriguez, Carl; Rasio, Frederic
2018-01-01
We explore the formation of mass-transferring binary systems containing black holes within globular clusters. We show that it is possible to form mass-transferring binaries with main sequence, giant, and white dwarf companions with a variety of orbital parameters in globular clusters spanning a large range in present-day properties. We show that the presence of mass-transferring black hole systems has little correlation with the total number of black holes within the cluster at any time. In addition to mass-transferring binaries retained within their host clusters at late times, we also examine the black hole and neutron star binaries that are ejected from their host clusters. These ejected systems may contribute to the low-mass x-ray binary population in the galactic field.
Biases on Initial Mass Function Determinations. II. Real Multiple Systems and Chance Superpositions
NASA Astrophysics Data System (ADS)
Maíz Apellániz, J.
2008-04-01
When calculating stellar initial mass functions (IMFs) for young clusters, one has to take into account that (1) most massive stars are born in multiple systems, (2) most IMFs are derived from data that cannot resolve such systems, and (3) multiple chance superpositions between members are expected to happen if the cluster is too distant. In this article I use numerical experiments to model the consequences of those phenomena on the observed color-magnitude diagrams and the IMFs derived from them. Real multiple systems affect the observed or apparent massive-star MF slope little but can create a significant population of apparently ultramassive stars. Chance superpositions produce only small biases when the number of superimposed stars is low but, once a certain number threshold is reached, they can affect both the observed slope and the apparent stellar upper mass limit. I apply these experiments to two well known massive young clusters in the Local Group, NGC 3603 and R136. In both cases I show that the observed population of stars with masses above 120 M⊙ can be explained by the effects of unresolved objects, mostly real multiple systems for NGC 3603 and a combination of real and chance-alignment multiple systems for R136. Therefore, the case for the reality of a stellar upper mass limit at solar or near-solar metallicities is strengthened, with a possible value even lower than 150 M⊙. An IMF slope somewhat flatter than Salpeter or Kroupa with γ between -1.6 and -2.0 is derived for the central region of NGC 3603, with a significant contribution to the uncertainty arising from the imprecise knowledge of the distance to the cluster. The IMF at the very center of R136 cannot be measured with the currently available data but the situation could change with new HST observations. This article is partially based on observations made with the NASA/ESA Hubble Space Telescope (HST), some of them associated with GO program 10602 and the rest gathered from the archive, obtained at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-26555.
Cost/Performance Ratio Achieved by Using a Commodity-Based Cluster
NASA Technical Reports Server (NTRS)
Lopez, Isaac
2001-01-01
Researchers at the NASA Glenn Research Center acquired a commodity cluster based on Intel Corporation processors to compare its performance with a traditional UNIX cluster in the execution of aeropropulsion applications. Since the cost differential of the clusters was significant, a cost/performance ratio was calculated. After executing a propulsion application on both clusters, the researchers demonstrated a 9.4 cost/performance ratio in favor of the Intel-based cluster. These researchers utilize the Aeroshark cluster as one of the primary testbeds for developing NPSS parallel application codes and system software. The Aero-shark cluster provides 64 Intel Pentium II 400-MHz processors, housed in 32 nodes. Recently, APNASA - a code developed by a Government/industry team for the design and analysis of turbomachinery systems was used for a simulation on Glenn's Aeroshark cluster.
Testing the Archivas Cluster (Arc) for Ozone Monitoring Instrument (OMI) Scientific Data Storage
NASA Technical Reports Server (NTRS)
Tilmes, Curt
2005-01-01
The Ozone Monitoring Instrument (OMI) launched on NASA's Aura Spacecraft, the third of the major platforms of the EOS program on July 15,2004. In addition to the long term archive and distribution of the data from OM1 through the Goddard Earth Science Distributed Active Archive Center (GESDAAC), we are evaluating other archive mechanisms that can archive the data in a more immediately available method where it can be used for futher data production and analysis. In 2004, Archivas, Inc. was selected by NASA s Small Business Innovative Research (SBIR) program for the development of their Archivas Cluster (ArC) product. Arc is an online disk based system utilizing self-management and automation on a Linux cluster. Its goal is to produce a low cost solution coupled with the ease of management. The OM1 project is an application partner of the SBIR program, and has deployed a small cluster (5TB) based on the beta Archwas software. We performed extensive testing of the unit using production OM1 data since launch. In 2005, Archivas, Inc. was funded in SBIR Phase II for further development, which will include testing scalability with the deployment of a larger (35TB) cluster at Goddard. We plan to include Arc in the OM1 Team Leader Computing Facility (TLCF) hosting OM1 data for direct access and analysis by the OMI Science Team. This presentation will include a brief technical description of the Archivas Cluster, a summary of the SBIR Phase I beta testing results, and an overview of the OMI ground data processing architecture including its interaction with the Phase II Archivas Cluster and hosting of OMI data for the scientists.
Solving the scalability issue in quantum-based refinement: Q|R#1.
Zheng, Min; Moriarty, Nigel W; Xu, Yanting; Reimers, Jeffrey R; Afonine, Pavel V; Waller, Mark P
2017-12-01
Accurately refining biomacromolecules using a quantum-chemical method is challenging because the cost of a quantum-chemical calculation scales approximately as n m , where n is the number of atoms and m (≥3) is based on the quantum method of choice. This fundamental problem means that quantum-chemical calculations become intractable when the size of the system requires more computational resources than are available. In the development of the software package called Q|R, this issue is referred to as Q|R#1. A divide-and-conquer approach has been developed that fragments the atomic model into small manageable pieces in order to solve Q|R#1. Firstly, the atomic model of a crystal structure is analyzed to detect noncovalent interactions between residues, and the results of the analysis are represented as an interaction graph. Secondly, a graph-clustering algorithm is used to partition the interaction graph into a set of clusters in such a way as to minimize disruption to the noncovalent interaction network. Thirdly, the environment surrounding each individual cluster is analyzed and any residue that is interacting with a particular cluster is assigned to the buffer region of that particular cluster. A fragment is defined as a cluster plus its buffer region. The gradients for all atoms from each of the fragments are computed, and only the gradients from each cluster are combined to create the total gradients. A quantum-based refinement is carried out using the total gradients as chemical restraints. In order to validate this interaction graph-based fragmentation approach in Q|R, the entire atomic model of an amyloid cross-β spine crystal structure (PDB entry 2oNA) was refined.
The role of FeS clusters for molybdenum cofactor biosynthesis and molybdoenzymes in bacteria
Yokoyama, Kenichi; Leimkühler, Silke
2016-01-01
Molybdenum is the only second row transition metal essential for biological systems, which is biologically available as molybdate ion. In eukarya, bacteria and archaea, molybdenum is bound to either to a tricyclic pyranopterin, thereby forming the molybdenum cofactor (Moco), or in some bacteria to the FeS cluster based iron-molybdenum cofactor (FeMoco), which forms the active site of nitrogenase. To date more than 50 Moco-containing enzymes have been purified and biochemically or structurally characterized. The physiological role of molybdenum in these enzymes is fundamental to organisms, since the reactions include the catalysis of key steps in carbon, nitrogen and sulfur metabolism. The catalyzed reactions are in most cases oxo-transfer reactions or the hydroxylation of carbon centers. The biosynthesis of Moco has been intensively studied, in addition to its insertion into molybdoenzymes. In particular, a link between the biosynthesis and maturation of molybdoenzymes and the biosynthesis and distribution of FeS clusters has been identified in the last years: 1) The synthesis of the first intermediate in Moco biosynthesis requires an FeS-cluster containing protein, 2) The sulfurtransferase for the dithiolene group in Moco is common also for the synthesis of FeS clusters, thiamin and thiolated tRNAs, 3) the modification of the active site with a sulfur atom additionally involves a sulfurtransferase, 4) most molybdoenzymes in bacteria require FeS clusters as additional redox active cofactors. In this review we will focus on the biosynthesis of the molybdenum cofactor in bacteria, its modification and insertion into molybdoenzymes, with an emphasis to its link to FeS cluster biosynthesis and sulfur transfer. PMID:25268953
Roelen, Bernard A J; de Graaff, Wim; Forlani, Sylvie; Deschamps, Jacqueline
2002-11-01
The molecular mechanism underlying the 3' to 5' polarity of induction of mouse Hox genes is still elusive. While relief from a cluster-encompassing repression was shown to lead to all Hoxd genes being expressed like the 3'most of them, Hoxd1 (Kondo and Duboule, 1999), the molecular basis of initial activation of this 3'most gene, is not understood yet. We show that, already before primitive streak formation, prior to initial expression of the first Hox gene, a dramatic transcriptional stimulation of the 3'most genes, Hoxb1 and Hoxb2, is observed upon a short pulse of exogenous retinoic acid (RA), whereas it is not in the case for more 5', cluster-internal, RA-responsive Hoxb genes. In contrast, the RA-responding Hoxb1lacZ transgene that faithfully mimics the endogenous gene (Marshall et al., 1994) did not exhibit the sensitivity of Hoxb1 to precocious activation. We conclude that polarity in initial activation of Hoxb genes reflects a greater availability of 3'Hox genes for transcription, suggesting a pre-existing (susceptibility to) opening of the chromatin structure at the 3' extremity of the cluster. We discuss the data in the context of prevailing models involving differential chromatin opening in the directionality of clustered Hox gene transcription, and regarding the importance of the cluster context for correct timing of initial Hox gene expression.Interestingly, Cdx1 manifested the same early transcriptional availability as Hoxb1. Copyright 2002 Elsevier Science Ireland Ltd.
ERIC Educational Resources Information Center
Bouchet, Francois; Harley, Jason M.; Trevors, Gregory J.; Azevedo, Roger
2013-01-01
In this paper, we present the results obtained using a clustering algorithm (Expectation-Maximization) on data collected from 106 college students learning about the circulatory system with MetaTutor, an agent-based Intelligent Tutoring System (ITS) designed to foster self-regulated learning (SRL). The three extracted clusters were validated and…
The origin of and conditions for clustering in fluids with competing interactions
NASA Astrophysics Data System (ADS)
Jadrich, Ryan; Bollinger, Jonathan; Truskett, Thomas
2015-03-01
Fluids with competing short-range attractions and long-range repulsions exhibit a rich phase behavior characterized by intermediate range order (IRO), as quantified via the static structure factor. This phase behavior includes cluster formation depending upon density-controlled packing effects and the magnitude and range of the attractive and repulsive interactions. Such model systems mimic (to zeroth order) screened, charge-stabilized, aqueous colloidal dispersions of, e.g., proteins. We employ molecular dynamics simulations and integral equation theory to elucidate a more fundamental microscopic explanation for IRO-driven clustering. A simple criterion is identified that indicates when dynamic, amorphous clustering emerges in a polydisperse system, namely when the Ornstein-Zernike thermal correlation length in the system exceeds the repulsive potential tail range. Remarkably, this criterion also appears tightly correlated to crystalline cluster formation in a monodisperse system. Our new gauge is compared to another phenomenological condition for clustering which is when the IRO peak magnitude exceeds ~ 2.7. Ramifications of crystalline versus amorphous clustering are discussed and potential ways of using our new measure in experiment are put forward.
NASA Astrophysics Data System (ADS)
Sitek, M.; Szymański, M. K.; Udalski, A.; Skowron, D. M.; Kostrzewa-Rutkowska, Z.; Skowron, J.; Karczmarek, P.; Cieślar, M.; Wyrzykowski, Ł.; Kozłowski, S.; Pietrukowicz, P.; Soszyński, I.; Mróz, P.; Pawlak, M.; Poleski, R.; Ulaczyk, K.
2017-12-01
The Magellanic System (MS) encompasses the nearest neighbors of the Milky Way, the Large (LMC) and Small (SMC) Magellanic Clouds, and the Magellanic Bridge (MBR). This system contains a diverse sample of star clusters. Their parameters, such as the spatial distribution, chemical composition and age distribution yield important information about the formation scenario of the whole Magellanic System. Using deep photometric maps compiled in the fourth phase of the Optical Gravitational Lensing Experiment (OGLE-IV) we present the most complete catalog of star clusters in the Magellanic System ever constructed from homogeneous, long time-scale photometric data. In this second paper of the series, we show the collection of star clusters found in the area of about 360 square degrees in the MBR and in the outer regions of the SMC. Our sample contains 198 visually identified star cluster candidates, 75 of which were not listed in any of the previously published catalogs. The new discoveries are mainly young small open clusters or clusters similar to associations.
Towards the use of computationally inserted lesions for mammographic CAD assessment
NASA Astrophysics Data System (ADS)
Ghanian, Zahra; Pezeshk, Aria; Petrick, Nicholas; Sahiner, Berkman
2018-03-01
Computer-aided detection (CADe) devices used for breast cancer detection on mammograms are typically first developed and assessed for a specific "original" acquisition system, e.g., a specific image detector. When CADe developers are ready to apply their CADe device to a new mammographic acquisition system, they typically assess the CADe device with images acquired using the new system. Collecting large repositories of clinical images containing verified cancer locations and acquired by the new image acquisition system is costly and time consuming. Our goal is to develop a methodology to reduce the clinical data burden in the assessment of a CADe device for use with a different image acquisition system. We are developing an image blending technique that allows users to seamlessly insert lesions imaged using an original acquisition system into normal images or regions acquired with a new system. In this study, we investigated the insertion of microcalcification clusters imaged using an original acquisition system into normal images acquired with that same system utilizing our previously-developed image blending technique. We first performed a reader study to assess whether experienced observers could distinguish between computationally inserted and native clusters. For this purpose, we applied our insertion technique to clinical cases taken from the University of South Florida Digital Database for Screening Mammography (DDSM) and the Breast Cancer Digital Repository (BCDR). Regions of interest containing microcalcification clusters from one breast of a patient were inserted into the contralateral breast of the same patient. The reader study included 55 native clusters and their 55 inserted counterparts. Analysis of the reader ratings using receiver operating characteristic (ROC) methodology indicated that inserted clusters cannot be reliably distinguished from native clusters (area under the ROC curve, AUC=0.58±0.04). Furthermore, CADe sensitivity was evaluated on mammograms with native and inserted microcalcification clusters using a commercial CADe system. For this purpose, we used full field digital mammograms (FFDMs) from 68 clinical cases, acquired at the University of Michigan Health System. The average sensitivities for native and inserted clusters were equal, 85.3% (58/68). These results demonstrate the feasibility of using the inserted microcalcification clusters for assessing mammographic CAD devices.
NASA Technical Reports Server (NTRS)
Kempler, Steve; Mathews, Tiffany
2016-01-01
The continuum of ever-evolving data management systems affords great opportunities to the enhancement of knowledge and facilitation of science research. To take advantage of these opportunities, it is essential to understand and develop methods that enable data relationships to be examined and the information to be manipulated. This presentation describes the efforts of the Earth Science Information Partners (ESIP) Federation Earth Science Data Analytics (ESDA) Cluster to understand, define, and facilitate the implementation of ESDA to advance science research. As a result of the void of Earth science data analytics publication material, the cluster has defined ESDA along with 10 goals to set the framework for a common understanding of tools and techniques that are available and still needed to support ESDA.
Photometric binary stars in Praesepe and the search for globular cluster binaries
NASA Technical Reports Server (NTRS)
Bolte, Michael
1991-01-01
A radial velocity study of the stars which are located on a second sequence above the single-star zero-age main sequence at a given color in the color-magnitude diagram of the open cluster Praesepe, (NGC 2632) shows that 10, and possibly 11, of 17 are binary systems. Of the binary systems, five have full amplitudes for their velocity variations that are greater than 50 km/s. To the extent that they can be applied to globular clusters, these results suggests that (1) observations of 'second-sequence' stars in globular clusters would be an efficient way of finding main-sequence binary systems in globulars, and (2) current instrumentation on large telescopes is sufficient for establishing unambiguously the existence of main-sequence binary systems in nearby globular clusters.
Toda Systems, Cluster Characters, and Spectral Networks
NASA Astrophysics Data System (ADS)
Williams, Harold
2016-11-01
We show that the Hamiltonians of the open relativistic Toda system are elements of the generic basis of a cluster algebra, and in particular are cluster characters of nonrigid representations of a quiver with potential. Using cluster coordinates defined via spectral networks, we identify the phase space of this system with the wild character variety related to the periodic nonrelativistic Toda system by the wild nonabelian Hodge correspondence. We show that this identification takes the relativistic Toda Hamiltonians to traces of holonomies around a simple closed curve. In particular, this provides nontrivial examples of cluster coordinates on SL n -character varieties for n > 2 where canonical functions associated to simple closed curves can be computed in terms of quivers with potential, extending known results in the SL 2 case.
Gas and galaxies in filaments between clusters of galaxies. The study of A399-A401
NASA Astrophysics Data System (ADS)
Bonjean, V.; Aghanim, N.; Salomé, P.; Douspis, M.; Beelen, A.
2018-01-01
We have performed a multi-wavelength analysis of two galaxy cluster systems selected with the thermal Sunyaev-Zel'dovich (tSZ) effect and composed of cluster pairs and an inter-cluster filament. We have focused on one pair of particular interest: A399-A401 at redshift z 0.073 seperated by 3 Mpc. We have also performed the first analysis of one lower-significance newly associated pair: A21-PSZ2 G114.09-34.34 at z 0.094, separated by 4.2 Mpc. We have characterised the intra-cluster gas using the tSZ signal from Planck and, when possible, the galaxy optical and infrared (IR) properties based on two photometric redshift catalogues: 2MPZ and WISExSCOS. From the tSZ data, we measured the gas pressure in the clusters and in the inter-cluster filaments. In the case of A399-A401, the results are in perfect agreement with previous studies and, using the temperature measured from the X-rays, we further estimate the gas density in the filament and find n0 = (4.3 ± 0.7) × 10-4 cm-3. The optical and IR colour-colour and colour-magnitude analyses of the galaxies selected in the cluster system, together with their star formation rate, show no segregation between galaxy populations, both in the clusters and in the filament of A399-A401. Galaxies are all passive, early type, and red and dead. The gas and galaxy properties of this system suggest that the whole system formed at the same time and corresponds to a pre-merger, with a cosmic filament gas heated by the collapse. For the other cluster system, the tSZ analysis was performed and the pressure in the clusters and in the inter-cluster filament was constrained. However, the limited or nonexistent optical and IR data prevent us from concluding on the presence of an actual cosmic filament or from proposing a scenario.
Probing the History of Galaxy Clusters with Metallicity and Entropy Measurements
NASA Astrophysics Data System (ADS)
Elkholy, Tamer Yohanna
Galaxy clusters are the largest gravitationally bound objects found today in our Universe. The gas they contain, the intra-cluster medium (ICM), is heated to temperatures in the approximate range of 1 to 10 keV, and thus emits X-ray radiation. Studying the ICM through the spatial and spectral analysis of its emission returns the richest information about both the overall cosmological context which governs the formation of clusters, as well as the physical processes occurring within. The aim of this thesis is to learn about the history of the physical processes that drive the evolution of galaxy clusters, through careful, spatially resolved measurements of their metallicity and entropy content. A sample of 45 nearby clusters observed with Chandra is analyzed to produce radial density, temperature, entropy and metallicity profiles. The entropy profiles are computed to larger radial extents than in previous Chandra analyses. The results of this analysis are made available to the scientific community in an electronic database. Comparing metallicity and entropy in the outskirts of clusters, we find no signature on the entropy profiles of the ensemble of supernovae that produced the observed metals. In the centers of clusters, we find that the metallicities of high-mass clusters are much less dispersed than those of low-mass clusters. A comparison of metallicity with the regularity of the X-ray emission morphology suggests that metallicities in low-mass clusters are more susceptible to increase from violent events such as mergers. We also find that the variation in the stellar-to-gas mass ratio as a function of cluster mass can explain the variation of central metallicity with cluster mass, only if we assume that there is a constant level of metallicity for clusters of all masses, above which the observed galaxies add more metals in proportion to their mass. (Copies available exclusively from MIT Libraries, libraries.mit.edu/docs - docs mit.edu)
NASA Astrophysics Data System (ADS)
Wang, Zhao; Yang, Shan; Wang, Shuguang; Shen, Yan
2017-10-01
The assessment of the dynamic urban structure has been affected by lack of timely and accurate spatial information for a long period, which has hindered the measurements of structural continuity at the macroscale. Defense meteorological satellite program's operational linescan system (DMSP/OLS) nighttime light (NTL) data provide an ideal source for urban information detection with a long-time span, short-time interval, and wide coverage. In this study, we extracted the physical boundaries of urban clusters from corrected NTL images and quantitatively analyzed the structure of the urban cluster system based on rank-size distribution, spatial metrics, and Mann-Kendall trend test. Two levels of urban cluster systems in the Yangtze River Delta region (YRDR) were examined. We found that (1) in the entire YRDR, the urban cluster system showed a periodic process, with a significant trend of even distribution before 2007 but an unequal growth pattern after 2007, and (2) at the metropolitan level, vast disparities exist in four metropolitan areas for the fluctuations of Pareto's exponent, the speed of cluster expansion, and the dominance of core cluster. The results suggest that the extracted urban cluster information from NTL data effectively reflect the evolving nature of regional urbanization, which in turn can aid in the planning of cities and help achieve more sustainable regional development.
Photometric properties of stars clusters with young or mixed age stellar populations
NASA Astrophysics Data System (ADS)
Mollá, M.; García-Vargas, M. L.; Martín-Manjón, M. L.
2013-05-01
The main goal of this work is to present and discuss the synthetic photometrical properties of stellar clusters resulting from the PopStar code. Colors in Johnson and SDSS systems, Hα and Hβ luminosities and equivalent widths, and ionizing region size, have been computed for a wide range of metallicities Z = 0.0001, 0.0004, 0.004,0.008,0.02 and 0.05, and ages, from 0.1 Myr to 20 Gyr in Mollá, Garc{í}a-Vargas, & Bressan (2009, MNRAS, 398, 451). Emission lines are shown in Mart{í}n-Manj{ó}n et al. (2010, MNRAS, 403, 2012). Now we calculate colors with the emission lines contribution to the broad band color, so colors include stellar and nebular components, plus the emission lines following the evolution of the cluster and the region geometry in a consistent way. We compare the Single Stellar Populations contaminated and uncontaminated colors (in both Johnson and SDSS systems) and show the importance of emission lines contribution when photometry is used as a tool to characterize stellar populations. With these models we may determine the physical properties of young ionizing clusters when only photometrical observations are available and these correspond to the isolated star forming regions, subtracted the contribution of the underlying population In most cases, however, the ionizing population is usually embedded in a large and complex system, and the observed photometrical properties are the result of the combination of both the young star-forming burst and the host-underlying older population. The second objective of our work is therefore to provide a grid of models for nearby galaxies able to interpret mixed regions where the separation of young and old population is not possible or reliable enough. We obtain a set of PopStar Spectral Energy Distributions (available at PopStar site and also in VO) and derived colors for mixed populations where an underlying host population is combined in different mass ratios with a recent, metal-rich ionizing burst. These colors, together with other photometrical parameters, like Hα radius of the ionized region, and Balmer lines equivalent width and luminosity allow to infer the physical properties of star-forming regions without any spectroscopic information. For details and a complete set of tables and figures see Mollá, García-Vargas, & Martín-Manjón (2012, MNRAS, submitted).
Makarova, Kira S; Sorokin, Alexander V; Novichkov, Pavel S; Wolf, Yuri I; Koonin, Eugene V
2007-11-27
An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on such a classification was implemented in Clusters of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. One of the practical strategies involves construction of refined COGs for phylogenetically compact subsets of genomes. New Archaeal Clusters of Orthologous Genes (arCOGs) were constructed for 41 archaeal genomes (13 Crenarchaeota, 27 Euryarchaeota and one Nanoarchaeon) using an improved procedure that employs a similarity tree between smaller, group-specific clusters, semi-automatically partitions orthology domains in multidomain proteins, and uses profile searches for identification of remote orthologs. The annotation of arCOGs is a consensus between three assignments based on the COGs, the CDD database, and the annotations of homologs in the NR database. The 7538 arCOGs, on average, cover approximately 88% of the genes in a genome compared to a approximately 76% coverage in COGs. The finer granularity of ortholog identification in the arCOGs is apparent from the fact that 4538 arCOGs correspond to 2362 COGs; approximately 40% of the arCOGs are new. The archaeal gene core (protein-coding genes found in all 41 genome) consists of 166 arCOGs. The arCOGs were used to reconstruct gene loss and gene gain events during archaeal evolution and gene sets of ancestral forms. The Last Archaeal Common Ancestor (LACA) is conservatively estimated to possess 996 genes compared to 1245 and 1335 genes for the last common ancestors of Crenarchaeota and Euryarchaeota, respectively. It is inferred that LACA was a chemoautotrophic hyperthermophile that, in addition to the core archaeal functions, encoded more idiosyncratic systems, e.g., the CASS systems of antivirus defense and some toxin-antitoxin systems. The arCOGs provide a convenient, flexible framework for functional annotation of archaeal genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archaeal hyperthermophiles. ArCOGs and related information are available at: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
Globular cluster chemistry in fast-rotating dwarf stars belonging to intermediate-age open clusters
NASA Astrophysics Data System (ADS)
Pancino, Elena
2018-06-01
The peculiar chemistry observed in multiple populations of Galactic globular clusters is not generally found in other systems such as dwarf galaxies and open clusters, and no model can currently fully explain it. Exploring the boundaries of the multiple-population phenomenon and the variation of its extent in the space of cluster mass, age, metallicity, and compactness has proven to be a fruitful line of investigation. In the framework of a larger project to search for multiple populations in open clusters that is based on literature and survey data, I found peculiar chemical abundance patterns in a sample of intermediate-age open clusters with publicly available data. More specifically, fast-rotating dwarf stars (v sin i ≥ 50 km s-1) that belong to four clusters (Pleiades, Ursa Major, Come Berenices, and Hyades) display a bimodality in either [Na/Fe] or [O/Fe], or both, with the low-Na and high-O peak more populated than the high-Na and low-O peak. Additionally, two clusters show a Na-O anti-correlation in the fast-rotating stars, and one cluster shows a large [Mg/Fe] variation in stars with high [Na/Fe], reaching the extreme Mg depletion observed in NGC 2808. Even considering that the sample sizes are small, these patterns call for attention in the light of a possible connection with the multiple population phenomenon of globular clusters. The specific chemistry observed in these fast-rotating dwarf stars is thought to be produced by a complex interplay of different diffusion and mixing mechanisms, such as rotational mixing and mass loss, which in turn are influenced by metallicity, binarity, mass, age, variability, and so on. However, with the sample in hand, it was not possible to identify which stellar parameters cause the observed Na and O bimodality and Na-O anti-correlation. This suggests that other stellar properties might be important in addition to stellar rotation. Stellar binarity might influence the rotational properties and enhance rotational mixing and mass loss of stars in a dense environment like that of clusters (especially globulars). In conclusion, rotation and binarity appear as a promising research avenue for better understanding multiple stellar populations in globular clusters; this is certainly worth exploring further.
NASA Astrophysics Data System (ADS)
Kato, Takeyoshi; Minagata, Atsushi; Suzuoki, Yasuo
This paper discusses the influence of mass installation of a home co-generation system (H-CGS) using a polymer electrolyte fuel cell (PEFC) on the voltage profile of power distribution system in residential area. The influence of H-CGS is compared with that of photovoltaic power generation systems (PV systems). The operation pattern of H-CGS is assumed based on the electricity and hot-water demand observed in 10 households for a year. The main results are as follows. With the clustered H-CGS, the voltage of each bus is higher by about 1-3% compared with the conventional system without any distributed generators. Because H-CGS tends to increase the output during the early evening, H-CGS contributes to recover the voltage drop during the early evening, resulting in smaller voltage variation of distribution system throughout a day. Because of small rated power output about 1kW, the influence on voltage profile by the clustered H-CGS is smaller than that by the clustered PV systems. The highest voltage during the day time is not so high as compared with the distribution system with the clustered PV systems, even if the reverse power flow from H-CGS is allowed.
Two- and three-cluster decays of light nuclei within a hyperspherical harmonics approach
NASA Astrophysics Data System (ADS)
Vasilevsky, V. S.; Lashko, Yu. A.; Filippov, G. F.
2018-06-01
We consider a set of three-cluster systems (4He, 7Li, 7Be, 8Be, 10Be) within a microscopic model which involves hyperspherical harmonics to represent intercluster motion. We selected three-cluster systems which have at least one binary channel. Our aim is to study whether hyperspherical harmonics are able, and under what conditions, to describe two-body channel(s) (nondemocratic motion) or if they are suitable for describing the three-cluster continuum only (democratic motion). It is demonstrated that a rather restricted number of hyperspherical harmonics allows us to describe bound states and scattering states in the two-body continuum for a three-cluster system.
Construction and application of Red5 cluster based on OpenStack
NASA Astrophysics Data System (ADS)
Wang, Jiaqing; Song, Jianxin
2017-08-01
With the application and development of cloud computing technology in various fields, the resource utilization rate of the data center has been improved obviously, and the system based on cloud computing platform has also improved the expansibility and stability. In the traditional way, Red5 cluster resource utilization is low and the system stability is poor. This paper uses cloud computing to efficiently calculate the resource allocation ability, and builds a Red5 server cluster based on OpenStack. Multimedia applications can be published to the Red5 cloud server cluster. The system achieves the flexible construction of computing resources, but also greatly improves the stability of the cluster and service efficiency.
Ballante, Flavio; Marshall, Garland R
2016-01-25
Molecular docking is a widely used technique in drug design to predict the binding pose of a candidate compound in a defined therapeutic target. Numerous docking protocols are available, each characterized by different search methods and scoring functions, thus providing variable predictive capability on a same ligand-protein system. To validate a docking protocol, it is necessary to determine a priori the ability to reproduce the experimental binding pose (i.e., by determining the docking accuracy (DA)) in order to select the most appropriate docking procedure and thus estimate the rate of success in docking novel compounds. As common docking programs use generally different root-mean-square deviation (RMSD) formulas, scoring functions, and format results, it is both difficult and time-consuming to consistently determine and compare their predictive capabilities in order to identify the best protocol to use for the target of interest and to extrapolate the binding poses (i.e., best-docked (BD), best-cluster (BC), and best-fit (BF) poses) when applying a given docking program over thousands/millions of molecules during virtual screening. To reduce this difficulty, two new procedures called Clusterizer and DockAccessor have been developed and implemented for use with some common and "free-for-academics" programs such as AutoDock4, AutoDock4(Zn), AutoDock Vina, DOCK, MpSDockZn, PLANTS, and Surflex-Dock to automatically extrapolate BD, BC, and BF poses as well as to perform consistent cluster and DA analyses. Clusterizer and DockAccessor (code available over the Internet) represent two novel tools to collect computationally determined poses and detect the most predictive docking approach. Herein an application to human lysine deacetylase (hKDAC) inhibitors is illustrated.
Dynamics of cD clusters of galaxies. II: Analysis of seven Abell clusters
NASA Technical Reports Server (NTRS)
Oegerle, William R.; Hill, John M.
1994-01-01
We have investigated the dynamics of the seven Abell clusters A193, A399, A401, A1795, A1809, A2063, and A2124, based on redshift data reported previously by us (Hill & Oegerle, (1993)). These papers present the initial results of a survey of cD cluster kinematics, with an emphasis on studying the nature of peculiar velocity cD galaxies and their parent clusters. In the current sample, we find no evidence for significant peculiar cD velocities, with respect to the global velocity distribution. However, the cD in A2063 has a significant (3 sigma) peculiar velocity with respect to galaxies in the inner 1.5 Mpc/h, which is likely due to the merger of a subcluster with A2063. We also find significant evidence for subclustering in A1795, and a marginally peculiar cD velocity with respect to galaxies within approximately 200 kpc/h of the cD. The available x-ray, optical, and galaxy redshift data strongly suggest that a subcluster has merged with A1795. We propose that the subclusters which merged with A1795 and A2063 were relatively small, with shallow potential wells, so that the cooling flows in these clusters were not disrupted. Two-body gravitational models of the A399/401 and A2063/MKW3S systems indicate that A399/401 is a bound pair with a total virial mass of approximately 4 x 10(exp 15) solar mass/h, while A2063 and MKW3S are very unlikely to be bound.
Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L.; Dianes, José A.; del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W.; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio
2016-01-01
Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra. PMID:27493588
Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L; Dianes, José A; Del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio
2016-08-01
Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra.
A Computational Cluster for Multiscale Simulations of Ionic Liquids
2008-09-16
AND SUBTITLE DURIP: A Computational Cluster for Multiscale Simulations of Ionic Liquids 5a. CONTRACT NUMBER 5b. GRANT NUMBER FA955007-1-0512 5c...AVAILABILITY STATEMENT ZO\\5oc\\\\%1>^ 13. SUPPLEMENTARY NOTES 14. ABSTRACT The focus of this project was to acquire and use computer cluster nodes...by ANSI Std. Z39.18 Adobe Professional 7.0 Comprehensive Final Report: Gregory A. Voth, PI Contract/Grant Title: DURIP: A Computational Cluster for
Dinamical properties of globular clusters: Primordial or evolutional?
NASA Astrophysics Data System (ADS)
Surdin, V. G.
1995-04-01
Some observable relations between globular cluster parameters appear as a result of dynamical evolution of the cluster system. These relations are inapplicable to the studies of the globular cluster origin
Onda, Kyle; Crocker, Jonny; Kayser, Georgia Lyn; Bartram, Jamie
2014-03-01
The fields of global health and international development commonly cluster countries by geography and income to target resources and describe progress. For any given sector of interest, a range of relevant indicators can serve as a more appropriate basis for classification. We create a new typology of country clusters specific to the water and sanitation (WatSan) sector based on similarities across multiple WatSan-related indicators. After a literature review and consultation with experts in the WatSan sector, nine indicators were selected. Indicator selection was based on relevance to and suggested influence on national water and sanitation service delivery, and to maximize data availability across as many countries as possible. A hierarchical clustering method and a gap statistic analysis were used to group countries into a natural number of relevant clusters. Two stages of clustering resulted in five clusters, representing 156 countries or 6.75 billion people. The five clusters were not well explained by income or geography, and were distinct from existing country clusters used in international development. Analysis of these five clusters revealed that they were more compact and well separated than United Nations and World Bank country clusters. This analysis and resulting country typology suggest that previous geography- or income-based country groupings can be improved upon for applications in the WatSan sector by utilizing globally available WatSan-related indicators. Potential applications include guiding and discussing research, informing policy, improving resource targeting, describing sector progress, and identifying critical knowledge gaps in the WatSan sector. Copyright © 2013 Elsevier GmbH. All rights reserved.
Discovering Massive z > 1 Galaxy Clusters with Spitzer and SPTpol
NASA Astrophysics Data System (ADS)
Bleem, Lindsey; Brodwin, Mark; Ashby, Matthew; Stalder, Brian; Klein, Matthias; Gladders, Michael; Stanford, Spencer; Canning, Rebecca
2018-05-01
We propose to obtain Spitzer/IRAC imaging of 50 high-redshift galaxy cluster candidates derived from two new completed SZ cluster surveys by the South Pole Telescope. Clusters from the deep SPTpol 500-square-deg main survey will extend high-redshift SZ cluster science to lower masses (median M500 2x10^14Msun) while systems drawn from the wider 2500-sq-deg SPTpol Extended Cluster Survey are some of the rarest most massive high-z clusters in the observable universe. The proposed small 10 h program will enable (1) confirmation of these candidates as high-redshift clusters, (2) measurements of the cluster redshifts (sigma_z/(1+z) 0.03), and (3) estimates of the stellar masses of the brightest cluster members. These observations will yield exciting and timely targets for the James Webb Space Telescope--and, combined with lower-z systems--will both extend cluster tests of dark energy to z>1 as well as enable studies of galaxy evolution in the richest environments for a mass-limited cluster sample from 0
Exact combinatorial approach to finite coagulating systems
NASA Astrophysics Data System (ADS)
Fronczak, Agata; Chmiel, Anna; Fronczak, Piotr
2018-02-01
This paper outlines an exact combinatorial approach to finite coagulating systems. In this approach, cluster sizes and time are discrete and the binary aggregation alone governs the time evolution of the systems. By considering the growth histories of all possible clusters, an exact expression is derived for the probability of a coagulating system with an arbitrary kernel being found in a given cluster configuration when monodisperse initial conditions are applied. Then this probability is used to calculate the time-dependent distribution for the number of clusters of a given size, the average number of such clusters, and that average's standard deviation. The correctness of our general expressions is proved based on the (analytical and numerical) results obtained for systems with the constant kernel. In addition, the results obtained are compared with the results arising from the solutions to the mean-field Smoluchowski coagulation equation, indicating its weak points. The paper closes with a brief discussion on the extensibility to other systems of the approach presented herein, emphasizing the issue of arbitrary initial conditions.
Kinematics of the Doped Quantum Vortices in Superfluid Helium Droplets
NASA Astrophysics Data System (ADS)
Bernando, Charles; Vilesov, Andrey F.
2018-05-01
Recent observation of quantum vortices in superfluid 4He droplets measuring a few hundreds of nanometers in diameter involved decoration of vortex cores by clusters containing large numbers of Xe atoms, which served as X-ray contrast agents. Here, we report on the study of the kinematics of the combined vortex-cluster system in a cylinder and in a sphere. Equilibrium states, characterized by total angular momentum, L, were found by minimizing the total energy, E, which sums from the kinetic energy of the liquid due to the vortex and due to orbiting Xe clusters, as well as solvation energy of the cluster in the droplet. Calculations show that, at small mass of the cluster, the equilibrium displacement of the system from the rotation axis is close to that for the bare vortex. However, upon decrease in L beyond certain critical value, which is larger for heavier clusters, the displacement bifurcates toward the surface region, where the motion of the system is governed by the clusters. In addition, at even smaller L, bare orbiting clusters become energetically favorable, opening the possibility for the vortex to detach from the cluster and to annihilate at the droplet's surface.
Multi-agent grid system Agent-GRID with dynamic load balancing of cluster nodes
NASA Astrophysics Data System (ADS)
Satymbekov, M. N.; Pak, I. T.; Naizabayeva, L.; Nurzhanov, Ch. A.
2017-12-01
In this study the work presents the system designed for automated load balancing of the contributor by analysing the load of compute nodes and the subsequent migration of virtual machines from loaded nodes to less loaded ones. This system increases the performance of cluster nodes and helps in the timely processing of data. A grid system balances the work of cluster nodes the relevance of the system is the award of multi-agent balancing for the solution of such problems.
Pyeon, Hye-Rim; Nah, Hee-Ju; Kang, Seung-Hoon; Choi, Si-Sun; Kim, Eung-Soo
2017-05-31
Heterologous expression of biosynthetic gene clusters of natural microbial products has become an essential strategy for titer improvement and pathway engineering of various potentially-valuable natural products. A Streptomyces artificial chromosomal conjugation vector, pSBAC, was previously successfully applied for precise cloning and tandem integration of a large polyketide tautomycetin (TMC) biosynthetic gene cluster (Nah et al. in Microb Cell Fact 14(1):1, 2015), implying that this strategy could be employed to develop a custom overexpression scheme of natural product pathway clusters present in actinomycetes. To validate the pSBAC system as a generally-applicable heterologous overexpression system for a large-sized polyketide biosynthetic gene cluster in Streptomyces, another model polyketide compound, the pikromycin biosynthetic gene cluster, was preciously cloned and heterologously expressed using the pSBAC system. A unique HindIII restriction site was precisely inserted at one of the border regions of the pikromycin biosynthetic gene cluster within the chromosome of Streptomyces venezuelae, followed by site-specific recombination of pSBAC into the flanking region of the pikromycin gene cluster. Unlike the previous cloning process, one HindIII site integration step was skipped through pSBAC modification. pPik001, a pSBAC containing the pikromycin biosynthetic gene cluster, was directly introduced into two heterologous hosts, Streptomyces lividans and Streptomyces coelicolor, resulting in the production of 10-deoxymethynolide, a major pikromycin derivative. When two entire pikromycin biosynthetic gene clusters were tandemly introduced into the S. lividans chromosome, overproduction of 10-deoxymethynolide and the presence of pikromycin, which was previously not detected, were both confirmed. Moreover, comparative qRT-PCR results confirmed that the transcription of pikromycin biosynthetic genes was significantly upregulated in S. lividans containing tandem clusters of pikromycin biosynthetic gene clusters. The 60 kb pikromycin biosynthetic gene cluster was isolated in a single integration pSBAC vector. Introduction of the pikromycin biosynthetic gene cluster into the pikromycin non-producing strains resulted in higher pikromycin production. The utility of the pSBAC system as a precise cloning tool for large-sized biosynthetic gene clusters was verified through heterologous expression of the pikromycin biosynthetic gene cluster. Moreover, this pSBAC-driven heterologous expression strategy was confirmed to be an ideal approach for production of low and inconsistent natural products such as pikromycin in S. venezuelae, implying that this strategy could be employed for development of a custom overexpression scheme of natural product biosynthetic gene clusters in actinomycetes.
New trial wave function for the nuclear cluster structure of nuclei
NASA Astrophysics Data System (ADS)
Zhou, Bo
2018-04-01
A new trial wave function is proposed for nuclear cluster physics, in which an exact solution to the long-standing center-of-mass problem is given. In the new approach, the widths of the single-nucleon Gaussian wave packets and the widths of the relative Gaussian wave functions describing correlations of nucleons or clusters are treated as variables in the explicit intrinsic wave function of the nuclear system. As an example, this new wave function was applied to study the typical {^{20}Ne} (α+{{^{16}}O}) cluster system. By removing exactly the spurious center-of-mass effect in a very simple way, the energy curve of {^{20}Ne} was obtained by variational calculations with the width of the α cluster, the width of the {{^{16}}O} cluster, and the size parameter of the nucleus. These are considered the three crucial variational variables in describing the {^{20}Ne} (α+{{^{16}}O}) cluster system. This shows that the new wave function can be a very interesting new tool for studying many-body and cluster effects in nuclear physics.
Primordial black holes in globular clusters
NASA Technical Reports Server (NTRS)
Sigurdsson, Steinn; Hernquist, Lars
1993-01-01
It has recently been recognized that significant numbers of medium-mass back holes (of order 10 solar masses) should form in globular clusters during the early stages of their evolution. Here we explore the dynamical and observational consequences of the presence of such a primordial black-hole population in a globular cluster. The holes initially segregate to the cluster cores, where they form binary and multiple black-hole systems. The subsequent dynamical evolution of the black-hole population ejects most of the holes on a relatively short timescale: a typical cluster will retain between zero and four black holes in its core, and possibly a few black holes in its halo. The presence of binary, triple, and quadruple black-hole systems in cluster cores will disrupt main-sequence and giant stellar binaries; this may account for the observed anomalies in the distribution of binaries in globular clusters. Furthermore, tidal interactions between a multiple black-hole system and a red giant star can remove much of the red giant's stellar envelope, which may explain the puzzling absence of larger red giants in the cores of some very dense clusters.
Large-Angular-Scale Clustering as a Clue to the Source of UHECRs
NASA Astrophysics Data System (ADS)
Berlind, Andreas A.; Farrar, Glennys R.
We explore what can be learned about the sources of UHECRs from their large-angular-scale clustering (referred to as their "bias" by the cosmology community). Exploiting the clustering on large scales has the advantage over small-scale correlations of being insensitive to uncertainties in source direction from magnetic smearing or measurement error. In a Cold Dark Matter cosmology, the amplitude of large-scale clustering depends on the mass of the system, with more massive systems such as galaxy clusters clustering more strongly than less massive systems such as ordinary galaxies or AGN. Therefore, studying the large-scale clustering of UHECRs can help determine a mass scale for their sources, given the assumption that their redshift depth is as expected from the GZK cutoff. We investigate the constraining power of a given UHECR sample as a function of its cutoff energy and number of events. We show that current and future samples should be able to distinguish between the cases of their sources being galaxy clusters, ordinary galaxies, or sources that are uncorrelated with the large-scale structure of the universe.
Examination of evidence for collinear cluster tri-partition
NASA Astrophysics Data System (ADS)
Pyatkov, Yu. V.; Kamanin, D. V.; Alexandrov, A. A.; Alexandrova, I. A.; Goryainova, Z. I.; Malaza, V.; Mkaza, N.; Kuznetsova, E. A.; Strekalovsky, A. O.; Strekalovsky, O. V.; Zhuchko, V. E.
2017-12-01
Background: In a series of experiments at different time-of-flight spectrometers of heavy ions we have observed manifestations of a new at least ternary decay channel of low excited heavy nuclei. Due to specific features of the effect, it was called collinear cluster tri-partition (CCT). The obtained experimental results have initiated a number of theoretical articles dedicated to different aspects of the CCT. Special attention was paid to kinematics constraints and stability of collinearity. Purpose: To compare theoretical predictions with our experimental data, only partially published so far. To develop the model of one of the most populated CCT modes that gives rise to the so-called "Ni-bump." Method: The fission events under analysis form regular two-dimensional linear structures in the mass correlation distributions of the fission fragments. The structures were revealed both at a highly statistically reliable level but on the background substrate, and at the low statistics in almost noiseless distribution. The structures are bounded by the known magic fragments and were reproduced at different spectrometers. All this provides high reliability of our experimental findings. The model of the CCT proposed here is based on theoretical results, published recently, and the detailed analysis of all available experimental data. Results: Under our model, the CCT mode giving rise to the Ni bump occurs as a two-stage breakup of the initial three body chain like the nuclear configuration with an elongated central cluster. After the first scission at the touching point with one of the side clusters, the predominantly heavier one, the deformation energy of the central cluster allows the emission of up to four neutrons flying apart isotropically. The heavy side cluster and a dinuclear system, consisting of the light side cluster and the central one, relaxed to a less elongated shape, are accelerated in the mutual Coulomb field. The "tip" of the dinuclear system at the moment of its rupture faces the heavy fragment or the opposite direction due to a single turn of the system around its center of gravity. Conclusions: Additional experimental information regarding the energies of the CCT partners and the proposed model of the process respond to criticisms concerning the kinematic constraints and the stability of collinearity in the CCT. The octupole deformed system formed after the first scission is oriented along the fission axis, and its rupture occurs predominantly after the full acceleration. Noncollinear true ternary fission and far asymmetric binary fission, observed earlier, appear to be the special cases of the decay of the prescission configuration leading to the CCT. Detection of the Ni-7268 fission fragments with a kinetic energy E <25 MeV at the mass-separator Lohengrin is proposed for an independent experimental verification of the CCT.
Towards an Autonomic Cluster Management System (ACMS) with Reflex Autonomicity
NASA Technical Reports Server (NTRS)
Truszkowski, Walt; Hinchey, Mike; Sterritt, Roy
2005-01-01
Cluster computing, whereby a large number of simple processors or nodes are combined together to apparently function as a single powerful computer, has emerged as a research area in its own right. The approach offers a relatively inexpensive means of providing a fault-tolerant environment and achieving significant computational capabilities for high-performance computing applications. However, the task of manually managing and configuring a cluster quickly becomes daunting as the cluster grows in size. Autonomic computing, with its vision to provide self-management, can potentially solve many of the problems inherent in cluster management. We describe the development of a prototype Autonomic Cluster Management System (ACMS) that exploits autonomic properties in automating cluster management and its evolution to include reflex reactions via pulse monitoring.
Organizational Modes of Severe Wind-producing Convective Systems over North China
NASA Astrophysics Data System (ADS)
Yang, Xinlin; Sun, Jianhua
2018-05-01
Severe weather reports and composite radar reflectivity data from 2010-14 over North China were used to analyze the distribution of severe convective wind (SCW) events and their organizational modes of radar reflectivity. The six organizational modes for SCW events (and their proportions) were cluster cells (35.4%), squall lines (18.4%), nonlinear-shaped systems (17.8%), broken lines (11.6%), individual cells (1.2%), and bow echoes (0.5%). The peak month for both squall lines and broken lines was June, whereas it was July for the other four modes. The highest numbers of SCW events were over the mountains, which were generally associated with disorganized systems of cluster cells. In contrast, SCW associated with linear systems occurred mainly over the plains, where stations recorded an average of less than one SCW event per year. Regions with a high frequency of SCW associated with nonlinear-shaped systems also experienced many SCW events associated with squall lines. Values of convective available potential energy, precipitable water, 0-3-km shear, and 0-6-km shear, were demonstrably larger over the plains than over the mountains, which had an evident effect on the organizational modes of SCW events. Therefore, topography may be an important factor in the organizational modes for SCW events over North China.
An Experimental Seismic Data and Parameter Exchange System for Tsunami Warning Systems
NASA Astrophysics Data System (ADS)
Hoffmann, T. L.; Hanka, W.; Saul, J.; Weber, B.; Becker, J.; Heinloo, A.; Hoffmann, M.
2009-12-01
For several years GFZ Potsdam is operating a global earthquake monitoring system. Since the beginning of 2008, this system is also used as an experimental seismic background data center for two different regional Tsunami Warning Systems (TWS), the IOTWS (Indian Ocean) and the interim NEAMTWS (NE Atlantic and Mediterranean). The SeisComP3 (SC3) software, developed within the GITEWS (German Indian Ocean Tsunami Early Warning System) project, capable to acquire, archive and process real-time data feeds, was extended for export and import of individual processing results within the two clusters of connected SC3 systems. Therefore not only real-time waveform data are routed to the attached warning centers through GFZ but also processing results. While the current experimental NEAMTWS cluster consists of SC3 systems in six designated national warning centers in Europe, the IOTWS cluster presently includes seven centers, with another three likely to join in 2009/10. For NEAMTWS purposes, the GFZ virtual real-time seismic network (GEOFON Extended Virtual Network -GEVN) in Europe was substantially extended by adding many stations from Western European countries optimizing the station distribution. In parallel to the data collection over the Internet, a GFZ VSAT hub for secured data collection of the EuroMED GEOFON and NEAMTWS backbone network stations became operational and first data links were established through this backbone. For the Southeast Asia region, a VSAT hub has been established in Jakarta already in 2006, with some other partner networks connecting to this backbone via the Internet. Since its establishment, the experimental system has had the opportunity to prove its performance in a number of relevant earthquakes. Reliable solutions derived from a minimum of 25 stations were very promising in terms of speed. For important events, automatic alerts were released and disseminated by emails and SMS. Manually verified solutions are added as soon as they become available. The results are also promising in terms of accuracy since epicenter coordinates, depth and magnitude estimates were sufficiently accurate from the very beginning, and usually do not differ substantially from the final solutions. In summary, automatic seismic event processing has shown to work well as a first step for starting a Tsunami Warning process. However, for the secured assessment of the tsunami potential of a given event, 24/7-manned regional TWCs are mandatory for reliable manual verification of the automatic seismic results. At this time, GFZ itself provides manual verification only when staff is available, not on a 24/7 basis, while the actual national tsunami warning centers have all a reliable 24/7 service.
A knowledge-driven approach to cluster validity assessment.
Bolshakova, Nadia; Azuaje, Francisco; Cunningham, Pádraig
2005-05-15
This paper presents an approach to assessing cluster validity based on similarity knowledge extracted from the Gene Ontology. The program is freely available for non-profit use on request from the authors.
High-Performance, Multi-Node File Copies and Checksums for Clustered File Systems
NASA Technical Reports Server (NTRS)
Kolano, Paul Z.; Ciotti, Robert B.
2012-01-01
Modern parallel file systems achieve high performance using a variety of techniques, such as striping files across multiple disks to increase aggregate I/O bandwidth and spreading disks across multiple servers to increase aggregate interconnect bandwidth. To achieve peak performance from such systems, it is typically necessary to utilize multiple concurrent readers/writers from multiple systems to overcome various singlesystem limitations, such as number of processors and network bandwidth. The standard cp and md5sum tools of GNU coreutils found on every modern Unix/Linux system, however, utilize a single execution thread on a single CPU core of a single system, and hence cannot take full advantage of the increased performance of clustered file systems. Mcp and msum are drop-in replacements for the standard cp and md5sum programs that utilize multiple types of parallelism and other optimizations to achieve maximum copy and checksum performance on clustered file systems. Multi-threading is used to ensure that nodes are kept as busy as possible. Read/write parallelism allows individual operations of a single copy to be overlapped using asynchronous I/O. Multinode cooperation allows different nodes to take part in the same copy/checksum. Split-file processing allows multiple threads to operate concurrently on the same file. Finally, hash trees allow inherently serial checksums to be performed in parallel. Mcp and msum provide significant performance improvements over standard cp and md5sum using multiple types of parallelism and other optimizations. The total speed-ups from all improvements are significant. Mcp improves cp performance over 27x, msum improves md5sum performance almost 19x, and the combination of mcp and msum improves verified copies via cp and md5sum by almost 22x. These improvements come in the form of drop-in replacements for cp and md5sum, so are easily used and are available for download as open source software at http://mutil.sourceforge.net.
ICAP - An Interactive Cluster Analysis Procedure for analyzing remotely sensed data
NASA Technical Reports Server (NTRS)
Wharton, S. W.; Turner, B. J.
1981-01-01
An Interactive Cluster Analysis Procedure (ICAP) was developed to derive classifier training statistics from remotely sensed data. ICAP differs from conventional clustering algorithms by allowing the analyst to optimize the cluster configuration by inspection, rather than by manipulating process parameters. Control of the clustering process alternates between the algorithm, which creates new centroids and forms clusters, and the analyst, who can evaluate and elect to modify the cluster structure. Clusters can be deleted, or lumped together pairwise, or new centroids can be added. A summary of the cluster statistics can be requested to facilitate cluster manipulation. The principal advantage of this approach is that it allows prior information (when available) to be used directly in the analysis, since the analyst interacts with ICAP in a straightforward manner, using basic terms with which he is more likely to be familiar. Results from testing ICAP showed that an informed use of ICAP can improve classification, as compared to an existing cluster analysis procedure.
Semi-supervised clustering methods.
Bair, Eric
2013-01-01
Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning that there is no outcome variable nor is anything known about the relationship between the observations in the data set. In many situations, however, information about the clusters is available in addition to the values of the features. For example, the cluster labels of some observations may be known, or certain observations may be known to belong to the same cluster. In other cases, one may wish to identify clusters that are associated with a particular outcome variable. This review describes several clustering algorithms (known as "semi-supervised clustering" methods) that can be applied in these situations. The majority of these methods are modifications of the popular k-means clustering method, and several of them will be described in detail. A brief description of some other semi-supervised clustering algorithms is also provided.
The HectoMAP Cluster Survey. I. redMaPPer Clusters
NASA Astrophysics Data System (ADS)
Sohn, Jubee; Geller, Margaret J.; Rines, Kenneth J.; Hwang, Ho Seong; Utsumi, Yousuke; Diaferio, Antonaldo
2018-04-01
We use the dense HectoMAP redshift survey to explore the properties of 104 redMaPPer cluster candidates. The redMaPPer systems in HectoMAP cover the full range of richness and redshift (0.08 < z < 0.60). Fifteen of the systems included in the Subaru/Hyper Suprime-Cam public data release are bona fide clusters. The median number of spectroscopic members per cluster is ∼20. We include redshifts of 3547 member candidates listed in the redMaPPer catalog whether they are cluster members or not. We evaluate the redMaPPer membership probability spectroscopically. The purity (number of real systems) in redMaPPer exceeds 90% even at the lowest richness. Three massive galaxy clusters (M ∼ 2 × 1013 M ⊙) associated with X-ray emission in the HectoMAP region are not included in the public redMaPPer catalog with λ rich > 20, because they lie outside the cuts for this catalog.
The HectoMAP Cluster Survey. I. redMaPPer Clusters
Sohn, Jubee; Geller, Margaret J.; Rines, Kenneth J.; ...
2018-04-05
We use the dense HectoMAP redshift survey to explore the properties of 104 redMaPPer cluster candidates. The redMaPPer systems in HectoMAP cover the full range of richness and redshift (0.08 < z < 0.60). Fifteen of the systems included in the Subaru/Hyper Suprime-Cam public data release are bona fide clusters. The median number of spectroscopic members per cluster is ~20. We include redshifts of 3547 member candidates listed in the redMaPPer catalog whether they are cluster members or not. Here, we evaluate the redMaPPer membership probability spectroscopically. The purity (number of real systems) in redMaPPer exceeds 90% even at themore » lowest richness. Three massive galaxy clusters (M ~ 2 × 10 13 M ⊙) associated with X-ray emission in the HectoMAP region are not included in the public redMaPPer catalog with λ rich > 20, because they lie outside the cuts for this catalog.« less
The HectoMAP Cluster Survey. I. redMaPPer Clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sohn, Jubee; Geller, Margaret J.; Rines, Kenneth J.
We use the dense HectoMAP redshift survey to explore the properties of 104 redMaPPer cluster candidates. The redMaPPer systems in HectoMAP cover the full range of richness and redshift (0.08 < z < 0.60). Fifteen of the systems included in the Subaru/Hyper Suprime-Cam public data release are bona fide clusters. The median number of spectroscopic members per cluster is ~20. We include redshifts of 3547 member candidates listed in the redMaPPer catalog whether they are cluster members or not. Here, we evaluate the redMaPPer membership probability spectroscopically. The purity (number of real systems) in redMaPPer exceeds 90% even at themore » lowest richness. Three massive galaxy clusters (M ~ 2 × 10 13 M ⊙) associated with X-ray emission in the HectoMAP region are not included in the public redMaPPer catalog with λ rich > 20, because they lie outside the cuts for this catalog.« less
Clustering in Cell Cycle Dynamics with General Response/Signaling Feedback
Young, Todd R.; Fernandez, Bastien; Buckalew, Richard; Moses, Gregory; Boczko, Erik M.
2011-01-01
Motivated by experimental and theoretical work on autonomous oscillations in yeast, we analyze ordinary differential equations models of large populations of cells with cell-cycle dependent feedback. We assume a particular type of feedback that we call Responsive/Signaling (RS), but do not specify a functional form of the feedback. We study the dynamics and emergent behaviour of solutions, particularly temporal clustering and stability of clustered solutions. We establish the existence of certain periodic clustered solutions as well as “uniform” solutions and add to the evidence that cell-cycle dependent feedback robustly leads to cell-cycle clustering. We highlight the fundamental differences in dynamics between systems with negative and positive feedback. For positive feedback systems the most important mechanism seems to be the stability of individual isolated clusters. On the other hand we find that in negative feedback systems, clusters must interact with each other to reinforce coherence. We conclude from various details of the mathematical analysis that negative feedback is most consistent with observations in yeast experiments. PMID:22001733
Agent-based method for distributed clustering of textual information
Potok, Thomas E [Oak Ridge, TN; Reed, Joel W [Knoxville, TN; Elmore, Mark T [Oak Ridge, TN; Treadwell, Jim N [Louisville, TN
2010-09-28
A computer method and system for storing, retrieving and displaying information has a multiplexing agent (20) that calculates a new document vector (25) for a new document (21) to be added to the system and transmits the new document vector (25) to master cluster agents (22) and cluster agents (23) for evaluation. These agents (22, 23) perform the evaluation and return values upstream to the multiplexing agent (20) based on the similarity of the document to documents stored under their control. The multiplexing agent (20) then sends the document (21) and the document vector (25) to the master cluster agent (22), which then forwards it to a cluster agent (23) or creates a new cluster agent (23) to manage the document (21). The system also searches for stored documents according to a search query having at least one term and identifying the documents found in the search, and displays the documents in a clustering display (80) of similarity so as to indicate similarity of the documents to each other.
Novel approaches to pin cluster synchronization on complex dynamical networks in Lur'e forms
NASA Astrophysics Data System (ADS)
Tang, Ze; Park, Ju H.; Feng, Jianwen
2018-04-01
This paper investigates the cluster synchronization of complex dynamical networks consisted of identical or nonidentical Lur'e systems. Due to the special topology structure of the complex networks and the existence of stochastic perturbations, a kind of randomly occurring pinning controller is designed which not only synchronizes all Lur'e systems in the same cluster but also decreases the negative influence among different clusters. Firstly, based on an extended integral inequality, the convex combination theorem and S-procedure, the conditions for cluster synchronization of identical Lur'e networks are derived in a convex domain. Secondly, randomly occurring adaptive pinning controllers with two independent Bernoulli stochastic variables are designed and then sufficient conditions are obtained for the cluster synchronization on complex networks consisted of nonidentical Lur'e systems. In addition, suitable control gains for successful cluster synchronization of nonidentical Lur'e networks are acquired by designing some adaptive updating laws. Finally, we present two numerical examples to demonstrate the validity of the control scheme and the theoretical analysis.
Nagaoka, Tomoaki; Watanabe, Soichi
2012-01-01
Electromagnetic simulation with anatomically realistic computational human model using the finite-difference time domain (FDTD) method has recently been performed in a number of fields in biomedical engineering. To improve the method's calculation speed and realize large-scale computing with the computational human model, we adapt three-dimensional FDTD code to a multi-GPU cluster environment with Compute Unified Device Architecture and Message Passing Interface. Our multi-GPU cluster system consists of three nodes. The seven GPU boards (NVIDIA Tesla C2070) are mounted on each node. We examined the performance of the FDTD calculation on multi-GPU cluster environment. We confirmed that the FDTD calculation on the multi-GPU clusters is faster than that on a multi-GPU (a single workstation), and we also found that the GPU cluster system calculate faster than a vector supercomputer. In addition, our GPU cluster system allowed us to perform the large-scale FDTD calculation because were able to use GPU memory of over 100 GB.
Cell cycle dynamics in a response/signalling feedback system with a gap.
Gong, Xue; Buckalew, Richard; Young, Todd; Boczko, Erik
2014-01-01
We consider a dynamical model of cell cycles of n cells in a culture in which cells in one specific phase (S for signalling) of the cell cycle produce chemical agents that influence the growth/cell cycle progression of cells in another phase (R for responsive). In the case that the feedback is negative, it is known that subpopulations of cells tend to become clustered in the cell cycle; while for a positive feedback, all the cells tend to become synchronized. In this paper, we suppose that there is a gap between the two phases. The gap can be thought of as modelling the physical reality of a time delay in the production and action of the signalling agents. We completely analyse the dynamics of this system when the cells are arranged into two cell cycle clusters. We also consider the stability of certain important periodic solutions in which clusters of cells have a cyclic arrangement and there are just enough clusters to allow interactions between them. We find that the inclusion of a small gap does not greatly alter the global dynamics of the system; there are still large open sets of parameters for which clustered solutions are stable. Thus, we add to the evidence that clustering can be a robust phenomenon in biological systems. However, the gap does effect the system by enhancing the stability of the stable clustered solutions. We explain this phenomenon in terms of contraction rates (Floquet exponents) in various invariant subspaces of the system. We conclude that in systems for which these models are reasonable, a delay in signalling is advantageous to the emergence of clustering.
Solid state and aqueous behavior of uranyl peroxide cage clusters
NASA Astrophysics Data System (ADS)
Pellegrini, Kristi Lynn
Uranyl peroxide cage clusters include a large family of more than 50 published clusters of a variety of sizes, which can incorporate various ligands including pyrophosphate and oxalate. Previous studies have reported that uranyl clusters can be used as a method to separate uranium from a solid matrix, with potential applications in reprocessing of irradiated nuclear fuel. Because of the potential applications of these novel structures in an advanced nuclear fuel cycle and their likely presence in areas of contamination, it is important to understand their behavior in both solid state and aqueous systems, including complex environments where other ions are present. In this thesis, I examine the aqueous behavior of U24Pp 12, as well as aqueous cluster systems with added mono-, di-, and trivalent cations. The resulting solutions were analyzed using dynamic light scattering and ultra-small angle X-ray scattering to evaluate the species in solution. Precipitates of these systems were analyzed using powder X-ray diffraction, X-ray fluorescence spectrometry, and Raman spectroscopy. The results of these analyses demonstrate the importance of cation size, charge, and concentration of added cations on the aqueous behavior of uranium macroions. Specifically, aggregates of various sizes and shapes form rapidly upon addition of cations, and in some cases these aggregates appear to precipitate into an X-ray amorphous material that still contains U24Pp12 clusters. In addition, I probe aggregation of U24Pp12 and U60, another uranyl peroxide cage cluster, in mixed solvent water-alcohol systems. The aggregation of uranyl clusters in water-alcohol systems is a result of hydrogen bonding with polar organic molecules and the reduction of the dielectric constant of the system. Studies of aggregation of uranyl clusters also allow for comparison between the newer uranyl polyoxometalate family and century-old transition metal polyoxometalates. To complement the solution studies of uranyl cage clusters, solid state analyses of U24Pp12 are presented, including single crystal X-ray diffraction and preliminary single crystal neutron diffraction. Solid state analyses are used to probe the complicated bonding environments between U24Pp12 and crystallized counterions, giving further insight into the importance of cluster protonation and counterions in uranyl cluster systems. The combination of solid state and solution techniques provides information about the complicated nature of uranyl peroxide nanoclusters, and insight towards future applications of clusters in the advanced nuclear fuel cycle and the environment.
Modulated Modularity Clustering as an Exploratory Tool for Functional Genomic Inference
Stone, Eric A.; Ayroles, Julien F.
2009-01-01
In recent years, the advent of high-throughput assays, coupled with their diminishing cost, has facilitated a systems approach to biology. As a consequence, massive amounts of data are currently being generated, requiring efficient methodology aimed at the reduction of scale. Whole-genome transcriptional profiling is a standard component of systems-level analyses, and to reduce scale and improve inference clustering genes is common. Since clustering is often the first step toward generating hypotheses, cluster quality is critical. Conversely, because the validation of cluster-driven hypotheses is indirect, it is critical that quality clusters not be obtained by subjective means. In this paper, we present a new objective-based clustering method and demonstrate that it yields high-quality results. Our method, modulated modularity clustering (MMC), seeks community structure in graphical data. MMC modulates the connection strengths of edges in a weighted graph to maximize an objective function (called modularity) that quantifies community structure. The result of this maximization is a clustering through which tightly-connected groups of vertices emerge. Our application is to systems genetics, and we quantitatively compare MMC both to the hierarchical clustering method most commonly employed and to three popular spectral clustering approaches. We further validate MMC through analyses of human and Drosophila melanogaster expression data, demonstrating that the clusters we obtain are biologically meaningful. We show MMC to be effective and suitable to applications of large scale. In light of these features, we advocate MMC as a standard tool for exploration and hypothesis generation. PMID:19424432
Barker, Daniel; D'Este, Catherine; Campbell, Michael J; McElduff, Patrick
2017-03-09
Stepped wedge cluster randomised trials frequently involve a relatively small number of clusters. The most common frameworks used to analyse data from these types of trials are generalised estimating equations and generalised linear mixed models. A topic of much research into these methods has been their application to cluster randomised trial data and, in particular, the number of clusters required to make reasonable inferences about the intervention effect. However, for stepped wedge trials, which have been claimed by many researchers to have a statistical power advantage over the parallel cluster randomised trial, the minimum number of clusters required has not been investigated. We conducted a simulation study where we considered the most commonly used methods suggested in the literature to analyse cross-sectional stepped wedge cluster randomised trial data. We compared the per cent bias, the type I error rate and power of these methods in a stepped wedge trial setting with a binary outcome, where there are few clusters available and when the appropriate adjustment for a time trend is made, which by design may be confounding the intervention effect. We found that the generalised linear mixed modelling approach is the most consistent when few clusters are available. We also found that none of the common analysis methods for stepped wedge trials were both unbiased and maintained a 5% type I error rate when there were only three clusters. Of the commonly used analysis approaches, we recommend the generalised linear mixed model for small stepped wedge trials with binary outcomes. We also suggest that in a stepped wedge design with three steps, at least two clusters be randomised at each step, to ensure that the intervention effect estimator maintains the nominal 5% significance level and is also reasonably unbiased.
Clustering by reordering of similarity and Laplacian matrices: Application to galaxy clusters
NASA Astrophysics Data System (ADS)
Mahmoud, E.; Shoukry, A.; Takey, A.
2018-04-01
Similarity metrics, kernels and similarity-based algorithms have gained much attention due to their increasing applications in information retrieval, data mining, pattern recognition and machine learning. Similarity Graphs are often adopted as the underlying representation of similarity matrices and are at the origin of known clustering algorithms such as spectral clustering. Similarity matrices offer the advantage of working in object-object (two-dimensional) space where visualization of clusters similarities is available instead of object-features (multi-dimensional) space. In this paper, sparse ɛ-similarity graphs are constructed and decomposed into strong components using appropriate methods such as Dulmage-Mendelsohn permutation (DMperm) and/or Reverse Cuthill-McKee (RCM) algorithms. The obtained strong components correspond to groups (clusters) in the input (feature) space. Parameter ɛi is estimated locally, at each data point i from a corresponding narrow range of the number of nearest neighbors. Although more advanced clustering techniques are available, our method has the advantages of simplicity, better complexity and direct visualization of the clusters similarities in a two-dimensional space. Also, no prior information about the number of clusters is needed. We conducted our experiments on two and three dimensional, low and high-sized synthetic datasets as well as on an astronomical real-dataset. The results are verified graphically and analyzed using gap statistics over a range of neighbors to verify the robustness of the algorithm and the stability of the results. Combining the proposed algorithm with gap statistics provides a promising tool for solving clustering problems. An astronomical application is conducted for confirming the existence of 45 galaxy clusters around the X-ray positions of galaxy clusters in the redshift range [0.1..0.8]. We re-estimate the photometric redshifts of the identified galaxy clusters and obtain acceptable values compared to published spectroscopic redshifts with a 0.029 standard deviation of their differences.
Improving clustering with metabolic pathway data.
Milone, Diego H; Stegmayer, Georgina; López, Mariana; Kamenetzky, Laura; Carrari, Fernando
2014-04-10
It is a common practice in bioinformatics to validate each group returned by a clustering algorithm through manual analysis, according to a-priori biological knowledge. This procedure helps finding functionally related patterns to propose hypotheses for their behavior and the biological processes involved. Therefore, this knowledge is used only as a second step, after data are just clustered according to their expression patterns. Thus, it could be very useful to be able to improve the clustering of biological data by incorporating prior knowledge into the cluster formation itself, in order to enhance the biological value of the clusters. A novel training algorithm for clustering is presented, which evaluates the biological internal connections of the data points while the clusters are being formed. Within this training algorithm, the calculation of distances among data points and neurons centroids includes a new term based on information from well-known metabolic pathways. The standard self-organizing map (SOM) training versus the biologically-inspired SOM (bSOM) training were tested with two real data sets of transcripts and metabolites from Solanum lycopersicum and Arabidopsis thaliana species. Classical data mining validation measures were used to evaluate the clustering solutions obtained by both algorithms. Moreover, a new measure that takes into account the biological connectivity of the clusters was applied. The results of bSOM show important improvements in the convergence and performance for the proposed clustering method in comparison to standard SOM training, in particular, from the application point of view. Analyses of the clusters obtained with bSOM indicate that including biological information during training can certainly increase the biological value of the clusters found with the proposed method. It is worth to highlight that this fact has effectively improved the results, which can simplify their further analysis.The algorithm is available as a web-demo at http://fich.unl.edu.ar/sinc/web-demo/bsom-lite/. The source code and the data sets supporting the results of this article are available at http://sourceforge.net/projects/sourcesinc/files/bsom.
Adjusting the fairshare policy to prevent computing power loss
NASA Astrophysics Data System (ADS)
Dal Pra, Stefano
2017-10-01
On a typical WLCG site providing batch access to computing resources according to a fairshare policy, the idle time lapse after a job ends and before a new one begins on a given slot is negligible if compared to the duration of typical jobs. The overall amount of these intervals over a time window increases with the size of the cluster and the inverse of job duration and can be considered equivalent to an average number of unavailable slots over that time window. This value has been investigated for the Tier-1 at CNAF, and observed to occasionally grow and reach up to more than the 10% of the about 20,000 available computing slots. Analysis reveals that this happens when a sustained rate of short jobs is submitted to the cluster and dispatched by the batch system. Because of how the default fairshare policy works, it increases the dynamic priority of those users mostly submitting short jobs, since they are not accumulating runtime, and will dispatch more of their jobs at the next round, thus worsening the situation until the submission flow ends. To address this problem the default behaviour of the fairshare have been altered by adding a correcting term to the default formula for the dynamic priority. The LSF batch system, currently adopted at CNAF, provides a way to define its value by invoking a C function, which returns it for each user in the cluster. The correcting term works by rounding up to a minimum defined runtime the most recently done jobs. Doing so, each short job looks almost like a regular one and the dynamic priority value settles to a proper value. The net effect is a reduction of the dispatching rate of short jobs and, consequently, the average number of available slots greatly improves. Furthermore, a potential starvation problem, actually observed at least once is also prevented. After describing short jobs and reporting about their impact on the cluster, possible workarounds are discussed and the selected solution is motivated. Details on the most critical aspects of the implementation are explained and the observed results are presented.
NASA Astrophysics Data System (ADS)
Cottaar, M.; Hénault-Brunet, V.
2014-02-01
Orbital motions from binary stars can broaden the observed line-of-sight velocity distribution of a stellar system and artificially inflate the measured line-of-sight velocity dispersion, which can in turn lead to erroneous conclusions about the dynamical state of the system. Recently, a maximum-likelihood procedure was proposed to recover the intrinsic velocity dispersion of a resolved star cluster from a single epoch of radial velocity data of individual stars, which was achieved by simultaneously fitting the intrinsic velocity distribution of the single stars and the centers of mass of the binaries along with the velocity shifts caused by binary orbital motions. Assuming well-characterized binary properties, this procedure can accurately reproduce intrinsic velocity dispersions below 1 km s-1 for solar-type stars. Here we investigate the systematic offsets induced when the binary properties are uncertain and we show that two epochs of radial velocity data with an appropriate baseline can help to mitigate these systematic effects. We first test the method described above using Monte Carlo simulations, taking into account the large uncertainties in the binary properties of OB stars. We then apply it to radial velocity data in the young massive cluster R136 for which the intrinsic velocity dispersion of O-type stars is known from an intensive multi-epoch approach. For typical velocity dispersions of young massive clusters (≳4 km s-1) and with a single epoch of data, we demonstrate that the method can just about distinguish between a cluster in virial equilibrium and an unbound cluster. This is due to the higher spectroscopic binary fraction and more loosely constrained distributions of orbital parameters of OB stars compared to solar-type stars. By extending the maximum-likelihood method to multi-epoch data, we show that the accuracy on the fitted velocity dispersion can be improved by only a few percent by using only two epochs of radial velocities. This procedure offers a promising method of accurately measuring the intrinsic stellar velocity dispersion in other systems for which the binary properties are poorly constrained, for example, young clusters and associations whose luminosity is dominated by OB stars. Appendix A is available in electronic form at http://www.aanda.org
Free energy of singular sticky-sphere clusters.
Kallus, Yoav; Holmes-Cerfon, Miranda
2017-02-01
Networks of particles connected by springs model many condensed-matter systems, from colloids interacting with a short-range potential and complex fluids near jamming, to self-assembled lattices and various metamaterials. Under small thermal fluctuations the vibrational entropy of a ground state is given by the harmonic approximation if it has no zero-frequency vibrational modes, yet such singular modes are at the epicenter of many interesting behaviors in the systems above. We consider a system of N spherical particles, and directly account for the singularities that arise in the sticky limit where the pairwise interaction is strong and short ranged. Although the contribution to the partition function from singular clusters diverges in the limit, its asymptotic value can be calculated and depends on only two parameters, characterizing the depth and range of the potential. The result holds for systems that are second-order rigid, a geometric characterization that describes all known ground-state (rigid) sticky clusters. To illustrate the applications of our theory we address the question of emergence: how does crystalline order arise in large systems when it is strongly disfavored in small ones? We calculate the partition functions of all known rigid clusters up to N≤21 and show the cluster landscape is dominated by hyperstatic clusters (those with more than 3N-6 contacts); singular and isostatic clusters are far less frequent, despite their extra vibrational and configurational entropies. Since the most hyperstatic clusters are close to fragments of a close-packed lattice, this underlies the emergence of order in sticky-sphere systems, even those as small as N=10.
Free energy of singular sticky-sphere clusters
NASA Astrophysics Data System (ADS)
Kallus, Yoav; Holmes-Cerfon, Miranda
2017-02-01
Networks of particles connected by springs model many condensed-matter systems, from colloids interacting with a short-range potential and complex fluids near jamming, to self-assembled lattices and various metamaterials. Under small thermal fluctuations the vibrational entropy of a ground state is given by the harmonic approximation if it has no zero-frequency vibrational modes, yet such singular modes are at the epicenter of many interesting behaviors in the systems above. We consider a system of N spherical particles, and directly account for the singularities that arise in the sticky limit where the pairwise interaction is strong and short ranged. Although the contribution to the partition function from singular clusters diverges in the limit, its asymptotic value can be calculated and depends on only two parameters, characterizing the depth and range of the potential. The result holds for systems that are second-order rigid, a geometric characterization that describes all known ground-state (rigid) sticky clusters. To illustrate the applications of our theory we address the question of emergence: how does crystalline order arise in large systems when it is strongly disfavored in small ones? We calculate the partition functions of all known rigid clusters up to N ≤21 and show the cluster landscape is dominated by hyperstatic clusters (those with more than 3 N -6 contacts); singular and isostatic clusters are far less frequent, despite their extra vibrational and configurational entropies. Since the most hyperstatic clusters are close to fragments of a close-packed lattice, this underlies the emergence of order in sticky-sphere systems, even those as small as N =10 .
StarBooster Demonstrator Cluster Configuration Analysis/Verification Program
NASA Technical Reports Server (NTRS)
DeTurris, Dianne J.
2003-01-01
In order to study the flight dynamics of the cluster configuration of two first stage boosters and upper-stage, flight-testing of subsonic sub-scale models has been undertaken using two glideback boosters launched on a center upper-stage. Three high power rockets clustered together were built and flown to demonstrate vertical launch, separation and horizontal recovery of the boosters. Although the boosters fly to conventional aircraft landing, the centerstage comes down separately under its own parachute. The goal of the project has been to collect data during separation and flight for comparison with a six degree of freedom simulation. The configuration for the delta wing canard boosters comes from a design by Starcraft Boosters, Inc. The subscale rockets were constructed of foam covered in carbon or fiberglass and were launched with commercially available solid rocket motors. The first set of boosters built were 3-ft tall with a 4-ft tall centerstage, and two additional sets of boosters were made that were each over 5-ft tall with a 7.5 ft centerstage. The rocket cluster is launched vertically, then after motor bum out the boosters are separated and flown to a horizontal landing under radio-control. An on-board data acquisition system recorded data during both the launch and glide phases of flight.
Modelling conflicts with cluster dynamics in networks
NASA Astrophysics Data System (ADS)
Tadić, Bosiljka; Rodgers, G. J.
2010-12-01
We introduce cluster dynamical models of conflicts in which only the largest cluster can be involved in an action. This mimics the situations in which an attack is planned by a central body, and the largest attack force is used. We study the model in its annealed random graph version, on a fixed network, and on a network evolving through the actions. The sizes of actions are distributed with a power-law tail, however, the exponent is non-universal and depends on the frequency of actions and sparseness of the available connections between units. Allowing the network reconstruction over time in a self-organized manner, e.g., by adding the links based on previous liaisons between units, we find that the power-law exponent depends on the evolution time of the network. Its lower limit is given by the universal value 5/2, derived analytically for the case of random fragmentation processes. In the temporal patterns behind the size of actions we find long-range correlations in the time series of the number of clusters and the non-trivial distribution of time that a unit waits between two actions. In the case of an evolving network the distribution develops a power-law tail, indicating that through repeated actions, the system develops an internal structure with a hierarchy of units.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Caruso, Francesco; Bellacicca, Andrea; Milani, Paolo, E-mail: pmilani@mi.infn.it
We report the rapid prototyping of passive electrical components (resistors and capacitors) on plain paper by an additive and parallel technology consisting of supersonic cluster beam deposition (SCBD) coupled with shadow mask printing. Cluster-assembled films have a growth mechanism substantially different from that of atom-assembled ones providing the possibility of a fine tuning of their electrical conduction properties around the percolative conduction threshold. Exploiting the precise control on cluster beam intensity and shape typical of SCBD, we produced, in a one-step process, batches of resistors with resistance values spanning a range of two orders of magnitude. Parallel plate capacitors withmore » paper as the dielectric medium were also produced with capacitance in the range of tens of picofarads. Compared to standard deposition technologies, SCBD allows for a very efficient use of raw materials and the rapid production of components with different shape and dimensions while controlling independently the electrical characteristics. Discrete electrical components produced by SCBD are very robust against deformation and bending, and they can be easily assembled to build circuits with desired characteristics. The availability of large batches of these components enables the rapid and cheap prototyping and integration of electrical components on paper as building blocks of more complex systems.« less
Zhao, Run -Ning; Chen, Rui; Yuan, Yan -Hong; ...
2017-08-10
Here, the stable equilibrium geometries, relative stabilities, and electronic and magnetic characteristics of Re n (n = 2–16) clusters were investigated by density functional theory method. The calculated fragmentation energies and second-order differences of energies exhibited interestingly that the stabilities of Re n (n = 2–16) clusters show a dramatic odd-even alternative behavior of the cluster size n: with the even-numbered Ren clusters being obviously more stable than their neighboring odd-numbered Re n clusters (beside n = 11). Simultaneously, the calculated HOMO-LUMO gaps of Re n (n = 6–16) display an oscillatory feature at large-sized Ren clusters. From the calculatedmore » magnetic moments and growth behaviors of Rhenium clusters, the magnetic Re 6 unit can be seen as the building block for the novel magnetic cluster-assembled nanomaterial. Such calculated results are in good agreement with the available experimental measurements.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhao, Run -Ning; Chen, Rui; Yuan, Yan -Hong
Here, the stable equilibrium geometries, relative stabilities, and electronic and magnetic characteristics of Re n (n = 2–16) clusters were investigated by density functional theory method. The calculated fragmentation energies and second-order differences of energies exhibited interestingly that the stabilities of Re n (n = 2–16) clusters show a dramatic odd-even alternative behavior of the cluster size n: with the even-numbered Ren clusters being obviously more stable than their neighboring odd-numbered Re n clusters (beside n = 11). Simultaneously, the calculated HOMO-LUMO gaps of Re n (n = 6–16) display an oscillatory feature at large-sized Ren clusters. From the calculatedmore » magnetic moments and growth behaviors of Rhenium clusters, the magnetic Re 6 unit can be seen as the building block for the novel magnetic cluster-assembled nanomaterial. Such calculated results are in good agreement with the available experimental measurements.« less
Cluster Computing for Embedded/Real-Time Systems
NASA Technical Reports Server (NTRS)
Katz, D.; Kepner, J.
1999-01-01
Embedded and real-time systems, like other computing systems, seek to maximize computing power for a given price, and thus can significantly benefit from the advancing capabilities of cluster computing.
Loewenstein, Yaniv; Portugaly, Elon; Fromer, Menachem; Linial, Michal
2008-01-01
Motivation: UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets. Application: We present a novel class of memory-constrained UPGMA (MC-UPGMA) algorithms. Given any practical memory size constraint, this framework guarantees the correct clustering solution without explicitly requiring all dissimilarities in memory. The algorithms are general and are applicable to any dataset. We present a data-dependent characterization of hardness and clustering efficiency. The presented concepts are applicable to any agglomerative clustering formulation. Results: We apply our algorithm to the entire collection of protein sequences, to automatically build a comprehensive evolutionary-driven hierarchy of proteins from sequence alone. The newly created tree captures protein families better than state-of-the-art large-scale methods such as CluSTr, ProtoNet4 or single-linkage clustering. We demonstrate that leveraging the entire mass embodied in all sequence similarities allows to significantly improve on current protein family clusterings which are unable to directly tackle the sheer mass of this data. Furthermore, we argue that non-metric constraints are an inherent complexity of the sequence space and should not be overlooked. The robustness of UPGMA allows significant improvement, especially for multidomain proteins, and for large or divergent families. Availability: A comprehensive tree built from all UniProt sequence similarities, together with navigation and classification tools will be made available as part of the ProtoNet service. A C++ implementation of the algorithm is available on request. Contact: lonshy@cs.huji.ac.il PMID:18586742
Analysis of candidates for interacting galaxy clusters. I. A1204 and A2029/A2033
NASA Astrophysics Data System (ADS)
Gonzalez, Elizabeth Johana; de los Rios, Martín; Oio, Gabriel A.; Lang, Daniel Hernández; Tagliaferro, Tania Aguirre; Domínguez R., Mariano J.; Castellón, José Luis Nilo; Cuevas L., Héctor; Valotto, Carlos A.
2018-04-01
Context. Merging galaxy clusters allow for the study of different mass components, dark and baryonic, separately. Also, their occurrence enables to test the ΛCDM scenario, which can be used to put constraints on the self-interacting cross-section of the dark-matter particle. Aim. It is necessary to perform a homogeneous analysis of these systems. Hence, based on a recently presented sample of candidates for interacting galaxy clusters, we present the analysis of two of these cataloged systems. Methods: In this work, the first of a series devoted to characterizing galaxy clusters in merger processes, we perform a weak lensing analysis of clusters A1204 and A2029/A2033 to derive the total masses of each identified interacting structure together with a dynamical study based on a two-body model. We also describe the gas and the mass distributions in the field through a lensing and an X-ray analysis. This is the first of a series of works which will analyze these type of system in order to characterize them. Results: Neither merging cluster candidate shows evidence of having had a recent merger event. Nevertheless, there is dynamical evidence that these systems could be interacting or could interact in the future. Conclusions: It is necessary to include more constraints in order to improve the methodology of classifying merging galaxy clusters. Characterization of these clusters is important in order to properly understand the nature of these systems and their connection with dynamical studies.
A Self-Organizing Spatial Clustering Approach to Support Large-Scale Network RTK Systems.
Shen, Lili; Guo, Jiming; Wang, Lei
2018-06-06
The network real-time kinematic (RTK) technique can provide centimeter-level real time positioning solutions and play a key role in geo-spatial infrastructure. With ever-increasing popularity, network RTK systems will face issues in the support of large numbers of concurrent users. In the past, high-precision positioning services were oriented towards professionals and only supported a few concurrent users. Currently, precise positioning provides a spatial foundation for artificial intelligence (AI), and countless smart devices (autonomous cars, unmanned aerial-vehicles (UAVs), robotic equipment, etc.) require precise positioning services. Therefore, the development of approaches to support large-scale network RTK systems is urgent. In this study, we proposed a self-organizing spatial clustering (SOSC) approach which automatically clusters online users to reduce the computational load on the network RTK system server side. The experimental results indicate that both the SOSC algorithm and the grid algorithm can reduce the computational load efficiently, while the SOSC algorithm gives a more elastic and adaptive clustering solution with different datasets. The SOSC algorithm determines the cluster number and the mean distance to cluster center (MDTCC) according to the data set, while the grid approaches are all predefined. The side-effects of clustering algorithms on the user side are analyzed with real global navigation satellite system (GNSS) data sets. The experimental results indicate that 10 km can be safely used as the cluster radius threshold for the SOSC algorithm without significantly reducing the positioning precision and reliability on the user side.
Lee, Tai-Fen; Du, Shin-Hei; Teng, Shih-Hua; Liao, Chun-Hsing; Sheng, Wang-Hui; Teng, Lee-Jene
2014-01-01
We evaluated whether the Bruker Biotyper matrix-associated laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) system provides accurate species-level identifications of 147 isolates of aerobically growing Gram-positive rods (GPRs). The bacterial isolates included Nocardia (n = 74), Listeria (n = 39), Kocuria (n = 15), Rhodococcus (n = 10), Gordonia (n = 7), and Tsukamurella (n = 2) species, which had all been identified by conventional methods, molecular methods, or both. In total, 89.7% of Listeria monocytogenes, 80% of Rhodococcus species, 26.7% of Kocuria species, and 14.9% of Nocardia species (n = 11, all N. nova and N. otitidiscaviarum) were correctly identified to the species level (score values, ≥2.0). A clustering analysis of spectra generated by the Bruker Biotyper identified six clusters of Nocardia species, i.e., cluster 1 (N. cyriacigeorgica), cluster 2 (N. brasiliensis), cluster 3 (N. farcinica), cluster 4 (N. puris), cluster 5 (N. asiatica), and cluster 6 (N. beijingensis), based on the six peaks generated by ClinProTools with the genetic algorithm, i.e., m/z 2,774.477 (cluster 1), m/z 5,389.792 (cluster 2), m/z 6,505.720 (cluster 3), m/z 5,428.795 (cluster 4), m/z 6,525.326 (cluster 5), and m/z 16,085.216 (cluster 6). Two clusters of L. monocytogenes spectra were also found according to the five peaks, i.e., m/z 5,594.85, m/z 6,184.39, and m/z 11,187.31, for cluster 1 (serotype 1/2a) and m/z 5,601.21 and m/z 11,199.33 for cluster 2 (serotypes 1/2b and 4b). The Bruker Biotyper system was unable to accurately identify Nocardia (except for N. nova and N. otitidiscaviarum), Tsukamurella, or Gordonia species. Continuous expansion of the MALDI-TOF MS databases to include more GPRs is necessary. PMID:24759706
Elastic K-means using posterior probability.
Zheng, Aihua; Jiang, Bo; Li, Yan; Zhang, Xuehan; Ding, Chris
2017-01-01
The widely used K-means clustering is a hard clustering algorithm. Here we propose a Elastic K-means clustering model (EKM) using posterior probability with soft capability where each data point can belong to multiple clusters fractionally and show the benefit of proposed Elastic K-means. Furthermore, in many applications, besides vector attributes information, pairwise relations (graph information) are also available. Thus we integrate EKM with Normalized Cut graph clustering into a single clustering formulation. Finally, we provide several useful matrix inequalities which are useful for matrix formulations of learning models. Based on these results, we prove the correctness and the convergence of EKM algorithms. Experimental results on six benchmark datasets demonstrate the effectiveness of proposed EKM and its integrated model.
X-ray aspects of the DAFT/FADA clusters
NASA Astrophysics Data System (ADS)
Guennou, L.; Durret, F.; Lima Neto, G. B.; Adami, C.
2012-12-01
We have undertaken the DAFT/FADA survey with the aim of applying constraints on dark energy based on weak lensing tomography as well as obtaining homogeneous and high quality data for a sample of 91 massive clusters in the redshift range [0.4,0.9] for which there are HST archive data. We have analysed the XMM-Newton data available for 42 of these clusters to derive their X-ray temperatures and luminosities and search for substructures. This study was coupled with a dynamical analysis for the 26 clusters having at least 30 spectroscopic galaxy redshifts in the cluster range. We present preliminary results on the coupled X-ray and dynamical analyses of these clusters.
VanGelder, L. E.; Kosswattaarachchi, A. M.; Forrestel, P. L.
2018-01-01
Non-aqueous redox flow batteries have emerged as promising systems for large-capacity, reversible energy storage, capable of meeting the variable demands of the electrical grid. Here, we investigate the potential for a series of Lindqvist polyoxovanadate-alkoxide (POV-alkoxide) clusters, [V6O7(OR)12] (R = CH3, C2H5), to serve as the electroactive species for a symmetric, non-aqueous redox flow battery. We demonstrate that the physical and electrochemical properties of these POV-alkoxides make them suitable for applications in redox flow batteries, as well as the ability for ligand modification at the bridging alkoxide moieties to yield significant improvements in cluster stability during charge–discharge cycling. Indeed, the metal–oxide core remains intact upon deep charge–discharge cycling, enabling extremely high coulombic efficiencies (∼97%) with minimal overpotential losses (∼0.3 V). Furthermore, the bulky POV-alkoxide demonstrates significant resistance to deleterious crossover, which will lead to improved lifetime and efficiency in a redox flow battery. PMID:29675217
A new open cluster binary system in the Milky Way
NASA Astrophysics Data System (ADS)
Piatti, A. E.; Clariá, J. J.; Ahumada, A. V.
2011-10-01
We have obtained CCD UBVI_{KC} photometry for the open clusters (OCs) Hogg 12 and NGC 3590. Based on photometric and morphological criteria, as well as on the stellar density in the region, our evidence is sufficient to consider them a new open cluster binary system candidate.
Clustering cancer gene expression data by projective clustering ensemble
Yu, Xianxue; Yu, Guoxian
2017-01-01
Gene expression data analysis has paramount implications for gene treatments, cancer diagnosis and other domains. Clustering is an important and promising tool to analyze gene expression data. Gene expression data is often characterized by a large amount of genes but with limited samples, thus various projective clustering techniques and ensemble techniques have been suggested to combat with these challenges. However, it is rather challenging to synergy these two kinds of techniques together to avoid the curse of dimensionality problem and to boost the performance of gene expression data clustering. In this paper, we employ a projective clustering ensemble (PCE) to integrate the advantages of projective clustering and ensemble clustering, and to avoid the dilemma of combining multiple projective clusterings. Our experimental results on publicly available cancer gene expression data show PCE can improve the quality of clustering gene expression data by at least 4.5% (on average) than other related techniques, including dimensionality reduction based single clustering and ensemble approaches. The empirical study demonstrates that, to further boost the performance of clustering cancer gene expression data, it is necessary and promising to synergy projective clustering with ensemble clustering. PCE can serve as an effective alternative technique for clustering gene expression data. PMID:28234920
Massive Binaries in the R 136 Cluster
NASA Astrophysics Data System (ADS)
Morrell, N. I.; Massey, P.; Degioia-Eastwood, K.; Penny, L. R.; Gies, D. R.; Tsitkin, Y.; Darnell, E.
2008-08-01
As part of a large project aimed to the discovery and follow up of massive eclipsing systems in young clusters and stellar associations, we have obtained V-band CCD imaging of the R136 cluster in 30 Doradus, and high resolution spectroscopy of several among the variable stars we found there. Here we summarize our preliminary analysis of light and radial velocity variations for 4 massive multiple systems in the R136 cluster.
ERIC Educational Resources Information Center
Spybrook, Jessaca; Hedges, Larry; Borenstein, Michael
2014-01-01
Research designs in which clusters are the unit of randomization are quite common in the social sciences. Given the multilevel nature of these studies, the power analyses for these studies are more complex than in a simple individually randomized trial. Tools are now available to help researchers conduct power analyses for cluster randomized…
Baun, Christian
2016-01-01
Clusters usually consist of servers, workstations or personal computers as nodes. But especially for academic purposes like student projects or scientific projects, the cost for purchase and operation can be a challenge. Single board computers cannot compete with the performance or energy-efficiency of higher-value systems, but they are an option to build inexpensive cluster systems. Because of the compact design and modest energy consumption, it is possible to build clusters of single board computers in a way that they are mobile and can be easily transported by the users. This paper describes the construction of such a cluster, useful applications and the performance of the single nodes. Furthermore, the clusters' performance and energy-efficiency is analyzed by executing the High Performance Linpack benchmark with a different number of nodes and different proportion of the systems total main memory utilized.
Clustering and correlates of screen-time and eating behaviours among young adolescents.
Pearson, Natalie; Griffiths, Paula; Biddle, Stuart Jh; Johnston, Julie P; McGeorge, Sonia; Haycraft, Emma
2017-05-31
Screen-time and eating behaviours are associated in adolescents, but few studies have examined the clustering of these health behaviours in this age group. The identification of clustered health behaviours, and influences on adolescents' clustered health behaviours, at the time when they are most likely to become habitual, is important for intervention design. The purpose of this study was to assess the prevalence and clustering of health behaviours in adolescents, and examine the sociodemographic, individual, behavioural, and home social and physical environmental correlates of clustered health behaviours. Adolescents aged 11-12 years (n = 527, 48% boys) completed a questionnaire during class-time which assessed screen-time (ST), fruit and vegetable (FV), and energy-dense (ED) snack consumption using a Food Frequency Questionnaire. Health behaviours were categorised into high and low frequencies based on recommendations for FV and ST and median splits for ED snacks. Adolescents reported on their habits, self-efficacy, eating at the television (TV), eating and watching TV together with parents, restrictive parenting practices, and the availability and accessibility of foods within the home. Behavioural clustering was assessed using an observed over expected ratio (O/E). Correlates of clustered behaviours were examined using multivariate multinomial logistic regression. Approximately 70% reported having two or three health risk behaviours. Overall, O/E ratios were close to 1, which indicates clustering. The three risk behaviour combination of low FV, high ED, and high ST occurred more frequently than expected (O/E ratio = 1.06 95% CI 1.01, 1.15. Individual, behavioural, and social and physical home environmental correlates were differentially associated with behavioural clusters. Correlates consistently associated with clusters included eating ED snacks while watching TV, eating at the TV with parents, and the availability and accessibility of ED snack foods within the home. There is a high prevalence of screen time and unhealthy eating, and screen time is coupled with unhealthy dietary behaviours. Strategies and policies are required that simultaneously address reductions in screen time and changes to habitual dietary patterns, such as TV snacking and snack availability and accessibility. These may require a combination of individual, social and environmental changes alongside conscious and more automatic (nudging) strategies.
Logistics Enterprise Evaluation Model Based On Fuzzy Clustering Analysis
NASA Astrophysics Data System (ADS)
Fu, Pei-hua; Yin, Hong-bo
In this thesis, we introduced an evaluation model based on fuzzy cluster algorithm of logistics enterprises. First of all,we present the evaluation index system which contains basic information, management level, technical strength, transport capacity,informatization level, market competition and customer service. We decided the index weight according to the grades, and evaluated integrate ability of the logistics enterprises using fuzzy cluster analysis method. In this thesis, we introduced the system evaluation module and cluster analysis module in detail and described how we achieved these two modules. At last, we gave the result of the system.
2013-01-01
Background Various diet- and activity-related parenting practices are positive determinants of child dietary and activity behaviour, including home availability, parental modelling and parental policies. There is evidence that parenting practices cluster within the dietary domain and within the activity domain. This study explores whether diet- and activity-related parenting practices cluster across the dietary and activity domain. Also examined is whether the clusters are related to child and parental background characteristics. Finally, to indicate the relevance of the clusters in influencing child dietary and activity behaviour, we examined whether clusters of parenting practices are related to these behaviours. Methods Data were used from 1480 parent–child dyads participating in the Dutch IVO Nutrition and Physical Activity Child cohorT (INPACT). Parents of children aged 8–11 years completed questionnaires at home assessing their diet- and activity-related parenting practices, child and parental background characteristics, and child dietary and activity behaviours. Principal component analysis (PCA) was used to identify clusters of parenting practices. Backward regression analysis was used to examine the relationship between child and parental background characteristics with cluster scores, and partial correlations to examine associations between cluster scores and child dietary and activity behaviours. Results PCA revealed five clusters of parenting practices: 1) high visibility and accessibility of screens and unhealthy food, 2) diet- and activity-related rules, 3) low availability of unhealthy food, 4) diet- and activity-related positive modelling, and 5) positive modelling on sports and fruit. Low parental education was associated with unhealthy cluster 1, while high(er) education was associated with healthy clusters 2, 3 and 5. Separate clusters were related to both child dietary and activity behaviour in the hypothesized directions: healthy clusters were positively related to obesity-reducing behaviours and negatively to obesity-inducing behaviours. Conclusion Parenting practices cluster across the dietary and activity domain. Parental education can be seen as an indicator of a broader parental context in which clusters of parenting practices operate. Separate clusters are related to both child dietary and activity behaviour. Interventions that focus on clusters of parenting practices to assist parents (especially low-educated parents) in changing their child’s dietary and activity behaviour seems justified. PMID:23531232
Gene diversity in some Muslim populations of North India.
Aarzoo, S Shabana; Afzal, Mohammad
2005-06-01
North Indian Muslim populations have historical, linguistic, and socioreligious significance to the Indian subcontinent. Although sociocultural and political dimensions of their demography are well documented, no detailed genetic structure of the populations is available. We have undertaken a survey of the gene frequencies of the ABO, Rh, PTC taste ability, sickling, and G6PD systems for different endogamous groups: Sheikh, Syed, Pathan, Ansari, Saifi, and Hindu Bania. All the groups at most loci showed statistically nonsignificant differences, except for ABO and PTC traits, for which interpopulational differences were seen. Heterozygosity ranged from 0.048 to 0.617 among the Sheikh, 0.149 to 0.599 among the Pathan, 0.105 to 0.585 among the Ansari, 0.25 to 0.869 among the Syed, 0.107 to 0.565 among the Saifi, and 0.100 to 0.492 among the Hindu Bania. The average D(ST) and G(ST) values for the five marker loci were 0.0625 +/- 0.098 and 0.1072 +/- 0.041, respectively. A dendrogram was constructed using the UPGMA clustering method. Our results revealed that the Pathan and the Sheikh form one cluster, the Syed and the Hindu Bania form another cluster, and the two clusters join together (the so-called higher caste); also, the Saifi and the Ansari form a separate cluster (lower caste). The results of the genetic distance analysis are useful for understanding the pattern of genetic relationships between different endogamous groups of Muslims.
Molecular and epidemiological study of enterovirus D68 in Taiwan.
Huang, Yuan-Pin; Lin, Tsuey-Li; Lin, Ting-Han; Wu, Ho-Sheng
2017-08-01
As an immunofluorescence assay for enterovirus D68 (EV-D68) is not available in the enteroviruses surveillance network in Taiwan, EV-D68 may be the actual pathogen of untypeable enterovirus-suspected isolates. The untypeable isolates collected from 2007 through 2014 were identified by nucleic acid amplification-based methods and sequencing of the VP1 region to analyze the phylogeny and epidemiology of EV-D68 in Taiwan. Twenty-nine EV-D68 isolates were sequenced, including 15 Cluster 3 and 14 Cluster 1 viruses. Approximately 41% of the patients were children under 5 years of age and their infections peaked in August. The ratio of male to female patients was 1.5 and 3.67 for Cluster 3 and Cluster 1, respectively. Fever and respiratory symptoms were commonly reported in EV-D68-infected patients. The results of phylogenetic analyses showed that EV-D68 isolates between 2007 and 2014 belonged to different clusters and existed for years, indicating that endemic circulation of EV-D68 existed in Taiwan. This study showed that EV-D68 has been endemic in Taiwan for some years despite a small number of positive cases. The continuous monitoring and efforts towards the improvement of diagnostic techniques are required to complete the surveillance system. This study provided the genetic and epidemiological information which could contribute to understanding the etiology and epidemiology of EV-D68. Copyright © 2015. Published by Elsevier B.V.
State estimation and prediction using clustered particle filters.
Lee, Yoonsang; Majda, Andrew J
2016-12-20
Particle filtering is an essential tool to improve uncertain model predictions by incorporating noisy observational data from complex systems including non-Gaussian features. A class of particle filters, clustered particle filters, is introduced for high-dimensional nonlinear systems, which uses relatively few particles compared with the standard particle filter. The clustered particle filter captures non-Gaussian features of the true signal, which are typical in complex nonlinear dynamical systems such as geophysical systems. The method is also robust in the difficult regime of high-quality sparse and infrequent observations. The key features of the clustered particle filtering are coarse-grained localization through the clustering of the state variables and particle adjustment to stabilize the method; each observation affects only neighbor state variables through clustering and particles are adjusted to prevent particle collapse due to high-quality observations. The clustered particle filter is tested for the 40-dimensional Lorenz 96 model with several dynamical regimes including strongly non-Gaussian statistics. The clustered particle filter shows robust skill in both achieving accurate filter results and capturing non-Gaussian statistics of the true signal. It is further extended to multiscale data assimilation, which provides the large-scale estimation by combining a cheap reduced-order forecast model and mixed observations of the large- and small-scale variables. This approach enables the use of a larger number of particles due to the computational savings in the forecast model. The multiscale clustered particle filter is tested for one-dimensional dispersive wave turbulence using a forecast model with model errors.
State estimation and prediction using clustered particle filters
Lee, Yoonsang; Majda, Andrew J.
2016-01-01
Particle filtering is an essential tool to improve uncertain model predictions by incorporating noisy observational data from complex systems including non-Gaussian features. A class of particle filters, clustered particle filters, is introduced for high-dimensional nonlinear systems, which uses relatively few particles compared with the standard particle filter. The clustered particle filter captures non-Gaussian features of the true signal, which are typical in complex nonlinear dynamical systems such as geophysical systems. The method is also robust in the difficult regime of high-quality sparse and infrequent observations. The key features of the clustered particle filtering are coarse-grained localization through the clustering of the state variables and particle adjustment to stabilize the method; each observation affects only neighbor state variables through clustering and particles are adjusted to prevent particle collapse due to high-quality observations. The clustered particle filter is tested for the 40-dimensional Lorenz 96 model with several dynamical regimes including strongly non-Gaussian statistics. The clustered particle filter shows robust skill in both achieving accurate filter results and capturing non-Gaussian statistics of the true signal. It is further extended to multiscale data assimilation, which provides the large-scale estimation by combining a cheap reduced-order forecast model and mixed observations of the large- and small-scale variables. This approach enables the use of a larger number of particles due to the computational savings in the forecast model. The multiscale clustered particle filter is tested for one-dimensional dispersive wave turbulence using a forecast model with model errors. PMID:27930332
RR Lyrae in Sagittarius Dwarf Globular Clusters (Poster abstract)
NASA Astrophysics Data System (ADS)
Pritzl, B. J.; Gehrman, T. J.; Bell, E.; Salinas, R.; Smith, H. A.; Catelan, M.
2016-12-01
(Abstract only) The Milky Way Galaxy was built up in part by the cannibalization of smaller dwarf galaxies. Some of them likely contained globular clusters. The Sagittarius dwarf galaxy provides a unique opportunity to study a system of globular clusters that originated outside the Milky Way. We have investigated the RR Lyrae populations in two Sagittarius globular clusters, Arp 2 and Terzan 8. The RR Lyrae are used to study the properties of the clusters and to compare this system to Milky Way globular clusters. We will discuss whether or not dwarf galaxies similar to the Sagittarius dwarf galaxy could have played a role in the formation of the Milky Way Galaxy.
Observation of dynamic equilibrium cluster phase in nanoparticle-polymer system
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kumar, Sugam, E-mail: sugam@barc.gov.in; Mehan, S.; Aswal, V. K.
2016-05-23
Small-angle neutron scattering (SANS) and dynamic light scattering (DLS) have been used to investigate the existence of a cluster phase in a nanoparticle-polymer system. The nanoparticle-polymer system shows an interesting reentrant phase behavior where the charge stabilized silica nanoparticles undergo particle clustering and back to individual nanoparticles as a function of polymer concentration. This kind of phase behavior is believed to be directed by opposing attractive and repulsive interactions present in the system. The phase behavior shows two narrow regions of polymer concentration immediately before and after the two-phase formation indicating the possibility of the existence of some equilibrium clusters.more » DLS results show a much higher size of particles than individuals in these two regions which remains unchanged even after dilution. The SANS data show the evolution of attraction with increased volume fraction of the particles supporting the dynamic nature of these clusters.« less
Computer aided detection of clusters of microcalcifications on full field digital mammograms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ge Jun; Sahiner, Berkman; Hadjiiski, Lubomir M.
2006-08-15
We are developing a computer-aided detection (CAD) system to identify microcalcification clusters (MCCs) automatically on full field digital mammograms (FFDMs). The CAD system includes six stages: preprocessing; image enhancement; segmentation of microcalcification candidates; false positive (FP) reduction for individual microcalcifications; regional clustering; and FP reduction for clustered microcalcifications. At the stage of FP reduction for individual microcalcifications, a truncated sum-of-squares error function was used to improve the efficiency and robustness of the training of an artificial neural network in our CAD system for FFDMs. At the stage of FP reduction for clustered microcalcifications, morphological features and features derived from themore » artificial neural network outputs were extracted from each cluster. Stepwise linear discriminant analysis (LDA) was used to select the features. An LDA classifier was then used to differentiate clustered microcalcifications from FPs. A data set of 96 cases with 192 images was collected at the University of Michigan. This data set contained 96 MCCs, of which 28 clusters were proven by biopsy to be malignant and 68 were proven to be benign. The data set was separated into two independent data sets for training and testing of the CAD system in a cross-validation scheme. When one data set was used to train and validate the convolution neural network (CNN) in our CAD system, the other data set was used to evaluate the detection performance. With the use of a truncated error metric, the training of CNN could be accelerated and the classification performance was improved. The CNN in combination with an LDA classifier could substantially reduce FPs with a small tradeoff in sensitivity. By using the free-response receiver operating characteristic methodology, it was found that our CAD system can achieve a cluster-based sensitivity of 70, 80, and 90 % at 0.21, 0.61, and 1.49 FPs/image, respectively. For case-based performance evaluation, a sensitivity of 70, 80, and 90 % can be achieved at 0.07, 0.17, and 0.65 FPs/image, respectively. We also used a data set of 216 mammograms negative for clustered microcalcifications to further estimate the FP rate of our CAD system. The corresponding FP rates were 0.15, 0.31, and 0.86 FPs/image for cluster-based detection when negative mammograms were used for estimation of FP rates.« less
OGLE Collection of Star Clusters. New Objects in the Outskirts of the Large Magellanic Cloud
NASA Astrophysics Data System (ADS)
Sitek, M.; Szymański, M. K.; Skowron, D. M.; Udalski, A.; Kostrzewa-Rutkowska, Z.; Skowron, J.; Karczmarek, P.; Cieślar, M.; Wyrzykowski, Ł.; Kozłowski, S.; Pietrukowicz, P.; Soszyński, I.; Mróz, P.; Pawlak, M.; Poleski, R.; Ulaczyk, K.
2016-09-01
The Magellanic System (MS), consisting of the Large Magellanic Cloud (LMC), the Small Magellanic Cloud (SMC) and the Magellanic Bridge (MBR), contains diverse sample of star clusters. Their spatial distribution, ages and chemical abundances may provide important information about the history of formation of the whole System. We use deep photometric maps derived from the images collected during the fourth phase of the Optical Gravitational Lensing Experiment (OGLE-IV) to construct the most complete catalog of star clusters in the Large Magellanic Cloud using the homogeneous photometric data. In this paper we present the collection of star clusters found in the area of about 225 square degrees in the outer regions of the LMC. Our sample contains 679 visually identified star cluster candidates, 226 of which were not listed in any of the previously published catalogs. The new clusters are mainly young small open clusters or clusters similar to associations.
VizieR Online Data Catalog: Globular and open clusters observed by SDSS/SEGUE (Morrison+, 2016)
NASA Astrophysics Data System (ADS)
Morrison, H. L.; Ma, Z.; Clem, J. L.; An, D.; Connor, T.; Schechtman-Rook, A.; Casagrande, L.; Rockosi, C.; Yanny, B.; Harding, P.; Beers, T. C.; Johnson, J. A.; Schneider, D. P.
2018-03-01
The SEGUE project observed a number of globular and open clusters for calibration purposes. For calibration of the red giants, we selected the globular clusters M92, M13 and M71 (spanning metallicities from -2.4 to -0.8) and the open clusters Be 29, NGC 7789 and NGC 6791, whose [Fe/H] values range from -0.4 to +0.4. In all but one case, the clusters are within the SDSS footprint and so ugriz photometry is available for the cluster stars. The SDSS cluster images were analyzed using DAOPHOT (Stetson 1987PASP...99..191S) by An et al. (2008ApJS..179..326A) because the SDSS photometric pipeline was not designed to handle crowded fields. (8 data files).
Cunningham, Evan; Jacka, Brendan; DeBeck, Kora; Applegate, Tanya A; Harrigan, P. Richard; Krajden, Mel; Marshall, Brandon DL; Montaner, Julio; Lima, Viviane Dias; Olmstead, Andrea; Milloy, M-J; Wood, Evan; Grebely, Jason
2015-01-01
Background Among prospective cohorts of people who inject drugs (PWID), phylogenetic clustering of HCV infection has been observed. However, the majority of studies have included older PWID, representing distant transmission events. The aim of this study was to investigate phylogenetic clustering of HCV infection among a cohort of street-involved youth. Methods Data were derived from a prospective cohort of street-involved youth aged 14–26 recruited between 2005 and 2012 in Vancouver, Canada (At Risk Youth Study, ARYS). HCV RNA testing and sequencing (Core-E2) were performed on HCV positive participants. Phylogenetic trees were inferred using maximum likelihood methods and clusters were identified using ClusterPicker (Core-E2 without HVR1, 90% bootstrap threshold, 0.05 genetic distance threshold). Results Among 945 individuals enrolled in ARYS, 16% (n=149, 100% recent injectors) were HCV antibody positive at baseline interview (n=86) or seroconverted during follow-up (n=63). Among HCV antibody positive participants with available samples (n=131), 75% (n=98) had detectable HCV RNA and 66% (n=65, mean age 23, 58% with recent methamphetamine injection, 31% female, 3% HIV+) had available Core-E2 sequences. Of those with Core-E2 sequence, 14% (n=9) were in a cluster (one cluster of three) or pair (two pairs), with all reporting recent methamphetamine injection. Recent methamphetamine injection was associated with membership in a cluster or pair (P=0.009). Conclusion In this study of street-involved youth with HCV infection and recent injecting, 14% demonstrated phylogenetic clustering. Phylogenetic clustering was associated with recent methamphetamine injection, suggesting that methamphetamine drug injection may play an important role in networks of HCV transmission. PMID:25977204
Recent advances in the Suf Fe-S cluster biogenesis pathway: Beyond the Proteobacteria
Outten, F. Wayne
2014-01-01
Fe-S clusters play critical roles in cellular function throughout all three kingdoms of life. Consequently, Fe-S cluster biogenesis systems are present in most organisms. The Suf (sulfur formation) system is the most ancient of the three characterized Fe-S cluster biogenesis pathways, which also include the Isc and Nif systems. Much of the first work on the Suf system took place in Gram-negative Proteobacteria used as model organisms. These early studies led to a wealth of biochemical, genetic, and physiological information on Suf function. From those studies we have learned that SufB functions as an Fe-S scaffold in conjunction with SufC (and in some cases SufD). SufS and SufE together mobilize sulfur for cluster assembly and SufA traffics the complete Fe-S cluster from SufB to target apo-proteins. However, recent progress on the Suf system in other organisms has opened up new avenues of research and new hypotheses about Suf function. This review focuses primarily on the most recent discoveries about the Suf pathway and where those new models may lead the field. PMID:25447545
Semi-supervised clustering methods
Bair, Eric
2013-01-01
Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning that there is no outcome variable nor is anything known about the relationship between the observations in the data set. In many situations, however, information about the clusters is available in addition to the values of the features. For example, the cluster labels of some observations may be known, or certain observations may be known to belong to the same cluster. In other cases, one may wish to identify clusters that are associated with a particular outcome variable. This review describes several clustering algorithms (known as “semi-supervised clustering” methods) that can be applied in these situations. The majority of these methods are modifications of the popular k-means clustering method, and several of them will be described in detail. A brief description of some other semi-supervised clustering algorithms is also provided. PMID:24729830
X-ray and optical substructures of the DAFT/FADA survey clusters
NASA Astrophysics Data System (ADS)
Guennou, L.; Durret, F.; Adami, C.; Lima Neto, G. B.
2013-04-01
We have undertaken the DAFT/FADA survey with the double aim of setting constraints on dark energy based on weak lensing tomography and of obtaining homogeneous and high quality data for a sample of 91 massive clusters in the redshift range 0.4-0.9 for which there were HST archive data. We have analysed the XMM-Newton data available for 42 of these clusters to derive their X-ray temperatures and luminosities and search for substructures. Out of these, a spatial analysis was possible for 30 clusters, but only 23 had deep enough X-ray data for a really robust analysis. This study was coupled with a dynamical analysis for the 26 clusters having at least 30 spectroscopic galaxy redshifts in the cluster range. Altogether, the X-ray sample of 23 clusters and the optical sample of 26 clusters have 14 clusters in common. We present preliminary results on the coupled X-ray and dynamical analyses of these 14 clusters.
Cluster-lensing: A Python Package for Galaxy Clusters and Miscentering
NASA Astrophysics Data System (ADS)
Ford, Jes; VanderPlas, Jake
2016-12-01
We describe a new open source package for calculating properties of galaxy clusters, including Navarro, Frenk, and White halo profiles with and without the effects of cluster miscentering. This pure-Python package, cluster-lensing, provides well-documented and easy-to-use classes and functions for calculating cluster scaling relations, including mass-richness and mass-concentration relations from the literature, as well as the surface mass density {{Σ }}(R) and differential surface mass density {{Δ }}{{Σ }}(R) profiles, probed by weak lensing magnification and shear. Galaxy cluster miscentering is especially a concern for stacked weak lensing shear studies of galaxy clusters, where offsets between the assumed and the true underlying matter distribution can lead to a significant bias in the mass estimates if not accounted for. This software has been developed and released in a public GitHub repository, and is licensed under the permissive MIT license. The cluster-lensing package is archived on Zenodo. Full documentation, source code, and installation instructions are available at http://jesford.github.io/cluster-lensing/.
m-BIRCH: an online clustering approach for computer vision applications
NASA Astrophysics Data System (ADS)
Madan, Siddharth K.; Dana, Kristin J.
2015-03-01
We adapt a classic online clustering algorithm called Balanced Iterative Reducing and Clustering using Hierarchies (BIRCH), to incrementally cluster large datasets of features commonly used in multimedia and computer vision. We call the adapted version modified-BIRCH (m-BIRCH). The algorithm uses only a fraction of the dataset memory to perform clustering, and updates the clustering decisions when new data comes in. Modifications made in m-BIRCH enable data driven parameter selection and effectively handle varying density regions in the feature space. Data driven parameter selection automatically controls the level of coarseness of the data summarization. Effective handling of varying density regions is necessary to well represent the different density regions in data summarization. We use m-BIRCH to cluster 840K color SIFT descriptors, and 60K outlier corrupted grayscale patches. We use the algorithm to cluster datasets consisting of challenging non-convex clustering patterns. Our implementation of the algorithm provides an useful clustering tool and is made publicly available.
Clean Water State Revolving Fund (CWSRF): Decentralized Wastewater Treatment
Decentralized wastewater treatment is an onsite or clustered system used to collect, treat, and disperse or reclaim wastewater from a small community or service area (e.g., septic systems, cluster systems, lagoons).
Cosmology from galaxy clusters as observed by Planck
NASA Astrophysics Data System (ADS)
Pierpaoli, Elena
We propose to use current all-sky data on galaxy clusters in the radio/infrared bands in order to constrain cosmology. This will be achieved performing parameter estimation with number counts and power spectra for galaxy clusters detected by Planck through their Sunyaev—Zeldovich signature. The ultimate goal of this proposal is to use clusters as tracers of matter density in order to provide information about fundamental properties of our Universe, such as the law of gravity on large scale, early Universe phenomena, structure formation and the nature of dark matter and dark energy. We will leverage on the availability of a larger and deeper cluster catalog from the latest Planck data release in order to include, for the first time, the cluster power spectrum in the cosmological parameter determination analysis. Furthermore, we will extend clusters' analysis to cosmological models not yet investigated by the Planck collaboration. These aims require a diverse set of activities, ranging from the characterization of the clusters' selection function, the choice of the cosmological cluster sample to be used for parameter estimation, the construction of mock samples in the various cosmological models with correct correlation properties in order to produce reliable selection functions and noise covariance matrices, and finally the construction of the appropriate likelihood for number counts and power spectra. We plan to make the final code available to the community and compatible with the most widely used cosmological parameter estimation code. This research makes use of data from the NASA satellites Planck and, less directly, Chandra, in order to constrain cosmology; and therefore perfectly fits the NASA objectives and the specifications of this solicitation.
Exploring the Dynamics of Exoplanetary Systems in a Young Stellar Cluster
NASA Astrophysics Data System (ADS)
Thornton, Jonathan Daniel; Glaser, Joseph Paul; Wall, Joshua Edward
2018-01-01
I describe a dynamical simulation of planetary systems in a young star cluster. One rather arbitrary aspect of cluster simulations is the choice of initial conditions. These are typically chosen from some standard model, such as Plummer or King, or from a “fractal” distribution to try to model young clumpy systems. Here I adopt the approach of realizing an initial cluster model directly from a detailed magnetohydrodynamical model of cluster formation from a 1000-solar-mass interstellar gas cloud, with magnetic fields and radiative and wind feedback from massive stars included self-consistently. The N-body simulation of the stars and planets starts once star formation is largely over and feedback has cleared much of the gas from the region where the newborn stars reside. It continues until the cluster dissolves in the galactic field. Of particular interest is what would happen to the free-floating planets created in the gas cloud simulation. Are they captured by a star or are they ejected from the cluster? This method of building a dynamical cluster simulation directly from the results of a cluster formation model allows us to better understand the evolution of young star clusters and enriches our understanding of extrasolar planet development in them. These simulations were performed within the AMUSE simulation framework, and combine N-body, multiples and background potential code.
Liu, Zhe; Geng, Yong; Zhang, Pan; Dong, Huijuan; Liu, Zuoxi
2014-09-01
In China, local governments of many areas prefer to give priority to the development of heavy industrial clusters in pursuit of high value of gross domestic production (GDP) growth to get political achievements, which usually results in higher costs from ecological degradation and environmental pollution. Therefore, effective methods and reasonable evaluation system are urgently needed to evaluate the overall efficiency of industrial clusters. Emergy methods links economic and ecological systems together, which can evaluate the contribution of ecological products and services as well as the load placed on environmental systems. This method has been successfully applied in many case studies of ecosystem but seldom in industrial clusters. This study applied the methodology of emergy analysis to perform the efficiency of industrial clusters through a series of emergy-based indices as well as the proposed indicators. A case study of Shenyang Economic Technological Development Area (SETDA) was investigated to show the emergy method's practical potential to evaluate industrial clusters to inform environmental policy making. The results of our study showed that the industrial cluster of electric equipment and electronic manufacturing produced the most economic value and had the highest efficiency of energy utilization among the four industrial clusters. However, the sustainability index of the industrial cluster of food and beverage processing was better than the other industrial clusters.
Pego-Reigosa, José María; Lois-Iglesias, Ana; Rúa-Figueroa, Íñigo; Galindo, María; Calvo-Alén, Jaime; de Uña-Álvarez, Jacobo; Balboa-Barreiro, Vanessa; Ibáñez Ruan, Jesús; Olivé, Alejandro; Rodríguez-Gómez, Manuel; Fernández Nebro, Antonio; Andrés, Mariano; Erausquin, Celia; Tomero, Eva; Horcada Rubio, Loreto; Uriarte Isacelaya, Esther; Freire, Mercedes; Montilla, Carlos; Sánchez-Atrio, Ana I; Santos-Soler, Gregorio; Zea, Antonio; Díez, Elvira; Narváez, Javier; Blanco-Alonso, Ricardo; Silva-Fernández, Lucía; Ruiz-Lucea, María Esther; Fernández-Castro, Mónica; Hernández-Beriain, José Ángel; Gantes-Mora, Marian; Hernández-Cruz, Blanca; Pérez-Venegas, José; Pecondón-Español, Ángela; Marras Fernández-Cid, Carlos; Ibáñez-Barcelo, Mónica; Bonilla, Gema; Torrente-Segarra, Vicenç; Castellví, Iván; Alegre, Juan José; Calvet, Joan; Marenco de la Fuente, José Luis; Raya, Enrique; Vázquez-Rodríguez, Tomás Ramón; Quevedo-Vila, Víctor; Muñoz-Fernández, Santiago; Otón, Teresa; Rahman, Anisur; López-Longo, Francisco Javier
2016-07-01
To identify patterns (clusters) of damage manifestations within a large cohort of SLE patients and evaluate the potential association of these clusters with a higher risk of mortality. This is a multicentre, descriptive, cross-sectional study of a cohort of 3656 SLE patients from the Spanish Society of Rheumatology Lupus Registry. Organ damage was ascertained using the Systemic Lupus International Collaborating Clinics Damage Index. Using cluster analysis, groups of patients with similar patterns of damage manifestations were identified. Then, overall clusters were compared as well as the subgroup of patients within every cluster with disease duration shorter than 5 years. Three damage clusters were identified. Cluster 1 (80.6% of patients) presented a lower amount of individuals with damage (23.2 vs 100% in clusters 2 and 3, P < 0.001). Cluster 2 (11.4% of patients) was characterized by musculoskeletal damage in all patients. Cluster 3 (8.0% of patients) was the only group with cardiovascular damage, and this was present in all patients. The overall mortality rate of patients in clusters 2 and 3 was higher than that in cluster 1 (P < 0.001 for both comparisons) and in patients with disease duration shorter than 5 years as well. In a large cohort of SLE patients, cardiovascular and musculoskeletal damage manifestations were the two dominant forms of damage to sort patients into clinically meaningful clusters. Both in early and late stages of the disease, there was a significant association of these clusters with an increased risk of mortality. Physicians should pay special attention to the early prevention of damage in these two systems. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Huang, Chen; Muñoz-García, Ana Belén; Pavone, Michele
2016-12-28
Density-functional embedding theory provides a general way to perform multi-physics quantum mechanics simulations of large-scale materials by dividing the total system's electron density into a cluster's density and its environment's density. It is then possible to compute the accurate local electronic structures and energetics of the embedded cluster with high-level methods, meanwhile retaining a low-level description of the environment. The prerequisite step in the density-functional embedding theory is the cluster definition. In covalent systems, cutting across the covalent bonds that connect the cluster and its environment leads to dangling bonds (unpaired electrons). These represent a major obstacle for the application of density-functional embedding theory to study extended covalent systems. In this work, we developed a simple scheme to define the cluster in covalent systems. Instead of cutting covalent bonds, we directly split the boundary atoms for maintaining the valency of the cluster. With this new covalent embedding scheme, we compute the dehydrogenation energies of several different molecules, as well as the binding energy of a cobalt atom on graphene. Well localized cluster densities are observed, which can facilitate the use of localized basis sets in high-level calculations. The results are found to converge faster with the embedding method than the other multi-physics approach ONIOM. This work paves the way to perform the density-functional embedding simulations of heterogeneous systems in which different types of chemical bonds are present.
NAS Requirements Checklist for Job Queuing/Scheduling Software
NASA Technical Reports Server (NTRS)
Jones, James Patton
1996-01-01
The increasing reliability of parallel systems and clusters of computers has resulted in these systems becoming more attractive for true production workloads. Today, the primary obstacle to production use of clusters of computers is the lack of a functional and robust Job Management System for parallel applications. This document provides a checklist of NAS requirements for job queuing and scheduling in order to make most efficient use of parallel systems and clusters for parallel applications. Future requirements are also identified to assist software vendors with design planning.
First assembly times and equilibration in stochastic coagulation-fragmentation
DOE Office of Scientific and Technical Information (OSTI.GOV)
D’Orsogna, Maria R.; Department of Mathematics, CSUN, Los Angeles, California 91330-8313; Lei, Qi
2015-07-07
We develop a fully stochastic theory for coagulation and fragmentation (CF) in a finite system with a maximum cluster size constraint. The process is modeled using a high-dimensional master equation for the probabilities of cluster configurations. For certain realizations of total mass and maximum cluster sizes, we find exact analytical results for the expected equilibrium cluster distributions. If coagulation is fast relative to fragmentation and if the total system mass is indivisible by the mass of the largest allowed cluster, we find a mean cluster-size distribution that is strikingly broader than that predicted by the corresponding mass-action equations. Combinations ofmore » total mass and maximum cluster size under which equilibration is accelerated, eluding late-stage coarsening, are also delineated. Finally, we compute the mean time it takes particles to first assemble into a maximum-sized cluster. Through careful state-space enumeration, the scaling of mean assembly times is derived for all combinations of total mass and maximum cluster size. We find that CF accelerates assembly relative to monomer kinetic only in special cases. All of our results hold in the infinite system limit and can be only derived from a high-dimensional discrete stochastic model, highlighting how classical mass-action models of self-assembly can fail.« less
Text Mining in Biomedical Domain with Emphasis on Document Clustering.
Renganathan, Vinaitheerthan
2017-07-01
With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise.
Williams, Christopher J; Thomas, Rhys H; Pickersgill, Trevor P; Lyons, Marion; Lowe, Gwen; Stiff, Rhianwen E; Moore, Catherine; Jones, Rachel; Howe, Robin; Brunt, Huw; Ashman, Anna; Mason, Brendan W
2016-01-01
We report a cluster of atypical Guillain-Barré syndrome in 10 adults temporally related to a cluster of four children with acute flaccid paralysis, over a 3-month period in South Wales, United Kingdom. All adult cases were male, aged between 24 and 77 years. Seven had prominent facial diplegia at onset. Available electrophysiological studies showed axonal involvement in five adults. Seven reported various forms of respiratory disease before onset of neurological symptoms. The ages of children ranged from one to 13 years, three of the four were two years old or younger. Enterovirus testing is available for three children; two had evidence of enterovirus D68 infection in stool or respiratory samples. We describe the clinical features, epidemiology and state of current investigations for these unusual clusters of illness.
Abramyan, Tigran M; Snyder, James A; Thyparambil, Aby A; Stuart, Steven J; Latour, Robert A
2016-08-05
Clustering methods have been widely used to group together similar conformational states from molecular simulations of biomolecules in solution. For applications such as the interaction of a protein with a surface, the orientation of the protein relative to the surface is also an important clustering parameter because of its potential effect on adsorbed-state bioactivity. This study presents cluster analysis methods that are specifically designed for systems where both molecular orientation and conformation are important, and the methods are demonstrated using test cases of adsorbed proteins for validation. Additionally, because cluster analysis can be a very subjective process, an objective procedure for identifying both the optimal number of clusters and the best clustering algorithm to be applied to analyze a given dataset is presented. The method is demonstrated for several agglomerative hierarchical clustering algorithms used in conjunction with three cluster validation techniques. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
The Nature of LSB galaxies revealed by their Globular Clusters
NASA Astrophysics Data System (ADS)
Kissler-Patig, Markus
2005-07-01
Low Surface Brightness {LSB} galaxies encompass many of the extremes in galaxy properties. Their understanding is essential to complete our picture of galaxy formation and evolution. Due to their historical under-representation on galaxy surveys, their importance to many areas of astronomy has only recently began to be realized. Globular clusters are superb tracers of the formation histories of galaxies and have been extensively used as such in high surface brightness galaxies. We propose to investigate the nature of massive LSB galaxies by studying their globular cluster systems. No globular cluster study has been reported for LSB galaxies to date. Yet, both the presence or absence of globular clusters set very strong constraints on the conditions prevailing during LSB galaxy formation and evolution. Both in dwarf and giant high surface brightness {HSB} galaxies, globular clusters are known to form as a constant fraction of baryonic mass. Their presence/absence immediately indicates similarities or discrepancies in the formation and evolution conditions of LSB and HSB galaxies. In particular, the presence/absence of metal-poor halo globular clusters infers similarities/differences in the halo formation and assembly processes of LSB vs. HSB galaxies, while the presence/absence of metal-rich globular clusters can be used to derive the occurrence and frequency of violent events {such as mergers} in the LSB galaxy assembly history. Two band imaging with ACS will allow us to identify the globular clusters {just resolved at the selected distance} and to determine their metallicity {potentially their rough age}. The composition of the systems will be compared to the extensive census built up on HSB galaxies. Our representative sample of six LSB galaxies {cz < 2700 km/s} are selected such, that a large system of globular clusters is expected. Globular clusters will constrain phases of LSB galaxy formation and evolution that can currently not be probed by other means. HST/ACS imaging is the only facility capable of studying the globular cluster systems of LSB galaxies given their distance and relative scarcity.
Hogg 12 and NGC 3590: A New Open Cluster Binary System Candidate
NASA Astrophysics Data System (ADS)
Piatti, Andrés E.; Clariá, Juan J.; Ahumada, Andrea V.
2010-05-01
We have obtained CCD UBVIKC photometry down to V ˜ 22.0 for the open clusters Hogg 12 and NGC 3590 and the fields surrounding them. Based on photometric and morphological criteria, as well as on the stellar density in the region, our evidence is sufficient to confirm that Hogg 12 is a genuine open cluster. NGC 3590 was used as a control cluster. The color-magnitude diagrams of Hogg 12, cleaned from field star contamination, reveal that this is a solar metal content cluster, affected by E(B - V) = 0.40 ± 0.05, located at a heliocentric distance d = 2.0 ± 0.5 kpc, and of an age similar to that of NGC 3590 (t = 30 Myr). Both clusters are surprisingly small objects whose radii are barely ˜1 pc, andthey are separated in the sky by scarcely 3.6 pc. These facts, added to their similar ages, reddenings, and metallicities, allow us to consider them a new open cluster binary system candidate. Of the ˜180 open cluster binary systems estimated to exist in the Galaxy, of which 27 are actually well known, Hogg 12 and NGC 3590 appear to be one of the two closest pairs.
VizieR Online Data Catalog: The Seven Sisters DANCe. I. Pleiades (Bouy+, 2015)
NASA Astrophysics Data System (ADS)
Bouy, H.; Bertin, E.; Sarro, L. M.; Barrado, D.; Moraux, E.; Bouvier, J.; Cuillandre, J.-C.; Berihuete, A.; Olivares, J.; Beletsky, Y.
2015-02-01
Position, proper motion, multi-wavelength ugrizYJHK photometry and membership probability to the Pleiades cluster for 1972245 sources. Present-day system bolometric luminosity and mass-functions of the Pleiades cluster. Empirical sequence of the Pleiades cluster in ugrizYJHK and BT,VT,JHK photometric systems. (7 data files).
Cartwright, Martin; Hirani, Shashivadan P; Rixon, Lorna; Beynon, Michelle; Doll, Helen; Bower, Peter; Bardsley, Martin; Steventon, Adam; Knapp, Martin; Henderson, Catherine; Rogers, Anne; Sanders, Caroline; Fitzpatrick, Ray; Barlow, James; Newman, Stanton P
2013-02-26
To assess the effect of second generation, home based telehealth on health related quality of life, anxiety, and depressive symptoms over 12 months in patients with long term conditions. A study of patient reported outcomes (the Whole Systems Demonstrator telehealth questionnaire study; baseline n=1573) was nested in a pragmatic, cluster randomised trial of telehealth (the Whole Systems Demonstrator telehealth trial, n=3230). General practice was the unit of randomisation, and telehealth was compared with usual care. Data were collected at baseline, four months (short term), and 12 months (long term). Primary intention to treat analyses tested treatment effectiveness; multilevel models controlled for clustering by general practice and a range of covariates. Analyses were conducted for 759 participants who completed questionnaire measures at all three time points (complete case cohort) and 1201 who completed the baseline assessment plus at least one other assessment (available case cohort). Secondary per protocol analyses tested treatment efficacy and included 633 and 1108 participants in the complete case and available case cohorts, respectively. Provision of primary and secondary care via general practices, specialist nurses, and hospital clinics in three diverse regions of England (Cornwall, Kent, and Newham), with established integrated health and social care systems. Patients with chronic obstructive pulmonary disease (COPD), diabetes, or heart failure recruited between May 2008 and December 2009. Generic, health related quality of life (assessed by physical and mental health component scores of the SF-12, and the EQ-5D), anxiety (assessed by the six item Brief State-Trait Anxiety Inventory), and depressive symptoms (assessed by the 10 item Centre for Epidemiological Studies Depression Scale). In the intention to treat analyses, differences between treatment groups were small and non-significant for all outcomes in the complete case (0.480 ≤ P ≤ 0.904) or available case (0.181 ≤ P ≤ 0.905) cohorts. The magnitude of differences between trial arms did not reach the trial defined, minimal clinically important difference (0.3 standardised mean difference) for any outcome in either cohort at four or 12 months. Per protocol analyses replicated the primary analyses; the main effect of trial arm (telehealth v usual care) was non-significant for any outcome (complete case cohort 0.273 ≤ P ≤ 0.761; available case cohort 0.145 ≤ P ≤ 0.696). Second generation, home based telehealth as implemented in the Whole Systems Demonstrator Evaluation was not effective or efficacious compared with usual care only. Telehealth did not improve quality of life or psychological outcomes for patients with chronic obstructive pulmonary disease, diabetes, or heart failure over 12 months. The findings suggest that concerns about potentially deleterious effect of telehealth are unfounded for most patients. ISRCTN43002091.
Ecotype diversification of an abundant Roseobacter lineage.
Sun, Ying; Zhang, Yao; Hollibaugh, James T; Luo, Haiwei
2017-04-01
The Roseobacter DC5-80-3 cluster (also known as the RCA clade) is among the most abundant bacterial lineages in temperate and polar oceans. Previous studies revealed two phylotypes within this cluster that are distinctly distributed in the Antarctic and other ocean provinces. Here, we report a nearly complete genome co-assembly of three closely related single cells co-occurring in the Antarctic, and compare it to the available genomes of the other phylotype from ocean regions where iron is more accessible but phosphorus and nitrogen are less. The Antarctic phylotype exclusively contains an operon structure consisting of a dicitrate transporter fecBCDE and an upstream regulator likely for iron uptake, whereas the other phylotype consistently carry a high-affinity phosphate pst transporter and the phoB-phoR regulatory system, a high-affinity ammonium amtB transporter, urea and taurine utilization systems. Moreover, the Antarctic phylotype uses proteorhodopsin to acquire light, whereas the other uses bacteriochlorophyll-a and the sulfur-oxidizing sox cluster for energy acquisition. This is potentially an iron-saving strategy for the Antarctic phylotype because only the latter two pathways have iron-requiring cytochromes. Therefore, the two DC5-80-3 phylotypes, while diverging by only 1.1% in their 16S rRNA genes, have evolved systematic differences in metabolism to support their distinct ecologies. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Is antibody clustering predictive of clinical subsets and damage in systemic lupus erythematosus?
To, C H; Petri, M
2005-12-01
To examine autoantibody clusters and their associations with clinical features and organ damage accrual in patients with systemic lupus erythematosus (SLE). The study group comprised 1,357 consecutive patients with SLE who were recruited to participate in a prospective longitudinal cohort study. In the cohort, 92.6% of the patients were women, the mean +/- SD age of the patients was 41.3 +/- 12.7 years, 55.9% were Caucasian, 39.1% were African American, and 5% were Asian. Seven autoantibodies (anti-double-stranded DNA [anti-dsDNA], anti-Sm, anti-Ro, anti-La, anti-RNP, lupus anticoagulant (LAC), and anticardiolipin antibody [aCL]) were selected for cluster analysis using the K-means cluster analysis procedure. Three distinct autoantibody clusters were identified: cluster 1 (anti-Sm and anti-RNP), cluster 2 (anti-dsDNA, anti-Ro, and anti-La), and cluster 3 (anti-dsDNA, LAC, and aCL). Patients in cluster 1 (n = 451), when compared with patients in clusters 2 (n = 470) and 3 (n = 436), had the lowest incidence of proteinuria (39.7%), anemia (52.8%), lymphopenia (33.9%), and thrombocytopenia (13.7%). The incidence of nephrotic syndrome and leukopenia was also lower in cluster 1 than in cluster 2. Cluster 2 had the highest female-to-male ratio (22:1) and the greatest proportion of Asian patients. Among the 3 clusters, cluster 2 had significantly more patients presenting with secondary Sjögren's syndrome (15.7%). Cluster 3, when compared with the other 2 clusters, consisted of more Caucasian and fewer African American patients and was characterized by the highest incidence of arterial thrombosis (17.4%), venous thrombosis (25.7%), and livedo reticularis (31.4%). By using the Systemic Lupus International Collaborating Clinics/American College of Rheumatology Damage Index, the greatest frequency of nephrotic syndrome (8.9%) was observed in patients in cluster 2, whereas cluster 3 patients had the highest percentage of damage due to cerebrovascular accident (12.8%) and venous thrombosis (7.8%). Osteoporotic fracture (11.9%) was also more common in cluster 3 than in cluster 2. Autoantibody clustering is a valuable tool to differentiate between various subsets of SLE, allowing prediction of subsequent clinical course and organ damage.
Clinical Study of the 3D-Master Color System among the Spanish Population.
Gómez-Polo, Cristina; Gómez-Polo, Miguel; Martínez Vázquez de Parga, Juan Antonio; Celemín-Viñuela, Alicia
2017-01-12
To study whether the shades of the 3D-Master System were grouped and represented in the chromatic space according to the three-color coordinates of value, chroma, and hue. Maxillary central incisor color was measured on tooth surfaces through the Easyshade Compact spectrophotometer using 1361 participants aged between 16 and 89. The natural (not bleached teeth) color of the middle thirds was registered in the 3D-Master System nomenclature and in the CIELCh system. Principal component analysis and cluster analysis were applied. 75 colors of the 3D-Master System were found. The statistical analysis revealed the existence of 5 cluster groups. The centroid, the average of the 75 samples, in relation to lightness (L*) was 74.64, 22.87 for chroma (C*), and 88.85 for hue (h*). All of the clusters, except cluster 3, showed significant statistical differences with the centroid for the three-color coordinates (p <0.001). The results of this study indicated that 75 shades in the 3D-Master System were grouped into 5 clusters following coordinates L*, C*, and h* resulting from the dental spectrophotometer Vita Easyshade compact. The shades that composed each cluster did not belong to the same lightness color dimension groups. There was no special uniform chromatic distribution among the colors of the 3D-Master System. © 2017 by the American College of Prosthodontists.
Automated clustering-based workload characterization
NASA Technical Reports Server (NTRS)
Pentakalos, Odysseas I.; Menasce, Daniel A.; Yesha, Yelena
1996-01-01
The demands placed on the mass storage systems at various federal agencies and national laboratories are continuously increasing in intensity. This forces system managers to constantly monitor the system, evaluate the demand placed on it, and tune it appropriately using either heuristics based on experience or analytic models. Performance models require an accurate workload characterization. This can be a laborious and time consuming process. It became evident from our experience that a tool is necessary to automate the workload characterization process. This paper presents the design and discusses the implementation of a tool for workload characterization of mass storage systems. The main features of the tool discussed here are: (1)Automatic support for peak-period determination. Histograms of system activity are generated and presented to the user for peak-period determination; (2) Automatic clustering analysis. The data collected from the mass storage system logs is clustered using clustering algorithms and tightness measures to limit the number of generated clusters; (3) Reporting of varied file statistics. The tool computes several statistics on file sizes such as average, standard deviation, minimum, maximum, frequency, as well as average transfer time. These statistics are given on a per cluster basis; (4) Portability. The tool can easily be used to characterize the workload in mass storage systems of different vendors. The user needs to specify through a simple log description language how the a specific log should be interpreted. The rest of this paper is organized as follows. Section two presents basic concepts in workload characterization as they apply to mass storage systems. Section three describes clustering algorithms and tightness measures. The following section presents the architecture of the tool. Section five presents some results of workload characterization using the tool.Finally, section six presents some concluding remarks.
Hadjithomas, Michalis; Chen, I-Min Amy; Chu, Ken; ...
2015-07-14
In the discovery of secondary metabolites, analysis of sequence data is a promising exploration path that remains largely underutilized due to the lack of computational platforms that enable such a systematic approach on a large scale. In this work, we present IMG-ABC (https://img.jgi.doe.gov/abc), an atlas of biosynthetic gene clusters within the Integrated Microbial Genomes (IMG) system, which is aimed at harnessing the power of “big” genomic data for discovering small molecules. IMG-ABC relies on IMG’s comprehensive integrated structural and functional genomic data for the analysis of biosynthetic gene clusters (BCs) and associated secondary metabolites (SMs). SMs and BCs serve asmore » the two main classes of objects in IMG-ABC, each with a rich collection of attributes. A unique feature of IMG-ABC is the incorporation of both experimentally validated and computationally predicted BCs in genomes as well as metagenomes, thus identifying BCs in uncultured populations and rare taxa. We demonstrate the strength of IMG-ABC’s focused integrated analysis tools in enabling the exploration of microbial secondary metabolism on a global scale, through the discovery of phenazine-producing clusters for the first time in lphaproteobacteria. IMG-ABC strives to fill the long-existent void of resources for computational exploration of the secondary metabolism universe; its underlying scalable framework enables traversal of uncovered phylogenetic and chemical structure space, serving as a doorway to a new era in the discovery of novel molecules. IMG-ABC is the largest publicly available database of predicted and experimental biosynthetic gene clusters and the secondary metabolites they produce. The system also includes powerful search and analysis tools that are integrated with IMG’s extensive genomic/metagenomic data and analysis tool kits. As new research on biosynthetic gene clusters and secondary metabolites is published and more genomes are sequenced, IMG-ABC will continue to expand, with the goal of becoming an essential component of any bioinformatic exploration of the secondary metabolism world.« less
Castillo, Paula; Pietrantonio, Patricia V.
2013-01-01
In the red imported fire ant, Solenopsis invicta Buren, the neuronal and molecular mechanisms related to worker division of labor are poorly understood. Workers from different subcastes (major, medium and minors) perform different tasks, which are loosely associated with their size. We hypothesized that the short neuropeptide F (sNPF) signaling system (NPY-like) could be involved in mechanisms of worker division of labor and sensing or responding to colony nutritional requirements. Thus, we investigated the expression of the short neuropeptide F receptor (sNPFR) in the brain and subesophageal ganglion (SEG) of workers from colonies with and without brood. Across worker subcastes a total of 9 clusters of immunoreactive sNPFR cells were localized in the brain and the subesophageal ganglion (SEG); some of these cells were similar to those observed previously in the queen. Worker brain sNPFR cell clusters were found in the protocerebrum near mushroom bodies, in the central complex and in the lateral horn. Other sNPFR immunoreactive cells were found at the edge of the antennal lobes. Across subcastes, we observed both a constant and a differential pattern of sNPFR clusters, with a higher number of sNPFR cells found in minor than in major workers. Those sNPFR cells detected in all worker subcastes appear to be involved in olfaction or SEG functions. The differential expression of clusters in subcastes suggests that sNPFR signaling is involved in regulating behaviors associated with specific subcastes and thus, division of labor. Some sNPFR cells appear to be involved in nutrient sensing and/or brood care, feeding behavior and locomotion. In colonies without brood, workers showed a lower cluster number, and an overall reduced sNPFR signal. Our results suggest the sNPF signaling system is a candidate for the neurobiological control of worker division of labor and sensing brood presence, perhaps correlating with protein requirements and availability. PMID:24376775
Chemotaxis and flagellar genes of Chromobacterium violaceum.
Pereira, Maristela; Parente, Juliana Alves; Bataus, Luiz Artur Mendes; Cardoso, Divina das Dores de Paula; Soares, Renata Bastos Ascenço; Soares, Célia Maria de Almeida
2004-03-31
The availability of the complete genome of the Gram-negative beta-proteobacterium Chromobacterium violaceum has increasingly impacted our understanding of this microorganism. This review focuses on the genomic organization and structural analysis of the deduced proteins of the chemosensory adaptation system of C. violaceum. C. violaceum has multiple homologues of most chemotaxis genes, organized mostly in clusters in the bacterial genome. We found at least 67 genes, distributed in 10 gene clusters, involved in the chemotaxis of C. violaceum. A close examination of the chemoreceptors methyl-accepting chemotaxis proteins (MCPs), and the deduced sequences of the members of the two-component signaling system revealed canonical motifs, described as essential for the function of the deduced proteins. The chemoreceptors found in C. violaceum include the complete repertoire of such genes described in bacteria, designated as tsr, tar, trg, and tap; 41 MCP loci were found in the C. violaceum genome. Also, the C. violaceum genome includes a large repertoire of the proteins of the chemosensory transducer system. Multiple homologues of bacterial chemotaxis genes, including CheA, CheB, CheD, CheR, CheV, CheY, CheZ, and CheW, were found in the C. violaceum genome.
Clustered carbohydrates as a target for natural killer cells: a model system.
Kovalenko, Elena I; Abakushina, Elena; Telford, William; Kapoor, Veena; Korchagina, Elena; Khaidukov, Sergei; Molotkovskaya, Irina; Sapozhnikov, Alexander; Vlaskin, Pavel; Bovin, Nicolai
2007-03-01
Membrane-associated oligosaccharides are known to take part in interactions between natural killer (NK) cells and their targets and modulate NK cell activity. A model system was therefore developed using synthetic glycoconjugates as tools to modify the carbohydrate pattern on NK target cell surfaces. NK cells were then assessed for function in response to synthetic glycoconjugates, using both cytolysis-associated caspase 6 activation measured by flow cytometry and IFN-gamma production. Lipophilic neoglycoconjugates were synthesized to provide their easy incorporation into the target cell membranes and to make carbohydrate residues available for cell-cell interactions. While incorporation was successful based on fluorescence monitoring, glycoconjugate incorporation did not evoke artifactual changes in surface antigen expression, and had no negative effect on cell viability. Glycoconjugates contained Le(x), sulfated Le(x), and Le(y) sharing the common structure motif trisaccharide Le(x) were revealed to enhance cytotoxicity mediated specifically by CD16 +CD56+NK cells. The glycoconjugate effects were dependent on saccharide presentation in a polymeric form. Only polymeric, or clustered, but not monomeric glycoconjugates resulted in alteration of cytotoxicity in our system, suggesting that appropriate presentation is critical for carbohydrate recognition and subsequent biological effects.
Cluster structures influenced by interaction with a surface.
Witt, Christopher; Dieterich, Johannes M; Hartke, Bernd
2018-05-30
Clusters on surfaces are vitally important for nanotechnological applications. Clearly, cluster-surface interactions heavily influence the preferred cluster structures, compared to clusters in vacuum. Nevertheless, systematic explorations and an in-depth understanding of these interactions and how they determine the cluster structures are still lacking. Here we present an extension of our well-established non-deterministic global optimization package OGOLEM from isolated clusters to clusters on surfaces. Applying this approach to intentionally simple Lennard-Jones test systems, we produce a first systematic exploration that relates changes in cluster-surface interactions to resulting changes in adsorbed cluster structures.
Alexander, Nathan; Woetzel, Nils; Meiler, Jens
2011-02-01
Clustering algorithms are used as data analysis tools in a wide variety of applications in Biology. Clustering has become especially important in protein structure prediction and virtual high throughput screening methods. In protein structure prediction, clustering is used to structure the conformational space of thousands of protein models. In virtual high throughput screening, databases with millions of drug-like molecules are organized by structural similarity, e.g. common scaffolds. The tree-like dendrogram structure obtained from hierarchical clustering can provide a qualitative overview of the results, which is important for focusing detailed analysis. However, in practice it is difficult to relate specific components of the dendrogram directly back to the objects of which it is comprised and to display all desired information within the two dimensions of the dendrogram. The current work presents a hierarchical agglomerative clustering method termed bcl::Cluster. bcl::Cluster utilizes the Pymol Molecular Graphics System to graphically depict dendrograms in three dimensions. This allows simultaneous display of relevant biological molecules as well as additional information about the clusters and the members comprising them.
Clustering of tethered satellite system simulation data by an adaptive neuro-fuzzy algorithm
NASA Technical Reports Server (NTRS)
Mitra, Sunanda; Pemmaraju, Surya
1992-01-01
Recent developments in neuro-fuzzy systems indicate that the concepts of adaptive pattern recognition, when used to identify appropriate control actions corresponding to clusters of patterns representing system states in dynamic nonlinear control systems, may result in innovative designs. A modular, unsupervised neural network architecture, in which fuzzy learning rules have been embedded is used for on-line identification of similar states. The architecture and control rules involved in Adaptive Fuzzy Leader Clustering (AFLC) allow this system to be incorporated in control systems for identification of system states corresponding to specific control actions. We have used this algorithm to cluster the simulation data of Tethered Satellite System (TSS) to estimate the range of delta voltages necessary to maintain the desired length rate of the tether. The AFLC algorithm is capable of on-line estimation of the appropriate control voltages from the corresponding length error and length rate error without a priori knowledge of their membership functions and familarity with the behavior of the Tethered Satellite System.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boksenberg, Alec; Sargent, Wallace L. W., E-mail: boksy@ast.cam.ac.uk
2015-05-15
Using Voigt-profile-fitting procedures on Keck High Resolution Spectrograph spectra of nine QSOs, we identify 1099 C IV absorber components clumped in 201 systems outside the Lyman forest over 1.6 ≲ z ≲ 4.4. With associated Si IV, C II, Si II and N V where available, we investigate the bulk statistical and ionization properties of the components and systems and find no significant change in redshift for C IV and Si IV while C II, Si II and N V change substantially. The C IV components exhibit strong clustering, but no clustering is detected for systems on scales from 150 kmmore » s{sup –1} out to 50,000 km s{sup –1}. We conclude that the clustering is due entirely to the peculiar velocities of gas present in the circumgalactic media of galaxies. Using specific combinations of ionic ratios, we compare our observations with model ionization predictions for absorbers exposed to the metagalactic ionizing radiation background augmented by proximity radiation from their associated galaxies and find that the generally accepted means of radiative escape by transparent channels from the internal star-forming sites is spectrally not viable for our stronger absorbers. We develop an active scenario based on runaway stars with resulting changes in the efflux of radiation that naturally enable the needed spectral convergence, and in turn provide empirical indicators of morphological evolution in the associated galaxies. Together with a coexisting population of relatively compact galaxies indicated by the weaker absorbers in our sample, the collective escape of radiation is sufficient to maintain the intergalactic medium ionized over the full range 1.9 < z ≲ 4.4.« less
Inductive Approaches to Improving Diagnosis and Design for Diagnosability
NASA Technical Reports Server (NTRS)
Fisher, Douglas H. (Principal Investigator)
1995-01-01
The first research area under this grant addresses the problem of classifying time series according to their morphological features in the time domain. A supervised learning system called CALCHAS, which induces a classification procedure for signatures from preclassified examples, was developed. For each of several signature classes, the system infers a model that captures the class's morphological features using Bayesian model induction and the minimum message length approach to assign priors. After induction, a time series (signature) is classified in one of the classes when there is enough evidence to support that decision. Time series with sufficiently novel features, belonging to classes not present in the training set, are recognized as such. A second area of research assumes two sources of information about a system: a model or domain theory that encodes aspects of the system under study and data from actual system operations over time. A model, when it exists, represents strong prior expectations about how a system will perform. Our work with a diagnostic model of the RCS (Reaction Control System) of the Space Shuttle motivated the development of SIG, a system which combines information from a model (or domain theory) and data. As it tracks RCS behavior, the model computes quantitative and qualitative values. Induction is then performed over the data represented by both the 'raw' features and the model-computed high-level features. Finally, work on clustering for operating mode discovery motivated some important extensions to the clustering strategy we had used. One modification appends an iterative optimization technique onto the clustering system; this optimization strategy appears to be novel in the clustering literature. A second modification improves the noise tolerance of the clustering system. In particular, we adapt resampling-based pruning strategies used by supervised learning systems to the task of simplifying hierarchical clusterings, thus making post-clustering analysis easier.
Optical setup for two-colour experiments at the low density matter beamline of FERMI
NASA Astrophysics Data System (ADS)
Finetti, Paola; Demidovich, Alexander; Plekan, Oksana; Di Fraia, Michele; Cucini, Riccardo; Callegari, Carlo; Cinquegrana, Paolo; Sigalotti, Paolo; Ivanov, Rosen; Danailov, Miltcho B.; Fava, Claudio; De Ninno, Giovanni; Coreno, Marcello; Grazioli, Cesare; Feifel, Raimund; Squibb, Richard J.; Mazza, Tommaso; Meyer, Michael; Prince, Kevin C.
2017-11-01
The low density matter beamline of the free electron laser facility FERMI is dedicated to the study of atomic, molecular and cluster systems, and here we describe the optical setup available for two-colour experiments. Samples can be exposed to ultrashort pulses from a Ti:Sapphire source (fundamental, or second or third harmonic), and ultrashort light pulses of FERMI in the EUV/soft x-ray region with a well-defined temporal delay, and negligible jitter (<10 fs) compared to the pulse durations (40-100 fs). Detection schemes available include electron, ion and optical spectroscopy. The majority of experiments using this apparatus are pump-and-probe, where either wavelength can be pump or probe, but the system is also useful for other techniques, such as multi-photon spectroscopy, cross-correlation measurements and alignment of molecules in space.
A Multidisciplinary Investigation of a Polycythemia Vera Cancer Cluster of Unknown Origin
Seaman, Vincent; Dearwent, Steve M; Gable, Debra; Lewis, Brian; Metcalf, Susan; Orloff, Ken; Tierney, Bruce; Zhu, Jane; Logue, James; Marchetto, David; Ostroff, Stephen; Hoffman, Ronald; Xu, Mingjiang; Carey, David; Erlich, Porat; Gerhard, Glenn; Roda, Paul; Iannuzzo, Joseph; Lewis, Robert; Mellow, John; Mulvihill, Linda; Myles, Zachary; Wu, Manxia; Frank, Arthur; Gross-Davis, Carol Ann; Klotz, Judith; Lynch, Adam; Weissfeld, Joel; Weinberg, Rona; Cole, Henry
2010-01-01
Cancer cluster investigations rarely receive significant public health resource allocations due to numerous inherent challenges and the limited success of past efforts. In 2008, a cluster of polycythemia vera, a rare blood cancer with unknown etiology, was identified in northeast Pennsylvania. A multidisciplinary group of federal and state agencies, academic institutions, and local healthcare providers subsequently developed a multifaceted research portfolio designed to better understand the cause of the cluster. This research agenda represents a unique and important opportunity to demonstrate that cancer cluster investigations can produce desirable public health and scientific outcomes when necessary resources are available. PMID:20617023
A conserved gene cluster as a putative functional unit in insect innate immunity.
Somogyi, Kálmán; Sipos, Botond; Pénzes, Zsolt; Andó, István
2010-11-05
The Nimrod gene superfamily is an important component of the innate immune response. The majority of its member genes are located in close proximity within the Drosophila melanogaster genome and they lie in a larger conserved cluster ("Nimrod cluster"), made up of non-related groups (families, superfamilies) of genes. This cluster has been a part of the Arthropod genomes for about 300-350 million years. The available data suggest that the Nimrod cluster is a functional module of the insect innate immune response. Copyright © 2010 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Some remarks on extragalactic globular clusters
NASA Astrophysics Data System (ADS)
Richtler, Tom
2006-03-01
I comment (in a review fashion) on a few selected topics in the field of extragalactic globular clusters with strong emphasis on recent work. The topics are: bimodality in the colour distribution of cluster systems, young massive clusters, and the brightest old clusters. Globular cluster research, per- haps more than ever, has lead to important (at least to astronomers) progress and problems in galaxy structure and formation.
Climate Ocean Modeling on a Beowulf Class System
NASA Technical Reports Server (NTRS)
Cheng, B. N.; Chao, Y.; Wang, P.; Bondarenko, M.
2000-01-01
With the growing power and shrinking cost of personal computers. the availability of fast ethernet interconnections, and public domain software packages, it is now possible to combine them to build desktop parallel computers (named Beowulf or PC clusters) at a fraction of what it would cost to buy systems of comparable power front supercomputer companies. This led as to build and assemble our own sys tem. specifically for climate ocean modeling. In this article, we present our experience with such a system, discuss its network performance, and provide some performance comparison data with both HP SPP2000 and Cray T3E for an ocean Model used in present-day oceanographic research.
PuReD-MCL: a graph-based PubMed document clustering methodology.
Theodosiou, T; Darzentas, N; Angelis, L; Ouzounis, C A
2008-09-01
Biomedical literature is the principal repository of biomedical knowledge, with PubMed being the most complete database collecting, organizing and analyzing such textual knowledge. There are numerous efforts that attempt to exploit this information by using text mining and machine learning techniques. We developed a novel approach, called PuReD-MCL (Pubmed Related Documents-MCL), which is based on the graph clustering algorithm MCL and relevant resources from PubMed. PuReD-MCL avoids using natural language processing (NLP) techniques directly; instead, it takes advantage of existing resources, available from PubMed. PuReD-MCL then clusters documents efficiently using the MCL graph clustering algorithm, which is based on graph flow simulation. This process allows users to analyse the results by highlighting important clues, and finally to visualize the clusters and all relevant information using an interactive graph layout algorithm, for instance BioLayout Express 3D. The methodology was applied to two different datasets, previously used for the validation of the document clustering tool TextQuest. The first dataset involves the organisms Escherichia coli and yeast, whereas the second is related to Drosophila development. PuReD-MCL successfully reproduces the annotated results obtained from TextQuest, while at the same time provides additional insights into the clusters and the corresponding documents. Source code in perl and R are available from http://tartara.csd.auth.gr/~theodos/
Crooks, Daniel R.; Ghosh, Manik C.; Haller, Ronald G.; Tong, Wing-Hang
2010-01-01
Mammalian ferrochelatase, the terminal enzyme in the heme biosynthetic pathway, possesses an iron-sulfur [2Fe-2S] cluster that does not participate in catalysis. We investigated ferrochelatase expression in iron-deficient erythropoietic tissues of mice lacking iron regulatory protein 2, in iron-deficient murine erythroleukemia cells, and in human patients with ISCU myopathy. Ferrochelatase activity and protein levels were dramatically decreased in Irp2−/− spleens, whereas ferrochelatase mRNA levels were increased, demonstrating posttranscriptional regulation of ferrochelatase in vivo. Translation of ferrochelatase mRNA was unchanged in iron-depleted murine erythroleukemia cells, and the stability of mature ferrochelatase protein was also unaffected. However, the stability of newly formed ferrochelatase protein was dramatically decreased during iron deficiency. Ferrochelatase was also severely depleted in muscle biopsies and cultured myoblasts from patients with ISCU myopathy, a disease caused by deficiency of a scaffold protein required for Fe-S cluster assembly. Together, these data suggest that decreased Fe-S cluster availability because of cellular iron depletion or impaired Fe-S cluster assembly causes reduced maturation and stabilization of apo-ferrochelatase, providing a direct link between Fe-S biogenesis and completion of heme biosynthesis. We propose that decreased heme biosynthesis resulting from impaired Fe-S cluster assembly can contribute to the pathogenesis of diseases caused by defective Fe-S cluster biogenesis. PMID:19965627
Time-Series Monitoring of Open Star Clusters
NASA Astrophysics Data System (ADS)
Hojaev, A. S.; Semakov, D. G.
2006-08-01
Star clusters especially a compact ones (with diameter of few to ten arcmin) are suitable targets to search of light variability for orchestera of stars by means of ordinary Casegrain telescope plus CCD system. A special patroling with short time-fixed exposures and mmag accuracy could be used also to study of stellar oscillation for group of stars simultaneously. The last can be carried out both separately from one site and within international campaigns. Detection and study of optical variability of X-ray sources including X-ray binaries with compact objects might be as a result of a long-term monitoring of such clusters as well. We present the program of open star clusters monitoring with Zeiss 1 meter RCC telescope of Maidanak observatory has been recently automated. In combination with quite good seeing at this observatory (see, e.g., Sarazin, M. 1999, URL http://www.eso.org/gen-fac/pubs/astclim/) the automatic telescope equipped with large-format (2KX2K) CCD camera AP-10 available will allow to collect homogenious time-series for analysis. We already started this program in 2001 and had a set of patrol observations with Zeiss 0.6 meter telescope and AP-10 camera in 2003. 7 compact open clusters in the Milky Way (NGC 7801, King1, King 13, King18, King20, Berkeley 55, IC 4996) have been monitored for stellar variability and some results of photometry will be presented. A few interesting variables were discovered and dozens were suspected for variability to the moment in these clusters for the first time. We have made steps to join the Whole-Earth Telescope effort in its future campaigns.
Characterization of the velocity anisotropy of accreted globular clusters
NASA Astrophysics Data System (ADS)
Bianchini, P.; Sills, A.; Miholics, M.
2017-10-01
Galactic globular clusters (GCs) are believed to have formed in situ in the Galaxy as well as in dwarf galaxies later accreted on to the Milky Way. However, to date, there is no unambiguous signature to distinguish accreted GCs. Using specifically designed N-body simulations of GCs evolving in a variety of time-dependent tidal fields (describing the potential of a dwarf galaxy-Milky Way merger), we analyse the effects imprinted on the internal kinematics of an accreted GC. In particular, we look at the evolution of the velocity anisotropy. Our simulations show that at early phases, the velocity anisotropy is determined by the tidal field of the dwarf galaxy and subsequently the clusters will adapt to the new tidal environment, losing any signature of their original environment in a few relaxation times. At 10 Gyr, GCs exhibit a variety of velocity anisotropy profiles, namely, isotropic velocity distribution in the inner regions and either isotropy or radial/tangential anisotropy in the intermediate and outer regions. Independent of an accreted origin, the velocity anisotropy primarily depends on the strength of the tidal field cumulatively experienced by a cluster. Tangentially anisotropic clusters correspond to systems that have experienced stronger tidal fields and are characterized by higher tidal filling factor, r50/rj ≳ 0.17, higher mass-loss ≳ 60 per cent and relaxation times trel ≲ 109 Gyr. Interestingly, we demonstrate that the presence of tidal tails can significantly contaminate the measurements of velocity anisotropy when a cluster is observed in projection. Our characterization of the velocity anisotropy profiles in different tidal environments provides a theoretical benchmark for the interpretation of the unprecedented amount of three-dimensional kinematic data progressively available for Galactic GCs.
R package to estimate intracluster correlation coefficient with confidence interval for binary data.
Chakraborty, Hrishikesh; Hossain, Akhtar
2018-03-01
The Intracluster Correlation Coefficient (ICC) is a major parameter of interest in cluster randomized trials that measures the degree to which responses within the same cluster are correlated. There are several types of ICC estimators and its confidence intervals (CI) suggested in the literature for binary data. Studies have compared relative weaknesses and advantages of ICC estimators as well as its CI for binary data and suggested situations where one is advantageous in practical research. The commonly used statistical computing systems currently facilitate estimation of only a very few variants of ICC and its CI. To address the limitations of current statistical packages, we developed an R package, ICCbin, to facilitate estimating ICC and its CI for binary responses using different methods. The ICCbin package is designed to provide estimates of ICC in 16 different ways including analysis of variance methods, moments based estimation, direct probabilistic methods, correlation based estimation, and resampling method. CI of ICC is estimated using 5 different methods. It also generates cluster binary data using exchangeable correlation structure. ICCbin package provides two functions for users. The function rcbin() generates cluster binary data and the function iccbin() estimates ICC and it's CI. The users can choose appropriate ICC and its CI estimate from the wide selection of estimates from the outputs. The R package ICCbin presents very flexible and easy to use ways to generate cluster binary data and to estimate ICC and it's CI for binary response using different methods. The package ICCbin is freely available for use with R from the CRAN repository (https://cran.r-project.org/package=ICCbin). We believe that this package can be a very useful tool for researchers to design cluster randomized trials with binary outcome. Copyright © 2017 Elsevier B.V. All rights reserved.
Chou, A; Burke, J
1999-05-01
DNA sequence clustering has become a valuable method in support of gene discovery and gene expression analysis. Our interest lies in leveraging the sequence diversity within clusters of expressed sequence tags (ESTs) to model gene structure for the study of gene variants that arise from, among other things, alternative mRNA splicing, polymorphism, and divergence after gene duplication, fusion, and translocation events. In previous work, CRAW was developed to discover gene variants from assembled clusters of ESTs. Most importantly, novel gene features (the differing units between gene variants, for example alternative exons, polymorphisms, transposable elements, etc.) that are specialized to tissue, disease, population, or developmental states can be identified when these tools collate DNA source information with gene variant discrimination. While the goal is complete automation of novel feature and gene variant detection, current methods are far from perfect and hence the development of effective tools for visualization and exploratory data analysis are of paramount importance in the process of sifting through candidate genes and validating targets. We present CRAWview, a Java based visualization extension to CRAW. Features that vary between gene forms are displayed using an automatically generated color coded index. The reporting format of CRAWview gives a brief, high level summary report to display overlap and divergence within clusters of sequences as well as the ability to 'drill down' and see detailed information concerning regions of interest. Additionally, the alignment viewing and editing capabilities of CRAWview make it possible to interactively correct frame-shifts and otherwise edit cluster assemblies. We have implemented CRAWview as a Java application across windows NT/95 and UNIX platforms. A beta version of CRAWview will be freely available to academic users from Pangea Systems (http://www.pangeasystems.com). Contact :
NASA Astrophysics Data System (ADS)
Richardson, Jacob; Connor, Charles; Malservisi, Rocco; Bleacher, Jacob; Connor, Laura
2014-05-01
Clusters of tens to thousands of small volcanoes (diameters generally <30 km) are common features on the surface of Mars, Venus, and the Earth. These clusters may be described as distributed-style volcanism. Better characterizing the magmatic plumbing system of these clusters can constrain magma ascent processes as well as the regional magma production budget and heat flux beneath each cluster. Unfortunately, directly observing the plumbing systems of volcano clusters on Mars and Venus eludes our current geologic abilities. Because erosion exposes such systems at the Earth's surface, a better understanding of magmatic processes and migration can be achieved via field analysis. The terrestrial plumbing system of an eroded volcanic field may be a valuable planetary analog for Venus and Mars clusters. The magmatic plumbing system of a Pliocene-aged monogenetic volcanic field, emplaced at 0.8 km depth, is currently exposed as a sill and dike swarm in the San Rafael Desert of Central Utah, USA. The mafic bodies in this region intruded into Mesozoic sedimentary units and now make up the most erosion resistant units as sills, dikes, and plug-like conduits. Light Detection and Ranging (LiDAR) can identify volcanic units (sills, dikes, and conduits) at high resolution, both geomorphologically and with near infrared return intensity values. Two Terrestrial LiDAR Surveys and an Airborne LiDAR Survey have been carried out over the San Rafael volcanic swarm, producing a three dimensional point cloud over approximately 36 sq. km. From the point clouds of these surveys, 1-meter DEMs are produced and volcanic intrusions have been mapped. Here we present reconstructions of the volcanic instrusions of the San Rafael Swarm. We create this reconstruction by extrapolating mapped intrustions from the LiDAR surveys into a 3D space around the current surface. We compare the estimated intrusive volume to the estimated conduit density and estimates of extrusive volume at volcano clusters of similar density. The extrapolated reconstruction and conduit mapping provide a first-order estimate of the final intrustive/extrusive volume ratio for the now eroded volcanic field. Earth, Venus and Mars clusters are compared using Kernel Density Estimation (KDE) , which objectively compares cluster area, complexity, and vent density per sq. km. We show that Martian clusters are less dense than Venus clusters, which in turn are less dense than those on Earth. KDE and previous models of intrusive morphology for Mars and Venus are here used to calibrate the San Rafael plumbing system model to clusters on the two planets. The results from the calibrated Mars and Venus plumbing system models can be compared to previous estimates of magma budget and intrusive/extrusive ratios on Venus and Mars.
Elastic K-means using posterior probability
Zheng, Aihua; Jiang, Bo; Li, Yan; Zhang, Xuehan; Ding, Chris
2017-01-01
The widely used K-means clustering is a hard clustering algorithm. Here we propose a Elastic K-means clustering model (EKM) using posterior probability with soft capability where each data point can belong to multiple clusters fractionally and show the benefit of proposed Elastic K-means. Furthermore, in many applications, besides vector attributes information, pairwise relations (graph information) are also available. Thus we integrate EKM with Normalized Cut graph clustering into a single clustering formulation. Finally, we provide several useful matrix inequalities which are useful for matrix formulations of learning models. Based on these results, we prove the correctness and the convergence of EKM algorithms. Experimental results on six benchmark datasets demonstrate the effectiveness of proposed EKM and its integrated model. PMID:29240756
NASA Astrophysics Data System (ADS)
Guhathakurta, Puragra; Toloba, Elisa; Peng, Eric W.; Li, Biao; Gwyn, Stephen; Ferrarese, Laura; Cote, Patrick; Chu, Jason; Sparkman, Lea; Chen, Stephanie; Yagati, Samyukta; Muller, Meredith; Next Generation Virgo Survey Collaboration
2015-01-01
We present results from an ongoing study of globular cluster (GC) satellites of low-luminosity dwarf elliptical (dE) galaxies in the Virgo cluster. Our 21 dE targets and candidate GC satellites around them in the apparent magnitude range g ~ 20-24 were selected from the Next Generation Virgo Survey (NGVS) and followed up with medium-resolution Keck/DEIMOS spectroscopy (resolving power: R ~ 2000; wavelength coverage: 4800-9500 Angstrom). In addition, the remaining space available on the nine DEIMOS multi-slit masks were populated with "filler" targets in the form of distant Milky Way halo star candidates in a comparable apparent magnitude range. A combination of radial velocity information (measured from the Keck/DEIMOS spectra), color-color information (from four-band NGVS photometry), and sky position information was used to sort the sample into the following categories: (1) GC satellites of dEs, (2) other non-satellite GCs in the Virgo cluster (we dub them "orphan" GCs), (3) foreground Milky Way stars that are members of the Sagittarius stream, the Virgo overdensity, or the field halo population, and (4) distant background galaxies. We stack the GC satellite population across all 21 host dEs and carry out dynamical modeling of the stacked sample in order to constrain the average mass of dark matter halos that these dEs are embedded in. We study rotation in the system of GC satellites of dEs in the handful of more populated systems in our sample - i.e., those that contain 10 or more GC satellites per dE. A companion AAS poster presented at this meeting (Chu, J. et al. 2015) presents chemical composition and age constraints for these GC satellites relative to the nuclei of the host dEs based on absorption line strengths in co-added spectra. The orphan GCs are likely to be intergalactic GCs within the Virgo cluster (or, equivalently, GCs in the remote outer envelope of the cluster's central galaxy, the giant elliptical M87).This project is funded in part by the National Science Foundation. Some of this research was conducted by high-school students working under the auspices of the Science Internship Program at the University of California Santa Cruz.
Dynamical Classifications of the Kuiper Belt
NASA Astrophysics Data System (ADS)
Maggard, Steven; Ragozzine, Darin
2018-04-01
The Minor Planet Center (MPC) contains a plethora of observational data on thousands of Kuiper Belt Objects (KBOs). Understanding their orbital properties refines our understanding of the formation of the solar system. My analysis pipeline, BUNSHIN, uses Bayesian methods to take the MPC observations and generate 30 statistically weighted orbital clones for each KBO that are propagated backwards along their orbits until the beginning of the solar system. These orbital integrations are saved as REBOUND SimulationArchive files (Rein & Tamayo 2017) which we will make publicly available, allowing many others to perform statistically-robust dynamical classification or complex dynamical investigations of outer solar system small bodies.This database has been used to expand the known collisional family members of the dwarf planet Haumea. Detailed orbital integrations are required to determine the dynamical distances between family members, in the form of "Delta v" as measured from conserved proper orbital elements (Ragozzine & Brown 2007). Our preliminary results have already ~tripled the number of known Haumea family members, allowing us to show that the Haumea family can be identified purely through dynamical clustering.We will discuss the methods associated with BUNSHIN and the database it generates, the refinement of the updated Haumea family, a brief search for other possible clusterings in the outer solar system, and the potential of our research to aid other dynamicists.
Advanced electronics for the CTF MEG system.
McCubbin, J; Vrba, J; Spear, P; McKenzie, D; Willis, R; Loewen, R; Robinson, S E; Fife, A A
2004-11-30
Development of the CTF MEG system has been advanced with the introduction of a computer processing cluster between the data acquisition electronics and the host computer. The advent of fast processors, memory, and network interfaces has made this innovation feasible for large data streams at high sampling rates. We have implemented tasks including anti-alias filter, sample rate decimation, higher gradient balancing, crosstalk correction, and optional filters with a cluster consisting of 4 dual Intel Xeon processors operating on up to 275 channel MEG systems at 12 kHz sample rate. The architecture is expandable with additional processors to implement advanced processing tasks which may include e.g., continuous head localization/motion correction, optional display filters, coherence calculations, or real time synthetic channels (via beamformer). We also describe an electronics configuration upgrade to provide operator console access to the peripheral interface features such as analog signal and trigger I/O. This allows remote location of the acoustically noisy electronics cabinet and fitting of the cabinet with doors for improved EMI shielding. Finally, we present the latest performance results available for the CTF 275 channel MEG system including an unshielded SEF (median nerve electrical stimulation) measurement enhanced by application of an adaptive beamformer technique (SAM) which allows recognition of the nominal 20-ms response in the unaveraged signal.
Chahal, Harsimranjit K.; Outten, F. Wayne
2012-01-01
Iron-sulfur (Fe-S) clusters are inorganic cofactors required for a variety of biological processes. In vivo biogenesis of Fe-S clusters proceeds via complex pathways involving multiple protein complexes. In the Suf Fe-S cluster biogenesis system, SufB may be a scaffold for nascent Fe-S cluster assembly whereas SufA is proposed to act as either a scaffold or an Fe-S cluster carrier from the scaffold to target apo-proteins. However, SufB can form multiple stable complexes with other Suf proteins, such as SufB2C2 and SufBC2D and the specific functions of these complexes in Fe-S cluster assembly are not clear. Here we compare the ability of the SufB2C2 and SufBC2D complexes as well as SufA to promote in vitro maturation of the [2Fe-2S] ferredoxin (Fdx). We found that SufB2C2 was most proficient as a scaffold for de novo assembly of holo-Fdx using sulfide and iron as freely available building blocks while SufA was best at direct transfer of a pre-formed Fe-S cluster to Fdx. Furthermore, cluster transfer from [4Fe-4S] SufB2C2 or SufBC2D to Fdx will proceed through a SufA intermediate to Fdx is SufA is present. Finally, addition of ATP repressed cluster transfer from [4Fe-4S] SufB2C2 to Fdx and from SufBC2D to [2Fe-2S] SufA or Fdx. These studies indicate that SufB2C2 can serve as a terminal scaffold to load the SufA Fe-S cluster carrier for in vitro maturation of [2Fe-2S] enzymes like Fdx. This work is the first to systematically compare the cluster transfer rates of a scaffold (SufB) to the transfer rates of a carrier (SufA) under the same conditions to the same target enzyme and is also the first to reconstitute the full transfer pathway (from scaffold to carrier to target enzyme) in a single reaction. PMID:23018275
NASA Astrophysics Data System (ADS)
Capone, V.; Esposito, R.; Pardi, S.; Taurino, F.; Tortone, G.
2012-12-01
Over the last few years we have seen an increasing number of services and applications needed to manage and maintain cloud computing facilities. This is particularly true for computing in high energy physics, which often requires complex configurations and distributed infrastructures. In this scenario a cost effective rationalization and consolidation strategy is the key to success in terms of scalability and reliability. In this work we describe an IaaS (Infrastructure as a Service) cloud computing system, with high availability and redundancy features, which is currently in production at INFN-Naples and ATLAS Tier-2 data centre. The main goal we intended to achieve was a simplified method to manage our computing resources and deliver reliable user services, reusing existing hardware without incurring heavy costs. A combined usage of virtualization and clustering technologies allowed us to consolidate our services on a small number of physical machines, reducing electric power costs. As a result of our efforts we developed a complete solution for data and computing centres that can be easily replicated using commodity hardware. Our architecture consists of 2 main subsystems: a clustered storage solution, built on top of disk servers running GlusterFS file system, and a virtual machines execution environment. GlusterFS is a network file system able to perform parallel writes on multiple disk servers, providing this way live replication of data. High availability is also achieved via a network configuration using redundant switches and multiple paths between hypervisor hosts and disk servers. We also developed a set of management scripts to easily perform basic system administration tasks such as automatic deployment of new virtual machines, adaptive scheduling of virtual machines on hypervisor hosts, live migration and automated restart in case of hypervisor failures.
Stochastic theory of log-periodic patterns
NASA Astrophysics Data System (ADS)
Canessa, Enrique
2000-12-01
We introduce an analytical model based on birth-death clustering processes to help in understanding the empirical log-periodic corrections to power law scaling and the finite-time singularity as reported in several domains including rupture, earthquakes, world population and financial systems. In our stochastic theory log-periodicities are a consequence of transient clusters induced by an entropy-like term that may reflect the amount of co-operative information carried by the state of a large system of different species. The clustering completion rates for the system are assumed to be given by a simple linear death process. The singularity at t0 is derived in terms of birth-death clustering coefficients.
Mehryary, Farrokh; Kaewphan, Suwisa; Hakala, Kai; Ginter, Filip
2016-01-01
Biomedical event extraction is one of the key tasks in biomedical text mining, supporting various applications such as database curation and hypothesis generation. Several systems, some of which have been applied at a large scale, have been introduced to solve this task. Past studies have shown that the identification of the phrases describing biological processes, also known as trigger detection, is a crucial part of event extraction, and notable overall performance gains can be obtained by solely focusing on this sub-task. In this paper we propose a novel approach for filtering falsely identified triggers from large-scale event databases, thus improving the quality of knowledge extraction. Our method relies on state-of-the-art word embeddings, event statistics gathered from the whole biomedical literature, and both supervised and unsupervised machine learning techniques. We focus on EVEX, an event database covering the whole PubMed and PubMed Central Open Access literature containing more than 40 million extracted events. The top most frequent EVEX trigger words are hierarchically clustered, and the resulting cluster tree is pruned to identify words that can never act as triggers regardless of their context. For rarely occurring trigger words we introduce a supervised approach trained on the combination of trigger word classification produced by the unsupervised clustering method and manual annotation. The method is evaluated on the official test set of BioNLP Shared Task on Event Extraction. The evaluation shows that the method can be used to improve the performance of the state-of-the-art event extraction systems. This successful effort also translates into removing 1,338,075 of potentially incorrect events from EVEX, thus greatly improving the quality of the data. The method is not solely bound to the EVEX resource and can be thus used to improve the quality of any event extraction system or database. The data and source code for this work are available at: http://bionlp-www.utu.fi/trigger-clustering/.
Yu, Shann S; Scherer, Randy L; Ortega, Ryan A; Bell, Charleson S; O'Neil, Conlin P; Hubbell, Jeffrey A; Giorgio, Todd D
2011-02-27
Drug and contrast agent delivery systems that achieve controlled release in the presence of enzymatic activity are becoming increasingly important, as enzymatic activity is a hallmark of a wide array of diseases, including cancer and atherosclerosis. Here, we have synthesized clusters of ultrasmall superparamagnetic iron oxides (USPIOs) that sense enzymatic activity for applications in magnetic resonance imaging (MRI). To achieve this goal, we utilize amphiphilic poly(propylene sulfide)-bl-poly(ethylene glycol) (PPS-b-PEG) copolymers, which are known to have excellent properties for smart delivery of drug and siRNA. Monodisperse PPS polymers were synthesized by anionic ring opening polymerization of propylene sulfide, and were sequentially reacted with commercially available heterobifunctional PEG reagents and then ssDNA sequences to fashion biofunctional PPS-bl-PEG copolymers. They were then combined with hydrophobic 12 nm USPIO cores in the thin-film hydration method to produce ssDNA-displaying USPIO micelles. Micelle populations displaying complementary ssDNA sequences were mixed to induce crosslinking of the USPIO micelles. By design, these crosslinking sequences contained an EcoRV cleavage site. Treatment of the clusters with EcoRV results in a loss of R2 negative contrast in the system. Further, the USPIO clusters demonstrate temperature sensitivity as evidenced by their reversible dispersion at ~75°C and re-clustering following return to room temperature. This work demonstrates proof of concept of an enzymatically-actuatable and thermoresponsive system for dynamic biosensing applications. The platform exhibits controlled release of nanoparticles leading to changes in magnetic relaxation, enabling detection of enzymatic activity. Further, the presented functionalization scheme extends the scope of potential applications for PPS-b-PEG. Combined with previous findings using this polymer platform that demonstrate controlled drug release in oxidative environments, smart theranostic applications combining drug delivery with imaging of platform localization are within reach. The modular design of these USPIO nanoclusters enables future development of platforms for imaging and drug delivery targeted towards proteolytic activity in tumors and in advanced atherosclerotic plaques.
Zhang, X; Patel, L A; Beckwith, O; Schneider, R; Weeden, C J; Kindt, J T
2017-11-14
Micelle cluster distributions from molecular dynamics simulations of a solvent-free coarse-grained model of sodium octyl sulfate (SOS) were analyzed using an improved method to extract equilibrium association constants from small-system simulations containing one or two micelle clusters at equilibrium with free surfactants and counterions. The statistical-thermodynamic and mathematical foundations of this partition-enabled analysis of cluster histograms (PEACH) approach are presented. A dramatic reduction in computational time for analysis was achieved through a strategy similar to the selector variable method to circumvent the need for exhaustive enumeration of the possible partitions of surfactants and counterions into clusters. Using statistics from a set of small-system (up to 60 SOS molecules) simulations as input, equilibrium association constants for micelle clusters were obtained as a function of both number of surfactants and number of associated counterions through a global fitting procedure. The resulting free energies were able to accurately predict micelle size and charge distributions in a large (560 molecule) system. The evolution of micelle size and charge with SOS concentration as predicted by the PEACH-derived free energies and by a phenomenological four-parameter model fit, along with the sensitivity of these predictions to variations in cluster definitions, are analyzed and discussed.
Diao, K; Farmani, R; Fu, G; Astaraie-Imani, M; Ward, S; Butler, D
2014-01-01
Large water distribution systems (WDSs) are networks with both topological and behavioural complexity. Thereby, it is usually difficult to identify the key features of the properties of the system, and subsequently all the critical components within the system for a given purpose of design or control. One way is, however, to more explicitly visualize the network structure and interactions between components by dividing a WDS into a number of clusters (subsystems). Accordingly, this paper introduces a clustering strategy that decomposes WDSs into clusters with stronger internal connections than external connections. The detected cluster layout is very similar to the community structure of the served urban area. As WDSs may expand along with urban development in a community-by-community manner, the correspondingly formed distribution clusters may reveal some crucial configurations of WDSs. For verification, the method is applied to identify all the critical links during firefighting for the vulnerability analysis of a real-world WDS. Moreover, both the most critical pipes and clusters are addressed, given the consequences of pipe failure. Compared with the enumeration method, the method used in this study identifies the same group of the most critical components, and provides similar criticality prioritizations of them in a more computationally efficient time.
Evaluating effective pair and multisite interactions for Ni-Mo system
NASA Astrophysics Data System (ADS)
Banerjee, Rumu H.; Arya, A.; Banerjee, S.
2018-04-01
Cluster expansion (CE) method was used to calculate the energies of various Ni-Mo phases. The clusters comprising of few nearest neighbours can describe any phase of Ni-Mo system by suitable choice of effective pair and multisite interaction parameters (ECI). The ECIs were evaluated in present study by fitting the ground state energies obtained by first principle calculations. The ECIs evaluated for Ni-Mo system were mostly pair clusters followed by triplets and quadruplet clusters with cluster diameters in the range 2.54 - 10.20 Å. The ECI values diminished for multi-body (triplets and quadruplets) clusters as compared to 2-point or pair clusters indicating a good convergence of CE model. With these ECIs the predicted energies of all the Ni-Mo structures across the Mo concentration range 0-100 at% were obtained. The quantitative error in the energies calculated by CE approach and first principle is very small (< 0.026 meV/atom). The appreciable values of 2-point ECIs upto 4th nearest neighbour reveal that two body interactions are dominant in the case of Ni-Mo system. These ECIs are compared with the reported values of compositional dependent effective pair interactions evaluated by first principle as well as by Monte Carlo method.
Spatio-Temporal Analysis of Smear-Positive Tuberculosis in the Sidama Zone, Southern Ethiopia
Dangisso, Mesay Hailu; Datiko, Daniel Gemechu; Lindtjørn, Bernt
2015-01-01
Background Tuberculosis (TB) is a disease of public health concern, with a varying distribution across settings depending on socio-economic status, HIV burden, availability and performance of the health system. Ethiopia is a country with a high burden of TB, with regional variations in TB case notification rates (CNRs). However, TB program reports are often compiled and reported at higher administrative units that do not show the burden at lower units, so there is limited information about the spatial distribution of the disease. We therefore aim to assess the spatial distribution and presence of the spatio-temporal clustering of the disease in different geographic settings over 10 years in the Sidama Zone in southern Ethiopia. Methods A retrospective space–time and spatial analysis were carried out at the kebele level (the lowest administrative unit within a district) to identify spatial and space-time clusters of smear-positive pulmonary TB (PTB). Scan statistics, Global Moran’s I, and Getis and Ordi (Gi*) statistics were all used to help analyze the spatial distribution and clusters of the disease across settings. Results A total of 22,545 smear-positive PTB cases notified over 10 years were used for spatial analysis. In a purely spatial analysis, we identified the most likely cluster of smear-positive PTB in 192 kebeles in eight districts (RR= 2, p<0.001), with 12,155 observed and 8,668 expected cases. The Gi* statistic also identified the clusters in the same areas, and the spatial clusters showed stability in most areas in each year during the study period. The space-time analysis also detected the most likely cluster in 193 kebeles in the same eight districts (RR= 1.92, p<0.001), with 7,584 observed and 4,738 expected cases in 2003-2012. Conclusion The study found variations in CNRs and significant spatio-temporal clusters of smear-positive PTB in the Sidama Zone. The findings can be used to guide TB control programs to devise effective TB control strategies for the geographic areas characterized by the highest CNRs. Further studies are required to understand the factors associated with clustering based on individual level locations and investigation of cases. PMID:26030162
VizieR Online Data Catalog: Catalogue of variable stars in open clusters (Zejda+, 2012)
NASA Astrophysics Data System (ADS)
Zejda, M.; Paunzen, E.; Baumann, B.; Mikulasek, Z.; Liska, J.
2012-08-01
The catalogue of variable stars in open clusters were prepared by cross-matching of Variable Stars Index (http://www.aavso.org/vsx) version Apr 29, 2012 (available online, Cat. B/vsx) against the version 3.1. catalogue of open clusters DAML02 (Dias et al. 2002A&A...389..871D, Cat. B/ocl) available on the website http://www.astro.iag.usp.br/~wilton. The open clusters were divided into two categories according to their size, where the limiting diameter was 60 arcmin. The list of all suspected variables and variable stars located within the fields of open clusters up to two times of given cluster radius were generated (Table 1). 8938 and 9127 variable stars are given in 461 "smaller" and 74 "larger" clusters, respectively. All found variable stars were matched against the PPMXL catalog of positions and proper motions within the ICRS (Roeser et al., 2010AJ....139.2440R, Cat. I/317). Proper motion data were included in our catalogue. Unfortunately, a homogeneous data set of mean cluster proper motions has not been available until now. Therefore we used the following sources (sorted alphabetically) to compile a new catalogue: Baumgardt et al. (2000, Cat. J/A+AS/146/251): based on the Hipparcos catalogue Beshenov & Loktin (2004A&AT...23..103B): based on the Tycho-2 catalogue Dias et al. (2001, Cat. J/A+A/376/441, 2002A&A...389..871D, Cat. B/ocl): based on the Tycho-2 catalogue Dias et al. (2006, Cat. J/A+A/446/949): based on the UCAC2 catalog (Zacharias et al., 2004AJ....127.3043Z, Cat. I/289) Frinchaboy & Majewski (2008, Cat. J/AJ/136/118): based on the Tycho-2 catalogue Kharchenko et al. (2005, J/A+A/438/1163): based on the ASCC2.5 catalogue (Kharchenko, 2001KFNT...17..409K, Cat. I/280) Krone-Martins et al. (2010, Cat. J/A+A/516/A3): based on the Bordeaux PM2000 proper motion catalogue (Ducourant et al., 2006A&A...448.1235D, Cat. I/300) Robichon et al. (1999, Cat. J/A+A/345/471): based on the Hipparcos catalogue van Leeuwen (2009A&A...497..209V): based on the new Hipparcos catalogue. In total, a catalogue of proper motions for 879 open clusters (Table 2), from which 436 have more than one available measurement, was compiled. (3 data files).
Identification of uncommon objects in containers
Bremer, Peer-Timo; Kim, Hyojin; Thiagarajan, Jayaraman J.
2017-09-12
A system for identifying in an image an object that is commonly found in a collection of images and for identifying a portion of an image that represents an object based on a consensus analysis of segmentations of the image. The system collects images of containers that contain objects for generating a collection of common objects within the containers. To process the images, the system generates a segmentation of each image. The image analysis system may also generate multiple segmentations for each image by introducing variations in the selection of voxels to be merged into a segment. The system then generates clusters of the segments based on similarity among the segments. Each cluster represents a common object found in the containers. Once the clustering is complete, the system may be used to identify common objects in images of new containers based on similarity between segments of images and the clusters.
Pinto, U; Maheshwari, B L; Ollerton, R L
2013-06-01
The Hawkesbury-Nepean River (HNR) system in South-Eastern Australia is the main source of water supply for the Sydney Metropolitan area and is one of the more complex river systems due to the influence of urbanisation and other activities in the peri-urban landscape through which it flows. The long-term monitoring of river water quality is likely to suffer from data gaps due to funding cuts, changes in priority and related reasons. Nevertheless, we need to assess river health based on the available information. In this study, we demonstrated how the Factor Analysis (FA), Hierarchical Agglomerative Cluster Analysis (HACA) and Trend Analysis (TA) can be applied to evaluate long-term historic data sets. Six water quality parameters, viz., temperature, chlorophyll-a, dissolved oxygen, oxides of nitrogen, suspended solids and reactive silicates, measured at weekly intervals between 1985 and 2008 at 12 monitoring stations located along the 300 km length of the HNR system were evaluated to understand the human and natural influences on the river system in a peri-urban landscape. The application of FA extracted three latent factors which explained more than 70 % of the total variance of the data and related to the 'bio-geographical', 'natural' and 'nutrient pollutant' dimensions of the HNR system. The bio-geographical and nutrient pollution factors more likely related to the direct influence of changes and activities of peri-urban natures and accounted for approximately 50 % of variability in water quality. The application of HACA indicated two major clusters representing clean and polluted zones of the river. On the spatial scale, one cluster was represented by the upper and lower sections of the river (clean zone) and accounted for approximately 158 km of the river. The other cluster was represented by the middle section (polluted zone) with a length of approximately 98 km. Trend Analysis indicated how the point sources influence river water quality on spatio-temporal scales, taking into account the various effects of nutrient and other pollutant loads from sewerage effluents, agriculture and other point and non-point sources along the river and major tributaries of the HNR. Over the past 26 years, water temperature has significantly increased while suspended solids have significantly decreased (p < 0.05). The analysis of water quality data through FA, HACA and TA helped to characterise the key sections and cluster the key water quality variables of the HNR system. The insights gained from this study have the potential to improve the effectiveness of river health-monitoring programs in terms of cost, time and effort, particularly in a peri-urban context.
Structure and substructure analysis of DAFT/FADA galaxy clusters in the [0.4–0.9] redshift range
DOE Office of Scientific and Technical Information (OSTI.GOV)
Guennou, L.; et al.
2014-01-17
Context. The DAFT/FADA survey is based on the study of ~90 rich(masses found in the literature >2 x 10^14 M_⊙)and moderately distant clusters (redshifts 0.4 < z < 0.9), all withHST imaging data available. This survey has two main objectives: to constrain dark energy(DE) using weak lensing tomography on galaxy clusters and to build a database (deepmulti-band imaging allowing photometric redshift estimates, spectroscopic data, X-raydata) of rich distant clusters to study their properties.
Gene context analysis in the Integrated Microbial Genomes (IMG) data management system.
Mavromatis, Konstantinos; Chu, Ken; Ivanova, Natalia; Hooper, Sean D; Markowitz, Victor M; Kyrpides, Nikos C
2009-11-24
Computational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across phylogenetically diverse collection of genomes. We have used the data management system of the Integrated Microbial Genomes (IMG) as the framework to implement and explore the power of gene context analysis methods because it provides one of the largest available genome integrations. Visualization and search tools to facilitate gene context analysis have been developed and applied across all publicly available archaeal and bacterial genomes in IMG. These computations are now maintained as part of IMG's regular genome content update cycle. IMG is available at: http://img.jgi.doe.gov.
Local pulmonary structure classification for computer-aided nodule detection
NASA Astrophysics Data System (ADS)
Bahlmann, Claus; Li, Xianlin; Okada, Kazunori
2006-03-01
We propose a new method of classifying the local structure types, such as nodules, vessels, and junctions, in thoracic CT scans. This classification is important in the context of computer aided detection (CAD) of lung nodules. The proposed method can be used as a post-process component of any lung CAD system. In such a scenario, the classification results provide an effective means of removing false positives caused by vessels and junctions thus improving overall performance. As main advantage, the proposed solution transforms the complex problem of classifying various 3D topological structures into much simpler 2D data clustering problem, to which more generic and flexible solutions are available in literature, and which is better suited for visualization. Given a nodule candidate, first, our solution robustly fits an anisotropic Gaussian to the data. The resulting Gaussian center and spread parameters are used to affine-normalize the data domain so as to warp the fitted anisotropic ellipsoid into a fixed-size isotropic sphere. We propose an automatic method to extract a 3D spherical manifold, containing the appropriate bounding surface of the target structure. Scale selection is performed by a data driven entropy minimization approach. The manifold is analyzed for high intensity clusters, corresponding to protruding structures. Techniques involve EMclustering with automatic mode number estimation, directional statistics, and hierarchical clustering with a modified Bhattacharyya distance. The estimated number of high intensity clusters explicitly determines the type of pulmonary structures: nodule (0), attached nodule (1), vessel (2), junction (>3). We show accurate classification results for selected examples in thoracic CT scans. This local procedure is more flexible and efficient than current state of the art and will help to improve the accuracy of general lung CAD systems.
NASA Astrophysics Data System (ADS)
Hoag, A.; Huang, K.-H.; Treu, T.; Bradač, M.; Schmidt, K. B.; Wang, X.; Brammer, G. B.; Broussard, A.; Amorin, R.; Castellano, M.; Fontana, A.; Merlin, E.; Schrabback, T.; Trenti, M.; Vulcani, B.
2016-11-01
We present a model using both strong and weak gravitational lensing of the galaxy cluster MACS J0416.1-2403, constrained using spectroscopy from the Grism Lens-Amplified Survey from Space (GLASS) and Hubble Frontier Fields (HFF) imaging data. We search for emission lines in known multiply imaged sources in the GLASS spectra, obtaining secure spectroscopic redshifts of 30 multiple images belonging to 15 distinct source galaxies. The GLASS spectra provide the first spectroscopic measurements for five of the source galaxies. The weak lensing signal is acquired from 884 galaxies in the F606W HFF image. By combining the weak lensing constraints with 15 multiple image systems with spectroscopic redshifts and nine multiple image systems with photometric redshifts, we reconstruct the gravitational potential of the cluster on an adaptive grid. The resulting map of total mass density is compared with a map of stellar mass density obtained from the deep Spitzer Frontier Fields imaging data to study the relative distribution of stellar and total mass in the cluster. We find that the projected stellar mass to total mass ratio, f ⋆, varies considerably with the stellar surface mass density. The mean projected stellar mass to total mass ratio is < {f}\\star > =0.009+/- 0.003 (stat.), but with a systematic error as large as 0.004-0.005, dominated by the choice of the initial mass function. We find agreement with several recent measurements of f ⋆ in massive cluster environments. The lensing maps of convergence, shear, and magnification are made available to the broader community in the standard HFF format.
QCS : a system for querying, clustering, and summarizing documents.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dunlavy, Daniel M.
2006-08-01
Information retrieval systems consist of many complicated components. Research and development of such systems is often hampered by the difficulty in evaluating how each particular component would behave across multiple systems. We present a novel hybrid information retrieval system--the Query, Cluster, Summarize (QCS) system--which is portable, modular, and permits experimentation with different instantiations of each of the constituent text analysis components. Most importantly, the combination of the three types of components in the QCS design improves retrievals by providing users more focused information organized by topic. We demonstrate the improved performance by a series of experiments using standard test setsmore » from the Document Understanding Conferences (DUC) along with the best known automatic metric for summarization system evaluation, ROUGE. Although the DUC data and evaluations were originally designed to test multidocument summarization, we developed a framework to extend it to the task of evaluation for each of the three components: query, clustering, and summarization. Under this framework, we then demonstrate that the QCS system (end-to-end) achieves performance as good as or better than the best summarization engines. Given a query, QCS retrieves relevant documents, separates the retrieved documents into topic clusters, and creates a single summary for each cluster. In the current implementation, Latent Semantic Indexing is used for retrieval, generalized spherical k-means is used for the document clustering, and a method coupling sentence ''trimming'', and a hidden Markov model, followed by a pivoted QR decomposition, is used to create a single extract summary for each cluster. The user interface is designed to provide access to detailed information in a compact and useful format. Our system demonstrates the feasibility of assembling an effective IR system from existing software libraries, the usefulness of the modularity of the design, and the value of this particular combination of modules.« less
QCS: a system for querying, clustering and summarizing documents.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dunlavy, Daniel M.; Schlesinger, Judith D.; O'Leary, Dianne P.
2006-10-01
Information retrieval systems consist of many complicated components. Research and development of such systems is often hampered by the difficulty in evaluating how each particular component would behave across multiple systems. We present a novel hybrid information retrieval system--the Query, Cluster, Summarize (QCS) system--which is portable, modular, and permits experimentation with different instantiations of each of the constituent text analysis components. Most importantly, the combination of the three types of components in the QCS design improves retrievals by providing users more focused information organized by topic. We demonstrate the improved performance by a series of experiments using standard test setsmore » from the Document Understanding Conferences (DUC) along with the best known automatic metric for summarization system evaluation, ROUGE. Although the DUC data and evaluations were originally designed to test multidocument summarization, we developed a framework to extend it to the task of evaluation for each of the three components: query, clustering, and summarization. Under this framework, we then demonstrate that the QCS system (end-to-end) achieves performance as good as or better than the best summarization engines. Given a query, QCS retrieves relevant documents, separates the retrieved documents into topic clusters, and creates a single summary for each cluster. In the current implementation, Latent Semantic Indexing is used for retrieval, generalized spherical k-means is used for the document clustering, and a method coupling sentence 'trimming', and a hidden Markov model, followed by a pivoted QR decomposition, is used to create a single extract summary for each cluster. The user interface is designed to provide access to detailed information in a compact and useful format. Our system demonstrates the feasibility of assembling an effective IR system from existing software libraries, the usefulness of the modularity of the design, and the value of this particular combination of modules.« less
UV-light-driven prebiotic synthesis of iron-sulfur clusters
NASA Astrophysics Data System (ADS)
Bonfio, Claudia; Valer, Luca; Scintilla, Simone; Shah, Sachin; Evans, David J.; Jin, Lin; Szostak, Jack W.; Sasselov, Dimitar D.; Sutherland, John D.; Mansy, Sheref S.
2017-12-01
Iron-sulfur clusters are ancient cofactors that play a fundamental role in metabolism and may have impacted the prebiotic chemistry that led to life. However, it is unclear whether iron-sulfur clusters could have been synthesized on prebiotic Earth. Dissolved iron on early Earth was predominantly in the reduced ferrous state, but ferrous ions alone cannot form polynuclear iron-sulfur clusters. Similarly, free sulfide may not have been readily available. Here we show that UV light drives the synthesis of [2Fe-2S] and [4Fe-4S] clusters through the photooxidation of ferrous ions and the photolysis of organic thiols. Iron-sulfur clusters coordinate to and are stabilized by a wide range of cysteine-containing peptides and the assembly of iron-sulfur cluster-peptide complexes can take place within model protocells in a process that parallels extant pathways. Our experiments suggest that iron-sulfur clusters may have formed easily on early Earth, facilitating the emergence of an iron-sulfur-cluster-dependent metabolism.
Magnetic switching in Crx (x = 2-8) and its oxide cluster series
NASA Astrophysics Data System (ADS)
Shah, Esha V.; Roy, Debesh R.
2018-04-01
First principle studies on the magnetic ground state structure, noncollinearity, binding energy and various electronic properties of a series of Crx (x = 2-8) clusters are performed. In order to investigate the effect of ionization and oxidation on the clusters, the anionic (Crx-) and oxidized (CrxO2) analogues of those clusters are also studied in detail. To calculate adiabatic electron affinity of CrxO2 clusters, additionally CrxO2- analogues are also included in the present work. An interesting even (non-magnetic) - odd (magnetic) feature in the considered cluster series has been noticed. The similar behavior is also reflected from their electronic properties as even (less reactive) - odd (more reactive). The most of the neutral and ionized chromium clusters, viz., Crx and Crx- are found to be noncollinear in their ground states, whereas oxidation stabilized those clusters into the collinear spin alignments. The bond distances of Cr clusters are found to be close with available experimental studies.
Distribution and diversity of bacterial secretion systems across metagenomic datasets.
Barret, Matthieu; Egan, Frank; O'Gara, Fergal
2013-02-01
Bacteria can manipulate their surrounding environment through the secretion of proteins into other living organisms and into the extracellular milieu. In Gram stain negative bacteria this process is mediated by different types of secretion systems from type I through type VI secretion system (T1SS-T6SS). In this study the prevalence of these secretion systems in 312 publicly available microbiomes derived from a wide range of ecosystems was investigated by a gene-centric approach. Our analysis demonstrates that some secretion systems are over-represented in some specific samples. In addition, some T3SS and T6SS phylogenetic clusters were specifically enriched in particular ecological niches, which could indicate specific bacterial adaptation to these environments. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
A self-organizing neural network for job scheduling in distributed systems
NASA Astrophysics Data System (ADS)
Newman, Harvey B.; Legrand, Iosif C.
2001-08-01
The aim of this work is to describe a possible approach for the optimization of the job scheduling in large distributed systems, based on a self-organizing Neural Network. This dynamic scheduling system should be seen as adaptive middle layer software, aware of current available resources and making the scheduling decisions using the "past experience." It aims to optimize job specific parameters as well as the resource utilization. The scheduling system is able to dynamically learn and cluster information in a large dimensional parameter space and at the same time to explore new regions in the parameters space. This self-organizing scheduling system may offer a possible solution to provide an effective use of resources for the off-line data processing jobs for future HEP experiments.
Hsueh, Po-Ren; Lee, Tai-Fen; Du, Shin-Hei; Teng, Shih-Hua; Liao, Chun-Hsing; Sheng, Wang-Hui; Teng, Lee-Jene
2014-07-01
We evaluated whether the Bruker Biotyper matrix-associated laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) system provides accurate species-level identifications of 147 isolates of aerobically growing Gram-positive rods (GPRs). The bacterial isolates included Nocardia (n = 74), Listeria (n = 39), Kocuria (n = 15), Rhodococcus (n = 10), Gordonia (n = 7), and Tsukamurella (n = 2) species, which had all been identified by conventional methods, molecular methods, or both. In total, 89.7% of Listeria monocytogenes, 80% of Rhodococcus species, 26.7% of Kocuria species, and 14.9% of Nocardia species (n = 11, all N. nova and N. otitidiscaviarum) were correctly identified to the species level (score values, ≥ 2.0). A clustering analysis of spectra generated by the Bruker Biotyper identified six clusters of Nocardia species, i.e., cluster 1 (N. cyriacigeorgica), cluster 2 (N. brasiliensis), cluster 3 (N. farcinica), cluster 4 (N. puris), cluster 5 (N. asiatica), and cluster 6 (N. beijingensis), based on the six peaks generated by ClinProTools with the genetic algorithm, i.e., m/z 2,774.477 (cluster 1), m/z 5,389.792 (cluster 2), m/z 6,505.720 (cluster 3), m/z 5,428.795 (cluster 4), m/z 6,525.326 (cluster 5), and m/z 16,085.216 (cluster 6). Two clusters of L. monocytogenes spectra were also found according to the five peaks, i.e., m/z 5,594.85, m/z 6,184.39, and m/z 11,187.31, for cluster 1 (serotype 1/2a) and m/z 5,601.21 and m/z 11,199.33 for cluster 2 (serotypes 1/2b and 4b). The Bruker Biotyper system was unable to accurately identify Nocardia (except for N. nova and N. otitidiscaviarum), Tsukamurella, or Gordonia species. Continuous expansion of the MALDI-TOF MS databases to include more GPRs is necessary. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Accreting Black Hole Binaries in Globular Clusters
NASA Astrophysics Data System (ADS)
Kremer, Kyle; Chatterjee, Sourav; Rodriguez, Carl L.; Rasio, Frederic A.
2018-01-01
We explore the formation of mass-transferring binary systems containing black holes (BHs) within globular clusters (GC). We show that it is possible to form mass-transferring BH binaries with main sequence, giant, and white dwarf companions with a variety of orbital parameters in GCs spanning a large range in present-day properties. All mass-transferring BH binaries found in our models at late times are dynamically created. The BHs in these systems experienced a median of ∼30 dynamical encounters within the cluster before and after acquiring the donor. Furthermore, we show that the presence of mass-transferring BH systems has little correlation with the total number of BHs within the cluster at any time. This is because the net rate of formation of BH–non-BH binaries in a cluster is largely independent of the total number of retained BHs. Our results suggest that the detection of a mass-transferring BH binary in a GC does not necessarily indicate that the host cluster contains a large BH population.
Brighter galaxy bias: underestimating the velocity dispersions of galaxy clusters
NASA Astrophysics Data System (ADS)
Old, L.; Gray, M. E.; Pearce, F. R.
2013-09-01
We study the systematic bias introduced when selecting the spectroscopic redshifts of brighter cluster galaxies to estimate the velocity dispersion of galaxy clusters from both simulated and observational galaxy catalogues. We select clusters with Ngal ≥ 50 at five low-redshift snapshots from the publicly available De Lucia & Blaziot semi-analytic model galaxy catalogue. Clusters are also selected from the Tempel Sloan Digital Sky Survey Data Release 8 groups and clusters catalogue across the redshift range 0.021 ≤ z ≤ 0.098. We employ various selection techniques to explore whether the velocity dispersion bias is simply due to a lack of dynamical information or is the result of an underlying physical process occurring in the cluster, for example, dynamical friction experienced by the brighter cluster members. The velocity dispersions of the parent dark matter (DM) haloes are compared to the galaxy cluster dispersions and the stacked distribution of DM particle velocities is examined alongside the corresponding galaxy velocity distribution. We find a clear bias between the halo and the semi-analytic galaxy cluster velocity dispersion on the order of σgal/σDM ˜ 0.87-0.95 and a distinct difference in the stacked galaxy and DM particle velocities distribution. We identify a systematic underestimation of the velocity dispersions when imposing increasing absolute I-band magnitude limits. This underestimation is enhanced when using only the brighter cluster members for dynamical analysis on the order of 5-35 per cent, indicating that dynamical friction is a serious source of bias when using galaxy velocities as tracers of the underlying gravitational potential. In contrast to the literature we find that the resulting bias is not only halo mass dependent but also that the nature of the dependence changes according to the galaxy selection strategy. We make a recommendation that, in the realistic case of limited availability of spectral observations, a strictly magnitude-limited sample should be avoided to ensure an unbiased estimate of the velocity dispersion.
NASA Astrophysics Data System (ADS)
Šuhada, R.; Fassbender, R.; Nastasi, A.; Böhringer, H.; de Hoon, A.; Pierini, D.; Santos, J. S.; Rosati, P.; Mühlegger, M.; Quintana, H.; Schwope, A. D.; Lamer, G.; Kohnert, J.; Pratt, G. W.
2011-06-01
Context. Multi-wavelength surveys for clusters of galaxies are opening a window on the elusive high-redshift (z > 1) cluster population. Well controlled statistical samples of distant clusters will enable us to answer questions about their cosmological context, early assembly phases and the thermodynamical evolution of the intracluster medium. Aims: We report on the detection of two z > 1 systems, XMMU J0302.2-0001 and XMMU J1532.2-0836, as part of the XMM-Newton Distant Cluster Project (XDCP) sample. We investigate the nature of the sources, measure their spectroscopic redshift and determine their basic physical parameters. Methods: The results of the present paper are based on the analysis of XMM-Newton archival data, optical/near-infrared imaging and deep optical follow-up spectroscopy of the clusters. Results: We confirm the X-ray source XMMU J0302.2-0001 as a gravitationally bound, bona fide cluster of galaxies at spectroscopic redshift z = 1.185. We estimate its M500 mass to (1.6 ± 0.3) × 1014 M⊙ from its measured X-ray luminosity. This ranks the cluster among intermediate mass system. In the case of XMMU J1532.2-0836 we find the X-ray detection to be coincident with a dynamically bound system of galaxies at z = 1.358. Optical spectroscopy reveals the presence of a central active galactic nucleus, which can be a dominant source of the detected X-ray emission from this system. We provide upper limits of X-ray parameters for the system and discuss cluster identification challenges in the high-redshift low-mass cluster regime. A third, intermediate redshift (z = 0.647) cluster, XMMU J0302.1-0000, is serendipitously detected in the same field as XMMU J0302.2-0001. We provide its analysis as well. Based on observations obtained with ESO Telescopes at the Paranal Observatory under program ID 080.A-0659 and 081.A-0312, observations collected at the Centro Astrnómico Hispano Alemán (CAHA) at Calar Alto, Spain operated jointly by the Max-Planck Institut für Astronomie and the Instituto de Astrofísica de Andalucía (CSIC). X-ray observations were obtained by XMM-Newton.
Ku, Wai Lim; Girvan, Michelle; Ott, Edward
2015-12-01
In this paper, we study dynamical systems in which a large number N of identical Landau-Stuart oscillators are globally coupled via a mean-field. Previously, it has been observed that this type of system can exhibit a variety of different dynamical behaviors. These behaviors include time periodic cluster states in which each oscillator is in one of a small number of groups for which all oscillators in each group have the same state which is different from group to group, as well as a behavior in which all oscillators have different states and the macroscopic dynamics of the mean field is chaotic. We argue that this second type of behavior is "extensive" in the sense that the chaotic attractor in the full phase space of the system has a fractal dimension that scales linearly with N and that the number of positive Lyapunov exponents of the attractor also scales linearly with N. An important focus of this paper is the transition between cluster states and extensive chaos as the system is subjected to slow adiabatic parameter change. We observe discontinuous transitions between the cluster states (which correspond to low dimensional dynamics) and the extensively chaotic states. Furthermore, examining the cluster state, as the system approaches the discontinuous transition to extensive chaos, we find that the oscillator population distribution between the clusters continually evolves so that the cluster state is always marginally stable. This behavior is used to reveal the mechanism of the discontinuous transition. We also apply the Kaplan-Yorke formula to study the fractal structure of the extensively chaotic attractors.
NASA Astrophysics Data System (ADS)
Ku, Wai Lim; Girvan, Michelle; Ott, Edward
2015-12-01
In this paper, we study dynamical systems in which a large number N of identical Landau-Stuart oscillators are globally coupled via a mean-field. Previously, it has been observed that this type of system can exhibit a variety of different dynamical behaviors. These behaviors include time periodic cluster states in which each oscillator is in one of a small number of groups for which all oscillators in each group have the same state which is different from group to group, as well as a behavior in which all oscillators have different states and the macroscopic dynamics of the mean field is chaotic. We argue that this second type of behavior is "extensive" in the sense that the chaotic attractor in the full phase space of the system has a fractal dimension that scales linearly with N and that the number of positive Lyapunov exponents of the attractor also scales linearly with N. An important focus of this paper is the transition between cluster states and extensive chaos as the system is subjected to slow adiabatic parameter change. We observe discontinuous transitions between the cluster states (which correspond to low dimensional dynamics) and the extensively chaotic states. Furthermore, examining the cluster state, as the system approaches the discontinuous transition to extensive chaos, we find that the oscillator population distribution between the clusters continually evolves so that the cluster state is always marginally stable. This behavior is used to reveal the mechanism of the discontinuous transition. We also apply the Kaplan-Yorke formula to study the fractal structure of the extensively chaotic attractors.
Quantitative radiomic profiling of glioblastoma represents transcriptomic expression.
Kong, Doo-Sik; Kim, Junhyung; Ryu, Gyuha; You, Hye-Jin; Sung, Joon Kyung; Han, Yong Hee; Shin, Hye-Mi; Lee, In-Hee; Kim, Sung-Tae; Park, Chul-Kee; Choi, Seung Hong; Choi, Jeong Won; Seol, Ho Jun; Lee, Jung-Il; Nam, Do-Hyun
2018-01-19
Quantitative imaging biomarkers have increasingly emerged in the field of research utilizing available imaging modalities. We aimed to identify good surrogate radiomic features that can represent genetic changes of tumors, thereby establishing noninvasive means for predicting treatment outcome. From May 2012 to June 2014, we retrospectively identified 65 patients with treatment-naïve glioblastoma with available clinical information from the Samsung Medical Center data registry. Preoperative MR imaging data were obtained for all 65 patients with primary glioblastoma. A total of 82 imaging features including first-order statistics, volume, and size features, were semi-automatically extracted from structural and physiologic images such as apparent diffusion coefficient and perfusion images. Using commercially available software, NordicICE, we performed quantitative imaging analysis and collected the dataset composed of radiophenotypic parameters. Unsupervised clustering methods revealed that the radiophenotypic dataset was composed of three clusters. Each cluster represented a distinct molecular classification of glioblastoma; classical type, proneural and neural types, and mesenchymal type. These clusters also reflected differential clinical outcomes. We found that extracted imaging signatures does not represent copy number variation and somatic mutation. Quantitative radiomic features provide a potential evidence to predict molecular phenotype and treatment outcome. Radiomic profiles represents transcriptomic phenotypes more well.
The Formation of Cluster Populations Through Direct Galaxy Collisions
NASA Astrophysics Data System (ADS)
Peterson, Bradley W.; Smith, Beverly J.; Struck, Curtis
2016-01-01
Much progress has been made on the question of how globular clusters form. In particular, the study of extragalactic populations of young, high-mass clusters ("super star clusters") has revealed a class of objects can evolve into globular clusters. The process by which these clusters form, and how many survive long enough to become globular clusters, is not wholly understood. Here, we use new data on the colliding galaxy system Arp 261 to investigate the possibility that young, massive clusters form in greater numbers during direct galaxy collisions, compared to less direct tidal collisions.
Composition, morphology, and growth of clusters in a gas of particles with random interactions
NASA Astrophysics Data System (ADS)
Azizi, Itay; Rabin, Yitzhak
2018-03-01
We use Langevin dynamics simulations to study the growth kinetics and the steady-state properties of condensed clusters in a dilute two-dimensional system of particles that are all different (APD) in the sense that each particle is characterized by a randomly chosen interaction parameter. The growth exponents, the transition temperatures, and the steady-state properties of the clusters and of the surrounding gas phase are obtained and compared with those of one-component systems. We investigate the fractionation phenomenon, i.e., how particles of different identities are distributed between the coexisting mother (gas) and daughter (clusters) phases. We study the local organization of particles inside clusters, according to their identity—neighbourhood identity ordering (NIO)—and compare the results with those of previous studies of NIO in dense APD systems.
An Integrated workflow for phenazine biosynthetic gene cluster discovery and characterization
USDA-ARS?s Scientific Manuscript database
Increasing availability of new genomes and putative biosynthetic gene clusters (BGCs) has extended the opportunity to access novel chemical diversity for agriculture, medicine, environmental and industrial purposes. However, functional characterization of BGCs through heterologous expression is limi...
Chiang-Ni, Chuan; Zheng, Po-Xing; Wang, Shu-Ying; Tsai, Pei-Jane; Chuang, Woei-Jer; Lin, Yee-Shin; Liu, Ching-Chuan; Wu, Jiunn-Jong
2016-01-01
emm typing is the most widely used molecular typing method for the human pathogen Streptococcus pyogenes (group A streptococcus [GAS]). emm typing is based on a small variable region of the emm gene; however, the emm cluster typing system defines GAS types according to the nearly complete sequence of the emm gene. Therefore, emm cluster typing is considered to provide more information regarding the functional and structural properties of M proteins in different emm types of GAS. In the present study, 677 isolates collected between 1994 and 2008 in a hospital in southern Taiwan were analyzed by the emm cluster typing system. emm clusters A-C4, E1, E6, and A-C3 were the most prevalent emm cluster types and accounted for 67.4% of total isolates. emm clusters A-C4 and E1 were associated with noninvasive diseases, whereas E6 was significantly associated with both invasive and noninvasive manifestations. In addition, emm clusters D4, E2, and E3 were significantly associated with invasive manifestations. Furthermore, we found that the functional properties of M protein, including low fibrinogen-binding and high IgG-binding activities, were correlated significantly with invasive manifestations. In summary, the present study provides updated epidemiological information on GAS emm cluster types in southern Taiwan. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pavanello, Michele; Van Voorhis, Troy; Visscher, Lucas
2013-02-07
Quantum-mechanical methods that are both computationally fast and accurate are not yet available for electronic excitations having charge transfer character. In this work, we present a significant step forward towards this goal for those charge transfer excitations that take place between non-covalently bound molecules. In particular, we present a method that scales linearly with the number of non-covalently bound molecules in the system and is based on a two-pronged approach: The molecular electronic structure of broken-symmetry charge-localized states is obtained with the frozen density embedding formulation of subsystem density-functional theory; subsequently, in a post-SCF calculation, the full-electron Hamiltonian and overlapmore » matrix elements among the charge-localized states are evaluated with an algorithm which takes full advantage of the subsystem DFT density partitioning technique. The method is benchmarked against coupled-cluster calculations and achieves chemical accuracy for the systems considered for intermolecular separations ranging from hydrogen-bond distances to tens of Angstroms. Numerical examples are provided for molecular clusters comprised of up to 56 non-covalently bound molecules.« less
Pavanello, Michele; Van Voorhis, Troy; Visscher, Lucas; Neugebauer, Johannes
2013-02-07
Quantum-mechanical methods that are both computationally fast and accurate are not yet available for electronic excitations having charge transfer character. In this work, we present a significant step forward towards this goal for those charge transfer excitations that take place between non-covalently bound molecules. In particular, we present a method that scales linearly with the number of non-covalently bound molecules in the system and is based on a two-pronged approach: The molecular electronic structure of broken-symmetry charge-localized states is obtained with the frozen density embedding formulation of subsystem density-functional theory; subsequently, in a post-SCF calculation, the full-electron Hamiltonian and overlap matrix elements among the charge-localized states are evaluated with an algorithm which takes full advantage of the subsystem DFT density partitioning technique. The method is benchmarked against coupled-cluster calculations and achieves chemical accuracy for the systems considered for intermolecular separations ranging from hydrogen-bond distances to tens of Ångstroms. Numerical examples are provided for molecular clusters comprised of up to 56 non-covalently bound molecules.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pathak, Himadri, E-mail: hmdrpthk@gmail.com; Sasmal, Sudip, E-mail: sudipsasmal.chem@gmail.com; Vaval, Nayana
2016-08-21
The open-shell reference relativistic equation-of-motion coupled-cluster method within its four-component description is successfully implemented with the consideration of single- and double- excitation approximations using the Dirac-Coulomb Hamiltonian. At the first attempt, the implemented method is employed to calculate ionization potential value of heavy atomic (Ag, Cs, Au, Fr, and Lr) and molecular (HgH and PbF) systems, where the effect of relativity does really matter to obtain highly accurate results. Not only the relativistic effect but also the effect of electron correlation is crucial in these heavy atomic and molecular systems. To justify the fact, we have taken two further approximationsmore » in the four-component relativistic equation-of-motion framework to quantify how the effect of electron correlation plays a role in the calculated values at different levels of theory. All these calculated results are compared with the available experimental data as well as with other theoretically calculated values to judge the extent of accuracy obtained in our calculations.« less
NASA Astrophysics Data System (ADS)
Shekar, B. H.; Bhat, S. S.
2017-05-01
Locating the boundary parameters of pupil and iris and segmenting the noise free iris portion are the most challenging phases of an automated iris recognition system. In this paper, we have presented person authentication frame work which uses particle swarm optimization (PSO) to locate iris region and circular hough transform (CHT) to device the boundary parameters. To undermine the effect of the noise presented in the segmented iris region we have divided the candidate region into N patches and used Fuzzy c-means clustering (FCM) to classify the patches into best iris region and not so best iris region (noisy region) based on the probability density function of each patch. Weighted mean Hammimng distance is adopted to find the dissimilarity score between the two candidate irises. We have used Log-Gabor, Riesz and Taylor's series expansion (TSE) filters and combinations of these three for iris feature extraction. To justify the feasibility of the proposed method, we experimented on the three publicly available data sets IITD, MMU v-2 and CASIA v-4 distance.
Bolstad, Heather M; Botelho, Danielle J; Wood, Matthew J
2010-10-01
Fe-S cluster biogenesis is of interest to many fields, including bioenergetics and gene regulation. The CSD system is one of three Fe-S cluster biogenesis systems in E. coli and is comprised of the cysteine desulfurase CsdA, the sulfur acceptor protein CsdE, and the E1-like protein CsdL. The biological role, biochemical mechanism, and protein targets of the system remain uncharacterized. Here we present that the active site CsdE C61 has a lowered pK(a) value of 6.5, which is nearly identical to that of C51 in the homologous SufE protein and which is likely critical for its function. We observed that CsdE forms disulfide bonds with multiple proteins and identified the proteins that copurify with CsdE. The identification of Fe-S proteins and both putative and established Fe-S cluster assembly (ErpA, glutaredoxin-3, glutaredoxin-4) and sulfur trafficking (CsdL, YchN) proteins supports the two-pathway model, in which the CSD system is hypothesized to synthesize both Fe-S clusters and other sulfur-containing cofactors. We suggest that the identified Fe-S cluster assembly proteins may be the scaffold and/or shuttle proteins for the CSD system. By comparison with previous analysis of SufE, we demonstrate that there is some overlap in the CsdE and SufE interactomes.
Cluster: A New Application for Spatial Analysis of Pixelated Data for Epiphytotics.
Nelson, Scot C; Corcoja, Iulian; Pethybridge, Sarah J
2017-12-01
Spatial analysis of epiphytotics is essential to develop and test hypotheses about pathogen ecology, disease dynamics, and to optimize plant disease management strategies. Data collection for spatial analysis requires substantial investment in time to depict patterns in various frames and hierarchies. We developed a new approach for spatial analysis of pixelated data in digital imagery and incorporated the method in a stand-alone desktop application called Cluster. The user isolates target entities (clusters) by designating up to 24 pixel colors as nontargets and moves a threshold slider to visualize the targets. The app calculates the percent area occupied by targeted pixels, identifies the centroids of targeted clusters, and computes the relative compass angle of orientation for each cluster. Users can deselect anomalous clusters manually and/or automatically by specifying a size threshold value to exclude smaller targets from the analysis. Up to 1,000 stochastic simulations randomly place the centroids of each cluster in ranked order of size (largest to smallest) within each matrix while preserving their calculated angles of orientation for the long axes. A two-tailed probability t test compares the mean inter-cluster distances for the observed versus the values derived from randomly simulated maps. This is the basis for statistical testing of the null hypothesis that the clusters are randomly distributed within the frame of interest. These frames can assume any shape, from natural (e.g., leaf) to arbitrary (e.g., a rectangular or polygonal field). Cluster summarizes normalized attributes of clusters, including pixel number, axis length, axis width, compass orientation, and the length/width ratio, available to the user as a downloadable spreadsheet. Each simulated map may be saved as an image and inspected. Provided examples demonstrate the utility of Cluster to analyze patterns at various spatial scales in plant pathology and ecology and highlight the limitations, trade-offs, and considerations for the sensitivities of variables and the biological interpretations of results. The Cluster app is available as a free download for Apple computers at iTunes, with a link to a user guide website.
R144: a very massive binary likely ejected from R136 through a binary-binary encounter
NASA Astrophysics Data System (ADS)
Oh, Seungkyung; Kroupa, Pavel; Banerjee, Sambaran
2014-02-01
R144 is a recently confirmed very massive, spectroscopic binary which appears isolated from the core of the massive young star cluster R136. The dynamical ejection hypothesis as an origin for its location is claimed improbable by Sana et al. due to its binary nature and high mass. We demonstrate here by means of direct N-body calculations that a very massive binary system can be readily dynamically ejected from an R136-like cluster, through a close encounter with a very massive system. One out of four N-body cluster models produces a dynamically ejected very massive binary system with a mass comparable to R144. The system has a system mass of ≈355 M⊙ and is located at 36.8 pc from the centre of its parent cluster, moving away from the cluster with a velocity of 57 km s-1 at 2 Myr as a result of a binary-binary interaction. This implies that R144 could have been ejected from R136 through a strong encounter with another massive binary or single star. In addition, we discuss all massive binaries and single stars which are ejected dynamically from their parent cluster in the N-body models.
Long-Period Planets in Open Clusters and the Evolution of Planetary Systems
NASA Astrophysics Data System (ADS)
Quinn, Samuel N.; White, Russel; Latham, David W.; Stefanik, Robert
2018-01-01
Recent discoveries of giant planets in open clusters confirm that they do form and migrate in relatively dense stellar groups, though overall occurrence rates are not yet well constrained because the small sample of giant planets discovered thus far predominantly have short periods. Moreover, planet formation rates and the architectures of planetary systems in clusters may vary significantly -- e.g., due to intercluster differences in the chemical properties that regulate the growth of planetary embryos or in the stellar space density and binary populations, which can influence the dynamical evolution of planetary systems. Constraints on the population of long-period Jovian planets -- those representing the reservoir from which many hot Jupiters likely form, and which are most vulnerable to intracluster dynamical interactions -- can help quantify how the birth environment affects formation and evolution, particularly through comparison of populations possessing a range of ages and chemical and dynamical properties. From our ongoing RV survey of open clusters, we present the discovery of several long-period planets and candidate substellar companions in the Praesepe, Coma Berenices, and Hyades open clusters. From these discoveries, we improve estimates of giant planet occurrence rates in clusters, and we note that high eccentricities in several of these systems support the prediction that the birth environment helps shape planetary system architectures.
Recent advances in the Suf Fe-S cluster biogenesis pathway: Beyond the Proteobacteria.
Outten, F Wayne
2015-06-01
Fe-S clusters play critical roles in cellular function throughout all three kingdoms of life. Consequently, Fe-S cluster biogenesis systems are present in most organisms. The Suf (sulfur formation) system is the most ancient of the three characterized Fe-S cluster biogenesis pathways, which also include the Isc and Nif systems. Much of the first work on the Suf system took place in Gram-negative Proteobacteria used as model organisms. These early studies led to a wealth of biochemical, genetic, and physiological information on Suf function. From those studies we have learned that SufB functions as an Fe-S scaffold in conjunction with SufC (and in some cases SufD). SufS and SufE together mobilize sulfur for cluster assembly and SufA traffics the complete Fe-S cluster from SufB to target apo-proteins. However, recent progress on the Suf system in other organisms has opened up new avenues of research and new hypotheses about Suf function. This review focuses primarily on the most recent discoveries about the Suf pathway and where those new models may lead the field. This article is part of a Special Issue entitled: Fe/S proteins: Analysis, structure, function, biogenesis and diseases. Copyright © 2014 Elsevier B.V. All rights reserved.
An effective trust-based recommendation method using a novel graph clustering algorithm
NASA Astrophysics Data System (ADS)
Moradi, Parham; Ahmadian, Sajad; Akhlaghian, Fardin
2015-10-01
Recommender systems are programs that aim to provide personalized recommendations to users for specific items (e.g. music, books) in online sharing communities or on e-commerce sites. Collaborative filtering methods are important and widely accepted types of recommender systems that generate recommendations based on the ratings of like-minded users. On the other hand, these systems confront several inherent issues such as data sparsity and cold start problems, caused by fewer ratings against the unknowns that need to be predicted. Incorporating trust information into the collaborative filtering systems is an attractive approach to resolve these problems. In this paper, we present a model-based collaborative filtering method by applying a novel graph clustering algorithm and also considering trust statements. In the proposed method first of all, the problem space is represented as a graph and then a sparsest subgraph finding algorithm is applied on the graph to find the initial cluster centers. Then, the proposed graph clustering algorithm is performed to obtain the appropriate users/items clusters. Finally, the identified clusters are used as a set of neighbors to recommend unseen items to the current active user. Experimental results based on three real-world datasets demonstrate that the proposed method outperforms several state-of-the-art recommender system methods.
Distributed shared memory for roaming large volumes.
Castanié, Laurent; Mion, Christophe; Cavin, Xavier; Lévy, Bruno
2006-01-01
We present a cluster-based volume rendering system for roaming very large volumes. This system allows to move a gigabyte-sized probe inside a total volume of several tens or hundreds of gigabytes in real-time. While the size of the probe is limited by the total amount of texture memory on the cluster, the size of the total data set has no theoretical limit. The cluster is used as a distributed graphics processing unit that both aggregates graphics power and graphics memory. A hardware-accelerated volume renderer runs in parallel on the cluster nodes and the final image compositing is implemented using a pipelined sort-last rendering algorithm. Meanwhile, volume bricking and volume paging allow efficient data caching. On each rendering node, a distributed hierarchical cache system implements a global software-based distributed shared memory on the cluster. In case of a cache miss, this system first checks page residency on the other cluster nodes instead of directly accessing local disks. Using two Gigabit Ethernet network interfaces per node, we accelerate data fetching by a factor of 4 compared to directly accessing local disks. The system also implements asynchronous disk access and texture loading, which makes it possible to overlap data loading, volume slicing and rendering for optimal volume roaming.
Fast max-margin clustering for unsupervised word sense disambiguation in biomedical texts
Duan, Weisi; Song, Min; Yates, Alexander
2009-01-01
Background We aim to solve the problem of determining word senses for ambiguous biomedical terms with minimal human effort. Methods We build a fully automated system for Word Sense Disambiguation by designing a system that does not require manually-constructed external resources or manually-labeled training examples except for a single ambiguous word. The system uses a novel and efficient graph-based algorithm to cluster words into groups that have the same meaning. Our algorithm follows the principle of finding a maximum margin between clusters, determining a split of the data that maximizes the minimum distance between pairs of data points belonging to two different clusters. Results On a test set of 21 ambiguous keywords from PubMed abstracts, our system has an average accuracy of 78%, outperforming a state-of-the-art unsupervised system by 2% and a baseline technique by 23%. On a standard data set from the National Library of Medicine, our system outperforms the baseline by 6% and comes within 5% of the accuracy of a supervised system. Conclusion Our system is a novel, state-of-the-art technique for efficiently finding word sense clusters, and does not require training data or human effort for each new word to be disambiguated. PMID:19344480
Performance Analysis of Cluster Formation in Wireless Sensor Networks.
Montiel, Edgar Romo; Rivero-Angeles, Mario E; Rubino, Gerardo; Molina-Lozano, Heron; Menchaca-Mendez, Rolando; Menchaca-Mendez, Ricardo
2017-12-13
Clustered-based wireless sensor networks have been extensively used in the literature in order to achieve considerable energy consumption reductions. However, two aspects of such systems have been largely overlooked. Namely, the transmission probability used during the cluster formation phase and the way in which cluster heads are selected. Both of these issues have an important impact on the performance of the system. For the former, it is common to consider that sensor nodes in a clustered-based Wireless Sensor Network (WSN) use a fixed transmission probability to send control data in order to build the clusters. However, due to the highly variable conditions experienced by these networks, a fixed transmission probability may lead to extra energy consumption. In view of this, three different transmission probability strategies are studied: optimal, fixed and adaptive. In this context, we also investigate cluster head selection schemes, specifically, we consider two intelligent schemes based on the fuzzy C-means and k-medoids algorithms and a random selection with no intelligence. We show that the use of intelligent schemes greatly improves the performance of the system, but their use entails higher complexity and selection delay. The main performance metrics considered in this work are energy consumption, successful transmission probability and cluster formation latency. As an additional feature of this work, we study the effect of errors in the wireless channel and the impact on the performance of the system under the different transmission probability schemes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tai, Lin-Ru; Chou, Chang-Wei; Lee, I-Fang
In this study, we used a multiple copy (EGFP){sub 3} reporter system to establish a numeric nuclear index system to assess the degree of nuclear import. The system was first validated by a FRAP assay, and then was applied to evaluate the essential and multifaceted nature of basic amino acid clusters during the nuclear import of ribosomal protein L7. The results indicate that the sequence context of the basic cluster determines the degree of nuclear import, and that the number of basic residues in the cluster is irrelevant; rather the position of the pertinent basic residues is crucial. Moreover, itmore » also found that the type of carrier protein used by basic cluster has a great impact on the degree of nuclear import. In case of L7, importin β2 or importin β3 are preferentially used by clusters with a high import efficiency, notwithstanding that other importins are also used by clusters with a weaker level of nuclear import. Such a preferential usage of multiple basic clusters and importins to gain nuclear entry would seem to be a common practice among ribosomal proteins in order to ensure their full participation in high rate ribosome synthesis. - Highlights: ► We introduce a numeric index system that represents the degree of nuclear import. ► The rate of nuclear import is dictated by the sequence context of the basic cluster. ► Importin β2 and β3 were mainly responsible for the N4 mediated nuclear import.« less
Performance Analysis of Cluster Formation in Wireless Sensor Networks
Montiel, Edgar Romo; Rivero-Angeles, Mario E.; Rubino, Gerardo; Molina-Lozano, Heron; Menchaca-Mendez, Rolando; Menchaca-Mendez, Ricardo
2017-01-01
Clustered-based wireless sensor networks have been extensively used in the literature in order to achieve considerable energy consumption reductions. However, two aspects of such systems have been largely overlooked. Namely, the transmission probability used during the cluster formation phase and the way in which cluster heads are selected. Both of these issues have an important impact on the performance of the system. For the former, it is common to consider that sensor nodes in a clustered-based Wireless Sensor Network (WSN) use a fixed transmission probability to send control data in order to build the clusters. However, due to the highly variable conditions experienced by these networks, a fixed transmission probability may lead to extra energy consumption. In view of this, three different transmission probability strategies are studied: optimal, fixed and adaptive. In this context, we also investigate cluster head selection schemes, specifically, we consider two intelligent schemes based on the fuzzy C-means and k-medoids algorithms and a random selection with no intelligence. We show that the use of intelligent schemes greatly improves the performance of the system, but their use entails higher complexity and selection delay. The main performance metrics considered in this work are energy consumption, successful transmission probability and cluster formation latency. As an additional feature of this work, we study the effect of errors in the wireless channel and the impact on the performance of the system under the different transmission probability schemes. PMID:29236065
DISCOVERY OF A DISSOCIATIVE GALAXY CLUSTER MERGER WITH LARGE PHYSICAL SEPARATION
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dawson, William A.; Wittman, David; Jee, M. James
2012-03-10
We present DLSCL J0916.2+2951 (z = 0.53), a newly discovered major cluster merger in which the collisional cluster gas has become dissociated from the collisionless galaxies and dark matter (DM). We identified the cluster using optical and weak-lensing observations as part of the Deep Lens Survey. Our follow-up observations with Keck, Subaru, Hubble Space Telescope, and Chandra show that the cluster is a dissociative merger and constrain the DM self-interaction cross-section {sigma}{sub DM} m{sup -1}{sub DM} {approx}< 7 cm{sup 2} g{sup -1}. The system is observed at least 0.7 {+-} 0.2 Gyr since first pass-through, thus providing a picture ofmore » cluster mergers 2-5 times further progressed than similar systems observed to date. This improved temporal leverage has implications for our understanding of merging clusters and their impact on galaxy evolution.« less