NASA Astrophysics Data System (ADS)
Byun, Hye Suk; El-Naggar, Mohamed Y.; Kalia, Rajiv K.; Nakano, Aiichiro; Vashishta, Priya
2017-10-01
Kinetic Monte Carlo (KMC) simulations are used to study long-time dynamics of a wide variety of systems. Unfortunately, the conventional KMC algorithm is not scalable to larger systems, since its time scale is inversely proportional to the simulated system size. A promising approach to resolving this issue is the synchronous parallel KMC (SPKMC) algorithm, which makes the time scale size-independent. This paper introduces a formal derivation of the SPKMC algorithm based on local transition-state and time-dependent Hartree approximations, as well as its scalable parallel implementation based on a dual linked-list cell method. The resulting algorithm has achieved a weak-scaling parallel efficiency of 0.935 on 1024 Intel Xeon processors for simulating biological electron transfer dynamics in a 4.2 billion-heme system, as well as decent strong-scaling parallel efficiency. The parallel code has been used to simulate a lattice of cytochrome complexes on a bacterial-membrane nanowire, and it is broadly applicable to other problems such as computational synthesis of new materials.
Metascalable molecular dynamics simulation of nano-mechano-chemistry
NASA Astrophysics Data System (ADS)
Shimojo, F.; Kalia, R. K.; Nakano, A.; Nomura, K.; Vashishta, P.
2008-07-01
We have developed a metascalable (or 'design once, scale on new architectures') parallel application-development framework for first-principles based simulations of nano-mechano-chemical processes on emerging petaflops architectures based on spatiotemporal data locality principles. The framework consists of (1) an embedded divide-and-conquer (EDC) algorithmic framework based on spatial locality to design linear-scaling algorithms, (2) a space-time-ensemble parallel (STEP) approach based on temporal locality to predict long-time dynamics, and (3) a tunable hierarchical cellular decomposition (HCD) parallelization framework to map these scalable algorithms onto hardware. The EDC-STEP-HCD framework exposes and expresses maximal concurrency and data locality, thereby achieving parallel efficiency as high as 0.99 for 1.59-billion-atom reactive force field molecular dynamics (MD) and 17.7-million-atom (1.56 trillion electronic degrees of freedom) quantum mechanical (QM) MD in the framework of the density functional theory (DFT) on adaptive multigrids, in addition to 201-billion-atom nonreactive MD, on 196 608 IBM BlueGene/L processors. We have also used the framework for automated execution of adaptive hybrid DFT/MD simulation on a grid of six supercomputers in the US and Japan, in which the number of processors changed dynamically on demand and tasks were migrated according to unexpected faults. The paper presents the application of the framework to the study of nanoenergetic materials: (1) combustion of an Al/Fe2O3 thermite and (2) shock initiation and reactive nanojets at a void in an energetic crystal.
Shibuta, Yasushi; Sakane, Shinji; Miyoshi, Eisuke; Okita, Shin; Takaki, Tomohiro; Ohno, Munekazu
2017-04-05
Can completely homogeneous nucleation occur? Large scale molecular dynamics simulations performed on a graphics-processing-unit rich supercomputer can shed light on this long-standing issue. Here, a billion-atom molecular dynamics simulation of homogeneous nucleation from an undercooled iron melt reveals that some satellite-like small grains surrounding previously formed large grains exist in the middle of the nucleation process, which are not distributed uniformly. At the same time, grains with a twin boundary are formed by heterogeneous nucleation from the surface of the previously formed grains. The local heterogeneity in the distribution of grains is caused by the local accumulation of the icosahedral structure in the undercooled melt near the previously formed grains. This insight is mainly attributable to the multi-graphics processing unit parallel computation combined with the rapid progress in high-performance computational environments.Nucleation is a fundamental physical process, however it is a long-standing issue whether completely homogeneous nucleation can occur. Here the authors reveal, via a billion-atom molecular dynamics simulation, that local heterogeneity exists during homogeneous nucleation in an undercooled iron melt.
GAPD: a GPU-accelerated atom-based polychromatic diffraction simulation code.
E, J C; Wang, L; Chen, S; Zhang, Y Y; Luo, S N
2018-03-01
GAPD, a graphics-processing-unit (GPU)-accelerated atom-based polychromatic diffraction simulation code for direct, kinematics-based, simulations of X-ray/electron diffraction of large-scale atomic systems with mono-/polychromatic beams and arbitrary plane detector geometries, is presented. This code implements GPU parallel computation via both real- and reciprocal-space decompositions. With GAPD, direct simulations are performed of the reciprocal lattice node of ultralarge systems (∼5 billion atoms) and diffraction patterns of single-crystal and polycrystalline configurations with mono- and polychromatic X-ray beams (including synchrotron undulator sources), and validation, benchmark and application cases are presented.
Vashishta, Priya; Kalia, Rajiv K; Nakano, Aiichiro
2006-03-02
We have developed a first-principles-based hierarchical simulation framework, which seamlessly integrates (1) a quantum mechanical description based on the density functional theory (DFT), (2) multilevel molecular dynamics (MD) simulations based on a reactive force field (ReaxFF) that describes chemical reactions and polarization, a nonreactive force field that employs dynamic atomic charges, and an effective force field (EFF), and (3) an atomistically informed continuum model to reach macroscopic length scales. For scalable hierarchical simulations, we have developed parallel linear-scaling algorithms for (1) DFT calculation based on a divide-and-conquer algorithm on adaptive multigrids, (2) chemically reactive MD based on a fast ReaxFF (F-ReaxFF) algorithm, and (3) EFF-MD based on a space-time multiresolution MD (MRMD) algorithm. On 1920 Intel Itanium2 processors, we have demonstrated 1.4 million atom (0.12 trillion grid points) DFT, 0.56 billion atom F-ReaxFF, and 18.9 billion atom MRMD calculations, with parallel efficiency as high as 0.953. Through the use of these algorithms, multimillion atom MD simulations have been performed to study the oxidation of an aluminum nanoparticle. Structural and dynamic correlations in the oxide region are calculated as well as the evolution of charges, surface oxide thickness, diffusivities of atoms, and local stresses. In the microcanonical ensemble, the oxidizing reaction becomes explosive in both molecular and atomic oxygen environments, due to the enormous energy release associated with Al-O bonding. In the canonical ensemble, an amorphous oxide layer of a thickness of approximately 40 angstroms is formed after 466 ps, in good agreement with experiments. Simulations have been performed to study nanoindentation on crystalline, amorphous, and nanocrystalline silicon nitride and silicon carbide. Simulation on nanocrystalline silicon carbide reveals unusual deformation mechanisms in brittle nanophase materials, due to coexistence of brittle grains and soft amorphous-like grain boundary phases. Simulations predict a crossover from intergranular continuous deformation to intragrain discrete deformation at a critical indentation depth.
A Metascalable Computing Framework for Large Spatiotemporal-Scale Atomistic Simulations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nomura, K; Seymour, R; Wang, W
2009-02-17
A metascalable (or 'design once, scale on new architectures') parallel computing framework has been developed for large spatiotemporal-scale atomistic simulations of materials based on spatiotemporal data locality principles, which is expected to scale on emerging multipetaflops architectures. The framework consists of: (1) an embedded divide-and-conquer (EDC) algorithmic framework based on spatial locality to design linear-scaling algorithms for high complexity problems; (2) a space-time-ensemble parallel (STEP) approach based on temporal locality to predict long-time dynamics, while introducing multiple parallelization axes; and (3) a tunable hierarchical cellular decomposition (HCD) parallelization framework to map these O(N) algorithms onto a multicore cluster based onmore » hybrid implementation combining message passing and critical section-free multithreading. The EDC-STEP-HCD framework exposes maximal concurrency and data locality, thereby achieving: (1) inter-node parallel efficiency well over 0.95 for 218 billion-atom molecular-dynamics and 1.68 trillion electronic-degrees-of-freedom quantum-mechanical simulations on 212,992 IBM BlueGene/L processors (superscalability); (2) high intra-node, multithreading parallel efficiency (nanoscalability); and (3) nearly perfect time/ensemble parallel efficiency (eon-scalability). The spatiotemporal scale covered by MD simulation on a sustained petaflops computer per day (i.e. petaflops {center_dot} day of computing) is estimated as NT = 2.14 (e.g. N = 2.14 million atoms for T = 1 microseconds).« less
SKIRT: Hybrid parallelization of radiative transfer simulations
NASA Astrophysics Data System (ADS)
Verstocken, S.; Van De Putte, D.; Camps, P.; Baes, M.
2017-07-01
We describe the design, implementation and performance of the new hybrid parallelization scheme in our Monte Carlo radiative transfer code SKIRT, which has been used extensively for modelling the continuum radiation of dusty astrophysical systems including late-type galaxies and dusty tori. The hybrid scheme combines distributed memory parallelization, using the standard Message Passing Interface (MPI) to communicate between processes, and shared memory parallelization, providing multiple execution threads within each process to avoid duplication of data structures. The synchronization between multiple threads is accomplished through atomic operations without high-level locking (also called lock-free programming). This improves the scaling behaviour of the code and substantially simplifies the implementation of the hybrid scheme. The result is an extremely flexible solution that adjusts to the number of available nodes, processors and memory, and consequently performs well on a wide variety of computing architectures.
SPEEDES - A multiple-synchronization environment for parallel discrete-event simulation
NASA Technical Reports Server (NTRS)
Steinman, Jeff S.
1992-01-01
Synchronous Parallel Environment for Emulation and Discrete-Event Simulation (SPEEDES) is a unified parallel simulation environment. It supports multiple-synchronization protocols without requiring users to recompile their code. When a SPEEDES simulation runs on one node, all the extra parallel overhead is removed automatically at run time. When the same executable runs in parallel, the user preselects the synchronization algorithm from a list of options. SPEEDES currently runs on UNIX networks and on the California Institute of Technology/Jet Propulsion Laboratory Mark III Hypercube. SPEEDES also supports interactive simulations. Featured in the SPEEDES environment is a new parallel synchronization approach called Breathing Time Buckets. This algorithm uses some of the conservative techniques found in Time Bucket synchronization, along with the optimism that characterizes the Time Warp approach. A mathematical model derived from first principles predicts the performance of Breathing Time Buckets. Along with the Breathing Time Buckets algorithm, this paper discusses the rules for processing events in SPEEDES, describes the implementation of various other synchronization protocols supported by SPEEDES, describes some new ones for the future, discusses interactive simulations, and then gives some performance results.
Synchronization Of Parallel Discrete Event Simulations
NASA Technical Reports Server (NTRS)
Steinman, Jeffrey S.
1992-01-01
Adaptive, parallel, discrete-event-simulation-synchronization algorithm, Breathing Time Buckets, developed in Synchronous Parallel Environment for Emulation and Discrete Event Simulation (SPEEDES) operating system. Algorithm allows parallel simulations to process events optimistically in fluctuating time cycles that naturally adapt while simulation in progress. Combines best of optimistic and conservative synchronization strategies while avoiding major disadvantages. Algorithm processes events optimistically in time cycles adapting while simulation in progress. Well suited for modeling communication networks, for large-scale war games, for simulated flights of aircraft, for simulations of computer equipment, for mathematical modeling, for interactive engineering simulations, and for depictions of flows of information.
Solving Integer Programs from Dependence and Synchronization Problems
1993-03-01
DEFF.NSNE Solving Integer Programs from Dependence and Synchronization Problems Jaspal Subhlok March 1993 CMU-CS-93-130 School of Computer ScienceT IC...method Is an exact and efficient way of solving integer programming problems arising in dependence and synchronization analysis of parallel programs...7/;- p Keywords: Exact dependence tesing, integer programming. parallelilzng compilers, parallel program analysis, synchronization analysis Solving
PTTI 2030 - Time Transfer and Applications in 2030
2010-01-01
today’s society is paramount. Every day billions of people worldwide depend on some level of time synchronization , and timing laboratories require...applications as an inexpensive way to disseminate GPS-acquired time and frequency among groups, or as a backup method of time synchronization in the...strictly for timing use would be very expensive, perhaps prohibitive. ADVANTAGES OF AN IEEE-1588-ENABLED POWER GRID Time synchronization in
Scalable and portable visualization of large atomistic datasets
NASA Astrophysics Data System (ADS)
Sharma, Ashish; Kalia, Rajiv K.; Nakano, Aiichiro; Vashishta, Priya
2004-10-01
A scalable and portable code named Atomsviewer has been developed to interactively visualize a large atomistic dataset consisting of up to a billion atoms. The code uses a hierarchical view frustum-culling algorithm based on the octree data structure to efficiently remove atoms outside of the user's field-of-view. Probabilistic and depth-based occlusion-culling algorithms then select atoms, which have a high probability of being visible. Finally a multiresolution algorithm is used to render the selected subset of visible atoms at varying levels of detail. Atomsviewer is written in C++ and OpenGL, and it has been tested on a number of architectures including Windows, Macintosh, and SGI. Atomsviewer has been used to visualize tens of millions of atoms on a standard desktop computer and, in its parallel version, up to a billion atoms. Program summaryTitle of program: Atomsviewer Catalogue identifier: ADUM Program summary URL:http://cpc.cs.qub.ac.uk/summaries/ADUM Program obtainable from: CPC Program Library, Queen's University of Belfast, N. Ireland Computer for which the program is designed and others on which it has been tested: 2.4 GHz Pentium 4/Xeon processor, professional graphics card; Apple G4 (867 MHz)/G5, professional graphics card Operating systems under which the program has been tested: Windows 2000/XP, Mac OS 10.2/10.3, SGI IRIX 6.5 Programming languages used: C++, C and OpenGL Memory required to execute with typical data: 1 gigabyte of RAM High speed storage required: 60 gigabytes No. of lines in the distributed program including test data, etc.: 550 241 No. of bytes in the distributed program including test data, etc.: 6 258 245 Number of bits in a word: Arbitrary Number of processors used: 1 Has the code been vectorized or parallelized: No Distribution format: tar gzip file Nature of physical problem: Scientific visualization of atomic systems Method of solution: Rendering of atoms using computer graphic techniques, culling algorithms for data minimization, and levels-of-detail for minimal rendering Restrictions on the complexity of the problem: None Typical running time: The program is interactive in its execution Unusual features of the program: None References: The conceptual foundation and subsequent implementation of the algorithms are found in [A. Sharma, A. Nakano, R.K. Kalia, P. Vashishta, S. Kodiyalam, P. Miller, W. Zhao, X.L. Liu, T.J. Campbell, A. Haas, Presence—Teleoperators and Virtual Environments 12 (1) (2003)].
Sensitivity optimization of Bell-Bloom magnetometers by manipulation of atomic spin synchronization
NASA Astrophysics Data System (ADS)
Ranjbaran, M.; Tehranchi, M. M.; Hamidi, S. M.; Khalkhali, S. M. H.
2018-05-01
Many efforts have been devoted to the developments of atomic magnetometers for achieving the high sensitivity required in biomagnetic applications. To reach the high sensitivity, many types of atomic magnetometers have been introduced for optimization of the creation and relaxation rates of atomic spin polarization. In this paper, regards to sensitivity optimization techniques in the Mx configuration, we have proposed a novelty approach for synchronization of the spin precession in the Bell-Bloom magnetometers. We have utilized the phenomenological Bloch equations to simulate the spin dynamics when modulation of pumping light and radio frequency magnetic field were both used for atomic spin synchronization. Our results showed that the synchronization process, improved the magnetometer sensitivity respect to the classical configurations.
On the Synchronization of EEG Spindle Waves
NASA Astrophysics Data System (ADS)
Long, Wen; Zhang, ChengFu; Zhao, SiLan; Shi, RuiHong
2000-06-01
Based on recently sleeping cellular substrates, a network model synaptically coupled by N three-cell circuits is provided. Simulation results show that: (i) the dynamic behavior of every circuit is chaotic; (ii) the synchronization of the network is incomplete; (iii) the incomplete synchronization can integrate burst firings of cortical cells into waxing-and-wanning EEG spindle waves. These results enlighten us that this kind of incomplete synchronization may integrate microscopic, electrical activities of neurons in billions into macroscopic, functional states in human brain. In addition, the effects of coupling strength, connectional mode and noise to the synchronization are discussed.
Optimistic barrier synchronization
NASA Technical Reports Server (NTRS)
Nicol, David M.
1992-01-01
Barrier synchronization is fundamental operation in parallel computation. In many contexts, at the point a processor enters a barrier it knows that it has already processed all the work required of it prior to synchronization. The alternative case, when a processor cannot enter a barrier with the assurance that it has already performed all the necessary pre-synchronization computation, is treated. The problem arises when the number of pre-sychronization messages to be received by a processor is unkown, for example, in a parallel discrete simulation or any other computation that is largely driven by an unpredictable exchange of messages. We describe an optimistic O(log sup 2 P) barrier algorithm for such problems, study its performance on a large-scale parallel system, and consider extensions to general associative reductions as well as associative parallel prefix computations.
Parallel-aware, dedicated job co-scheduling within/across symmetric multiprocessing nodes
Jones, Terry R.; Watson, Pythagoras C.; Tuel, William; Brenner, Larry; ,Caffrey, Patrick; Fier, Jeffrey
2010-10-05
In a parallel computing environment comprising a network of SMP nodes each having at least one processor, a parallel-aware co-scheduling method and system for improving the performance and scalability of a dedicated parallel job having synchronizing collective operations. The method and system uses a global co-scheduler and an operating system kernel dispatcher adapted to coordinate interfering system and daemon activities on a node and across nodes to promote intra-node and inter-node overlap of said interfering system and daemon activities as well as intra-node and inter-node overlap of said synchronizing collective operations. In this manner, the impact of random short-lived interruptions, such as timer-decrement processing and periodic daemon activity, on synchronizing collective operations is minimized on large processor-count SPMD bulk-synchronous programming styles.
Multibillion-atom Molecular Dynamics Simulations of Plasticity, Spall, and Ejecta
NASA Astrophysics Data System (ADS)
Germann, Timothy C.
2007-06-01
Modern supercomputing platforms, such as the IBM BlueGene/L at Lawrence Livermore National Laboratory and the Roadrunner hybrid supercomputer being built at Los Alamos National Laboratory, are enabling large-scale classical molecular dynamics simulations of phenomena that were unthinkable just a few years ago. Using either the embedded atom method (EAM) description of simple (close-packed) metals, or modified EAM (MEAM) models of more complex solids and alloys with mixed covalent and metallic character, simulations containing billions to trillions of atoms are now practical, reaching volumes in excess of a cubic micron. In order to obtain any new physical insights, however, it is equally important that the analysis of such systems be tractable. This is in fact possible, in large part due to our highly efficient parallel visualization code, which enables the rendering of atomic spheres, Eulerian cells, and other geometric objects in a matter of minutes, even for tens of thousands of processors and billions of atoms. After briefly describing the BlueGene/L and Roadrunner architectures, and the code optimization strategies that were employed, results obtained thus far on BlueGene/L will be reviewed, including: (1) shock compression and release of a defective EAM Cu sample, illustrating the plastic deformation accompanying void collapse as well as the subsequent void growth and linkup upon release; (2) solid-solid martensitic phase transition in shock-compressed MEAM Ga; and (3) Rayleigh-Taylor fluid instability modeled using large-scale direct simulation Monte Carlo (DSMC) simulations. I will also describe our initial experiences utilizing Cell Broadband Engine processors (developed for the Sony PlayStation 3), and planned simulation studies of ejecta and spall failure in polycrystalline metals that will be carried out when the full Petaflop Opteron/Cell Roadrunner supercomputer is assembled in mid-2008.
The Global Positioning System: a high-tech success story
NASA Astrophysics Data System (ADS)
Ashby, Neil
2002-03-01
The Global Positioning System (GPS) consists of 24 or more satellites in twelve-hour orbits, each carrying atomic clocks and transmitting synchronized time and position information. The satellite system is supported by time referencing and processing centers, and data collection stations around the world. The signals make possible accurate navigation anywhere in the vicinity of Earth. There is probably no other large engineering system that relies on a broader range of applications of fundamental modern physics, such as special and general relativity, and atomic physics. Atomic clocks only a few inches on a side have been developed to an almost incredible stage of reliability and stability. Modern circuit fabrication techniques produce GPS receivers on a chip at cost comparable to that of handheld cell phones. Widespread availability and low cost in the civilian sector has led to a host of interesting applications. The economic impact of GPS is in the billions of dollars annually and is increasing. A comparable system, currently with only a few satellites, is the Soviet GLONASS. Europeans are developing another competitor, GALILEO, and have plans to place Hydrogen masers in space. These systems are changing the way we determine where we are and are revolutionizing many fields of scientific research.
Parallel discrete-event simulation of FCFS stochastic queueing networks
NASA Technical Reports Server (NTRS)
Nicol, David M.
1988-01-01
Physical systems are inherently parallel. Intuition suggests that simulations of these systems may be amenable to parallel execution. The parallel execution of a discrete-event simulation requires careful synchronization of processes in order to ensure the execution's correctness; this synchronization can degrade performance. Largely negative results were recently reported in a study which used a well-known synchronization method on queueing network simulations. Discussed here is a synchronization method (appointments), which has proven itself to be effective on simulations of FCFS queueing networks. The key concept behind appointments is the provision of lookahead. Lookahead is a prediction on a processor's future behavior, based on an analysis of the processor's simulation state. It is shown how lookahead can be computed for FCFS queueing network simulations, give performance data that demonstrates the method's effectiveness under moderate to heavy loads, and discuss performance tradeoffs between the quality of lookahead, and the cost of computing lookahead.
Quantum synchronization of many coupled atoms for an ultranarrow linewidth laser
NASA Astrophysics Data System (ADS)
He, Peiru; Xu, Minghui; Tieri, David; Zhu, Bihui; Rey, Ana Maria; Hazzard, Kaden; Holland, Murray
2014-05-01
We theoretically investigate the effect of quantum synchronization on many coupled two-level atoms acting as high quality oscillators. We show that quantum synchronization - the spontaneous alignment of the phase (of the two-level superposition) between different atoms - provides a potential approach to produce robust atomic coherences and coherent light with ultranarrow linewidth and extreme phase stability. The atoms may be coupled either through their direct dipole-dipole interactions or, as in a superradiant laser, through an optical cavity. We develop a variety of analytic and computational approaches for this problem, including exact open quantum system methods for small systems, semiclassical theories, and approaches that make use of the permutation symmetry of identically coupled ensembles. We investigate the first and second order coherence properties of both the optical and atomic degrees of freedom. We study synchronization in both the steady-state, as well as during the dynamically applied pulse sequences of Rabi and Ramsey interferometry. This work was supported by the DARPA QuASAR program, the NSF, and NIST.
Parallel integrated frame synchronizer chip
NASA Technical Reports Server (NTRS)
Solomon, Jeffrey Michael (Inventor); Ghuman, Parminder Singh (Inventor); Bennett, Toby Dennis (Inventor)
2000-01-01
A parallel integrated frame synchronizer which implements a sequential pipeline process wherein serial data in the form of telemetry data or weather satellite data enters the synchronizer by means of a front-end subsystem and passes to a parallel correlator subsystem or a weather satellite data processing subsystem. When in a CCSDS mode, data from the parallel correlator subsystem passes through a window subsystem, then to a data alignment subsystem and then to a bit transition density (BTD)/cyclical redundancy check (CRC) decoding subsystem. Data from the BTD/CRC decoding subsystem or data from the weather satellite data processing subsystem is then fed to an output subsystem where it is output from a data output port.
Report to the High Order Language Working Group (HOLWG)
1977-01-14
as running, runnable, suspended or dormant, may be synchronized by semaphore variables, may be schedaled using clock and duration data types and mpy...Recursive and non-recursive routines G6. Parallel processes, synchronization , critical regions G7. User defined parameterized exception handling G8...typed and lacks extensibility, parallel processing, synchronization and real-time features. Overall Evaluation IBM strongly recommended PL/I as a
Quantum Atomic Clock Synchronization: An Entangled Concept of Nonlocal Simultaneity
NASA Technical Reports Server (NTRS)
Abrams, D.; Dowling, J.; Williams, C.; Jozsa, R.
2000-01-01
We demonstrate that two spatially separated parties (Alice and Bob) can utilize shared prior quantum entanglement, as well as a classical information channel, to establish a synchronized pair of atomic clocks.
Synchronization of a self-sustained cold-atom oscillator
NASA Astrophysics Data System (ADS)
Heimonen, H.; Kwek, L. C.; Kaiser, R.; Labeyrie, G.
2018-04-01
Nonlinear oscillations and synchronization phenomena are ubiquitous in nature. We study the synchronization of self-oscillating magneto-optically trapped cold atoms to a weak external driving. The oscillations arise from a dynamical instability due the competition between the screened magneto-optical trapping force and the interatomic repulsion due to multiple scattering of light. A weak modulation of the trapping force allows the oscillations of the cloud to synchronize to the driving. The synchronization frequency range increases with the forcing amplitude. The corresponding Arnold tongue is experimentally measured and compared to theoretical predictions. Phase locking between the oscillator and drive is also observed.
Robust Synchronization Models for Presentation System Using SMIL-Driven Approach
ERIC Educational Resources Information Center
Asnawi, Rustam; Ahmad, Wan Fatimah Wan; Rambli, Dayang Rohaya Awang
2013-01-01
Current common Presentation System (PS) models are slide based oriented and lack synchronization analysis either with temporal or spatial constraints. Such models, in fact, tend to lead to synchronization problems, particularly on parallel synchronization with spatial constraints between multimedia element presentations. However, parallel…
Graphics applications utilizing parallel processing
NASA Technical Reports Server (NTRS)
Rice, John R.
1990-01-01
The results are presented of research conducted to develop a parallel graphic application algorithm to depict the numerical solution of the 1-D wave equation, the vibrating string. The research was conducted on a Flexible Flex/32 multiprocessor and a Sequent Balance 21000 multiprocessor. The wave equation is implemented using the finite difference method. The synchronization issues that arose from the parallel implementation and the strategies used to alleviate the effects of the synchronization overhead are discussed.
Synchronous optical pumping of quantum revival beats for atomic magnetometry
DOE Office of Scientific and Technical Information (OSTI.GOV)
Seltzer, S. J.; Meares, P. J.; Romalis, M. V.
2007-05-15
We observe quantum beats with periodic revivals due to nonlinear spacing of Zeeman levels in the ground state of potassium atoms, and demonstrate their synchronous optical pumping by double modulation of the pumping light at the Larmor frequency and the revival frequency. We show that synchronous pumping increases the degree of spin polarization by a factor of 4. As a practical example, we explore the application of this double-modulation technique to atomic magnetometers operating in the geomagnetic field range, and find that it can increase the sensitivity and reduce magnetic-field-orientation-dependent measurement errors endemic to alkali-metal magnetometers.
NASA Technical Reports Server (NTRS)
Sohn, Andrew; Biswas, Rupak
1996-01-01
Solving the hard Satisfiability Problem is time consuming even for modest-sized problem instances. Solving the Random L-SAT Problem is especially difficult due to the ratio of clauses to variables. This report presents a parallel synchronous simulated annealing method for solving the Random L-SAT Problem on a large-scale distributed-memory multiprocessor. In particular, we use a parallel synchronous simulated annealing procedure, called Generalized Speculative Computation, which guarantees the same decision sequence as sequential simulated annealing. To demonstrate the performance of the parallel method, we have selected problem instances varying in size from 100-variables/425-clauses to 5000-variables/21,250-clauses. Experimental results on the AP1000 multiprocessor indicate that our approach can satisfy 99.9 percent of the clauses while giving almost a 70-fold speedup on 500 processors.
Memory-based frame synchronizer. [for digital communication systems
NASA Technical Reports Server (NTRS)
Stattel, R. J.; Niswander, J. K. (Inventor)
1981-01-01
A frame synchronizer for use in digital communications systems wherein data formats can be easily and dynamically changed is described. The use of memory array elements provide increased flexibility in format selection and sync word selection in addition to real time reconfiguration ability. The frame synchronizer comprises a serial-to-parallel converter which converts a serial input data stream to a constantly changing parallel data output. This parallel data output is supplied to programmable sync word recognizers each consisting of a multiplexer and a random access memory (RAM). The multiplexer is connected to both the parallel data output and an address bus which may be connected to a microprocessor or computer for purposes of programming the sync word recognizer. The RAM is used as an associative memory or decorder and is programmed to identify a specific sync word. Additional programmable RAMs are used as counter decoders to define word bit length, frame word length, and paragraph frame length.
Paralex: An Environment for Parallel Programming in Distributed Systems
1991-12-07
distributed systems is coni- parable to assembly language programming for traditional sequential systems - the user must resort to low-level primitives ...to accomplish data encoding/decoding, communication, remote exe- cution, synchronization , failure detection and recovery. It is our belief that... synchronization . Finally, composing parallel programs by interconnecting se- quential computations allows automatic support for heterogeneity and fault tolerance
Quantum Synchronization of three-level atoms
NASA Astrophysics Data System (ADS)
He, Peiru; Rey, Ana Maria; Holland, Murray
2015-05-01
Recent studies show that quantum synchronization, the spontaneous alignment of the quantum phase between different oscillators, can be used to build superradiant lasers with ultranarrow linewidth. We theoretically investigate the effect of quantum synchronization on many coupled three-level atoms where there are richer phase diagrams than the standard two-level system. This three-level model allows two-color ultranarrow coherent light to be produced where more than one phase must be simultaneously synchronized. Of particular interest, we study the V-type geometry that is relevant to current 87 Sr experiments in JILA. As well as the synchronization phenomenon, we explore other quantum effects such as photon correlations and squeezing. This work is supported by the DARPA QuASAR program, the NSF, and NIST.
Compositions of doped, co-doped and tri-doped semiconductor materials
Lynn, Kelvin [Pullman, WA; Jones, Kelly [Colfax, WA; Ciampi, Guido [Watertown, MA
2011-12-06
Semiconductor materials suitable for being used in radiation detectors are disclosed. A particular example of the semiconductor materials includes tellurium, cadmium, and zinc. Tellurium is in molar excess of cadmium and zinc. The example also includes aluminum having a concentration of about 10 to about 20,000 atomic parts per billion and erbium having a concentration of at least 10,000 atomic parts per billion.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chrisochoides, N.; Sukup, F.
In this paper we present a parallel implementation of the Bowyer-Watson (BW) algorithm using the task-parallel programming model. The BW algorithm constitutes an ideal mesh refinement strategy for implementing a large class of unstructured mesh generation techniques on both sequential and parallel computers, by preventing the need for global mesh refinement. Its implementation on distributed memory multicomputes using the traditional data-parallel model has been proven very inefficient due to excessive synchronization needed among processors. In this paper we demonstrate that with the task-parallel model we can tolerate synchronization costs inherent to data-parallel methods by exploring concurrency in the processor level.more » Our preliminary performance data indicate that the task- parallel approach: (i) is almost four times faster than the existing data-parallel methods, (ii) scales linearly, and (iii) introduces minimum overheads compared to the {open_quotes}best{close_quotes} sequential implementation of the BW algorithm.« less
Global synchronization of parallel processors using clock pulse width modulation
Chen, Dong; Ellavsky, Matthew R.; Franke, Ross L.; Gara, Alan; Gooding, Thomas M.; Haring, Rudolf A.; Jeanson, Mark J.; Kopcsay, Gerard V.; Liebsch, Thomas A.; Littrell, Daniel; Ohmacht, Martin; Reed, Don D.; Schenck, Brandon E.; Swetz, Richard A.
2013-04-02
A circuit generates a global clock signal with a pulse width modification to synchronize processors in a parallel computing system. The circuit may include a hardware module and a clock splitter. The hardware module may generate a clock signal and performs a pulse width modification on the clock signal. The pulse width modification changes a pulse width within a clock period in the clock signal. The clock splitter may distribute the pulse width modified clock signal to a plurality of processors in the parallel computing system.
NASA Technical Reports Server (NTRS)
Nicol, David; Fujimoto, Richard
1992-01-01
This paper surveys topics that presently define the state of the art in parallel simulation. Included in the tutorial are discussions on new protocols, mathematical performance analysis, time parallelism, hardware support for parallel simulation, load balancing algorithms, and dynamic memory management for optimistic synchronization.
TIME SIGNALS, * SYNCHRONIZATION (ELECTRONICS)), NETWORKS, FREQUENCY, STANDARDS, RADIO SIGNALS, ERRORS, VERY LOW FREQUENCY, PROPAGATION, ACCURACY, ATOMIC CLOCKS, CESIUM, RADIO STATIONS, NAVAL SHORE FACILITIES
Aurora Synchronization Improvement
1991-06-01
AURORA SYNCHRONIZATION IMPROVEMENT D. M. Weidenheimer, N. R. Pereira, and D. C. Judy* Berkeley Research Associates, Inc., PO Box 852, Springfield...Recently, synchronization of the four pulse-forming lines (PFLs) has been significantly improved over the original de- sign. The four parallel PFLs are...now synchronized to within 10 ns over 60% of the shots. This paper describes the current switching scheme, reports the current timing statistics, and
The cost of conservative synchronization in parallel discrete event simulations
NASA Technical Reports Server (NTRS)
Nicol, David M.
1990-01-01
The performance of a synchronous conservative parallel discrete-event simulation protocol is analyzed. The class of simulation models considered is oriented around a physical domain and possesses a limited ability to predict future behavior. A stochastic model is used to show that as the volume of simulation activity in the model increases relative to a fixed architecture, the complexity of the average per-event overhead due to synchronization, event list manipulation, lookahead calculations, and processor idle time approach the complexity of the average per-event overhead of a serial simulation. The method is therefore within a constant factor of optimal. The analysis demonstrates that on large problems--those for which parallel processing is ideally suited--there is often enough parallel workload so that processors are not usually idle. The viability of the method is also demonstrated empirically, showing how good performance is achieved on large problems using a thirty-two node Intel iPSC/2 distributed memory multiprocessor.
Linux Kernel Co-Scheduling and Bulk Synchronous Parallelism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jones, Terry R
2012-01-01
This paper describes a kernel scheduling algorithm that is based on coscheduling principles and that is intended for parallel applications running on 1000 cores or more. Experimental results for a Linux implementation on a Cray XT5 machine are presented. The results indicate that Linux is a suitable operating system for this new scheduling scheme, and that this design provides a dramatic improvement in scaling performance for synchronizing collective operations at scale.
Semiconductive materials and associated uses thereof
Lynn, Kelvin [Pullman, WA; Jones, Kelly [Colfax, WA; Ciampi, Guido [Waltham, MA
2011-11-01
High rate radiation detectors are disclosed herein. The detectors include a detector material disposed inside the container, the detector material containing cadmium, tellurium, and zinc, a first dopant containing at least one of aluminum, chlorine, and indium, and a second dopant containing a rare earth metal. The first dopant has a concentration of about 500 to about 20,000 atomic parts per billion, and the second dopant has a concentration of about 200 to about 20,000 atomic parts per billion.
Semiconductive materials and associated uses thereof
Lynn, Kelvin; Jones, Kelly; Ciampi, Guido
2012-10-09
High rate radiation detectors are disclosed herein. The detectors include a detector material disposed inside the container, the detector material containing cadmium, tellurium, and zinc, a first dopant containing at least one of aluminum, chlorine, and indium, and a second dopant containing a rare earth metal. The first dopant has a concentration of about 500 to about 20,000 atomic parts per billion, and the second dopant has a concentration of about 200 to about 20,000 atomic parts per billion.
2007-01-01
as a function of the particle velocity that drives the shock [7]. The MD and experimen- tal data agree very well. Furthermore, the simulation shows...topological anomalies in multimillion - node chemical bond networks in materials [48]. At the Col- laboratory for Advanced Computing and Simulations ...to-billion atom simulations of chemical reactions Aiichiro Nakano a,*, Rajiv K. Kalia a, Ken-ichi Nomura a, Ashish Sharma a, Priya Vashishta a, Fuyuki
A relativistic analysis of clock synchronization
NASA Technical Reports Server (NTRS)
Thomas, J. B.
1974-01-01
The relativistic conversion between coordinate time and atomic time is reformulated to allow simpler time calculations relating analysis in solar-system barycentric coordinates (using coordinate time) with earth-fixed observations (measuring earth-bound proper time or atomic time.) After an interpretation of terms, this simplified formulation, which has a rate accuracy of about 10 to the minus 15th power, is used to explain the conventions required in the synchronization of a world wide clock network and to analyze two synchronization techniques-portable clocks and radio interferometry. Finally, pertinent experiment tests of relativity are briefly discussed in terms of the reformulated time conversion.
The tectonics of anorthosite massifs
NASA Technical Reports Server (NTRS)
Seyfert, C. K.
1981-01-01
Anorthosite massifs developed approximately 1.4 to 1.5 billion years ago along an arch which developed parallel to a zone of continental separation as a block which included North America, Europe, and probably Asia separated from a block which included parts of South America, Africa, India, and Australia. Anorthosite massifs also developed at the same time along a belt which runs through the continents which comprise Gondwanaland (South America), Africa, India, Australia, and Antarctica. This was a zone of continental separation which subsequently became a zone of continental collision about 1.2 billion years ago. The northern anorthosite belt also parallels an orogenic belt which was active between 1.8 and 1.7 billion years ago. Heat generated during this mountain building period helped in the formation of the anorthosites.
NASA Astrophysics Data System (ADS)
Muraviev, A. V.; Smolski, V. O.; Loparo, Z. E.; Vodopyanov, K. L.
2018-04-01
Mid-infrared spectroscopy offers supreme sensitivity for the detection of trace gases, solids and liquids based on tell-tale vibrational bands specific to this spectral region. Here, we present a new platform for mid-infrared dual-comb Fourier-transform spectroscopy based on a pair of ultra-broadband subharmonic optical parametric oscillators pumped by two phase-locked thulium-fibre combs. Our system provides fast (7 ms for a single interferogram), moving-parts-free, simultaneous acquisition of 350,000 spectral data points, spaced by a 115 MHz intermodal interval over the 3.1-5.5 µm spectral range. Parallel detection of 22 trace molecular species in a gas mixture, including isotopologues containing isotopes such as 13C, 18O, 17O, 15N, 34S, 33S and deuterium, with part-per-billion sensitivity and sub-Doppler resolution is demonstrated. The technique also features absolute optical frequency referencing to an atomic clock, a high degree of mutual coherence between the two mid-infrared combs with a relative comb-tooth linewidth of 25 mHz, coherent averaging and feasibility for kilohertz-scale spectral resolution.
Note: optical receiver system for 152-channel magnetoencephalography.
Kim, Jin-Mok; Kwon, Hyukchan; Yu, Kwon-kyu; Lee, Yong-Ho; Kim, Kiwoong
2014-11-01
An optical receiver system composing 13 serial data restore/synchronizer modules and a single module combiner converted optical 32-bit serial data into 32-bit synchronous parallel data for a computer to acquire 152-channel magnetoencephalography (MEG) signals. A serial data restore/synchronizer module identified 32-bit channel-voltage bits from 48-bit streaming serial data, and then consecutively reproduced 13 times of 32-bit serial data, acting in a synchronous clock. After selecting a single among 13 reproduced data in each module, a module combiner converted it into 32-bit parallel data, which were carried to 32-port digital input board in a computer. When the receiver system together with optical transmitters were applied to 152-channel superconducting quantum interference device sensors, this MEG system maintained a field noise level of 3 fT/√Hz @ 100 Hz at a sample rate of 1 kSample/s per channel.
Linux Kernel Co-Scheduling For Bulk Synchronous Parallel Applications
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jones, Terry R
2011-01-01
This paper describes a kernel scheduling algorithm that is based on co-scheduling principles and that is intended for parallel applications running on 1000 cores or more where inter-node scalability is key. Experimental results for a Linux implementation on a Cray XT5 machine are presented.1 The results indicate that Linux is a suitable operating system for this new scheduling scheme, and that this design provides a dramatic improvement in scaling performance for synchronizing collective operations at scale.
NASA Astrophysics Data System (ADS)
Grzybowski, H.; Mosdorf, R.
2016-09-01
The temperature fluctuations occurring in flow boiling in parallel minichannels with diameter of 1 mm have been experimentally investigated and analysed. The wall temperature was recorded at each minichannel outlet by thermocouple with 0.08 mm diameter probe. The time series where recorded during dynamic two-phase flow instabilities which are accompanied by chaotic temperature fluctuations. Time series were denoised using wavelet decomposition and were analysed using cross recurrence plots (CRP) which enables the study of two time series synchronization.
Using Abstraction in Explicity Parallel Programs.
1991-07-01
However, we only rely on sequential consistency of memory operations. includ- ing reads. writes and any synchronization primitives provided by the...explicit synchronization primitives . This demonstrates the practical power of sequentially consistent memory, as opposed to weaker models of memory that...a small set of synchronization primitives , all pro- cedures have non-waiting specifications. This is in contrast to richer process-oriented
Code of Federal Regulations, 2011 CFR
2011-01-01
... that amount of radioactive material which disintegrates at the rate of 37 billion atoms per second... material which disintegrates at the rate of 37 thousand atoms per second; Millicurie means that amount of radioactive material which disintegrates at the rate of 37 million atoms per second; Particle accelerator...
Self-organization in cold atomic gases: a synchronization perspective.
Tesio, E; Robb, G R M; Oppo, G-L; Gomes, P M; Ackemann, T; Labeyrie, G; Kaiser, R; Firth, W J
2014-10-28
We study non-equilibrium spatial self-organization in cold atomic gases, where long-range spatial order spontaneously emerges from fluctuations in the plane transverse to the propagation axis of a single optical beam. The self-organization process can be interpreted as a synchronization transition in a fully connected network of fictitious oscillators, and described in terms of the Kuramoto model. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Quantum Synchronization of Two Ensembles of Atoms
NASA Astrophysics Data System (ADS)
Xu, Minghui; Tieri, David; Fine, Effie; Thompson, James; Holland, Murray
2014-05-01
We present a system that exhibits quantum synchronization as a modern analogue of the Huygens experiment which is implemented using state-of-the-art neutral atom lattice clocks of the highest precision. In particular, we study the correlated phase dynamics of two mesoscopic ensembles of atoms through their collective coupling to an optical cavity. We find a dynamical quantum phase transition induced by pump noise and cavity output-coupling. The spectral properties of the superradiant light emitted from the cavity show that at a critical pump rate the system undergoes a transition from the independent behavior of two disparate oscillators to the phase-locking that is the signature of quantum synchronization. Besides being of fundamental importance in nonequilibrium quantum many-body physics, this work could have broad implications for many practical applications of ultrastable lasers and precision measurements. This work was supported by the DARPA QuASAR program, the NSF, and NIST.
Robust synchronization of spin-torque oscillators with an LCR load.
Pikovsky, Arkady
2013-09-01
We study dynamics of a serial array of spin-torque oscillators with a parallel inductor-capacitor-resistor (LCR) load. In a large range of parameters the fully synchronous regime, where all the oscillators have the same state and the output field is maximal, is shown to be stable. However, not always such a robust complete synchronization develops from a random initial state; in many cases nontrivial clustering is observed, with a partial synchronization resulting in a quasiperiodic or chaotic mean-field dynamics.
A Pervasive Parallel Processing Framework for Data Visualization and Analysis at Extreme Scale
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moreland, Kenneth; Geveci, Berk
2014-11-01
The evolution of the computing world from teraflop to petaflop has been relatively effortless, with several of the existing programming models scaling effectively to the petascale. The migration to exascale, however, poses considerable challenges. All industry trends infer that the exascale machine will be built using processors containing hundreds to thousands of cores per chip. It can be inferred that efficient concurrency on exascale machines requires a massive amount of concurrent threads, each performing many operations on a localized piece of data. Currently, visualization libraries and applications are based off what is known as the visualization pipeline. In the pipelinemore » model, algorithms are encapsulated as filters with inputs and outputs. These filters are connected by setting the output of one component to the input of another. Parallelism in the visualization pipeline is achieved by replicating the pipeline for each processing thread. This works well for today’s distributed memory parallel computers but cannot be sustained when operating on processors with thousands of cores. Our project investigates a new visualization framework designed to exhibit the pervasive parallelism necessary for extreme scale machines. Our framework achieves this by defining algorithms in terms of worklets, which are localized stateless operations. Worklets are atomic operations that execute when invoked unlike filters, which execute when a pipeline request occurs. The worklet design allows execution on a massive amount of lightweight threads with minimal overhead. Only with such fine-grained parallelism can we hope to fill the billions of threads we expect will be necessary for efficient computation on an exascale machine.« less
Research Ethics Timeline (1932-Present)
... 2 billion Manhattan Project to develop an atomic bomb. 1944-1980s The U.S. government sponsors secret research ... military personnel. 1945 The US drops two atomic bombs on Japan. 1945 Led by Pres. Eisenhower and ...
Android Protection Mechanism: A Signed Code Security Mechanism for Smartphone Applications
2011-03-01
status registers, exceptions, endian support, unaligned access support, synchronization primitives , the Jazelle Extension, and saturated integer...supports comprehensive non-blocking shared-memory synchronization primitives that scale for multiple-processor system designs. This is an improvement... synchronization . Memory semaphores can be loaded and altered without interruption because the load and store operations are atomic. Processor
A Distributed Platform for Global-Scale Agent-Based Models of Disease Transmission
Parker, Jon; Epstein, Joshua M.
2013-01-01
The Global-Scale Agent Model (GSAM) is presented. The GSAM is a high-performance distributed platform for agent-based epidemic modeling capable of simulating a disease outbreak in a population of several billion agents. It is unprecedented in its scale, its speed, and its use of Java. Solutions to multiple challenges inherent in distributing massive agent-based models are presented. Communication, synchronization, and memory usage are among the topics covered in detail. The memory usage discussion is Java specific. However, the communication and synchronization discussions apply broadly. We provide benchmarks illustrating the GSAM’s speed and scalability. PMID:24465120
Reformulation of the relativistic conversion between coordinate time and atomic time
NASA Technical Reports Server (NTRS)
Thomas, J. B.
1975-01-01
The relativistic conversion between coordinate time and atomic time is reformulated to allow simpler time calculations relating analysis in solar system barycentric coordinates (using coordinate time) with earth-fixed observations (measuring 'earth-bound' proper time or atomic time). After an interpretation in terms of relatively well-known concepts, this simplified formulation, which has a rate accuracy of about 10 to the minus 15th, is used to explain the conventions required in the synchronization of a worldwide clock network and to analyze two synchronization techniques - portable clocks and radio interferometry. Finally, pertinent experimental tests of relativity are briefly discussed in terms of the reformulated time conversion.
Problems in characterizing barrier performance
NASA Technical Reports Server (NTRS)
Jordan, Harry F.
1988-01-01
The barrier is a synchronization construct which is useful in separating a parallel program into parallel sections which are executed in sequence. The completion of a barrier requires cooperation among all executing processes. This requirement not only introduces the wait for the slowest process delay which is inherent in the definition of the synchronization, but also has implications for the efficient implementation and measurement of barrier performance in different systems. Types of barrier implementation and their relationship to different multiprocessor environments are described. Then the problem of measuring the performance of barrier implementations on specific machine architecture is discussed. The fact that the barrier synchronization requires the cooperation of all processes makes the problem of performance measurement similarly global. Making non-intrusive measurements of sufficient accuracy can be tricky on systems offering only rudimentary measurement tools.
2015-08-01
Atomic/Molecular Massively Parallel Simulator ( LAMMPS ) Software by N Scott Weingarten and James P Larentzos Approved for...Massively Parallel Simulator ( LAMMPS ) Software by N Scott Weingarten Weapons and Materials Research Directorate, ARL James P Larentzos Engility...Shifted Periodic Boundary Conditions in the Large-Scale Atomic/Molecular Massively Parallel Simulator ( LAMMPS ) Software 5a. CONTRACT NUMBER 5b
DOE Office of Scientific and Technical Information (OSTI.GOV)
Williams, R.C.; Cantelon, P.L.
1984-01-01
In selecting these historical documents the authors have applied three general tests: first, does the document help tell the story of the development of American nuclear policy in a nontechnical way; second, is the source primary rather than secondary, written by an actor in the drama rather than by a member of the audience; third, does the document provide coverage of the major chapters in the story. The Manhattan Project was America's $2 billion secret project to build an atomic bomb. Many documents associated with the project have come to light only in recent years. In Section II they usemore » the letters of J. Robert Oppenheimer and the recently declassified minutes of policy committees to tell the story of how the bomb was designed and built and how the decision was made to drop the first uranium and plutonium devices on the Japanese cities of Hiroshima and Nagasaki in 1945. How did a weapon of war become the key to a peacetime industry. In considering atomic energy after World War II, they focus in Section III on the legislative enabling acts that established the Atomic Energy Commission, the short-lived dream of international control of nuclear weapons under the Baruch Plan, and the ''atoms for peace'' program of President Dwight D. Eisenhower. By 1954 the highly classified work on nuclear weapons paralleled a new development of nuclear energy and power reactors. Knowledge was shared with both private industry and other countries. The fruits of this program are considered in the later section on nuclear power.« less
Proxy-equation paradigm: A strategy for massively parallel asynchronous computations
NASA Astrophysics Data System (ADS)
Mittal, Ankita; Girimaji, Sharath
2017-09-01
Massively parallel simulations of transport equation systems call for a paradigm change in algorithm development to achieve efficient scalability. Traditional approaches require time synchronization of processing elements (PEs), which severely restricts scalability. Relaxing synchronization requirement introduces error and slows down convergence. In this paper, we propose and develop a novel "proxy equation" concept for a general transport equation that (i) tolerates asynchrony with minimal added error, (ii) preserves convergence order and thus, (iii) expected to scale efficiently on massively parallel machines. The central idea is to modify a priori the transport equation at the PE boundaries to offset asynchrony errors. Proof-of-concept computations are performed using a one-dimensional advection (convection) diffusion equation. The results demonstrate the promise and advantages of the present strategy.
Archer, Charles J [Rochester, MN; Blocksome, Michael A [Rochester, MN; Peters, Amanda A [Rochester, MN; Ratterman, Joseph D [Rochester, MN; Smith, Brian E [Rochester, MN
2012-01-10
Methods, apparatus, and products are disclosed for reducing power consumption while synchronizing a plurality of compute nodes during execution of a parallel application that include: beginning, by each compute node, performance of a blocking operation specified by the parallel application, each compute node beginning the blocking operation asynchronously with respect to the other compute nodes; reducing, for each compute node, power to one or more hardware components of that compute node in response to that compute node beginning the performance of the blocking operation; and restoring, for each compute node, the power to the hardware components having power reduced in response to all of the compute nodes beginning the performance of the blocking operation.
Archer, Charles J [Rochester, MN; Blocksome, Michael A [Rochester, MN; Peters, Amanda E [Cambridge, MA; Ratterman, Joseph D [Rochester, MN; Smith, Brian E [Rochester, MN
2012-04-17
Methods, apparatus, and products are disclosed for reducing power consumption while synchronizing a plurality of compute nodes during execution of a parallel application that include: beginning, by each compute node, performance of a blocking operation specified by the parallel application, each compute node beginning the blocking operation asynchronously with respect to the other compute nodes; reducing, for each compute node, power to one or more hardware components of that compute node in response to that compute node beginning the performance of the blocking operation; and restoring, for each compute node, the power to the hardware components having power reduced in response to all of the compute nodes beginning the performance of the blocking operation.
On extending parallelism to serial simulators
NASA Technical Reports Server (NTRS)
Nicol, David; Heidelberger, Philip
1994-01-01
This paper describes an approach to discrete event simulation modeling that appears to be effective for developing portable and efficient parallel execution of models of large distributed systems and communication networks. In this approach, the modeler develops submodels using an existing sequential simulation modeling tool, using the full expressive power of the tool. A set of modeling language extensions permit automatically synchronized communication between submodels; however, the automation requires that any such communication must take a nonzero amount off simulation time. Within this modeling paradigm, a variety of conservative synchronization protocols can transparently support conservative execution of submodels on potentially different processors. A specific implementation of this approach, U.P.S. (Utilitarian Parallel Simulator), is described, along with performance results on the Intel Paragon.
1995-01-01
possible to determine communication points. For this version, a C program spawning Posix threads and using semaphores to synchronize would have to...performance such as the time required for network communication and synchronization as well as issues of asynchrony and memory hierarchy. For example...enhances reusability. Process (or task) parallel computations can also be succinctly expressed with a small set of process creation and synchronization
100 Gbps Wireless System and Circuit Design Using Parallel Spread-Spectrum Sequencing
NASA Astrophysics Data System (ADS)
Scheytt, J. Christoph; Javed, Abdul Rehman; Bammidi, Eswara Rao; KrishneGowda, Karthik; Kallfass, Ingmar; Kraemer, Rolf
2017-09-01
In this article mixed analog/digital signal processing techniques based on parallel spread-spectrum sequencing (PSSS) and radio frequency (RF) carrier synchronization for ultra-broadband wireless communication are investigated on system and circuit level.
Employment of a Dual Status Commander in a Multi-State Disaster Operation
2016-06-10
propagates parallel commands among affected states without a singular organization to synchronize and prioritize efforts. Thus, the central research ...without a singular organization to synchronize and prioritize efforts. Thus, the central research question is: How can laws be changed to support the...1 The Research Question
A tunable, solid, Fabry-Perot etalon for solar seismology
NASA Technical Reports Server (NTRS)
Rust, David M.; Burton, Clive H.; Leistner, Achim J.
1986-01-01
A solid etalon has been designed and fabricated from a 50-mm diameter wafer of optical-quality lithium niobate. The finished etalon has a free spectral range of 0.325 nm at 588 nm. The parallel faces are coated with silver, and the central 15-mm aperture of the etalon has a finesse of 18.6. The reflective faces double as electrodes, and application of voltage will shift the passband. This feature was used in a servo circuit to stabilize the passband against temperature and tilt-induced drifts to better than three parts in one billion. Operated in the stabilized mode for day-long sessions, this filter alternately samples the wings of a narrow atomic absorption line in the solar spectrum and produces a signal proportional to velocity on the solar disk. The Fourier transform of this signal yields information on acoustic waves in the solar interior.
PCTDSE: A parallel Cartesian-grid-based TDSE solver for modeling laser-atom interactions
NASA Astrophysics Data System (ADS)
Fu, Yongsheng; Zeng, Jiaolong; Yuan, Jianmin
2017-01-01
We present a parallel Cartesian-grid-based time-dependent Schrödinger equation (TDSE) solver for modeling laser-atom interactions. It can simulate the single-electron dynamics of atoms in arbitrary time-dependent vector potentials. We use a split-operator method combined with fast Fourier transforms (FFT), on a three-dimensional (3D) Cartesian grid. Parallelization is realized using a 2D decomposition strategy based on the Message Passing Interface (MPI) library, which results in a good parallel scaling on modern supercomputers. We give simple applications for the hydrogen atom using the benchmark problems coming from the references and obtain repeatable results. The extensions to other laser-atom systems are straightforward with minimal modifications of the source code.
Trace Element Analysis of Biological Samples.
ERIC Educational Resources Information Center
Veillon, Claude
1986-01-01
Reviews background of atomic absorption spectrometry techniques. Discusses problems encountered and precautions to be taken in determining trace elements in the parts-per-billion concentration range and below. Concentrates on determining chromium in biological samples by graphite furnace atomic absorption. Considers other elements, matrices, and…
Classical synchronization indicates persistent entanglement in isolated quantum systems
Witthaut, Dirk; Wimberger, Sandro; Burioni, Raffaella; Timme, Marc
2017-01-01
Synchronization and entanglement constitute fundamental collective phenomena in multi-unit classical and quantum systems, respectively, both equally implying coordinated system states. Here, we present a direct link for a class of isolated quantum many-body systems, demonstrating that synchronization emerges as an intrinsic system feature. Intriguingly, quantum coherence and entanglement arise persistently through the same transition as synchronization. This direct link between classical and quantum cooperative phenomena may further our understanding of strongly correlated quantum systems and can be readily observed in state-of-the-art experiments, for example, with ultracold atoms. PMID:28401881
Classical synchronization indicates persistent entanglement in isolated quantum systems.
Witthaut, Dirk; Wimberger, Sandro; Burioni, Raffaella; Timme, Marc
2017-04-12
Synchronization and entanglement constitute fundamental collective phenomena in multi-unit classical and quantum systems, respectively, both equally implying coordinated system states. Here, we present a direct link for a class of isolated quantum many-body systems, demonstrating that synchronization emerges as an intrinsic system feature. Intriguingly, quantum coherence and entanglement arise persistently through the same transition as synchronization. This direct link between classical and quantum cooperative phenomena may further our understanding of strongly correlated quantum systems and can be readily observed in state-of-the-art experiments, for example, with ultracold atoms.
ASSET: Analysis of Sequences of Synchronous Events in Massively Parallel Spike Trains
Canova, Carlos; Denker, Michael; Gerstein, George; Helias, Moritz
2016-01-01
With the ability to observe the activity from large numbers of neurons simultaneously using modern recording technologies, the chance to identify sub-networks involved in coordinated processing increases. Sequences of synchronous spike events (SSEs) constitute one type of such coordinated spiking that propagates activity in a temporally precise manner. The synfire chain was proposed as one potential model for such network processing. Previous work introduced a method for visualization of SSEs in massively parallel spike trains, based on an intersection matrix that contains in each entry the degree of overlap of active neurons in two corresponding time bins. Repeated SSEs are reflected in the matrix as diagonal structures of high overlap values. The method as such, however, leaves the task of identifying these diagonal structures to visual inspection rather than to a quantitative analysis. Here we present ASSET (Analysis of Sequences of Synchronous EvenTs), an improved, fully automated method which determines diagonal structures in the intersection matrix by a robust mathematical procedure. The method consists of a sequence of steps that i) assess which entries in the matrix potentially belong to a diagonal structure, ii) cluster these entries into individual diagonal structures and iii) determine the neurons composing the associated SSEs. We employ parallel point processes generated by stochastic simulations as test data to demonstrate the performance of the method under a wide range of realistic scenarios, including different types of non-stationarity of the spiking activity and different correlation structures. Finally, the ability of the method to discover SSEs is demonstrated on complex data from large network simulations with embedded synfire chains. Thus, ASSET represents an effective and efficient tool to analyze massively parallel spike data for temporal sequences of synchronous activity. PMID:27420734
NASA Astrophysics Data System (ADS)
Shoemaker, C. A.; Pang, M.; Akhtar, T.; Bindel, D.
2016-12-01
New parallel surrogate global optimization algorithms are developed and applied to objective functions that are expensive simulations (possibly with multiple local minima). The algorithms can be applied to most geophysical simulations, including those with nonlinear partial differential equations. The optimization does not require simulations be parallelized. Asynchronous (and synchronous) parallel execution is available in the optimization toolbox "pySOT". The parallel algorithms are modified from serial to eliminate fine grained parallelism. The optimization is computed with open source software pySOT, a Surrogate Global Optimization Toolbox that allows user to pick the type of surrogate (or ensembles), the search procedure on surrogate, and the type of parallelism (synchronous or asynchronous). pySOT also allows the user to develop new algorithms by modifying parts of the code. In the applications here, the objective function takes up to 30 minutes for one simulation, and serial optimization can take over 200 hours. Results from Yellowstone (NSF) and NCSS (Singapore) supercomputers are given for groundwater contaminant hydrology simulations with applications to model parameter estimation and decontamination management. All results are compared with alternatives. The first results are for optimization of pumping at many wells to reduce cost for decontamination of groundwater at a superfund site. The optimization runs with up to 128 processors. Superlinear speed up is obtained for up to 16 processors, and efficiency with 64 processors is over 80%. Each evaluation of the objective function requires the solution of nonlinear partial differential equations to describe the impact of spatially distributed pumping and model parameters on model predictions for the spatial and temporal distribution of groundwater contaminants. The second application uses an asynchronous parallel global optimization for groundwater quality model calibration. The time for a single objective function evaluation varies unpredictably, so efficiency is improved with asynchronous parallel calculations to improve load balancing. The third application (done at NCSS) incorporates new global surrogate multi-objective parallel search algorithms into pySOT and applies it to a large watershed calibration problem.
NASA Technical Reports Server (NTRS)
Ayguade, Eduard; Gonzalez, Marc; Martorell, Xavier; Jost, Gabriele
2004-01-01
In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Benchmarks employing multi-level OpenMP parallelism. For our study we use the NanosCompiler, which supports nesting of OpenMP directives and provides clauses to control the grouping of threads, load balancing, and synchronization. We report the benchmark results, compare the timings with those of different hybrid parallelization paradigms and discuss OpenMP implementation issues which effect the performance of multi-level parallel applications.
More Effective Distributed ML via a Stale Synchronous Parallel Parameter Server
Ho, Qirong; Cipar, James; Cui, Henggang; Kim, Jin Kyu; Lee, Seunghak; Gibbons, Phillip B.; Gibson, Garth A.; Ganger, Gregory R.; Xing, Eric P.
2014-01-01
We propose a parameter server system for distributed ML, which follows a Stale Synchronous Parallel (SSP) model of computation that maximizes the time computational workers spend doing useful work on ML algorithms, while still providing correctness guarantees. The parameter server provides an easy-to-use shared interface for read/write access to an ML model’s values (parameters and variables), and the SSP model allows distributed workers to read older, stale versions of these values from a local cache, instead of waiting to get them from a central storage. This significantly increases the proportion of time workers spend computing, as opposed to waiting. Furthermore, the SSP model ensures ML algorithm correctness by limiting the maximum age of the stale values. We provide a proof of correctness under SSP, as well as empirical results demonstrating that the SSP model achieves faster algorithm convergence on several different ML problems, compared to fully-synchronous and asynchronous schemes. PMID:25400488
Polynuclear aromatic hydrocarbon analysis using the synchronous scanning luminoscope
NASA Astrophysics Data System (ADS)
Hyfantis, George J., Jr.; Teglas, Matthew S.; Wilbourn, Robert G.
2001-02-01
12 The Synchronous Scanning Luminoscope (SSL) is a field- portable, synchronous luminescence spectrofluorometer that was developed for on-site analysis of contaminated soil and ground water. The SSL is capable of quantitative analysis of total polynuclear aromatic hydrocarbons (PAHs) using phosphorescence and fluorescence techniques with a high correlation to laboratory data as illustrated by this study. The SSL is also capable of generating benzo(a)pyrene equivalency results, based on seven carcinogenic PAHs and Navy risk numbers, with a high correlation to laboratory data as illustrated by this study. These techniques allow rapid field assessments of total PAHs and benzo(a)pyrene equivalent concentrations. The Luminoscope is capable of detecting total PAHs to the parts per billion range. This paper describes standard field methods for using the SSL and describes the results of field/laboratory testing of PAHs. SSL results from two different hazardous waste sites are discussed.
Quantum synchronization and the no-photon laser
NASA Astrophysics Data System (ADS)
Holland, Murray
2014-03-01
This talk will present a new approach to lasers that is based on the quantum synchronization of many atoms. Such lasers are predicted to produce light of unprecedented spectral purity and coherence, some two orders of magnitude better than any system available today. The idea is based on superradiant emission, where an ensemble of atoms with an extremely narrow atomic transition can phase-lock and form a macroscopic dipole that radiates light collectively. This is quite unlike a typical laser where atoms essentially act independently. The resulting light source is expected to have a spectral linewidth of just a few millihertz and could lead to more accurate and stable atomic clocks. Atomic clocks based on optical transitions have improved tremendously in recent years, giving clocks that tick 1015 times per second, and can have a fractional stability exceeding one part in 1016. This new sharper light source aims to push the frontier even further, so that fundamental tests of physics, such as the time variation of constants and tests of gravity, might even be possible. We acknowledge support from NSF and the DARPA QuASAR program.
On-the-fly transition search and applications to temperature-accelerated dynamics
NASA Astrophysics Data System (ADS)
Shim, Yunsic; Amar, Jacques
2015-03-01
Temperature-accelerated dynamics (TAD) is a powerful method to study non-equilibrium processes and has been providing surprising insights for a variety of systems. While serial TAD simulations have been limited by the roughly N3 increase in the computational cost as a function of the number of atoms N in the system, recently we have shown that by carrying out parallel TAD simulations which combine spatial decomposition with our semi-rigorous synchronous sublattice algorithm, significantly improved scaling is possible. However, in this approach the size of activated events is limited by the processor size while the dynamics is not exact. Here we discuss progress in improving the scaling of serial TAD by combining the use of on-the-fly transition searching with our previously developed localized saddle-point method. We demonstrate improved performance for the cases of Ag/Ag(100) annealing and Cu/Cu(100) growth. Supported by NSF DMR-1410840.
2008-02-09
Campbell, S. Ogata, and F. Shimojo, “ Multimillion atom simulations of nanosystems on parallel computers,” in Proceedings of the International...nanomesas: multimillion -atom molecular dynamics simulations on parallel computers,” J. Appl. Phys. 94, 6762 (2003). 21. P. Vashishta, R. K. Kalia...and A. Nakano, “ Multimillion atom molecular dynamics simulations of nanoparticles on parallel computers,” Journal of Nanoparticle Research 5, 119-135
Li, Le-Bao; Sun, Ling-Ling; Zhang, Sheng-Zhou; Yang, Qing-Quan
2015-09-01
A new control approach for speed tracking and synchronization of multiple motors is developed, by incorporating an adaptive sliding mode control (ASMC) technique into a ring coupling synchronization control structure. This control approach can stabilize speed tracking of each motor and synchronize its motion with other motors' motion so that speed tracking errors and synchronization errors converge to zero. Moreover, an adaptive law is exploited to estimate the unknown bound of uncertainty, which is obtained in the sense of Lyapunov stability theorem to minimize the control effort and attenuate chattering. Performance comparisons with parallel control, relative coupling control and conventional PI control are investigated on a four-motor synchronization control system. Extensive simulation results show the effectiveness of the proposed control scheme. Copyright © 2015 ISA. Published by Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Gareev, F. A.; Zhidkova, I. E.
2007-03-01
We come to the conclusion that all atomic models based on either the Newton equation and the Kepler laws, or the Maxwell equations, or the Schrodinger and Dirac equations are in reasonable agreement with experimental data. We can only suspect that these equations are grounded on the same fundamental principle(s) which is (are) not known or these equations can be transformed into each other. We proposed a new mechanism of LENR: cooperative processes in the whole system nuclei + atoms + condensed matter - nuclear reactions in plasma - can occur at smaller threshold energies than the corresponding ones on free constituents. We were able to quantize phenomenologically the first time the differences between atomic and nuclear rest masses by the formula: δδM =n1/n2 X 0.0076294 (in MeV/ c^2), ni=1,2,3,.... Note that this quantization rule is justified for atoms and nuclei with different A, N and Z and the nuclei and atoms represent a coherent synchronized systems - a complex of coupled oscillators (resonators). The cooperative resonance synchronization mechanisms can explain how electron volt (atomic-) scale processes can induce and control nuclear MeV (nuclear-) scale processes and reactions., F.A. Gareev, I.E. Zhidkova, E-print arXiv Nucl-th/ 0610002 2006.
NASA Astrophysics Data System (ADS)
Lü, Hua-Ping; Wang, Shi-Hong; Li, Xiao-Wen; Tang, Guo-Ning; Kuang, Jin-Yu; Ye, Wei-Ping; Hu, Gang
2004-06-01
Two-dimensional one-way coupled map lattices are used for cryptography where multiple space units produce chaotic outputs in parallel. One of the outputs plays the role of driving for synchronization of the decryption system while the others perform the function of information encoding. With this separation of functions the receiver can establish a self-checking and self-correction mechanism, and enjoys the advantages of both synchronous and self-synchronizing schemes. A comparison between the present system with the system of advanced encryption standard (AES) is presented in the aspect of channel noise influence. Numerical investigations show that our system is much stronger than AES against channel noise perturbations, and thus can be better used for secure communications with large channel noise.
NASA Technical Reports Server (NTRS)
Arenstorf, Norbert S.; Jordan, Harry F.
1987-01-01
A barrier is a method for synchronizing a large number of concurrent computer processes. After considering some basic synchronization mechanisms, a collection of barrier algorithms with either linear or logarithmic depth are presented. A graphical model is described that profiles the execution of the barriers and other parallel programming constructs. This model shows how the interaction between the barrier algorithms and the work that they synchronize can impact their performance. One result is that logarithmic tree structured barriers show good performance when synchronizing fixed length work, while linear self-scheduled barriers show better performance when synchronizing fixed length work with an imbedded critical section. The linear barriers are better able to exploit the process skew associated with critical sections. Timing experiments, performed on an eighteen processor Flex/32 shared memory multiprocessor, that support these conclusions are detailed.
Design of object-oriented distributed simulation classes
NASA Technical Reports Server (NTRS)
Schoeffler, James D. (Principal Investigator)
1995-01-01
Distributed simulation of aircraft engines as part of a computer aided design package is being developed by NASA Lewis Research Center for the aircraft industry. The project is called NPSS, an acronym for 'Numerical Propulsion Simulation System'. NPSS is a flexible object-oriented simulation of aircraft engines requiring high computing speed. It is desirable to run the simulation on a distributed computer system with multiple processors executing portions of the simulation in parallel. The purpose of this research was to investigate object-oriented structures such that individual objects could be distributed. The set of classes used in the simulation must be designed to facilitate parallel computation. Since the portions of the simulation carried out in parallel are not independent of one another, there is the need for communication among the parallel executing processors which in turn implies need for their synchronization. Communication and synchronization can lead to decreased throughput as parallel processors wait for data or synchronization signals from other processors. As a result of this research, the following have been accomplished. The design and implementation of a set of simulation classes which result in a distributed simulation control program have been completed. The design is based upon MIT 'Actor' model of a concurrent object and uses 'connectors' to structure dynamic connections between simulation components. Connectors may be dynamically created according to the distribution of objects among machines at execution time without any programming changes. Measurements of the basic performance have been carried out with the result that communication overhead of the distributed design is swamped by the computation time of modules unless modules have very short execution times per iteration or time step. An analytical performance model based upon queuing network theory has been designed and implemented. Its application to realistic configurations has not been carried out.
Design of Object-Oriented Distributed Simulation Classes
NASA Technical Reports Server (NTRS)
Schoeffler, James D.
1995-01-01
Distributed simulation of aircraft engines as part of a computer aided design package being developed by NASA Lewis Research Center for the aircraft industry. The project is called NPSS, an acronym for "Numerical Propulsion Simulation System". NPSS is a flexible object-oriented simulation of aircraft engines requiring high computing speed. It is desirable to run the simulation on a distributed computer system with multiple processors executing portions of the simulation in parallel. The purpose of this research was to investigate object-oriented structures such that individual objects could be distributed. The set of classes used in the simulation must be designed to facilitate parallel computation. Since the portions of the simulation carried out in parallel are not independent of one another, there is the need for communication among the parallel executing processors which in turn implies need for their synchronization. Communication and synchronization can lead to decreased throughput as parallel processors wait for data or synchronization signals from other processors. As a result of this research, the following have been accomplished. The design and implementation of a set of simulation classes which result in a distributed simulation control program have been completed. The design is based upon MIT "Actor" model of a concurrent object and uses "connectors" to structure dynamic connections between simulation components. Connectors may be dynamically created according to the distribution of objects among machines at execution time without any programming changes. Measurements of the basic performance have been carried out with the result that communication overhead of the distributed design is swamped by the computation time of modules unless modules have very short execution times per iteration or time step. An analytical performance model based upon queuing network theory has been designed and implemented. Its application to realistic configurations has not been carried out.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cortese, Luca; Catinella, Barbara; Janowiecki, Steven, E-mail: luca.cortese@uwa.edu.au
Cold hydrogen gas is the raw fuel for star formation in galaxies, and its partition into atomic and molecular phases is a key quantity for galaxy evolution. In this Letter, we combine Atacama Large Millimeter/submillimeter Array and Arecibo single-dish observations to estimate the molecular-to-atomic hydrogen mass ratio for massive star-forming galaxies at z ∼ 0.2 extracted from the HIGHz survey, i.e., some of the most massive gas-rich systems currently known. We show that the balance between atomic and molecular hydrogen in these galaxies is similar to that of local main-sequence disks, implying that atomic hydrogen has been dominating the coldmore » gas mass budget of star-forming galaxies for at least the past three billion years. In addition, despite harboring gas reservoirs that are more typical of objects at the cosmic noon, HIGHz galaxies host regular rotating disks with low gas velocity dispersions suggesting that high total gas fractions do not necessarily drive high turbulence in the interstellar medium.« less
Measure synchronization in a spin-orbit-coupled bosonic Josephson junction
NASA Astrophysics Data System (ADS)
Wang, Wen-Yuan; Liu, Jie; Fu, Li-Bin
2015-11-01
We present measure synchronization (MS) in a bosonic Josephson junction with spin-orbit coupling. The two atomic hyperfine states are coupled by a Raman dressing scheme, and they are regarded as two orientations of a pseudo-spin-1 /2 system. A feature specific to a spin-orbit-coupled (SOC) bosonic Josephson junction is that the transition from non-MS to MS dynamics can be modulated by Raman laser intensity, even in the absence of interspin atomic interaction. A phase diagram of non-MS and MS dynamics as functions of Raman laser intensity and Josephson tunneling amplitude is presented. Taking into account interspin atomic interactions, the system exhibits MS breaking dynamics resulting from the competition between intraspin and interspin atomic interactions. When interspin atomic interactions dominate in the competition, the system always exhibits MS dynamics. For interspin interaction weaker than intraspin interaction, a window for non-MS dynamics is present. Since SOC Bose-Einstein condensates provide a powerful platform for studies on physical problems in various fields, the study of MS dynamics is valuable in researching the collective coherent dynamical behavior in a spin-orbit-coupled bosonic Josephson junction.
Synchronization Design and Error Analysis of Near-Infrared Cameras in Surgical Navigation.
Cai, Ken; Yang, Rongqian; Chen, Huazhou; Huang, Yizhou; Wen, Xiaoyan; Huang, Wenhua; Ou, Shanxing
2016-01-01
The accuracy of optical tracking systems is important to scientists. With the improvements reported in this regard, such systems have been applied to an increasing number of operations. To enhance the accuracy of these systems further and to reduce the effect of synchronization and visual field errors, this study introduces a field-programmable gate array (FPGA)-based synchronization control method, a method for measuring synchronous errors, and an error distribution map in field of view. Synchronization control maximizes the parallel processing capability of FPGA, and synchronous error measurement can effectively detect the errors caused by synchronization in an optical tracking system. The distribution of positioning errors can be detected in field of view through the aforementioned error distribution map. Therefore, doctors can perform surgeries in areas with few positioning errors, and the accuracy of optical tracking systems is considerably improved. The system is analyzed and validated in this study through experiments that involve the proposed methods, which can eliminate positioning errors attributed to asynchronous cameras and different fields of view.
Isomorphic Properties of Atoms, Molecules, Water, DNA, Crystals, Earth, SolarSystem and Galaxies
NASA Astrophysics Data System (ADS)
Gareev, F. A.; Gareeva, G. F.; Zhidkova, I. E.
2009-03-01
We discuss the cooperative resonance synchronization enhancement mechanisms of Low Energy Nuclear Reactions (LENR). Some of the low energy external fields can be used as triggers for starting and enhancing exothermic LENR. Any external field shortening distances between protons in nuclei and electrons in atoms should enhance beta-decay (capture) or double-beta decay (capture). We have proposed a new mechanism of LENR: cooperative resonance synchronization processes in the whole system nuclei+atoms+condensed matter+gaseuos+plasma medium, which we suggest can occur at a smaller threshold than the corresponding ones on free constituents. The cooperative processes can be induced and enhanced by low energy external fields. The excess heat is the emission of internal energy, and transmutations at LENR are the result of redistribution inner energy of the whole system.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Saprykin, E. G., E-mail: Saprykin@gorodok.net
2016-02-15
Four types of anomalous optical magnetic resonances shifted with respect to the zero magnetic field and with different shapes are found in radiation of a glow discharge in a mixture of even neon isotopes placed in a swept longitudinal magnetic field. This testifies to the manifestation of collective processes of synchronous light emission by oscillators belonging to isotopically different spatially separated atoms in discharge plasma. The origin of resonances is associated with nonstationary interference of reactive fields in the near radiation-field zones of emission of atoms, averaged over the lifetime of the fields (interference), while different types of resonances aremore » associated with different methods of synchronization of the phases of the fields.« less
Hong, Ie-Hong; Liao, Yung-Cheng; Tsai, Yung-Feng
2013-11-05
The perfectly ordered parallel arrays of periodic Ce silicide nanowires can self-organize with atomic precision on single-domain Si(110)-16 × 2 surfaces. The growth evolution of self-ordered parallel Ce silicide nanowire arrays is investigated over a broad range of Ce coverages on single-domain Si(110)-16 × 2 surfaces by scanning tunneling microscopy (STM). Three different types of well-ordered parallel arrays, consisting of uniformly spaced and atomically identical Ce silicide nanowires, are self-organized through the heteroepitaxial growth of Ce silicides on a long-range grating-like 16 × 2 reconstruction at the deposition of various Ce coverages. Each atomically precise Ce silicide nanowire consists of a bundle of chains and rows with different atomic structures. The atomic-resolution dual-polarity STM images reveal that the interchain coupling leads to the formation of the registry-aligned chain bundles within individual Ce silicide nanowire. The nanowire width and the interchain coupling can be adjusted systematically by varying the Ce coverage on a Si(110) surface. This natural template-directed self-organization of perfectly regular parallel nanowire arrays allows for the precise control of the feature size and positions within ±0.2 nm over a large area. Thus, it is a promising route to produce parallel nanowire arrays in a straightforward, low-cost, high-throughput process.
2013-01-01
The perfectly ordered parallel arrays of periodic Ce silicide nanowires can self-organize with atomic precision on single-domain Si(110)-16 × 2 surfaces. The growth evolution of self-ordered parallel Ce silicide nanowire arrays is investigated over a broad range of Ce coverages on single-domain Si(110)-16 × 2 surfaces by scanning tunneling microscopy (STM). Three different types of well-ordered parallel arrays, consisting of uniformly spaced and atomically identical Ce silicide nanowires, are self-organized through the heteroepitaxial growth of Ce silicides on a long-range grating-like 16 × 2 reconstruction at the deposition of various Ce coverages. Each atomically precise Ce silicide nanowire consists of a bundle of chains and rows with different atomic structures. The atomic-resolution dual-polarity STM images reveal that the interchain coupling leads to the formation of the registry-aligned chain bundles within individual Ce silicide nanowire. The nanowire width and the interchain coupling can be adjusted systematically by varying the Ce coverage on a Si(110) surface. This natural template-directed self-organization of perfectly regular parallel nanowire arrays allows for the precise control of the feature size and positions within ±0.2 nm over a large area. Thus, it is a promising route to produce parallel nanowire arrays in a straightforward, low-cost, high-throughput process. PMID:24188092
RAMA: A file system for massively parallel computers
NASA Technical Reports Server (NTRS)
Miller, Ethan L.; Katz, Randy H.
1993-01-01
This paper describes a file system design for massively parallel computers which makes very efficient use of a few disks per processor. This overcomes the traditional I/O bottleneck of massively parallel machines by storing the data on disks within the high-speed interconnection network. In addition, the file system, called RAMA, requires little inter-node synchronization, removing another common bottleneck in parallel processor file systems. Support for a large tertiary storage system can easily be integrated in lo the file system; in fact, RAMA runs most efficiently when tertiary storage is used.
Parallelization of the FLAPW method
NASA Astrophysics Data System (ADS)
Canning, A.; Mannstadt, W.; Freeman, A. J.
2000-08-01
The FLAPW (full-potential linearized-augmented plane-wave) method is one of the most accurate first-principles methods for determining structural, electronic and magnetic properties of crystals and surfaces. Until the present work, the FLAPW method has been limited to systems of less than about a hundred atoms due to the lack of an efficient parallel implementation to exploit the power and memory of parallel computers. In this work, we present an efficient parallelization of the method by division among the processors of the plane-wave components for each state. The code is also optimized for RISC (reduced instruction set computer) architectures, such as those found on most parallel computers, making full use of BLAS (basic linear algebra subprograms) wherever possible. Scaling results are presented for systems of up to 686 silicon atoms and 343 palladium atoms per unit cell, running on up to 512 processors on a CRAY T3E parallel supercomputer.
[Synchronous playing and acquiring of heart sounds and electrocardiogram based on labVIEW].
Dan, Chunmei; He, Wei; Zhou, Jing; Que, Xiaosheng
2008-12-01
In this paper is described a comprehensive system, which can acquire heart sounds and electrocardiogram (ECG) in parallel, synchronize the display; and play of heart sound and make auscultation and check phonocardiogram to tie in. The hardware system with C8051F340 as the core acquires the heart sound and ECG synchronously, and then sends them to indicators, respectively. Heart sounds are displayed and played simultaneously by controlling the moment of writing to indicator and sound output device. In clinical testing, heart sounds can be successfully located with ECG and real-time played.
Stereographic cloud heights from the imagery of two scan-synchronized geostationary satellites
NASA Technical Reports Server (NTRS)
Minzner, R. A.; Teagle, R. D.; Steranka, J.; Shenk, W. E.
1979-01-01
Scan synchronization of the sensors of two SMS-GOES satellites yields imagery from which cloud heights can be derived stereographically with a theoretical two-sigma random uncertainty of + or - 0.25 km for pairs of satellites separated by 60 degrees of longitude. Systematic height errors due to cloud motion can be kept below 100 m for all clouds with east-west components of speed below hurricane speed, provided the scan synchronization is within 40 seconds at the mid-point latitude, and the spin axis of each satellite is parallel to that of the earth.
Support of Multidimensional Parallelism in the OpenMP Programming Model
NASA Technical Reports Server (NTRS)
Jin, Hao-Qiang; Jost, Gabriele
2003-01-01
OpenMP is the current standard for shared-memory programming. While providing ease of parallel programming, the OpenMP programming model also has limitations which often effect the scalability of applications. Examples for these limitations are work distribution and point-to-point synchronization among threads. We propose extensions to the OpenMP programming model which allow the user to easily distribute the work in multiple dimensions and synchronize the workflow among the threads. The proposed extensions include four new constructs and the associated runtime library. They do not require changes to the source code and can be implemented based on the existing OpenMP standard. We illustrate the concept in a prototype translator and test with benchmark codes and a cloud modeling code.
Sakai, Mai; Morisaka, Tadamichi; Kogi, Kazunobu; Hishii, Toru; Kohshima, Shiro
2010-01-01
We quantitatively analysed synchronous breathing for dyads in Indo-Pacific bottlenose dolphins at Mikura Island, Tokyo, Japan. For most cases, we observed dyads swimming in the same direction (97%), in close proximity (i.e., less than 1.5m) and with their body axes parallel as they breathed synchronously. Moreover, the pairs engaged in identical behaviour before and after the synchronous breathing episodes. These results suggest that the dolphins synchronize their movements, and that synchronous breathing is a component of "pair-swimming", an affiliative social behaviour. Same sex pairs of the same age class frequently engaged in synchronous breathing for adults and subadults, as well as mother-calf and escort-calf pairs. The distance between individuals during synchronous breathing for mother-calf pairs was less than for other pairs. The distance observed between individuals for female pairs was less than for male pairs. The time differences between each exhale for each of the two dolphins involved in synchronous breathing episodes for female pairs were smaller than for male pairs, and time differences for adult pairs were smaller than subadult pairs. These results suggest that age and sex class influenced the characteristics of this behaviour. 2009 Elsevier B.V. All rights reserved.
Dynamic programming on a shared-memory multiprocessor
NASA Technical Reports Server (NTRS)
Edmonds, Phil; Chu, Eleanor; George, Alan
1993-01-01
Three new algorithms for solving dynamic programming problems on a shared-memory parallel computer are described. All three algorithms attempt to balance work load, while keeping synchronization cost low. In particular, for a multiprocessor having p processors, an analysis of the best algorithm shows that the arithmetic cost is O(n-cubed/6p) and that the synchronization cost is O(absolute value of log sub C n) if p much less than n, where C = (2p-1)/(2p + 1) and n is the size of the problem. The low synchronization cost is important for machines where synchronization is expensive. Analysis and experiments show that the best algorithm is effective in balancing the work load and producing high efficiency.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Perumalla, Kalyan S.; Alam, Maksudul
A novel parallel algorithm is presented for generating random scale-free networks using the preferential-attachment model. The algorithm, named cuPPA, is custom-designed for single instruction multiple data (SIMD) style of parallel processing supported by modern processors such as graphical processing units (GPUs). To the best of our knowledge, our algorithm is the first to exploit GPUs, and also the fastest implementation available today, to generate scale free networks using the preferential attachment model. A detailed performance study is presented to understand the scalability and runtime characteristics of the cuPPA algorithm. In one of the best cases, when executed on an NVidiamore » GeForce 1080 GPU, cuPPA generates a scale free network of a billion edges in less than 2 seconds.« less
Propulsion and stabilization system for magnetically levitated vehicles
Coffey, Howard T.
1993-06-29
A propulsion and stabilization system for an inductive repulsion type magnetically levitated vehicle which is propelled and stabilized by a system which includes propulsion windings mounted above and parallel to vehicle-borne suspension magnets. A linear synchronous motor is part of the vehicle guideway and is mounted above and parallel to superconducting magnets attached to the magnetically levitated vehicle.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brown, William Michael; Plimpton, Steven James; Wang, Peng
2010-03-01
LAMMPS is a classical molecular dynamics code, and an acronym for Large-scale Atomic/Molecular Massively Parallel Simulator. LAMMPS has potentials for soft materials (biomolecules, polymers) and solid-state materials (metals, semiconductors) and coarse-grained or mesoscopic systems. It can be used to model atoms or, more generically, as a parallel particle simulator at the atomic, meso, or continuum scale. LAMMPS runs on single processors or in parallel using message-passing techniques and a spatial-decomposition of the simulation domain. The code is designed to be easy to modify or extend with new functionality.
Time Warp Operating System (TWOS)
NASA Technical Reports Server (NTRS)
Bellenot, Steven F.
1993-01-01
Designed to support parallel discrete-event simulation, TWOS is complete implementation of Time Warp mechanism - distributed protocol for virtual time synchronization based on process rollback and message annihilation.
Controlador para un Reloj GPS de Referencia en el Protocolo NTP
NASA Astrophysics Data System (ADS)
Hauscarriaga, F.; Bareilles, F. A.
The synchronization between computers in a local network plays a very important role on enviroments similar to IAR. Calculations for exact time are needed before, during and after an observation. For this purpose the IAR's GNU/Linux Software Development Team implemented a driver inside NTP protocol (an internet standard for time synchronization of computers) for a GPS receiver acquired a few years ago by IAR, which did not have support in such protocol. Today our Institute has a stable and reliable time base synchronized to atomic clocks on board GPS Satellites according to computers's synchronization standard, offering precise time services to all scientific community and particularly to the University of La Plata. FULL TEXT IN SPANISH
Parallel computing of physical maps--a comparative study in SIMD and MIMD parallelism.
Bhandarkar, S M; Chirravuri, S; Arnold, J
1996-01-01
Ordering clones from a genomic library into physical maps of whole chromosomes presents a central computational problem in genetics. Chromosome reconstruction via clone ordering is usually isomorphic to the NP-complete Optimal Linear Arrangement problem. Parallel SIMD and MIMD algorithms for simulated annealing based on Markov chain distribution are proposed and applied to the problem of chromosome reconstruction via clone ordering. Perturbation methods and problem-specific annealing heuristics are proposed and described. The SIMD algorithms are implemented on a 2048 processor MasPar MP-2 system which is an SIMD 2-D toroidal mesh architecture whereas the MIMD algorithms are implemented on an 8 processor Intel iPSC/860 which is an MIMD hypercube architecture. A comparative analysis of the various SIMD and MIMD algorithms is presented in which the convergence, speedup, and scalability characteristics of the various algorithms are analyzed and discussed. On a fine-grained, massively parallel SIMD architecture with a low synchronization overhead such as the MasPar MP-2, a parallel simulated annealing algorithm based on multiple periodically interacting searches performs the best. For a coarse-grained MIMD architecture with high synchronization overhead such as the Intel iPSC/860, a parallel simulated annealing algorithm based on multiple independent searches yields the best results. In either case, distribution of clonal data across multiple processors is shown to exacerbate the tendency of the parallel simulated annealing algorithm to get trapped in a local optimum.
Synchronous parallel system for emulation and discrete event simulation
NASA Technical Reports Server (NTRS)
Steinman, Jeffrey S. (Inventor)
1992-01-01
A synchronous parallel system for emulation and discrete event simulation having parallel nodes responds to received messages at each node by generating event objects having individual time stamps, stores only the changes to state variables of the simulation object attributable to the event object, and produces corresponding messages. The system refrains from transmitting the messages and changing the state variables while it determines whether the changes are superseded, and then stores the unchanged state variables in the event object for later restoral to the simulation object if called for. This determination preferably includes sensing the time stamp of each new event object and determining which new event object has the earliest time stamp as the local event horizon, determining the earliest local event horizon of the nodes as the global event horizon, and ignoring the events whose time stamps are less than the global event horizon. Host processing between the system and external terminals enables such a terminal to query, monitor, command or participate with a simulation object during the simulation process.
Synchronous Parallel System for Emulation and Discrete Event Simulation
NASA Technical Reports Server (NTRS)
Steinman, Jeffrey S. (Inventor)
2001-01-01
A synchronous parallel system for emulation and discrete event simulation having parallel nodes responds to received messages at each node by generating event objects having individual time stamps, stores only the changes to the state variables of the simulation object attributable to the event object and produces corresponding messages. The system refrains from transmitting the messages and changing the state variables while it determines whether the changes are superseded, and then stores the unchanged state variables in the event object for later restoral to the simulation object if called for. This determination preferably includes sensing the time stamp of each new event object and determining which new event object has the earliest time stamp as the local event horizon, determining the earliest local event horizon of the nodes as the global event horizon, and ignoring events whose time stamps are less than the global event horizon. Host processing between the system and external terminals enables such a terminal to query, monitor, command or participate with a simulation object during the simulation process.
Automated problem scheduling and reduction of synchronization delay effects
NASA Technical Reports Server (NTRS)
Saltz, Joel H.
1987-01-01
It is anticipated that in order to make effective use of many future high performance architectures, programs will have to exhibit at least a medium grained parallelism. A framework is presented for partitioning very sparse triangular systems of linear equations that is designed to produce favorable preformance results in a wide variety of parallel architectures. Efficient methods for solving these systems are of interest because: (1) they provide a useful model problem for use in exploring heuristics for the aggregation, mapping and scheduling of relatively fine grained computations whose data dependencies are specified by directed acrylic graphs, and (2) because such efficient methods can find direct application in the development of parallel algorithms for scientific computation. Simple expressions are derived that describe how to schedule computational work with varying degrees of granularity. The Encore Multimax was used as a hardware simulator to investigate the performance effects of using the partitioning techniques presented in shared memory architectures with varying relative synchronization costs.
Synchronous parallel spatially resolved stochastic cluster dynamics
Dunn, Aaron; Dingreville, Rémi; Martínez, Enrique; ...
2016-04-23
In this work, a spatially resolved stochastic cluster dynamics (SRSCD) model for radiation damage accumulation in metals is implemented using a synchronous parallel kinetic Monte Carlo algorithm. The parallel algorithm is shown to significantly increase the size of representative volumes achievable in SRSCD simulations of radiation damage accumulation. Additionally, weak scaling performance of the method is tested in two cases: (1) an idealized case of Frenkel pair diffusion and annihilation, and (2) a characteristic example problem including defect cluster formation and growth in α-Fe. For the latter case, weak scaling is tested using both Frenkel pair and displacement cascade damage.more » To improve scaling of simulations with cascade damage, an explicit cascade implantation scheme is developed for cases in which fast-moving defects are created in displacement cascades. For the first time, simulation of radiation damage accumulation in nanopolycrystals can be achieved with a three dimensional rendition of the microstructure, allowing demonstration of the effect of grain size on defect accumulation in Frenkel pair-irradiated α-Fe.« less
Parallel-vector out-of-core equation solver for computational mechanics
NASA Technical Reports Server (NTRS)
Qin, J.; Agarwal, T. K.; Storaasli, O. O.; Nguyen, D. T.; Baddourah, M. A.
1993-01-01
A parallel/vector out-of-core equation solver is developed for shared-memory computers, such as the Cray Y-MP machine. The input/ output (I/O) time is reduced by using the a synchronous BUFFER IN and BUFFER OUT, which can be executed simultaneously with the CPU instructions. The parallel and vector capability provided by the supercomputers is also exploited to enhance the performance. Numerical applications in large-scale structural analysis are given to demonstrate the efficiency of the present out-of-core solver.
Performance Implications of Synchronization Support for Parallel FORTRAN Programs
1991-06-17
applications we used in this study are BDNA and FLO52. BDNA is a molecular dy- I namics simulator for biomolecules in water and it uses ordinary...parallelism structures and loop granularity. In the BDNA program, most of the parallel loops are not nested and the iterations are 200-1000 instructions long...are of concern. The BDNA curve in Figure 21 shows that for this program only 17% of all 32 I I 100 BDNA -4 FLO52 -I 80 3 CumuilatQe percentage of3
Detecting Potential Synchronization Constraint Deadlocks from Formal System Specifications
1992-03-01
family of languages, consisting of the Larch Shared Language and a series of Larch interface languages, specific to particular programming languages...specify sequential (non- concurrent) programs , and explicitly does not include the ability to specify atomic actions (Guttag, 1985). Larch is therefore...synchronized communication between two such agents is ronsidered as a single action. The transitions in CCS trees are labelled to show how they are
SCaLeM: A Framework for Characterizing and Analyzing Execution Models
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chavarría-Miranda, Daniel; Manzano Franco, Joseph B.; Krishnamoorthy, Sriram
2014-10-13
As scalable parallel systems evolve towards more complex nodes with many-core architectures and larger trans-petascale & upcoming exascale deployments, there is a need to understand, characterize and quantify the underlying execution models being used on such systems. Execution models are a conceptual layer between applications & algorithms and the underlying parallel hardware and systems software on which those applications run. This paper presents the SCaLeM (Synchronization, Concurrency, Locality, Memory) framework for characterizing and execution models. SCaLeM consists of three basic elements: attributes, compositions and mapping of these compositions to abstract parallel systems. The fundamental Synchronization, Concurrency, Locality and Memory attributesmore » are used to characterize each execution model, while the combinations of those attributes in the form of compositions are used to describe the primitive operations of the execution model. The mapping of the execution model’s primitive operations described by compositions, to an underlying abstract parallel system can be evaluated quantitatively to determine its effectiveness. Finally, SCaLeM also enables the representation and analysis of applications in terms of execution models, for the purpose of evaluating the effectiveness of such mapping.« less
Estimating phase synchronization in dynamical systems using cellular nonlinear networks
NASA Astrophysics Data System (ADS)
Sowa, Robert; Chernihovskyi, Anton; Mormann, Florian; Lehnertz, Klaus
2005-06-01
We propose a method for estimating phase synchronization between time series using the parallel computing architecture of cellular nonlinear networks (CNN’s). Applying this method to time series of coupled nonlinear model systems and to electroencephalographic time series from epilepsy patients, we show that an accurate approximation of the mean phase coherence R —a bivariate measure for phase synchronization—can be achieved with CNN’s using polynomial-type templates.
Method of synchronizing independent functional unit
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Changhoan
A system for synchronizing parallel processing of a plurality of functional processing units (FPU), a first FPU and a first program counter to control timing of a first stream of program instructions issued to the first FPU by advancement of the first program counter; a second FPU and a second program counter to control timing of a second stream of program instructions issued to the second FPU by advancement of the second program counter, the first FPU is in communication with a second FPU to synchronize the issuance of a first stream of program instructions to the second stream ofmore » program instructions and the second FPU is in communication with the first FPU to synchronize the issuance of the second stream program instructions to the first stream of program instructions.« less
Method of synchronizing independent functional unit
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Changhoan
2017-05-16
A system for synchronizing parallel processing of a plurality of functional processing units (FPU), a first FPU and a first program counter to control timing of a first stream of program instructions issued to the first FPU by advancement of the first program counter; a second FPU and a second program counter to control timing of a second stream of program instructions issued to the second FPU by advancement of the second program counter, the first FPU is in communication with a second FPU to synchronize the issuance of a first stream of program instructions to the second stream ofmore » program instructions and the second FPU is in communication with the first FPU to synchronize the issuance of the second stream program instructions to the first stream of program instructions.« less
Method of synchronizing independent functional unit
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Changhoan
2017-02-14
A system for synchronizing parallel processing of a plurality of functional processing units (FPU), a first FPU and a first program counter to control timing of a first stream of program instructions issued to the first FPU by advancement of the first program counter; a second FPU and a second program counter to control timing of a second stream of program instructions issued to the second FPU by advancement of the second program counter, the first FPU is in communication with a second FPU to synchronize the issuance of a first stream of program instructions to the second stream ofmore » program instructions and the second FPU is in communication with the first FPU to synchronize the issuance of the second stream program instructions to the first stream of program instructions.« less
Anisotropic pitch angle distribution of 50 eV to 50 keV particles at synchronous altitude.
NASA Technical Reports Server (NTRS)
Deforest, S. E.; Mcilwain, C. F.
1972-01-01
At times, the electron pitch angle distributions at synchronous orbit have been observed to be highly anisotropic. In the local morning region, distributions concentrated near 90 deg are often observed in particles of less than approximately 2000 V. This anisotropy decreases with increasing energy from 1 keV to the detector's limit at 50 keV. The time development of anisotropy is consistent with production by pitch angle scattering processes which are not effective on electrons with small velocities parallel to the magnetic field. Another type of distribution has been observed with the low-energy (below 1000 V) electrons concentrated parallel and antiparallel to the magnetic field. These distributions are only seen in the dusk sector, but this may be an orbital artifact.
Parallelization of the FLAPW method and comparison with the PPW method
NASA Astrophysics Data System (ADS)
Canning, Andrew; Mannstadt, Wolfgang; Freeman, Arthur
2000-03-01
The FLAPW (full-potential linearized-augmented plane-wave) method is one of the most accurate first-principles methods for determining electronic and magnetic properties of crystals and surfaces. In the past the FLAPW method has been limited to systems of about a hundred atoms due to the lack of an efficient parallel implementation to exploit the power and memory of parallel computers. In this work we present an efficient parallelization of the method by division among the processors of the plane-wave components for each state. The code is also optimized for RISC (reduced instruction set computer) architectures, such as those found on most parallel computers, making full use of BLAS (basic linear algebra subprograms) wherever possible. Scaling results are presented for systems of up to 686 silicon atoms and 343 palladium atoms per unit cell running on up to 512 processors on a Cray T3E parallel supercomputer. Some results will also be presented on a comparison of the plane-wave pseudopotential method and the FLAPW method on large systems.
Accurate frequency and time dissemination in the optical domain
NASA Astrophysics Data System (ADS)
Khabarova, K. Yu; Kalganova, E. S.; Kolachevsky, N. N.
2018-02-01
The development of the optical frequency comb technique has enabled a wide use of atomic optical clocks by allowing frequency conversion from the optical to the radio frequency range. Today, the fractional instability of such clocks has reached the record eighteen-digit level, two orders of magnitude better than for cesium fountains representing the primary frequency standard. This is paralleled by the development of techniques for transferring accurate time and optical frequency signals, including fiber links. With this technology, the fractional instability of transferred frequency can be lowered to below 10‑18 with an averaging time of 1000 s for a 1000 km optical link. At a distance of 500 km, a time signal uncertainty of 250 ps has been achieved. Optical links allow comparing optical clocks and creating a synchronized time and frequency standard network at a new level of precision. Prospects for solving new problems arise, including the determination of the gravitational potential, the measurement of the continental Sagnac effect, and precise tests of fundamental theories.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Castellana, Vito G.; Tumeo, Antonino; Ferrandi, Fabrizio
Emerging applications such as data mining, bioinformatics, knowledge discovery, social network analysis are irregular. They use data structures based on pointers or linked lists, such as graphs, unbalanced trees or unstructures grids, which generates unpredictable memory accesses. These data structures usually are large, but difficult to partition. These applications mostly are memory bandwidth bounded and have high synchronization intensity. However, they also have large amounts of inherent dynamic parallelism, because they potentially perform a task for each one of the element they are exploring. Several efforts are looking at accelerating these applications on hybrid architectures, which integrate general purpose processorsmore » with reconfigurable devices. Some solutions, which demonstrated significant speedups, include custom-hand tuned accelerators or even full processor architectures on the reconfigurable logic. In this paper we present an approach for the automatic synthesis of accelerators from C, targeted at irregular applications. In contrast to typical High Level Synthesis paradigms, which construct a centralized Finite State Machine, our approach generates dynamically scheduled hardware components. While parallelism exploitation in typical HLS-generated accelerators is usually bound within a single execution flow, our solution allows concurrently running multiple execution flow, thus also exploiting the coarser grain task parallelism of irregular applications. Our approach supports multiple, multi-ported and distributed memories, and atomic memory operations. Its main objective is parallelizing as many memory operations as possible, independently from their execution time, to maximize the memory bandwidth utilization. This significantly differs from current HLS flows, which usually consider a single memory port and require precise scheduling of memory operations. A key innovation of our approach is the generation of a memory interface controller, which dynamically maps concurrent memory accesses to multiple ports. We present a case study on a typical irregular kernel, Graph Breadth First search (BFS), exploring different tradeoffs in terms of parallelism and number of memories.« less
Parallel Processing of Broad-Band PPM Signals
NASA Technical Reports Server (NTRS)
Gray, Andrew; Kang, Edward; Lay, Norman; Vilnrotter, Victor; Srinivasan, Meera; Lee, Clement
2010-01-01
A parallel-processing algorithm and a hardware architecture to implement the algorithm have been devised for timeslot synchronization in the reception of pulse-position-modulated (PPM) optical or radio signals. As in the cases of some prior algorithms and architectures for parallel, discrete-time, digital processing of signals other than PPM, an incoming broadband signal is divided into multiple parallel narrower-band signals by means of sub-sampling and filtering. The number of parallel streams is chosen so that the frequency content of the narrower-band signals is low enough to enable processing by relatively-low speed complementary metal oxide semiconductor (CMOS) electronic circuitry. The algorithm and architecture are intended to satisfy requirements for time-varying time-slot synchronization and post-detection filtering, with correction of timing errors independent of estimation of timing errors. They are also intended to afford flexibility for dynamic reconfiguration and upgrading. The architecture is implemented in a reconfigurable CMOS processor in the form of a field-programmable gate array. The algorithm and its hardware implementation incorporate three separate time-varying filter banks for three distinct functions: correction of sub-sample timing errors, post-detection filtering, and post-detection estimation of timing errors. The design of the filter bank for correction of timing errors, the method of estimating timing errors, and the design of a feedback-loop filter are governed by a host of parameters, the most critical one, with regard to processing very broadband signals with CMOS hardware, being the number of parallel streams (equivalently, the rate-reduction parameter).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shah, Nihar; Wei, Max; Letschert, Virginie
2015-10-01
Hydrofluorocarbons (HFCs) emitted from uses such as refrigerants and thermal insulating foam, are now the fastest growing greenhouse gases (GHGs), with global warming potentials (GWP) thousands of times higher than carbon dioxide (CO2). Because of the short lifetime of these molecules in the atmosphere, mitigating the amount of these short-lived climate pollutants (SLCPs) provides a faster path to climate change mitigation than control of CO2 alone. This has led to proposals from Africa, Europe, India, Island States, and North America to amend the Montreal Protocol on Substances that Deplete the Ozone Layer (Montreal Protocol) to phase-down high-GWP HFCs. Simultaneously, energymore » efficiency market transformation programs such as standards, labeling and incentive programs are endeavoring to improve the energy efficiency for refrigeration and air conditioning equipment to provide life cycle cost, energy, GHG, and peak load savings. In this paper we provide an estimate of the magnitude of such GHG and peak electric load savings potential, for room air conditioning, if the refrigerant transition and energy efficiency improvement policies are implemented either separately or in parallel. We find that implementing HFC refrigerant transition and energy efficiency improvement policies in parallel for room air conditioning, roughly doubles the benefit of either policy implemented separately. We estimate that shifting the 2030 world stock of room air conditioners from the low efficiency technology using high-GWP refrigerants to higher efficiency technology and low-GWP refrigerants in parallel would save between 340-790 gigawatts (GW) of peak load globally, which is roughly equivalent to avoiding 680-1550 peak power plants of 500MW each. This would save 0.85 GT/year annually in China equivalent to over 8 Three Gorges dams and over 0.32 GT/year annually in India equivalent to roughly twice India’s 100GW solar mission target. While there is some uncertainty associated with emissions and growth projections, moving to efficient room air conditioning (~30% more efficient than current technology) in parallel with low-GWP refrigerants in room air conditioning could avoid up to ~25 billion tonnes of CO2 in 2030, ~33 billion in 2040, and ~40 billion in 2050, i.e. cumulative savings up to 98 billion tonnes of CO2 by 2050. Therefore, superefficient room ACs using low-GWP refrigerants merit serious consideration to maximize peak load reduction and GHG savings.« less
Clonal origins and parallel evolution of regionally synchronous colorectal adenoma and carcinoma.
Kim, Tae-Min; An, Chang Hyeok; Rhee, Je-Keun; Jung, Seung-Hyun; Lee, Sung Hak; Baek, In-Pyo; Kim, Min Sung; Lee, Sug Hyung; Chung, Yeun-Jun
2015-09-29
Although the colorectal adenoma-to-carcinoma sequence represents a classical cancer progression model, the evolution of the mutational landscape underlying this model is not fully understood. In this study, we analyzed eight synchronous pairs of colorectal high-grade adenomas and carcinomas, four microsatellite-unstable (MSU) and four-stable (MSS) pairs, using whole-exome sequencing. In the MSU adenoma-carcinoma pairs, we observed no subclonal mutations in adenomas that became fixed in paired carcinomas, suggesting a 'parallel' evolution of synchronous adenoma-to-carcinoma, rather than a 'stepwise' evolution. The abundance of indel (in MSU and MSS pairs) and microsatellite instability (in MSU pairs) was noted in the later adenoma- or carcinoma-specific mutations, indicating that the mutational processes and functional constraints operative in early and late colorectal carcinogenesis are different. All MSU cases exhibited clonal, truncating mutations in ACVR2A, TGFBR2, and DNA mismatch repair genes, but none were present in APC or KRAS. In three MSS pairs, both APC and KRAS mutations were identified as both early and clonal events, often accompanying clonal copy number changes. An MSS case uniquely exhibited clonal ERBB2 amplification, followed by APC and TP53 mutations as carcinoma-specific events. Along with the previously unrecognized clonal origins of synchronous colorectal adenoma-carcinoma pairs, our study revealed that the preferred sequence of mutational events during colorectal carcinogenesis can be context-dependent.
Parallel and Distributed Systems for Probabilistic Reasoning
2012-12-01
work at CMU I had the opportunity to work with Andreas Krause on Gaussian process models for signal quality estimation in wireless sensor networks ...we reviewed the natural parallelization of the belief propagation algorithm using the synchronous schedule and demonstrated both theoretically and...problem is that the power-law sparsity structure, commonly found in graphs derived from natural phenomena (e.g., social networks and the web
Design of a MIMD neural network processor
NASA Astrophysics Data System (ADS)
Saeks, Richard E.; Priddy, Kevin L.; Pap, Robert M.; Stowell, S.
1994-03-01
The Accurate Automation Corporation (AAC) neural network processor (NNP) module is a fully programmable multiple instruction multiple data (MIMD) parallel processor optimized for the implementation of neural networks. The AAC NNP design fully exploits the intrinsic sparseness of neural network topologies. Moreover, by using a MIMD parallel processing architecture one can update multiple neurons in parallel with efficiency approaching 100 percent as the size of the network increases. Each AAC NNP module has 8 K neurons and 32 K interconnections and is capable of 140,000,000 connections per second with an eight processor array capable of over one billion connections per second.
Determination of gold in geologic materials by solvent extraction and atomic-absorption spectrometry
Huffman, Claude; Mensik, J.D.; Riley, L.B.
1967-01-01
The two methods presented for the determination of traces of gold in geologic materials are the cyanide atomic-absorption method and the fire-assay atomic-absorption method. In the cyanide method gold is leached with a sodium-cyanide solution. The monovalent gold is then oxidized to the trivalent state and concentrated by extracting into methyl isobutyl ketone prior to estimation by atomic absorption. In the fire-assay atomic-absorption method, the gold-silver bead obtained from fire assay is dissolved in nitric and hydrochloric acids. Gold is then concentrated by extracting into methyl isobutyl ketone prior to determination by atomic absorption. By either method concentrations as low as 50 parts per billion of gold can be determined in a 15-gram sample.
DGDFT: A massively parallel method for large scale density functional theory calculations.
Hu, Wei; Lin, Lin; Yang, Chao
2015-09-28
We describe a massively parallel implementation of the recently developed discontinuous Galerkin density functional theory (DGDFT) method, for efficient large-scale Kohn-Sham DFT based electronic structure calculations. The DGDFT method uses adaptive local basis (ALB) functions generated on-the-fly during the self-consistent field iteration to represent the solution to the Kohn-Sham equations. The use of the ALB set provides a systematic way to improve the accuracy of the approximation. By using the pole expansion and selected inversion technique to compute electron density, energy, and atomic forces, we can make the computational complexity of DGDFT scale at most quadratically with respect to the number of electrons for both insulating and metallic systems. We show that for the two-dimensional (2D) phosphorene systems studied here, using 37 basis functions per atom allows us to reach an accuracy level of 1.3 × 10(-4) Hartree/atom in terms of the error of energy and 6.2 × 10(-4) Hartree/bohr in terms of the error of atomic force, respectively. DGDFT can achieve 80% parallel efficiency on 128,000 high performance computing cores when it is used to study the electronic structure of 2D phosphorene systems with 3500-14 000 atoms. This high parallel efficiency results from a two-level parallelization scheme that we will describe in detail.
Synchronizing compute node time bases in a parallel computer
Chen, Dong; Faraj, Daniel A; Gooding, Thomas M; Heidelberger, Philip
2015-01-27
Synchronizing time bases in a parallel computer that includes compute nodes organized for data communications in a tree network, where one compute node is designated as a root, and, for each compute node: calculating data transmission latency from the root to the compute node; configuring a thread as a pulse waiter; initializing a wakeup unit; and performing a local barrier operation; upon each node completing the local barrier operation, entering, by all compute nodes, a global barrier operation; upon all nodes entering the global barrier operation, sending, to all the compute nodes, a pulse signal; and for each compute node upon receiving the pulse signal: waking, by the wakeup unit, the pulse waiter; setting a time base for the compute node equal to the data transmission latency between the root node and the compute node; and exiting the global barrier operation.
Synchronizing compute node time bases in a parallel computer
Chen, Dong; Faraj, Daniel A; Gooding, Thomas M; Heidelberger, Philip
2014-12-30
Synchronizing time bases in a parallel computer that includes compute nodes organized for data communications in a tree network, where one compute node is designated as a root, and, for each compute node: calculating data transmission latency from the root to the compute node; configuring a thread as a pulse waiter; initializing a wakeup unit; and performing a local barrier operation; upon each node completing the local barrier operation, entering, by all compute nodes, a global barrier operation; upon all nodes entering the global barrier operation, sending, to all the compute nodes, a pulse signal; and for each compute node upon receiving the pulse signal: waking, by the wakeup unit, the pulse waiter; setting a time base for the compute node equal to the data transmission latency between the root node and the compute node; and exiting the global barrier operation.
Parallel pulse processing and data acquisition for high speed, low error flow cytometry
van den Engh, Gerrit J.; Stokdijk, Willem
1992-01-01
A digitally synchronized parallel pulse processing and data acquisition system for a flow cytometer has multiple parallel input channels with independent pulse digitization and FIFO storage buffer. A trigger circuit controls the pulse digitization on all channels. After an event has been stored in each FIFO, a bus controller moves the oldest entry from each FIFO buffer onto a common data bus. The trigger circuit generates an ID number for each FIFO entry, which is checked by an error detection circuit. The system has high speed and low error rate.
HEP - A semaphore-synchronized multiprocessor with central control. [Heterogeneous Element Processor
NASA Technical Reports Server (NTRS)
Gilliland, M. C.; Smith, B. J.; Calvert, W.
1976-01-01
The paper describes the design concept of the Heterogeneous Element Processor (HEP), a system tailored to the special needs of scientific simulation. In order to achieve high-speed computation required by simulation, HEP features a hierarchy of processes executing in parallel on a number of processors, with synchronization being largely accomplished by hardware. A full-empty-reserve scheme of synchronization is realized by zero-one-valued hardware semaphores. A typical system has, besides the control computer and the scheduler, an algebraic module, a memory module, a first-in first-out (FIFO) module, an integrator module, and an I/O module. The architecture of the scheduler and the algebraic module is examined in detail.
Parallel performance of TORT on the CRAY J90: Model and measurement
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barnett, A.; Azmy, Y.Y.
1997-10-01
A limitation on the parallel performance of TORT on the CRAY J90 is the amount of extra work introduced by the multitasking algorithm itself. The extra work beyond that of the serial version of the code, called overhead, arises from the synchronization of the parallel tasks and the accumulation of results by the master task. The goal of recent updates to TORT was to reduce the time consumed by these activities. To help understand which components of the multitasking algorithm contribute significantly to the overhead, a parallel performance model was constructed and compared to measurements of actual timings of themore » code.« less
Communications oriented programming of parallel iterative solutions of sparse linear systems
NASA Technical Reports Server (NTRS)
Patrick, M. L.; Pratt, T. W.
1986-01-01
Parallel algorithms are developed for a class of scientific computational problems by partitioning the problems into smaller problems which may be solved concurrently. The effectiveness of the resulting parallel solutions is determined by the amount and frequency of communication and synchronization and the extent to which communication can be overlapped with computation. Three different parallel algorithms for solving the same class of problems are presented, and their effectiveness is analyzed from this point of view. The algorithms are programmed using a new programming environment. Run-time statistics and experience obtained from the execution of these programs assist in measuring the effectiveness of these algorithms.
Program For Parallel Discrete-Event Simulation
NASA Technical Reports Server (NTRS)
Beckman, Brian C.; Blume, Leo R.; Geiselman, John S.; Presley, Matthew T.; Wedel, John J., Jr.; Bellenot, Steven F.; Diloreto, Michael; Hontalas, Philip J.; Reiher, Peter L.; Weiland, Frederick P.
1991-01-01
User does not have to add any special logic to aid in synchronization. Time Warp Operating System (TWOS) computer program is special-purpose operating system designed to support parallel discrete-event simulation. Complete implementation of Time Warp mechanism. Supports only simulations and other computations designed for virtual time. Time Warp Simulator (TWSIM) subdirectory contains sequential simulation engine interface-compatible with TWOS. TWOS and TWSIM written in, and support simulations in, C programming language.
On program restructuring, scheduling, and communication for parallel processor systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Polychronopoulos, Constantine D.
1986-08-01
This dissertation discusses several software and hardware aspects of program execution on large-scale, high-performance parallel processor systems. The issues covered are program restructuring, partitioning, scheduling and interprocessor communication, synchronization, and hardware design issues of specialized units. All this work was performed focusing on a single goal: to maximize program speedup, or equivalently, to minimize parallel execution time. Parafrase, a Fortran restructuring compiler was used to transform programs in a parallel form and conduct experiments. Two new program restructuring techniques are presented, loop coalescing and subscript blocking. Compile-time and run-time scheduling schemes are covered extensively. Depending on the program construct, thesemore » algorithms generate optimal or near-optimal schedules. For the case of arbitrarily nested hybrid loops, two optimal scheduling algorithms for dynamic and static scheduling are presented. Simulation results are given for a new dynamic scheduling algorithm. The performance of this algorithm is compared to that of self-scheduling. Techniques for program partitioning and minimization of interprocessor communication for idealized program models and for real Fortran programs are also discussed. The close relationship between scheduling, interprocessor communication, and synchronization becomes apparent at several points in this work. Finally, the impact of various types of overhead on program speedup and experimental results are presented.« less
The 20-minute team--a critical case study from the emergency room.
Berlin, Johan M; Carlström, Eric D
2008-08-01
In this article, the difference between team and group is tested empirically. The research question posed is How are teams formed? Three theoretical concepts that distinguish groups from teams are presented: sequentiality, parallelism and synchronicity. The presumption is that groups cooperate sequentially and teams synchronously, while parallel cooperation is a transition between group and team. To answer the question, a longitudinal case study has been made of a trauma team at a university hospital. Data have been collected through interviews and direct observations. Altogether the work of the trauma team has been studied for a period of 5 years (2002-2006). The results indicate that two factors are of central importance for the creation of a team. The first is related to its management and the other to the forms of cooperation. To allow for a team to act rapidly and to reduce friction between different members, clear leadership is required. The studied team developed cooperation with synchronous elements but never attained a level that corresponds to idealized conceptions of teams. This is used as a basis for challenging ideas that teams are harmonious and free from conflicts and that cooperation takes place without friction.
NASA Astrophysics Data System (ADS)
Yasuda, Shugo; Yamamoto, Ryoichi
2015-11-01
The Synchronized Molecular-Dynamics simulation which was recently proposed by authors is applied to the analysis of polymer lubrication between parallel plates. In the SMD method, the MD simulations are assigned to small fluid elements to calculate the local stresses and temperatures and are synchronized at certain time intervals to satisfy the macroscopic heat- and momentum-transport equations.The rheological properties and conformation of the polymer chains coupled with local viscous heating are investigated with a non-dimensional parameter, the Nahme-Griffith number, which is defined as the ratio of the viscous heating to the thermal conduction at the characteristic temperature required to sufficiently change the viscosity. The present simulation demonstrates that strong shear thinning and a transitional behavior of the conformation of the polymer chains are exhibited with a rapid temperature rise when the Nahme-Griffith number exceeds unity.The results also clarify that the reentrant transition of the linear stress-optical relation occurs for large shear stresses due to the coupling of the conformation of polymer chains with heat generation under shear flows. This study was financially supported by JSPS KAKENHI Grant Nos. 26790080 and 26247069.
Synchronization trigger control system for flow visualization
NASA Technical Reports Server (NTRS)
Chun, K. S.
1987-01-01
The use of cinematography or holographic interferometry for dynamic flow visualization in an internal combustion engine requires a control device that globally synchronizes camera and light source timing at a predefined shaft encoder angle. The device is capable of 0.35 deg resolution for rotational speeds of up to 73 240 rpm. This was achieved by implementing the shaft encoder signal addressed look-up table (LUT) and appropriate latches. The developed digital signal processing technique achieves 25 nsec of high speed triggering angle detection by using direct parallel bit comparison of the shaft encoder digital code with a simulated angle reference code, instead of using angle value comparison which involves more complicated computation steps. In order to establish synchronization to an AC reference signal whose magnitude is variant with the rotating speed, a dynamic peak followup synchronization technique has been devised. This method scrutinizes the reference signal and provides the right timing within 40 nsec. Two application examples are described.
NASA Astrophysics Data System (ADS)
Yang, Ying; Liu, Xiaobao; Wang, Jieci; Jing, Jiliang
2018-03-01
We study how to improve the precision of the quantum estimation of phase for an uniformly accelerated atom in fluctuating electromagnetic field by reflecting boundaries. We find that the precision decreases with increases of the acceleration without the boundary. With the presence of a reflecting boundary, the precision depends on the atomic polarization, position and acceleration, which can be effectively enhanced compared to the case without boundary if we choose the appropriate conditions. In particular, with the presence of two parallel reflecting boundaries, we obtain the optimal precision for atomic parallel polarization and the special distance between two boundaries, as if the atom were shielded from the fluctuation.
Mn-silicide nanostructures aligned on massively parallel silicon nano-ribbons
NASA Astrophysics Data System (ADS)
De Padova, Paola; Ottaviani, Carlo; Ronci, Fabio; Colonna, Stefano; Olivieri, Bruno; Quaresima, Claudio; Cricenti, Antonio; Dávila, Maria E.; Hennies, Franz; Pietzsch, Annette; Shariati, Nina; Le Lay, Guy
2013-01-01
The growth of Mn nanostructures on a 1D grating of silicon nano-ribbons is investigated at atomic scale by means of scanning tunneling microscopy, low energy electron diffraction and core level photoelectron spectroscopy. The grating of silicon nano-ribbons represents an atomic scale template that can be used in a surface-driven route to control the combination of Si with Mn in the development of novel materials for spintronics devices. The Mn atoms show a preferential adsorption site on silicon atoms, forming one-dimensional nanostructures. They are parallel oriented with respect to the surface Si array, which probably predetermines the diffusion pathways of the Mn atoms during the process of nanostructure formation.
Mn-silicide nanostructures aligned on massively parallel silicon nano-ribbons.
De Padova, Paola; Ottaviani, Carlo; Ronci, Fabio; Colonna, Stefano; Olivieri, Bruno; Quaresima, Claudio; Cricenti, Antonio; Dávila, Maria E; Hennies, Franz; Pietzsch, Annette; Shariati, Nina; Le Lay, Guy
2013-01-09
The growth of Mn nanostructures on a 1D grating of silicon nano-ribbons is investigated at atomic scale by means of scanning tunneling microscopy, low energy electron diffraction and core level photoelectron spectroscopy. The grating of silicon nano-ribbons represents an atomic scale template that can be used in a surface-driven route to control the combination of Si with Mn in the development of novel materials for spintronics devices. The Mn atoms show a preferential adsorption site on silicon atoms, forming one-dimensional nanostructures. They are parallel oriented with respect to the surface Si array, which probably predetermines the diffusion pathways of the Mn atoms during the process of nanostructure formation.
Subthalamic nucleus long-range synchronization—an independent hallmark of human Parkinson's disease
Moshel, Shay; Shamir, Reuben R.; Raz, Aeyal; de Noriega, Fernando R.; Eitan, Renana; Bergman, Hagai; Israel, Zvi
2013-01-01
Beta-band synchronous oscillations in the dorsolateral region of the subthalamic nucleus (STN) of human patients with Parkinson's disease (PD) have been frequently reported. However, the correlation between STN oscillations and synchronization has not been thoroughly explored. The simultaneous recordings of 2390 multi-unit pairs recorded by two parallel microelectrodes (separated by fixed distance of 2 mm, n = 72 trajectories with two electrode tracks >4 mm STN span) in 57 PD patients undergoing STN deep brain stimulation surgery were analyzed. Automatic procedures were utilized to divide the STN into dorsolateral oscillatory and ventromedial non-oscillatory regions, and to quantify the intensity of STN oscillations and synchronicity. Finally, the synchronicity of simultaneously vs. non-simultaneously recorded pairs were compared using a shuffling procedure. Synchronization was observed predominately in the beta range and only between multi-unit pairs in the dorsolateral oscillatory region (n = 615). In paired recordings between sites in the dorsolateral and ventromedial (n = 548) and ventromedial-ventromedial region pairs (n = 1227), no synchronization was observed. Oscillation and synchronicity intensity decline along the STN dorsolateral-ventromedial axis suggesting a fuzzy border between the STN regions. Synchronization strength was significantly correlated to the oscillation power, but synchronization was no longer observed following shuffling. We conclude that STN long-range beta oscillatory synchronization is due to increased neuronal coupling in the Parkinsonian brain and does not merely reflect the outcome of oscillations at similar frequency. The neural synchronization in the dorsolateral (probably the motor domain) STN probably augments the pathological changes in firing rate and patterns of subthalamic neurons in PD patients. PMID:24312018
A Real-Time Imaging System for Stereo Atomic Microscopy at SPring-8's BL25SU
DOE Office of Scientific and Technical Information (OSTI.GOV)
Matsushita, Tomohiro; Guo, Fang Zhun; Muro, Takayuki
2007-01-19
We have developed a real-time photoelectron angular distribution (PEAD) and Auger-electron angular distribution (AEAD) imaging system at SPring-8 BL25SU, Japan. In addition, a real-time imaging system for circular dichroism (CD) studies of PEAD/AEAD has been newly developed. Two PEAD images recorded with left- and right-circularly polarized light can be regarded as a stereo image of the atomic arrangement. A two-dimensional display type mirror analyzer (DIANA) has been installed at the beamline, making it possible to record PEAD/AEAD patterns with an acceptance angle of {+-}60 deg. in real-time. The twin-helical undulators at BL25SU enable helicity switching of the circularly polarized lightmore » at 10Hz, 1Hz or 0.1Hz. In order to realize real-time measurements of the CD of the PEAD/AEAD, the CCD camera must be synchronized to the switching frequency. The VME computer that controls the ID is connected to the measurement computer with two BNC cables, and the helicity information is sent using TTL signals. For maximum flexibility, rather than using a hardware shutter synchronizing with the TTL signal we have developed software to synchronize the CCD shutter with the TTL signal. We have succeeded in synchronizing the CCD camera in both the 1Hz and 0.1Hz modes.« less
Performance bounds on parallel self-initiating discrete-event
NASA Technical Reports Server (NTRS)
Nicol, David M.
1990-01-01
The use is considered of massively parallel architectures to execute discrete-event simulations of what is termed self-initiating models. A logical process in a self-initiating model schedules its own state re-evaluation times, independently of any other logical process, and sends its new state to other logical processes following the re-evaluation. The interest is in the effects of that communication on synchronization. The performance is considered of various synchronization protocols by deriving upper and lower bounds on optimal performance, upper bounds on Time Warp's performance, and lower bounds on the performance of a new conservative protocol. The analysis of Time Warp includes the overhead costs of state-saving and rollback. The analysis points out sufficient conditions for the conservative protocol to outperform Time Warp. The analysis also quantifies the sensitivity of performance to message fan-out, lookahead ability, and the probability distributions underlying the simulation.
Conservative parallel simulation of priority class queueing networks
NASA Technical Reports Server (NTRS)
Nicol, David
1992-01-01
A conservative synchronization protocol is described for the parallel simulation of queueing networks having C job priority classes, where a job's class is fixed. This problem has long vexed designers of conservative synchronization protocols because of its seemingly poor ability to compute lookahead: the time of the next departure. For, a job in service having low priority can be preempted at any time by an arrival having higher priority and an arbitrarily small service time. The solution is to skew the event generation activity so that the events for higher priority jobs are generated farther ahead in simulated time than lower priority jobs. Thus, when a lower priority job enters service for the first time, all the higher priority jobs that may preempt it are already known and the job's departure time can be exactly predicted. Finally, the protocol was analyzed and it was demonstrated that good performance can be expected on the simulation of large queueing networks.
Conservative parallel simulation of priority class queueing networks
NASA Technical Reports Server (NTRS)
Nicol, David M.
1990-01-01
A conservative synchronization protocol is described for the parallel simulation of queueing networks having C job priority classes, where a job's class is fixed. This problem has long vexed designers of conservative synchronization protocols because of its seemingly poor ability to compute lookahead: the time of the next departure. For, a job in service having low priority can be preempted at any time by an arrival having higher priority and an arbitrarily small service time. The solution is to skew the event generation activity so that the events for higher priority jobs are generated farther ahead in simulated time than lower priority jobs. Thus, when a lower priority job enters service for the first time, all the higher priority jobs that may preempt it are already known and the job's departure time can be exactly predicted. Finally, the protocol was analyzed and it was demonstrated that good performance can be expected on the simulation of large queueing networks.
Thought Leaders during Crises in Massive Social Networks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Corley, Courtney D.; Farber, Robert M.; Reynolds, William
The vast amount of social media data that can be gathered from the internet coupled with workflows that utilize both commodity systems and massively parallel supercomputers, such as the Cray XMT, open new vistas for research to support health, defense, and national security. Computer technology now enables the analysis of graph structures containing more than 4 billion vertices joined by 34 billion edges along with metrics and massively parallel algorithms that exhibit near-linear scalability according to number of processors. The challenge lies in making this massive data and analysis comprehensible to an analyst and end-users that require actionable knowledge tomore » carry out their duties. Simply stated, we have developed language and content agnostic techniques to reduce large graphs built from vast media corpora into forms people can understand. Specifically, our tools and metrics act as a survey tool to identify thought leaders' -- those members that lead or reflect the thoughts and opinions of an online community, independent of the source language.« less
Compiled MPI: Cost-Effective Exascale Applications Development
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bronevetsky, G; Quinlan, D; Lumsdaine, A
2012-04-10
The complexity of petascale and exascale machines makes it increasingly difficult to develop applications that can take advantage of them. Future systems are expected to feature billion-way parallelism, complex heterogeneous compute nodes and poor availability of memory (Peter Kogge, 2008). This new challenge for application development is motivating a significant amount of research and development on new programming models and runtime systems designed to simplify large-scale application development. Unfortunately, DoE has significant multi-decadal investment in a large family of mission-critical scientific applications. Scaling these applications to exascale machines will require a significant investment that will dwarf the costs of hardwaremore » procurement. A key reason for the difficulty in transitioning today's applications to exascale hardware is their reliance on explicit programming techniques, such as the Message Passing Interface (MPI) programming model to enable parallelism. MPI provides a portable and high performance message-passing system that enables scalable performance on a wide variety of platforms. However, it also forces developers to lock the details of parallelization together with application logic, making it very difficult to adapt the application to significant changes in the underlying system. Further, MPI's explicit interface makes it difficult to separate the application's synchronization and communication structure, reducing the amount of support that can be provided by compiler and run-time tools. This is in contrast to the recent research on more implicit parallel programming models such as Chapel, OpenMP and OpenCL, which promise to provide significantly more flexibility at the cost of reimplementing significant portions of the application. We are developing CoMPI, a novel compiler-driven approach to enable existing MPI applications to scale to exascale systems with minimal modifications that can be made incrementally over the application's lifetime. It includes: (1) New set of source code annotations, inserted either manually or automatically, that will clarify the application's use of MPI to the compiler infrastructure, enabling greater accuracy where needed; (2) A compiler transformation framework that leverages these annotations to transform the original MPI source code to improve its performance and scalability; (3) Novel MPI runtime implementation techniques that will provide a rich set of functionality extensions to be used by applications that have been transformed by our compiler; and (4) A novel compiler analysis that leverages simple user annotations to automatically extract the application's communication structure and synthesize most complex code annotations.« less
Parallel pulse processing and data acquisition for high speed, low error flow cytometry
Engh, G.J. van den; Stokdijk, W.
1992-09-22
A digitally synchronized parallel pulse processing and data acquisition system for a flow cytometer has multiple parallel input channels with independent pulse digitization and FIFO storage buffer. A trigger circuit controls the pulse digitization on all channels. After an event has been stored in each FIFO, a bus controller moves the oldest entry from each FIFO buffer onto a common data bus. The trigger circuit generates an ID number for each FIFO entry, which is checked by an error detection circuit. The system has high speed and low error rate. 17 figs.
Portable parallel portfolio optimization in the Aurora Financial Management System
NASA Astrophysics Data System (ADS)
Laure, Erwin; Moritsch, Hans
2001-07-01
Financial planning problems are formulated as large scale, stochastic, multiperiod, tree structured optimization problems. An efficient technique for solving this kind of problems is the nested Benders decomposition method. In this paper we present a parallel, portable, asynchronous implementation of this technique. To achieve our portability goals we elected the programming language Java for our implementation and used a high level Java based framework, called OpusJava, for expressing the parallelism potential as well as synchronization constraints. Our implementation is embedded within a modular decision support tool for portfolio and asset liability management, the Aurora Financial Management System.
Parallel Gaussian elimination of a block tridiagonal matrix using multiple microcomputers
NASA Technical Reports Server (NTRS)
Blech, Richard A.
1989-01-01
The solution of a block tridiagonal matrix using parallel processing is demonstrated. The multiprocessor system on which results were obtained and the software environment used to program that system are described. Theoretical partitioning and resource allocation for the Gaussian elimination method used to solve the matrix are discussed. The results obtained from running 1, 2 and 3 processor versions of the block tridiagonal solver are presented. The PASCAL source code for these solvers is given in the appendix, and may be transportable to other shared memory parallel processors provided that the synchronization outlines are reproduced on the target system.
The Geologic Time Spiral - A Path to the Past
Graham, Joseph; Newman, William; Stacy, John
2008-01-01
The Earth is very old - 4.5 billion years or more according to scientific estimates. Most of the evidence for an ancient Earth is contained in the rocks that form the Earth's crust. The rock layers themselves - like pages in a long and complicated history - record the events of the past, and buried within them are the remains of life - the plants and animals that evolved from organic structures that existed 3 billion years ago. Also contained in rocks once molten are radioactive elements whose isotopes provide Earth with an atomic clock. Within these rocks, 'parent' isotopes decay at a predictable rate to form 'daughter' isotopes. By determining the relative amounts of parent and daughter isotopes, the age of these rocks can be calculated. Thus, the scientific evidence from rock layers, from fossils, and from the ages of rocks as measured by atomic clocks attests to a very old Earth. See USGS Fact Sheet 2007-3015 at http://pubs.usgs.gov/fs/2007/3015/ for ages of geologic time periods. Ages in the spiral have been rounded from the age estimates in the Fact Sheet. B.Y., billion years; M.Y., million years. For more information, see the booklet on Geologic Time at http://pubs.usgs.gov/gip/geotime/. The Geologic Time Spiral poster is available for purchase from the USGS Store.
Synchronized Schlieren method for vortex shedding in cascade during acoustic resonance
NASA Astrophysics Data System (ADS)
Nagashima, T.; Tanida, Y.
1986-10-01
An evaluation is made of synchronized schlieren optical system methods for the simultaneous visualization of both the acoustic wave and vortex shedding phenomena encountered during acoustic resonance excited by vortex shedding from the trailing edges of cascade blades. Attention is given to the case of parallel flat plate blades in throughflow velocities of up to 100 m/s. The acoustic wavefront is found to appear in the trailing edge region and travel upstream when a pair of vortices of opposite sign are fully developed at the trailing edge.
Anatomical connectivity influences both intra- and inter-brain synchronizations.
Dumas, Guillaume; Chavez, Mario; Nadel, Jacqueline; Martinerie, Jacques
2012-01-01
Recent development in diffusion spectrum brain imaging combined to functional simulation has the potential to further our understanding of how structure and dynamics are intertwined in the human brain. At the intra-individual scale, neurocomputational models have already started to uncover how the human connectome constrains the coordination of brain activity across distributed brain regions. In parallel, at the inter-individual scale, nascent social neuroscience provides a new dynamical vista of the coupling between two embodied cognitive agents. Using EEG hyperscanning to record simultaneously the brain activities of subjects during their ongoing interaction, we have previously demonstrated that behavioral synchrony correlates with the emergence of inter-brain synchronization. However, the functional meaning of such synchronization remains to be specified. Here, we use a biophysical model to quantify to what extent inter-brain synchronizations are related to the anatomical and functional similarity of the two brains in interaction. Pairs of interacting brains were numerically simulated and compared to real data. Results show a potential dynamical property of the human connectome to facilitate inter-individual synchronizations and thus may partly account for our propensity to generate dynamical couplings with others.
Logical synchronization: how evidence and hypotheses steer atomic clocks
NASA Astrophysics Data System (ADS)
Myers, John M.; Madjid, F. Hadi
2014-05-01
A clock steps a computer through a cycle of phases. For the propagation of logical symbols from one computer to another, each computer must mesh its phases with arrivals of symbols from other computers. Even the best atomic clocks drift unforeseeably in frequency and phase; feedback steers them toward aiming points that depend on a chosen wave function and on hypotheses about signal propagation. A wave function, always under-determined by evidence, requires a guess. Guessed wave functions are coded into computers that steer atomic clocks in frequency and position—clocks that step computers through their phases of computations, as well as clocks, some on space vehicles, that supply evidence of the propagation of signals. Recognizing the dependence of the phasing of symbol arrivals on guesses about signal propagation elevates `logical synchronization.' from its practice in computer engineering to a dicipline essential to physics. Within this discipline we begin to explore questions invisible under any concept of time that fails to acknowledge the unforeseeable. In particular, variation of spacetime curvature is shown to limit the bit rate of logical communication.
2013-08-01
potential for HMX / RDX (3, 9). ...................................................................................8 1 1. Purpose This work...6 dispersion and electrostatic interactions. Constants for the SB potential are given in table 1. 8 Table 1. SB potential for HMX / RDX (3, 9...modeling dislocations in the energetic molecular crystal RDX using the Large-Scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) molecular
Thalamocortical Oscillations in the Sleeping and Aroused Brain
NASA Astrophysics Data System (ADS)
Steriade, Mircea; McCormick, David A.; Sejnowski, Terrence J.
1993-10-01
Sleep is characterized by synchronized events in billions of synaptically coupled neurons in thalamocortical systems. The activation of a series of neuromodulatory transmitter systems during awakening blocks low-frequency oscillations, induces fast rhythms, and allows the brain to recover full responsiveness. Analysis of cortical and thalamic networks at many levels, from molecules to single neurons to large neuronal assemblies, with a variety of techniques, ranging from intracellular recordings in vivo and in vitro to computer simulations, is beginning to yield insights into the mechanisms of the generation, modulation, and function of brain oscillations.
A hybrid algorithm for parallel molecular dynamics simulations
NASA Astrophysics Data System (ADS)
Mangiardi, Chris M.; Meyer, R.
2017-10-01
This article describes algorithms for the hybrid parallelization and SIMD vectorization of molecular dynamics simulations with short-range forces. The parallelization method combines domain decomposition with a thread-based parallelization approach. The goal of the work is to enable efficient simulations of very large (tens of millions of atoms) and inhomogeneous systems on many-core processors with hundreds or thousands of cores and SIMD units with large vector sizes. In order to test the efficiency of the method, simulations of a variety of configurations with up to 74 million atoms have been performed. Results are shown that were obtained on multi-core systems with Sandy Bridge and Haswell processors as well as systems with Xeon Phi many-core processors.
Parallel transformation of K-SVD solar image denoising algorithm
NASA Astrophysics Data System (ADS)
Liang, Youwen; Tian, Yu; Li, Mei
2017-02-01
The images obtained by observing the sun through a large telescope always suffered with noise due to the low SNR. K-SVD denoising algorithm can effectively remove Gauss white noise. Training dictionaries for sparse representations is a time consuming task, due to the large size of the data involved and to the complexity of the training algorithms. In this paper, an OpenMP parallel programming language is proposed to transform the serial algorithm to the parallel version. Data parallelism model is used to transform the algorithm. Not one atom but multiple atoms updated simultaneously is the biggest change. The denoising effect and acceleration performance are tested after completion of the parallel algorithm. Speedup of the program is 13.563 in condition of using 16 cores. This parallel version can fully utilize the multi-core CPU hardware resources, greatly reduce running time and easily to transplant in multi-core platform.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dmitriy Morozov, Tom Peterka
2014-07-29
Computing a Voronoi or Delaunay tessellation from a set of points is a core part of the analysis of many simulated and measured datasets. As the scale of simulations and observations surpasses billions of particles, a distributed-memory scalable parallel algorithm is the only feasible approach. The primary contribution of this software is a distributed-memory parallel Delaunay and Voronoi tessellation algorithm based on existing serial computational geometry libraries that automatically determines which neighbor points need to be exchanged among the subdomains of a spatial decomposition. Other contributions include the addition of periodic and wall boundary conditions.
Research on the phase adjustment method for dispersion interferometer on HL-2A tokamak
NASA Astrophysics Data System (ADS)
Tongyu, WU; Wei, ZHANG; Haoxi, WANG; Yan, ZHOU; Zejie, YIN
2018-06-01
A synchronous demodulation system is proposed and deployed for CO2 dispersion interferometer on HL-2A, which aims at high plasma density measurements and real-time feedback control. In order to make sure that the demodulator and the interferometer signal are synchronous in phase, a phase adjustment (PA) method has been developed for the demodulation system. The method takes advantages of the field programmable gate array parallel and pipeline process capabilities to carry out high performance and low latency PA. Some experimental results presented show that the PA method is crucial to the synchronous demodulation system and reliable to follow the fast change of the electron density. The system can measure the line-integrated density with a high precision of 2.0 × 1018 m‑2.
Eberle, Henry; Nasuto, Slawomir J; Hayashi, Yoshikatsu
2018-03-01
We present a novel way of using a dynamical model for predictive tracking control that can adapt to a wide range of delays without parameter update. This is achieved by incorporating the paradigm of anticipating synchronization (AS), where a 'slave' system predicts a 'master' via delayed self-feedback. By treating the delayed output of the plant as one half of a 'sensory' AS coupling, the plant and an internal dynamical model can be synchronized such that the plant consistently leads the target's motion. We use two simulated robotic systems with differing arrangements of the plant and internal model ('parallel' and 'serial') to demonstrate that this form of control adapts to a wide range of delays without requiring the parameters of the controller to be changed.
Network control processor for a TDMA system
NASA Astrophysics Data System (ADS)
Suryadevara, Omkarmurthy; Debettencourt, Thomas J.; Shulman, R. B.
Two unique aspects of designing a network control processor (NCP) to monitor and control a demand-assigned, time-division multiple-access (TDMA) network are described. The first involves the implementation of redundancy by synchronizing the databases of two geographically remote NCPs. The two sets of databases are kept in synchronization by collecting data on both systems, transferring databases, sending incremental updates, and the parallel updating of databases. A periodic audit compares the checksums of the databases to ensure synchronization. The second aspect involves the use of a tracking algorithm to dynamically reallocate TDMA frame space. This algorithm detects and tracks current and long-term load changes in the network. When some portions of the network are overloaded while others have excess capacity, the algorithm automatically calculates and implements a new burst time plan.
Sensorimotor Synchronization with Different Metrical Levels of Point-Light Dance Movements.
Su, Yi-Huang
2016-01-01
Rhythm perception and synchronization have been extensively investigated in the auditory domain, as they underlie means of human communication such as music and speech. Although recent studies suggest comparable mechanisms for synchronizing with periodically moving visual objects, the extent to which it applies to ecologically relevant information, such as the rhythm of complex biological motion, remains unknown. The present study addressed this issue by linking rhythm of music and dance in the framework of action-perception coupling. As a previous study showed that observers perceived multiple metrical periodicities in dance movements that embodied this structure, the present study examined whether sensorimotor synchronization (SMS) to dance movements resembles what is known of auditory SMS. Participants watched a point-light figure performing two basic steps of Swing dance cyclically, in which the trunk bounced at every beat and the limbs moved at every second beat, forming two metrical periodicities. Participants tapped synchronously to the bounce of the trunk with or without the limbs moving in the stimuli (Experiment 1), or tapped synchronously to the leg movements with or without the trunk bouncing simultaneously (Experiment 2). Results showed that, while synchronization with the bounce (lower-level pulse) was not influenced by the presence or absence of limb movements (metrical accent), synchronization with the legs (beat) was improved by the presence of the bounce (metrical subdivision) across different movement types. The latter finding parallels the "subdivision benefit" often demonstrated in auditory tasks, suggesting common sensorimotor mechanisms for visual rhythms in dance and auditory rhythms in music.
Three-body effects in Casimir-Polder repulsion
NASA Astrophysics Data System (ADS)
Milton, Kimball A.; Abalo, E. K.; Parashar, Prachi; Pourtolami, Nima; Brevik, Iver; Ellingsen, Simen Å.; Buhmann, Stefan Yoshi; Scheel, Stefan
2015-04-01
In this paper we study an archetypical scenario in which repulsive Casimir-Polder forces between an atom or molecule and two macroscopic bodies can be achieved. This is an extension of previous studies of the interaction between a polarizable atom and a wedge, in which repulsion occurs if the atom is sufficiently anisotropic and close enough to the symmetry plane of the wedge. A similar repulsion occurs if such an atom passes a thin cylinder or a wire. An obvious extension is to compute the interaction between such an atom and two facing wedges, which includes as a special case the interaction of an atom with a conducting screen possessing a slit, or between two parallel wires. To this end we further extend the electromagnetic multiple-scattering formalism for three-body interactions. To test this machinery we reinvestigate the interaction of a polarizable atom between two parallel conducting plates. In that case, three-body effects are shown to be small and are dominated by three- and four-scattering terms. The atom-wedge calculation is illustrated by an analogous scalar situation, described in the Appendix. The wedge-wedge-atom geometry is difficult to analyze because this is a scale-free problem. However, it is not so hard to investigate the three-body corrections to the interaction between an anisotropic atom or nanoparticle and a pair of parallel conducting cylinders and show that the three-body effects are very small and do not affect the Casimir-Polder repulsion at large distances between the cylinders. Finally, we consider whether such highly anisotropic atoms needed for repulsion are practically realizable. Since this appears rather difficult to accomplish, it may be more feasible to observe such effects with highly anisotropic nanoparticles.
NASA Astrophysics Data System (ADS)
Lian, Yanping; Lin, Stephen; Yan, Wentao; Liu, Wing Kam; Wagner, Gregory J.
2018-05-01
In this paper, a parallelized 3D cellular automaton computational model is developed to predict grain morphology for solidification of metal during the additive manufacturing process. Solidification phenomena are characterized by highly localized events, such as the nucleation and growth of multiple grains. As a result, parallelization requires careful treatment of load balancing between processors as well as interprocess communication in order to maintain a high parallel efficiency. We give a detailed summary of the formulation of the model, as well as a description of the communication strategies implemented to ensure parallel efficiency. Scaling tests on a representative problem with about half a billion cells demonstrate parallel efficiency of more than 80% on 8 processors and around 50% on 64; loss of efficiency is attributable to load imbalance due to near-surface grain nucleation in this test problem. The model is further demonstrated through an additive manufacturing simulation with resulting grain structures showing reasonable agreement with those observed in experiments.
NASA Astrophysics Data System (ADS)
Lian, Yanping; Lin, Stephen; Yan, Wentao; Liu, Wing Kam; Wagner, Gregory J.
2018-01-01
In this paper, a parallelized 3D cellular automaton computational model is developed to predict grain morphology for solidification of metal during the additive manufacturing process. Solidification phenomena are characterized by highly localized events, such as the nucleation and growth of multiple grains. As a result, parallelization requires careful treatment of load balancing between processors as well as interprocess communication in order to maintain a high parallel efficiency. We give a detailed summary of the formulation of the model, as well as a description of the communication strategies implemented to ensure parallel efficiency. Scaling tests on a representative problem with about half a billion cells demonstrate parallel efficiency of more than 80% on 8 processors and around 50% on 64; loss of efficiency is attributable to load imbalance due to near-surface grain nucleation in this test problem. The model is further demonstrated through an additive manufacturing simulation with resulting grain structures showing reasonable agreement with those observed in experiments.
Experimental nonlinear dynamical studies in cesium magneto-optical trap using time-series analysis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anwar, M., E-mail: mamalik2000@gmail.com; Islam, R.; Faisal, M.
2015-03-30
A magneto-optical trap of neutral atoms is essentially a dissipative quantum system. The fast thermal atoms continuously dissipate their energy to the environment via spontaneous emissions during the cooling. The atoms are, therefore, strongly coupled with the vacuum reservoir and the laser field. The vacuum fluctuations as well as the field fluctuations are imparted to the atoms as random photon recoils. Consequently, the external and internal dynamics of atoms becomes stochastic. In this paper, we have investigated the stochastic dynamics of the atoms in a magneto-optical trap during the loading process. The time series analysis of the fluorescence signal showsmore » that the dynamics of the atoms evolves, like all dissipative systems, from deterministic to the chaotic regime. The subsequent disappearance and revival of chaos was attributed to chaos synchronization between spatially different atoms in the magneto-optical trap.« less
ERIC Educational Resources Information Center
Hiatt, Blanchard; Gwynne, Peter
1984-01-01
To make computing power broadly available and truly friendly, both soft and hard meshing and synchronization problems will have to be solved. Possible solutions and research related to these problems are discussed. Topics considered include compilers, parallelism, networks, distributed sensors, dataflow, CEDAR system (using dataflow principles),…
Kantsyrev, V L; Chuvatin, A S; Rudakov, L I; Velikovich, A L; Shrestha, I K; Esaulov, A A; Safronova, A S; Shlyaptseva, V V; Osborne, G C; Astanovitsky, A L; Weller, M E; Stafford, A; Schultz, K A; Cooper, M C; Cuneo, M E; Jones, B; Vesey, R A
2014-12-01
A compact Z-pinch x-ray hohlraum design with parallel-driven x-ray sources is experimentally demonstrated in a configuration with a central target and tailored shine shields at a 1.7-MA Zebra generator. Driving in parallel two magnetically decoupled compact double-planar-wire Z pinches has demonstrated the generation of synchronized x-ray bursts that correlated well in time with x-ray emission from a central reemission target. Good agreement between simulated and measured hohlraum radiation temperature of the central target is shown. The advantages of compact hohlraum design applications for multi-MA facilities are discussed.
Symmetries and stability of chimera states in small, globally-coupled networks
NASA Astrophysics Data System (ADS)
Hart, Joseph D.; Bansal, Kanika; Murphy, Thomas E.; Roy, Rajarshi
It has recently been demonstrated that symmetries in a network's topology can help predict the patterns of synchronized clusters that can emerge in a network of coupled oscillators. This and related discoveries have led to increased interest in both network symmetries and cluster synchronization. In parallel with these discoveries, interest in chimera states-dynamical patterns in which a network separates into coherent and incoherent portions-has grown, and chimeras have now been observed in a variety of experimental systems. We present an opto-electronic experiment in which both chimera states and synchronized clusters are observed in a small, globally-coupled network. We show that the symmetries and sub-symmetries of the network permit the formation of the chimera and cluster states. A recently developed group theoretical approach enables us to predict the stability of the observed chimera and cluster states, and highlights the close relationship between chimera and cluster states as belonging to the broader phenomenon of partial synchronization.
Parallel algorithms for simulating continuous time Markov chains
NASA Technical Reports Server (NTRS)
Nicol, David M.; Heidelberger, Philip
1992-01-01
We have previously shown that the mathematical technique of uniformization can serve as the basis of synchronization for the parallel simulation of continuous-time Markov chains. This paper reviews the basic method and compares five different methods based on uniformization, evaluating their strengths and weaknesses as a function of problem characteristics. The methods vary in their use of optimism, logical aggregation, communication management, and adaptivity. Performance evaluation is conducted on the Intel Touchstone Delta multiprocessor, using up to 256 processors.
A parallel implementation of a multisensor feature-based range-estimation method
NASA Technical Reports Server (NTRS)
Suorsa, Raymond E.; Sridhar, Banavar
1993-01-01
There are many proposed vision based methods to perform obstacle detection and avoidance for autonomous or semi-autonomous vehicles. All methods, however, will require very high processing rates to achieve real time performance. A system capable of supporting autonomous helicopter navigation will need to extract obstacle information from imagery at rates varying from ten frames per second to thirty or more frames per second depending on the vehicle speed. Such a system will need to sustain billions of operations per second. To reach such high processing rates using current technology, a parallel implementation of the obstacle detection/ranging method is required. This paper describes an efficient and flexible parallel implementation of a multisensor feature-based range-estimation algorithm, targeted for helicopter flight, realized on both a distributed-memory and shared-memory parallel computer.
An Autonomous Satellite Time Synchronization System Using Remotely Disciplined VC-OCXOs.
Gu, Xiaobo; Chang, Qing; Glennon, Eamonn P; Xu, Baoda; Dempseter, Andrew G; Wang, Dun; Wu, Jiapeng
2015-07-23
An autonomous remote clock control system is proposed to provide time synchronization and frequency syntonization for satellite to satellite or ground to satellite time transfer, with the system comprising on-board voltage controlled oven controlled crystal oscillators (VC-OCXOs) that are disciplined to a remote master atomic clock or oscillator. The synchronization loop aims to provide autonomous operation over extended periods, be widely applicable to a variety of scenarios and robust. A new architecture comprising the use of frequency division duplex (FDD), synchronous time division (STDD) duplex and code division multiple access (CDMA) with a centralized topology is employed. This new design utilizes dual one-way ranging methods to precisely measure the clock error, adopts least square (LS) methods to predict the clock error and employs a third-order phase lock loop (PLL) to generate the voltage control signal. A general functional model for this system is proposed and the error sources and delays that affect the time synchronization are discussed. Related algorithms for estimating and correcting these errors are also proposed. The performance of the proposed system is simulated and guidance for selecting the clock is provided.
Static inverter with synchronous output waveform synthesized by time-optimal-response feedback
NASA Technical Reports Server (NTRS)
Kernick, A.; Stechschulte, D. L.; Shireman, D. W.
1976-01-01
Time-optimal-response 'bang-bang' or 'bang-hang' technique, using four feedback control loops, synthesizes static-inverter sinusoidal output waveform by self-oscillatory but yet synchronous pulse-frequency-modulation (SPFM). A single modular power stage per phase of ac output entails the minimum of circuit complexity while providing by feedback synthesis individual phase voltage regulation, phase position control and inherent compensation simultaneously for line and load disturbances. Clipped sinewave performance is described under off-limit load or input voltage conditions. Also, approaches to high power levels, 3-phase arraying and parallel modular connection are given.
NASA Technical Reports Server (NTRS)
Monford, L. G., Jr. (Inventor)
1974-01-01
A digital communication system is reported for parallel operation of 16 or more transceiver units with the use of only four interconnecting wires. A remote synchronization circuit produces unit address control words sequentially in data frames of 16 words. Means are provided in each transceiver unit to decode calling signals and to transmit calling and data signals. The transceivers communicate with each other over one data line. The synchronization unit communicates the address control information to the transceiver units over an address line and further provides the timing information over a clock line. A reference voltage level or ground line completes the interconnecting four wire hookup.
Associative architecture for image processing
NASA Astrophysics Data System (ADS)
Adar, Rutie; Akerib, Avidan
1997-09-01
This article presents a new generation in parallel processing architecture for real-time image processing. The approach is implemented in a real time image processor chip, called the XiumTM-2, based on combining a fully associative array which provides the parallel engine with a serial RISC core on the same die. The architecture is fully programmable and can be programmed to implement a wide range of color image processing, computer vision and media processing functions in real time. The associative part of the chip is based on patented pending methodology of Associative Computing Ltd. (ACL), which condenses 2048 associative processors, each of 128 'intelligent' bits. Each bit can be a processing bit or a memory bit. At only 33 MHz and 0.6 micron manufacturing technology process, the chip has a computational power of 3 billion ALU operations per second and 66 billion string search operations per second. The fully programmable nature of the XiumTM-2 chip enables developers to use ACL tools to write their own proprietary algorithms combined with existing image processing and analysis functions from ACL's extended set of libraries.
Xenon Formal Security Policy Model
2007-08-14
munication primitives such as locks or semaphores , machine instruction results, hypercall results, traps, and interrupts. For an informal example...communicated on the corresponding side of the parallel oper- ator. Events that are in X ∪ Y are synchronized over the two processes. So if we define
Flexible language constructs for large parallel programs
NASA Technical Reports Server (NTRS)
Rosing, Matthew; Schnabel, Robert
1993-01-01
The goal of the research described is to develop flexible language constructs for writing large data parallel numerical programs for distributed memory (MIMD) multiprocessors. Previously, several models have been developed to support synchronization and communication. Models for global synchronization include SIMD (Single Instruction Multiple Data), SPMD (Single Program Multiple Data), and sequential programs annotated with data distribution statements. The two primary models for communication include implicit communication based on shared memory and explicit communication based on messages. None of these models by themselves seem sufficient to permit the natural and efficient expression of the variety of algorithms that occur in large scientific computations. An overview of a new language that combines many of these programming models in a clean manner is given. This is done in a modular fashion such that different models can be combined to support large programs. Within a module, the selection of a model depends on the algorithm and its efficiency requirements. An overview of the language and discussion of some of the critical implementation details is given.
Proceedings of the 14th Annual Precise Time and Time Interval (PTTI) Applications Planning Meeting
NASA Technical Reports Server (NTRS)
Wardrip, S. C. (Editor)
1983-01-01
Developments and applications in the field of frequency and time are addressed. Specific topics include rubidium frequency standards, future timing requirements, noise and atomic standards, hydrogen maser technology, synchronization, and quartz technology.
Advances in Parallelization for Large Scale Oct-Tree Mesh Generation
NASA Technical Reports Server (NTRS)
O'Connell, Matthew; Karman, Steve L.
2015-01-01
Despite great advancements in the parallelization of numerical simulation codes over the last 20 years, it is still common to perform grid generation in serial. Generating large scale grids in serial often requires using special "grid generation" compute machines that can have more than ten times the memory of average machines. While some parallel mesh generation techniques have been proposed, generating very large meshes for LES or aeroacoustic simulations is still a challenging problem. An automated method for the parallel generation of very large scale off-body hierarchical meshes is presented here. This work enables large scale parallel generation of off-body meshes by using a novel combination of parallel grid generation techniques and a hybrid "top down" and "bottom up" oct-tree method. Meshes are generated using hardware commonly found in parallel compute clusters. The capability to generate very large meshes is demonstrated by the generation of off-body meshes surrounding complex aerospace geometries. Results are shown including a one billion cell mesh generated around a Predator Unmanned Aerial Vehicle geometry, which was generated on 64 processors in under 45 minutes.
2012-10-01
using the open-source code Large-scale Atomic/Molecular Massively Parallel Simulator ( LAMMPS ) (http://lammps.sandia.gov) (23). The commercial...parameters are proprietary and cannot be ported to the LAMMPS 4 simulation code. In our molecular dynamics simulations at the atomistic resolution, we...IBI iterative Boltzmann inversion LAMMPS Large-scale Atomic/Molecular Massively Parallel Simulator MAPS Materials Processes and Simulations MS
Implementation of highly parallel and large scale GW calculations within the OpenAtom software
NASA Astrophysics Data System (ADS)
Ismail-Beigi, Sohrab
The need to describe electronic excitations with better accuracy than provided by band structures produced by Density Functional Theory (DFT) has been a long-term enterprise for the computational condensed matter and materials theory communities. In some cases, appropriate theoretical frameworks have existed for some time but have been difficult to apply widely due to computational cost. For example, the GW approximation incorporates a great deal of important non-local and dynamical electronic interaction effects but has been too computationally expensive for routine use in large materials simulations. OpenAtom is an open source massively parallel ab initiodensity functional software package based on plane waves and pseudopotentials (http://charm.cs.uiuc.edu/OpenAtom/) that takes advantage of the Charm + + parallel framework. At present, it is developed via a three-way collaboration, funded by an NSF SI2-SSI grant (ACI-1339804), between Yale (Ismail-Beigi), IBM T. J. Watson (Glenn Martyna) and the University of Illinois at Urbana Champaign (Laxmikant Kale). We will describe the project and our current approach towards implementing large scale GW calculations with OpenAtom. Potential applications of large scale parallel GW software for problems involving electronic excitations in semiconductor and/or metal oxide systems will be also be pointed out.
Ethyl 2-[(carbamothioyl-amino)-imino]-propano-ate.
Corrêa, Charlane C; Graúdo, José Eugênio J C; de Oliveira, Luiz Fernando C; de Almeida, Mauro V; Diniz, Renata
2011-08-01
The title compound, C(6)H(11)N(3)O(2)S, consists of a roughly planar mol-ecule (r.m.s deviation from planarity = 0.077 Å for the non-H atoms) and has the S atom in an anti position to the imine N atom. This N atom is the acceptor of a strongly bent inter-nal N-H⋯N hydrogen bond donated by the amino group. In the crystal, mol-ecules are arranged in undulating layers parallel to (010). The mol-ecules are linked via inter-molecular amino-carboxyl N-H⋯O hydrogen bonds, forming chains parallel to [001]. The chains are cross-linked by N(carbazone)-H⋯S and C-H⋯S inter-actions, forming infinite sheets.
Chou, Yi-Chia; Tang, Wei; Chiou, Chien-Jyun; Chen, Kai; Minor, Andrew M; Tu, K N
2015-06-10
Effects of strain impact a range of applications involving mobility change in field-effect-transistors. We report the effect of strain fluctuation on epitaxial growth of NiSi2 in a Si nanowire via point contact and atomic layer reactions, and we discuss the thermodynamic, kinetic, and mechanical implications. The generation and relaxation of strain shown by in situ TEM is periodic and in synchronization with the atomic layer reaction. The Si lattice at the epitaxial interface is under tensile strain, which enables a high solubility of supersaturated interstitial Ni atoms for homogeneous nucleation of an epitaxial atomic layer of the disilicide phase. The tensile strain is reduced locally during the incubation period of nucleation by the dissolution of supersaturated Ni atoms in the Si lattice but the strained-Si state returns once the atomic layer epitaxial growth of NiSi2 occurs by consuming the supersaturated Ni.
PPM Receiver Implemented in Software
NASA Technical Reports Server (NTRS)
Gray, Andrew; Kang, Edward; Lay, Norman; Vilnrotter, Victor; Srinivasan, Meera; Lee, Clement
2010-01-01
A computer program has been written as a tool for developing optical pulse-position- modulation (PPM) receivers in which photodetector outputs are fed to analog-to-digital converters (ADCs) and all subsequent signal processing is performed digitally. The program can be used, for example, to simulate an all-digital version of the PPM receiver described in Parallel Processing of Broad-Band PPM Signals (NPO-40711), which appears elsewhere in this issue of NASA Tech Briefs. The program can also be translated into a design for digital PPM receiver hardware. The most notable innovation embodied in the software and the underlying PPM-reception concept is a digital processing subsystem that performs synchronization of PPM time slots, even though the digital processing is, itself, asynchronous in the sense that no attempt is made to synchronize it with the incoming optical signal a priori and there is no feedback to analog signal processing subsystems or ADCs. Functions performed by the software receiver include time-slot synchronization, symbol synchronization, coding preprocessing, and diagnostic functions. The program is written in the MATLAB and Simulink software system. The software receiver is highly parameterized and, hence, programmable: for example, slot- and symbol-synchronization filters have programmable bandwidths.
Implementation of a Synchronized Oscillator Circuit for Fast Sensing and Labeling of Image Objects
Kowalski, Jacek; Strzelecki, Michal; Kim, Hyongsuk
2011-01-01
We present an application-specific integrated circuit (ASIC) CMOS chip that implements a synchronized oscillator cellular neural network with a matrix size of 32 × 32 for object sensing and labeling in binary images. Networks of synchronized oscillators are a recently developed tool for image segmentation and analysis. Its parallel network operation is based on a “temporary correlation” theory that attempts to describe scene recognition as if performed by the human brain. The synchronized oscillations of neuron groups attract a person’s attention if he or she is focused on a coherent stimulus (image object). For more than one perceived stimulus, these synchronized patterns switch in time between different neuron groups, thus forming temporal maps that code several features of the analyzed scene. In this paper, a new oscillator circuit based on a mathematical model is proposed, and the network architecture and chip functional blocks are presented and discussed. The proposed chip is implemented in AMIS 0.35 μm C035M-D 5M/1P technology. An application of the proposed network chip for the segmentation of insulin-producing pancreatic islets in magnetic resonance liver images is presented. PMID:22163803
Method of detecting system function by measuring frequency response
Morrison, John L.; Morrison, William H.
2008-07-01
Real time battery impedance spectrum is acquired using one time record, Compensated Synchronous Detection (CSD). This parallel method enables battery diagnostics. The excitation current to a test battery is a sum of equal amplitude sin waves of a few frequencies spread over range of interest. The time profile of this signal has duration that is a few periods of the lowest frequency. The voltage response of the battery, average deleted, is the impedance of the battery in the time domain. Since the excitation frequencies are known, synchronous detection processes the time record and each component, both magnitude and phase, is obtained. For compensation, the components, except the one of interest, are reassembled in the time domain. The resulting signal is subtracted from the original signal and the component of interest is synchronously detected. This process is repeated for each component.
Method of Detecting System Function by Measuring Frequency Response
NASA Technical Reports Server (NTRS)
Morrison, John L. (Inventor); Morrison, William H. (Inventor)
2008-01-01
Real time battery impedance spectrum is acquired using one time record, Compensated Synchronous Detection (CSD). This parallel method enables battery diagnostics. The excitation current to a test battery is a sum of equal amplitude sin waves of a few frequencies spread over range of interest. The time profile of this signal has duration that is a few periods of the lowest frequency. The voltage response of the battery, average deleted, is the impedance of the battery in the time domain. Since the excitation frequencies are known, synchronous detection processes the time record and each component, both magnitude and phase, is obtained. For compensation, the components, except the one of interest, are reassembled in the time domain. The resulting signal is subtracted from the original signal and the component of interest is synchronously detected. This process is repeated for each component.
NASA Astrophysics Data System (ADS)
Lyan, Oleg; Jankunas, Valdas; Guseinoviene, Eleonora; Pašilis, Aleksas; Senulis, Audrius; Knolis, Audrius; Kurt, Erol
2018-02-01
In this study, a permanent magnet synchronous generator (PMSG) topology with compensated reactance windings in parallel rod configuration is proposed to reduce the armature reactance X L and to achieve higher efficiency of PMSG. The PMSG was designed using iron-cored bifilar coil topology to overcome problems of market-dominant rotary type generators. Often the problem is a comparatively high armature reactance X L, which is usually bigger than armature resistance R a. Therefore, the topology is proposed to partially compensate or negligibly reduce the PMSG reactance. The study was performed by using finite element method (FEM) analysis and experimental investigation. FEM analysis was used to investigate magnetic field flux distribution and density in PMSG. The PMSG experimental analyses of no-load losses and electromotive force versus frequency (i.e., speed) was performed. Also terminal voltage, power output and efficiency relation with load current at different frequencies have been evaluated. The reactance of PMSG has low value and a linear relation with operating frequency. The low reactance gives a small variation of efficiency (from 90% to 95%) in a wide range of load (from 3 A to 10 A) and operation frequency (from 44 Hz to 114 Hz). The comparison of PMSG characteristics with parallel and series winding connection showed insignificant power variation. The research results showed that compensated reactance winding in parallel rod configuration in PMSG design provides lower reactance and therefore, higher efficiency under wider load and frequency variation.
Computer-Aided Parallelizer and Optimizer
NASA Technical Reports Server (NTRS)
Jin, Haoqiang
2011-01-01
The Computer-Aided Parallelizer and Optimizer (CAPO) automates the insertion of compiler directives (see figure) to facilitate parallel processing on Shared Memory Parallel (SMP) machines. While CAPO currently is integrated seamlessly into CAPTools (developed at the University of Greenwich, now marketed as ParaWise), CAPO was independently developed at Ames Research Center as one of the components for the Legacy Code Modernization (LCM) project. The current version takes serial FORTRAN programs, performs interprocedural data dependence analysis, and generates OpenMP directives. Due to the widely supported OpenMP standard, the generated OpenMP codes have the potential to run on a wide range of SMP machines. CAPO relies on accurate interprocedural data dependence information currently provided by CAPTools. Compiler directives are generated through identification of parallel loops in the outermost level, construction of parallel regions around parallel loops and optimization of parallel regions, and insertion of directives with automatic identification of private, reduction, induction, and shared variables. Attempts also have been made to identify potential pipeline parallelism (implemented with point-to-point synchronization). Although directives are generated automatically, user interaction with the tool is still important for producing good parallel codes. A comprehensive graphical user interface is included for users to interact with the parallelization process.
Jagged Tiling for Intra-tile Parallelism and Fine-Grain Multithreading
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shrestha, Sunil; Manzano Franco, Joseph B.; Marquez, Andres
In this paper, we have developed a novel methodology that takes into consideration multithreaded many-core designs to better utilize memory/processing resources and improve memory residence on tileable applications. It takes advantage of polyhedral analysis and transformation in the form of PLUTO, combined with a highly optimized finegrain tile runtime to exploit parallelism at all levels. The main contributions of this paper include the introduction of multi-hierarchical tiling techniques that increases intra tile parallelism; and a data-flow inspired runtime library that allows the expression of parallel tiles with an efficient synchronization registry. Our current implementation shows performance improvements on an Intelmore » Xeon Phi board up to 32.25% against instances produced by state-of-the-art compiler frameworks for selected stencil applications.« less
Todorov, Petko; Bloch, Daniel
2017-11-21
For a gas at thermal equilibrium, it is usually assumed that the velocity distribution follows an isotropic 3-dimensional Maxwell-Boltzmann (M-B) law. This assumption classically implies the assumption of a "cos θ" law for the flux of atoms leaving the surface. Actually, such a law has no grounds in surface physics, and experimental tests of this assumption have remained very few. In a variety of recently developed sub-Doppler laser spectroscopy techniques for gases one-dimensionally confined in a thin cell, the specific contribution of atoms moving nearly parallel to the boundary of the vapor container becomes essential. We report here on the implementation of an experiment to probe effectively the distribution of atomic velocities parallel to the windows for a thin (60 μm) Cs vapor cell. The principle of the setup relies on a spatially separated pump-probe experiment, where the variations of the signal amplitude with the pump-probe separation provide the information on the velocity distribution. The experiment is performed in a sapphire cell on the Cs resonance line, which benefits from a long-lived hyperfine optical pumping. Presently, we can analyze specifically the density of atoms with slow normal velocities ∼5-20 m/s, already corresponding to unusual grazing flight-at ∼85°-88.5° from the normal to the surface-and no deviation from the M-B law is found within the limits of our elementary setup. Finally we suggest tracks to explore more parallel velocities, when surface details-roughness or structure-and the atom-surface interaction should play a key role to restrict the applicability of an M-B-type distribution.
NASA Astrophysics Data System (ADS)
Todorov, Petko; Bloch, Daniel
2017-11-01
For a gas at thermal equilibrium, it is usually assumed that the velocity distribution follows an isotropic 3-dimensional Maxwell-Boltzmann (M-B) law. This assumption classically implies the assumption of a "cos θ" law for the flux of atoms leaving the surface. Actually, such a law has no grounds in surface physics, and experimental tests of this assumption have remained very few. In a variety of recently developed sub-Doppler laser spectroscopy techniques for gases one-dimensionally confined in a thin cell, the specific contribution of atoms moving nearly parallel to the boundary of the vapor container becomes essential. We report here on the implementation of an experiment to probe effectively the distribution of atomic velocities parallel to the windows for a thin (60 μm) Cs vapor cell. The principle of the setup relies on a spatially separated pump-probe experiment, where the variations of the signal amplitude with the pump-probe separation provide the information on the velocity distribution. The experiment is performed in a sapphire cell on the Cs resonance line, which benefits from a long-lived hyperfine optical pumping. Presently, we can analyze specifically the density of atoms with slow normal velocities ˜5-20 m/s, already corresponding to unusual grazing flight—at ˜85°-88.5° from the normal to the surface—and no deviation from the M-B law is found within the limits of our elementary setup. Finally we suggest tracks to explore more parallel velocities, when surface details—roughness or structure—and the atom-surface interaction should play a key role to restrict the applicability of an M-B-type distribution.
NASA Astrophysics Data System (ADS)
Zhang, Quan; Li, Chaodong; Zhang, Jiantao; Zhang, Xu
2017-11-01
The macro-micro combined approach, as an effective way to realize trans-scale nano-precision positioning with multi-dimensions and high velocity, plays a significant role in integrated circuit manufacturing field. A 3-degree-of-freedoms (3-DOFs) macro-micro manipulator is designed and analyzed to compromise the conflictions among the large stroke, high precision and multi-DOFs. The macro manipulator is a 3-Prismatic-Revolute-Revolute (3-PRR) structure parallel manipulator which is driven by three linear ultrasonic motors. The dynamic model and the cross-coupling error based synchronized motion controller of the 3-PRR parallel manipulator are theoretical analyzed and experimental tested. To further improve the positioning accuracy, a 3-DOFs monolithic compliant manipulator actuated by three piezoelectric stack actuators is designed. Then a multilayer BP neural network based inverse kinematic model identifier is developed to perform the positioning control. Finally, by forming the macro-micro structure, the dual stage manipulator successfully achieved the positioning task from the point (2 mm, 2 mm, 0 rad) back to the original point (0 mm, 0 mm, 0 rad) with the translation errors in X and Y directions less than ±50 nm and the rotation error around Z axis less than ±1 μrad, respectively.
Virtual Oscillator Controls | Grid Modernization | NREL
Virtual Oscillator Controls Virtual Oscillator Controls NREL is developing virtual oscillator Santa-Barbara, and SunPower. Publications Synthesizing Virtual Oscillators To Control Islanded Inverters Synchronization of Parallel Single-Phase Inverters Using Virtual Oscillator Control, IEEE Transactions on Power
Zhou, Xian; Chen, Xue
2011-05-09
The digital coherent receivers combine coherent detection with digital signal processing (DSP) to compensate for transmission impairments, and therefore are a promising candidate for future high-speed optical transmission system. However, the maximum symbol rate supported by such real-time receivers is limited by the processing rate of hardware. In order to cope with this difficulty, the parallel processing algorithms is imperative. In this paper, we propose a novel parallel digital timing recovery loop (PDTRL) based on our previous work. Furthermore, for increasing the dynamic dispersion tolerance range of receivers, we embed a parallel adaptive equalizer in the PDTRL. This parallel joint scheme (PJS) can be used to complete synchronization, equalization and polarization de-multiplexing simultaneously. Finally, we demonstrate that PDTRL and PJS allow the hardware to process 112 Gbit/s POLMUX-DQPSK signal at the hundreds MHz range. © 2011 Optical Society of America
Sensorimotor Synchronization with Different Metrical Levels of Point-Light Dance Movements
Su, Yi-Huang
2016-01-01
Rhythm perception and synchronization have been extensively investigated in the auditory domain, as they underlie means of human communication such as music and speech. Although recent studies suggest comparable mechanisms for synchronizing with periodically moving visual objects, the extent to which it applies to ecologically relevant information, such as the rhythm of complex biological motion, remains unknown. The present study addressed this issue by linking rhythm of music and dance in the framework of action-perception coupling. As a previous study showed that observers perceived multiple metrical periodicities in dance movements that embodied this structure, the present study examined whether sensorimotor synchronization (SMS) to dance movements resembles what is known of auditory SMS. Participants watched a point-light figure performing two basic steps of Swing dance cyclically, in which the trunk bounced at every beat and the limbs moved at every second beat, forming two metrical periodicities. Participants tapped synchronously to the bounce of the trunk with or without the limbs moving in the stimuli (Experiment 1), or tapped synchronously to the leg movements with or without the trunk bouncing simultaneously (Experiment 2). Results showed that, while synchronization with the bounce (lower-level pulse) was not influenced by the presence or absence of limb movements (metrical accent), synchronization with the legs (beat) was improved by the presence of the bounce (metrical subdivision) across different movement types. The latter finding parallels the “subdivision benefit” often demonstrated in auditory tasks, suggesting common sensorimotor mechanisms for visual rhythms in dance and auditory rhythms in music. PMID:27199709
A Parallel Particle Swarm Optimization Algorithm Accelerated by Asynchronous Evaluations
NASA Technical Reports Server (NTRS)
Venter, Gerhard; Sobieszczanski-Sobieski, Jaroslaw
2005-01-01
A parallel Particle Swarm Optimization (PSO) algorithm is presented. Particle swarm optimization is a fairly recent addition to the family of non-gradient based, probabilistic search algorithms that is based on a simplified social model and is closely tied to swarming theory. Although PSO algorithms present several attractive properties to the designer, they are plagued by high computational cost as measured by elapsed time. One approach to reduce the elapsed time is to make use of coarse-grained parallelization to evaluate the design points. Previous parallel PSO algorithms were mostly implemented in a synchronous manner, where all design points within a design iteration are evaluated before the next iteration is started. This approach leads to poor parallel speedup in cases where a heterogeneous parallel environment is used and/or where the analysis time depends on the design point being analyzed. This paper introduces an asynchronous parallel PSO algorithm that greatly improves the parallel e ciency. The asynchronous algorithm is benchmarked on a cluster assembled of Apple Macintosh G5 desktop computers, using the multi-disciplinary optimization of a typical transport aircraft wing as an example.
Research on Multi - Person Parallel Modeling Method Based on Integrated Model Persistent Storage
NASA Astrophysics Data System (ADS)
Qu, MingCheng; Wu, XiangHu; Tao, YongChao; Liu, Ying
2018-03-01
This paper mainly studies the multi-person parallel modeling method based on the integrated model persistence storage. The integrated model refers to a set of MDDT modeling graphics system, which can carry out multi-angle, multi-level and multi-stage description of aerospace general embedded software. Persistent storage refers to converting the data model in memory into a storage model and converting the storage model into a data model in memory, where the data model refers to the object model and the storage model is a binary stream. And multi-person parallel modeling refers to the need for multi-person collaboration, the role of separation, and even real-time remote synchronization modeling.
SAPNEW: Parallel finite element code for thin shell structures on the Alliant FX/80
NASA Astrophysics Data System (ADS)
Kamat, Manohar P.; Watson, Brian C.
1992-02-01
The results of a research activity aimed at providing a finite element capability for analyzing turbo-machinery bladed-disk assemblies in a vector/parallel processing environment are summarized. Analysis of aircraft turbofan engines is very computationally intensive. The performance limit of modern day computers with a single processing unit was estimated at 3 billions of floating point operations per second (3 gigaflops). In view of this limit of a sequential unit, performance rates higher than 3 gigaflops can be achieved only through vectorization and/or parallelization as on Alliant FX/80. Accordingly, the efforts of this critically needed research were geared towards developing and evaluating parallel finite element methods for static and vibration analysis. A special purpose code, named with the acronym SAPNEW, performs static and eigen analysis of multi-degree-of-freedom blade models built-up from flat thin shell elements.
SAPNEW: Parallel finite element code for thin shell structures on the Alliant FX/80
NASA Technical Reports Server (NTRS)
Kamat, Manohar P.; Watson, Brian C.
1992-01-01
The results of a research activity aimed at providing a finite element capability for analyzing turbo-machinery bladed-disk assemblies in a vector/parallel processing environment are summarized. Analysis of aircraft turbofan engines is very computationally intensive. The performance limit of modern day computers with a single processing unit was estimated at 3 billions of floating point operations per second (3 gigaflops). In view of this limit of a sequential unit, performance rates higher than 3 gigaflops can be achieved only through vectorization and/or parallelization as on Alliant FX/80. Accordingly, the efforts of this critically needed research were geared towards developing and evaluating parallel finite element methods for static and vibration analysis. A special purpose code, named with the acronym SAPNEW, performs static and eigen analysis of multi-degree-of-freedom blade models built-up from flat thin shell elements.
Molecular dynamics simulations of large macromolecular complexes.
Perilla, Juan R; Goh, Boon Chong; Cassidy, C Keith; Liu, Bo; Bernardi, Rafael C; Rudack, Till; Yu, Hang; Wu, Zhe; Schulten, Klaus
2015-04-01
Connecting dynamics to structural data from diverse experimental sources, molecular dynamics simulations permit the exploration of biological phenomena in unparalleled detail. Advances in simulations are moving the atomic resolution descriptions of biological systems into the million-to-billion atom regime, in which numerous cell functions reside. In this opinion, we review the progress, driven by large-scale molecular dynamics simulations, in the study of viruses, ribosomes, bioenergetic systems, and other diverse applications. These examples highlight the utility of molecular dynamics simulations in the critical task of relating atomic detail to the function of supramolecular complexes, a task that cannot be achieved by smaller-scale simulations or existing experimental approaches alone. Copyright © 2015 Elsevier Ltd. All rights reserved.
Quantifying In Situ Metal and Organic Contaminant Mobility in Marine Sediments
2009-01-01
and west of Ford Island, within the Pearl Harbor Naval Base. Sediments are fine grain silts and clays of basaltic origins and contain various... fiber filters for organics), and check valves (Figure 8) connected to synchronized parallel rotary valves connected to the collection chamber. Samples
System software for the finite element machine
NASA Technical Reports Server (NTRS)
Crockett, T. W.; Knott, J. D.
1985-01-01
The Finite Element Machine is an experimental parallel computer developed at Langley Research Center to investigate the application of concurrent processing to structural engineering analysis. This report describes system-level software which has been developed to facilitate use of the machine by applications researchers. The overall software design is outlined, and several important parallel processing issues are discussed in detail, including processor management, communication, synchronization, and input/output. Based on experience using the system, the hardware architecture and software design are critiqued, and areas for further work are suggested.
Ethyl 2-[(carbamothioylamino)imino]propanoate
Corrêa, Charlane C.; Graúdo, José Eugênio J.C.; de Oliveira, Luiz Fernando C.; de Almeida, Mauro V.; Diniz, Renata
2011-01-01
The title compound, C6H11N3O2S, consists of a roughly planar molecule (r.m.s deviation from planarity = 0.077 Å for the non-H atoms) and has the S atom in an anti position to the imine N atom. This N atom is the acceptor of a strongly bent internal N—H⋯N hydrogen bond donated by the amino group. In the crystal, molecules are arranged in undulating layers parallel to (010). The molecules are linked via intermolecular amino–carboxyl N—H⋯O hydrogen bonds, forming chains parallel to [001]. The chains are cross-linked by Ncarbazone—H⋯S and C—H⋯S interactions, forming infinite sheets. PMID:22091006
Jung, Jaewoon; Mori, Takaharu; Kobayashi, Chigusa; Matsunaga, Yasuhiro; Yoda, Takao; Feig, Michael; Sugita, Yuji
2015-07-01
GENESIS (Generalized-Ensemble Simulation System) is a new software package for molecular dynamics (MD) simulations of macromolecules. It has two MD simulators, called ATDYN and SPDYN. ATDYN is parallelized based on an atomic decomposition algorithm for the simulations of all-atom force-field models as well as coarse-grained Go-like models. SPDYN is highly parallelized based on a domain decomposition scheme, allowing large-scale MD simulations on supercomputers. Hybrid schemes combining OpenMP and MPI are used in both simulators to target modern multicore computer architectures. Key advantages of GENESIS are (1) the highly parallel performance of SPDYN for very large biological systems consisting of more than one million atoms and (2) the availability of various REMD algorithms (T-REMD, REUS, multi-dimensional REMD for both all-atom and Go-like models under the NVT, NPT, NPAT, and NPγT ensembles). The former is achieved by a combination of the midpoint cell method and the efficient three-dimensional Fast Fourier Transform algorithm, where the domain decomposition space is shared in real-space and reciprocal-space calculations. Other features in SPDYN, such as avoiding concurrent memory access, reducing communication times, and usage of parallel input/output files, also contribute to the performance. We show the REMD simulation results of a mixed (POPC/DMPC) lipid bilayer as a real application using GENESIS. GENESIS is released as free software under the GPLv2 licence and can be easily modified for the development of new algorithms and molecular models. WIREs Comput Mol Sci 2015, 5:310-323. doi: 10.1002/wcms.1220.
A Fault Oblivious Extreme-Scale Execution Environment
DOE Office of Scientific and Technical Information (OSTI.GOV)
McKie, Jim
The FOX project, funded under the ASCR X-stack I program, developed systems software and runtime libraries for a new approach to the data and work distribution for massively parallel, fault oblivious application execution. Our work was motivated by the premise that exascale computing systems will provide a thousand-fold increase in parallelism and a proportional increase in failure rate relative to today’s machines. To deliver the capability of exascale hardware, the systems software must provide the infrastructure to support existing applications while simultaneously enabling efficient execution of new programming models that naturally express dynamic, adaptive, irregular computation; coupled simulations; and massivemore » data analysis in a highly unreliable hardware environment with billions of threads of execution. Our OS research has prototyped new methods to provide efficient resource sharing, synchronization, and protection in a many-core compute node. We have experimented with alternative task/dataflow programming models and shown scalability in some cases to hundreds of thousands of cores. Much of our software is in active development through open source projects. Concepts from FOX are being pursued in next generation exascale operating systems. Our OS work focused on adaptive, application tailored OS services optimized for multi → many core processors. We developed a new operating system NIX that supports role-based allocation of cores to processes which was released to open source. We contributed to the IBM FusedOS project, which promoted the concept of latency-optimized and throughput-optimized cores. We built a task queue library based on distributed, fault tolerant key-value store and identified scaling issues. A second fault tolerant task parallel library was developed, based on the Linda tuple space model, that used low level interconnect primitives for optimized communication. We designed fault tolerance mechanisms for task parallel computations employing work stealing for load balancing that scaled to the largest existing supercomputers. Finally, we implemented the Elastic Building Blocks runtime, a library to manage object-oriented distributed software components. To support the research, we won two INCITE awards for time on Intrepid (BG/P) and Mira (BG/Q). Much of our work has had impact in the OS and runtime community through the ASCR Exascale OS/R workshop and report, leading to the research agenda of the Exascale OS/R program. Our project was, however, also affected by attrition of multiple PIs. While the PIs continued to participate and offer guidance as time permitted, losing these key individuals was unfortunate both for the project and for the DOE HPC community.« less
Frog: Asynchronous Graph Processing on GPU with Hybrid Coloring Model
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shi, Xuanhua; Luo, Xuan; Liang, Junling
GPUs have been increasingly used to accelerate graph processing for complicated computational problems regarding graph theory. Many parallel graph algorithms adopt the asynchronous computing model to accelerate the iterative convergence. Unfortunately, the consistent asynchronous computing requires locking or atomic operations, leading to significant penalties/overheads when implemented on GPUs. As such, coloring algorithm is adopted to separate the vertices with potential updating conflicts, guaranteeing the consistency/correctness of the parallel processing. Common coloring algorithms, however, may suffer from low parallelism because of a large number of colors generally required for processing a large-scale graph with billions of vertices. We propose a light-weightmore » asynchronous processing framework called Frog with a preprocessing/hybrid coloring model. The fundamental idea is based on Pareto principle (or 80-20 rule) about coloring algorithms as we observed through masses of realworld graph coloring cases. We find that a majority of vertices (about 80%) are colored with only a few colors, such that they can be read and updated in a very high degree of parallelism without violating the sequential consistency. Accordingly, our solution separates the processing of the vertices based on the distribution of colors. In this work, we mainly answer three questions: (1) how to partition the vertices in a sparse graph with maximized parallelism, (2) how to process large-scale graphs that cannot fit into GPU memory, and (3) how to reduce the overhead of data transfers on PCIe while processing each partition. We conduct experiments on real-world data (Amazon, DBLP, YouTube, RoadNet-CA, WikiTalk and Twitter) to evaluate our approach and make comparisons with well-known non-preprocessed (such as Totem, Medusa, MapGraph and Gunrock) and preprocessed (Cusha) approaches, by testing four classical algorithms (BFS, PageRank, SSSP and CC). On all the tested applications and datasets, Frog is able to significantly outperform existing GPU-based graph processing systems except Gunrock and MapGraph. MapGraph gets better performance than Frog when running BFS on RoadNet-CA. The comparison between Gunrock and Frog is inconclusive. Frog can outperform Gunrock more than 1.04X when running PageRank and SSSP, while the advantage of Frog is not obvious when running BFS and CC on some datasets especially for RoadNet-CA.« less
NASA Astrophysics Data System (ADS)
Takenaka, N.; Kadowaki, T.; Kawabata, Y.; Lim, I. C.; Sim, C. M.
2005-04-01
Visualization of cavitation phenomena in a Diesel engine fuel injection nozzle was carried out by using neutron radiography system at KUR in Research Reactor Institute in Kyoto University and at HANARO in Korea Atomic Energy Research Institute. A neutron chopper was synchronized to the engine rotation for high shutter speed exposures. A multi-exposure method was applied to obtain a clear image as an ensemble average of the synchronized images. Some images were successfully obtained and suggested new understanding of the cavitation phenomena in a Diesel engine fuel injection nozzle.
Synchronization algorithm for three-phase voltages of an inverter and a grid
NASA Astrophysics Data System (ADS)
Nos, O. V.
2017-07-01
This paper presents the results of designing a joint phase-locked loop for adjusting the phase shifts (speed) and Euclidean norm of three-phase voltages of an inverter to the same grid parameters. The design can be used, in particular, to match the potentials of two parallel-connected power sources for the fundamental harmonic at the moments of switching the stator windings of an induction AC motor from a converter to a centralized power-supply system and back. Technical implementation of the developed synchronization algorithm will significantly reduce the inductance of the current-balancing reactor and exclude emergency operation modes in the electric motor power circuit.
An Autonomous Satellite Time Synchronization System Using Remotely Disciplined VC-OCXOs
Gu, Xiaobo; Chang, Qing; Glennon, Eamonn P.; Xu, Baoda; Dempseter, Andrew G.; Wang, Dun; Wu, Jiapeng
2015-01-01
An autonomous remote clock control system is proposed to provide time synchronization and frequency syntonization for satellite to satellite or ground to satellite time transfer, with the system comprising on-board voltage controlled oven controlled crystal oscillators (VC-OCXOs) that are disciplined to a remote master atomic clock or oscillator. The synchronization loop aims to provide autonomous operation over extended periods, be widely applicable to a variety of scenarios and robust. A new architecture comprising the use of frequency division duplex (FDD), synchronous time division (STDD) duplex and code division multiple access (CDMA) with a centralized topology is employed. This new design utilizes dual one-way ranging methods to precisely measure the clock error, adopts least square (LS) methods to predict the clock error and employs a third-order phase lock loop (PLL) to generate the voltage control signal. A general functional model for this system is proposed and the error sources and delays that affect the time synchronization are discussed. Related algorithms for estimating and correcting these errors are also proposed. The performance of the proposed system is simulated and guidance for selecting the clock is provided. PMID:26213929
[Rapid identification of hogwash oil by using synchronous fluorescence spectroscopy].
Sun, Yan-Hui; An, Hai-Yang; Jia, Xiao-Li; Wang, Juan
2012-10-01
To identify hogwash oil quickly, the characteristic delta lambda of hogwash oil was analyzed by three dimensional fluorescence spectroscopy with parallel factor analysis, and the model was built up by using synchronous fluorescence spectroscopy with support vector machines (SVM). The results showed that the characteristic delta lambda of hogwash oil was 60 nm. Collecting original spectrum of different samples under the condition of characteristic delta lambda 60 nm, the best model was established while 5 principal components were selected from original spectrum and the radial basis function (RBF) was used as the kernel function, and the optimal penalty factor C and kernel function g were 512 and 0.5 respectively obtained by the grid searching and 6-fold cross validation. The discrimination rate of the model was 100% for both training sets and prediction sets. Thus, it is quick and accurate to apply synchronous fluorescence spectroscopy to identification of hogwash oil.
NASA Astrophysics Data System (ADS)
Luo, Bingyang; Chi, Shangjie; Fang, Man; Li, Mengchao
2017-03-01
Permanent magnet synchronous motor is used widely in industry, the performance requirements wouldn't be met by adopting traditional PID control in some of the occasions with high requirements. In this paper, a hybrid control strategy - nonlinear neural network PID and traditional PID parallel control are adopted. The high stability and reliability of traditional PID was combined with the strong adaptive ability and robustness of neural network. The permanent magnet synchronous motor will get better control performance when switch different working modes according to different controlled object conditions. As the results showed, the speed response adopting the composite control strategy in this paper was faster than the single control strategy. And in the case of sudden disturbance, the recovery time adopting the composite control strategy designed in this paper was shorter, the recovery ability and the robustness were stronger.
Multithreaded Model for Dynamic Load Balancing Parallel Adaptive PDE Computations
NASA Technical Reports Server (NTRS)
Chrisochoides, Nikos
1995-01-01
We present a multithreaded model for the dynamic load-balancing of numerical, adaptive computations required for the solution of Partial Differential Equations (PDE's) on multiprocessors. Multithreading is used as a means of exploring concurrency in the processor level in order to tolerate synchronization costs inherent to traditional (non-threaded) parallel adaptive PDE solvers. Our preliminary analysis for parallel, adaptive PDE solvers indicates that multithreading can be used an a mechanism to mask overheads required for the dynamic balancing of processor workloads with computations required for the actual numerical solution of the PDE's. Also, multithreading can simplify the implementation of dynamic load-balancing algorithms, a task that is very difficult for traditional data parallel adaptive PDE computations. Unfortunately, multithreading does not always simplify program complexity, often makes code re-usability not an easy task, and increases software complexity.
An efficient parallel algorithm for the calculation of canonical MP2 energies.
Baker, Jon; Pulay, Peter
2002-09-01
We present the parallel version of a previous serial algorithm for the efficient calculation of canonical MP2 energies (Pulay, P.; Saebo, S.; Wolinski, K. Chem Phys Lett 2001, 344, 543). It is based on the Saebo-Almlöf direct-integral transformation, coupled with an efficient prescreening of the AO integrals. The parallel algorithm avoids synchronization delays by spawning a second set of slaves during the bin-sort prior to the second half-transformation. Results are presented for systems with up to 2000 basis functions. MP2 energies for molecules with 400-500 basis functions can be routinely calculated to microhartree accuracy on a small number of processors (6-8) in a matter of minutes with modern PC-based parallel computers. Copyright 2002 Wiley Periodicals, Inc. J Comput Chem 23: 1150-1156, 2002
How does life emerge out of chaos?
NASA Astrophysics Data System (ADS)
Grebosz, Jerzy
2013-10-01
How it is possible that unanimated matter can spontaneously create an exquisite beauty? How the simple dust or group of billion atoms can spontaneously create living human beings? These questions for hundred years were a domain of different religions or philosophy. Now, for the first time in a history - the science starts to answer such questions.
Time-synchronized continuous wave laser-induced fluorescence on an oscillatory xenon discharge.
MacDonald, N A; Cappelli, M A; Hargus, W A
2012-11-01
A novel approach to time-synchronizing laser-induced fluorescence measurements to an oscillating current in a 60 Hz xenon discharge lamp using a continuous wave laser is presented. A sample-hold circuit is implemented to separate out signals at different phases along a current cycle, and is followed by a lock-in amplifier to pull out the resulting time-synchronized fluorescence trace from the large background signal. The time evolution of lower state population is derived from the changes in intensity of the fluorescence excitation line shape resulting from laser-induced fluorescence measurements of the 6s(')[1/2](1)(0)-6p(')[3/2](2) xenon atomic transition at λ = 834.68 nm. Results show that the lower state population oscillates at twice the frequency of the discharge current, 120 Hz.
Parallel adaptive wavelet collocation method for PDEs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nejadmalayeri, Alireza, E-mail: Alireza.Nejadmalayeri@gmail.com; Vezolainen, Alexei, E-mail: Alexei.Vezolainen@Colorado.edu; Brown-Dymkoski, Eric, E-mail: Eric.Browndymkoski@Colorado.edu
2015-10-01
A parallel adaptive wavelet collocation method for solving a large class of Partial Differential Equations is presented. The parallelization is achieved by developing an asynchronous parallel wavelet transform, which allows one to perform parallel wavelet transform and derivative calculations with only one data synchronization at the highest level of resolution. The data are stored using tree-like structure with tree roots starting at a priori defined level of resolution. Both static and dynamic domain partitioning approaches are developed. For the dynamic domain partitioning, trees are considered to be the minimum quanta of data to be migrated between the processes. This allowsmore » fully automated and efficient handling of non-simply connected partitioning of a computational domain. Dynamic load balancing is achieved via domain repartitioning during the grid adaptation step and reassigning trees to the appropriate processes to ensure approximately the same number of grid points on each process. The parallel efficiency of the approach is discussed based on parallel adaptive wavelet-based Coherent Vortex Simulations of homogeneous turbulence with linear forcing at effective non-adaptive resolutions up to 2048{sup 3} using as many as 2048 CPU cores.« less
Scalable Parallel Density-based Clustering and Applications
NASA Astrophysics Data System (ADS)
Patwary, Mostofa Ali
2014-04-01
Recently, density-based clustering algorithms (DBSCAN and OPTICS) have gotten significant attention of the scientific community due to their unique capability of discovering arbitrary shaped clusters and eliminating noise data. These algorithms have several applications, which require high performance computing, including finding halos and subhalos (clusters) from massive cosmology data in astrophysics, analyzing satellite images, X-ray crystallography, and anomaly detection. However, parallelization of these algorithms are extremely challenging as they exhibit inherent sequential data access order, unbalanced workload resulting in low parallel efficiency. To break the data access sequentiality and to achieve high parallelism, we develop new parallel algorithms, both for DBSCAN and OPTICS, designed using graph algorithmic techniques. For example, our parallel DBSCAN algorithm exploits the similarities between DBSCAN and computing connected components. Using datasets containing up to a billion floating point numbers, we show that our parallel density-based clustering algorithms significantly outperform the existing algorithms, achieving speedups up to 27.5 on 40 cores on shared memory architecture and speedups up to 5,765 using 8,192 cores on distributed memory architecture. In our experiments, we found that while achieving the scalability, our algorithms produce clustering results with comparable quality to the classical algorithms.
An improved limit on the charge of antihydrogen from stochastic acceleration.
Ahmadi, M; Baquero-Ruiz, M; Bertsche, W; Butler, E; Capra, A; Carruth, C; Cesar, C L; Charlton, M; Charman, A E; Eriksson, S; Evans, L T; Evetts, N; Fajans, J; Friesen, T; Fujiwara, M C; Gill, D R; Gutierrez, A; Hangst, J S; Hardy, W N; Hayden, M E; Isaac, C A; Ishida, A; Jones, S A; Jonsell, S; Kurchaninov, L; Madsen, N; Maxwell, D; McKenna, J T K; Menary, S; Michan, J M; Momose, T; Munich, J J; Nolan, P; Olchanski, K; Olin, A; Povilus, A; Pusa, P; Rasmussen, C Ø; Robicheaux, F; Sacramento, R L; Sameed, M; Sarid, E; Silveira, D M; So, C; Tharp, T D; Thompson, R I; van der Werf, D P; Wurtele, J S; Zhmoginov, A I
2016-01-21
Antimatter continues to intrigue physicists because of its apparent absence in the observable Universe. Current theory requires that matter and antimatter appeared in equal quantities after the Big Bang, but the Standard Model of particle physics offers no quantitative explanation for the apparent disappearance of half the Universe. It has recently become possible to study trapped atoms of antihydrogen to search for possible, as yet unobserved, differences in the physical behaviour of matter and antimatter. Here we consider the charge neutrality of the antihydrogen atom. By applying stochastic acceleration to trapped antihydrogen atoms, we determine an experimental bound on the antihydrogen charge, Qe, of |Q| < 0.71 parts per billion (one standard deviation), in which e is the elementary charge. This bound is a factor of 20 less than that determined from the best previous measurement of the antihydrogen charge. The electrical charge of atoms and molecules of normal matter is known to be no greater than about 10(-21)e for a diverse range of species including H2, He and SF6. Charge-parity-time symmetry and quantum anomaly cancellation demand that the charge of antihydrogen be similarly small. Thus, our measurement constitutes an improved limit and a test of fundamental aspects of the Standard Model. If we assume charge superposition and use the best measured value of the antiproton charge, then we can place a new limit on the positron charge anomaly (the relative difference between the positron and elementary charge) of about one part per billion (one standard deviation), a 25-fold reduction compared to the current best measurement.
Reaching extended length-scales with temperature-accelerated dynamics
NASA Astrophysics Data System (ADS)
Amar, Jacques G.; Shim, Yunsic
2013-03-01
In temperature-accelerated dynamics (TAD) a high-temperature molecular dynamics (MD) simulation is used to accelerate the search for the next low-temperature activated event. While TAD has been quite successful in extending the time-scales of simulations of non-equilibrium processes, due to the fact that the computational work scales approximately as the cube of the number of atoms, until recently only simulations of relatively small systems have been carried out. Recently, we have shown that by combining spatial decomposition with our synchronous sublattice algorithm, significantly improved scaling is possible. However, in this approach the size of activated events is limited by the processor size while the dynamics is not exact. Here we discuss progress in developing an alternate approach in which high-temperature parallel MD along with localized saddle-point (LSAD) calculations, are used to carry out TAD simulations without restricting the size of activated events while keeping the dynamics ``exact'' within the context of harmonic transition-state theory. In tests of our LSAD method applied to Ag/Ag(100) annealing and Cu/Cu(100) growth simulations we find significantly improved scaling of TAD, while maintaining a negligibly small error in the energy barriers. Supported by NSF DMR-0907399.
Bioinspired architecture approach for a one-billion transistor smart CMOS camera chip
NASA Astrophysics Data System (ADS)
Fey, Dietmar; Komann, Marcus
2007-05-01
In the paper we present a massively parallel VLSI architecture for future smart CMOS camera chips with up to one billion transistors. To exploit efficiently the potential offered by future micro- or nanoelectronic devices traditional on central structures oriented parallel architectures based on MIMD or SIMD approaches will fail. They require too long and too many global interconnects for the distribution of code or the access to common memory. On the other hand nature developed self-organising and emergent principles to manage successfully complex structures based on lots of interacting simple elements. Therefore we developed a new as Marching Pixels denoted emergent computing paradigm based on a mixture of bio-inspired computing models like cellular automaton and artificial ants. In the paper we present different Marching Pixels algorithms and the corresponding VLSI array architecture. A detailed synthesis result for a 0.18 μm CMOS process shows that a 256×256 pixel image is processed in less than 10 ms assuming a moderate 100 MHz clock rate for the processor array. Future higher integration densities and a 3D chip stacking technology will allow the integration and processing of Mega pixels within the same time since our architecture is fully scalable.
Parallel discrete event simulation: A shared memory approach
NASA Technical Reports Server (NTRS)
Reed, Daniel A.; Malony, Allen D.; Mccredie, Bradley D.
1987-01-01
With traditional event list techniques, evaluating a detailed discrete event simulation model can often require hours or even days of computation time. Parallel simulation mimics the interacting servers and queues of a real system by assigning each simulated entity to a processor. By eliminating the event list and maintaining only sufficient synchronization to insure causality, parallel simulation can potentially provide speedups that are linear in the number of processors. A set of shared memory experiments is presented using the Chandy-Misra distributed simulation algorithm to simulate networks of queues. Parameters include queueing network topology and routing probabilities, number of processors, and assignment of network nodes to processors. These experiments show that Chandy-Misra distributed simulation is a questionable alternative to sequential simulation of most queueing network models.
Parallel Splash Belief Propagation
2010-08-01
s/ ROGER J. DZIEGIEL, Jr. MICHAEL J. WESSING, Deputy Chief Work Unit Manager For... Management and Budget, Paperwork Reduction Project (0704-0188) Washington, DC 20503. PLEASE DO NOT RETURN YOUR FORM TO THE ABOVE ADDRESS. 1. REPORT DATE...Propagation algorithm outperforms synchronous, round-robin, wild-fire ( Ranganathan et al., 2007), and residual (Elidan et al., 2006) belief propagation
NASA Astrophysics Data System (ADS)
Kumari, Komal; Donzis, Diego
2017-11-01
Highly resolved computational simulations on massively parallel machines are critical in understanding the physics of a vast number of complex phenomena in nature governed by partial differential equations. Simulations at extreme levels of parallelism present many challenges with communication between processing elements (PEs) being a major bottleneck. In order to fully exploit the computational power of exascale machines one needs to devise numerical schemes that relax global synchronizations across PEs. This asynchronous computations, however, have a degrading effect on the accuracy of standard numerical schemes.We have developed asynchrony-tolerant (AT) schemes that maintain order of accuracy despite relaxed communications. We show, analytically and numerically, that these schemes retain their numerical properties with multi-step higher order temporal Runge-Kutta schemes. We also show that for a range of optimized parameters,the computation time and error for AT schemes is less than their synchronous counterpart. Stability of the AT schemes which depends upon history and random nature of delays, are also discussed. Support from NSF is gratefully acknowledged.
Flexible Language Constructs for Large Parallel Programs
Rosing, Matt; Schnabel, Robert
1994-01-01
The goal of the research described in this article is to develop flexible language constructs for writing large data parallel numerical programs for distributed memory (multiple instruction multiple data [MIMD]) multiprocessors. Previously, several models have been developed to support synchronization and communication. Models for global synchronization include single instruction multiple data (SIMD), single program multiple data (SPMD), and sequential programs annotated with data distribution statements. The two primary models for communication include implicit communication based on shared memory and explicit communication based on messages. None of these models by themselves seem sufficient to permit the natural and efficient expression ofmore » the variety of algorithms that occur in large scientific computations. In this article, we give an overview of a new language that combines many of these programming models in a clean manner. This is done in a modular fashion such that different models can be combined to support large programs. Within a module, the selection of a model depends on the algorithm and its efficiency requirements. In this article, we give an overview of the language and discuss some of the critical implementation details.« less
Jung, Jaewoon; Mori, Takaharu; Kobayashi, Chigusa; Matsunaga, Yasuhiro; Yoda, Takao; Feig, Michael; Sugita, Yuji
2015-01-01
GENESIS (Generalized-Ensemble Simulation System) is a new software package for molecular dynamics (MD) simulations of macromolecules. It has two MD simulators, called ATDYN and SPDYN. ATDYN is parallelized based on an atomic decomposition algorithm for the simulations of all-atom force-field models as well as coarse-grained Go-like models. SPDYN is highly parallelized based on a domain decomposition scheme, allowing large-scale MD simulations on supercomputers. Hybrid schemes combining OpenMP and MPI are used in both simulators to target modern multicore computer architectures. Key advantages of GENESIS are (1) the highly parallel performance of SPDYN for very large biological systems consisting of more than one million atoms and (2) the availability of various REMD algorithms (T-REMD, REUS, multi-dimensional REMD for both all-atom and Go-like models under the NVT, NPT, NPAT, and NPγT ensembles). The former is achieved by a combination of the midpoint cell method and the efficient three-dimensional Fast Fourier Transform algorithm, where the domain decomposition space is shared in real-space and reciprocal-space calculations. Other features in SPDYN, such as avoiding concurrent memory access, reducing communication times, and usage of parallel input/output files, also contribute to the performance. We show the REMD simulation results of a mixed (POPC/DMPC) lipid bilayer as a real application using GENESIS. GENESIS is released as free software under the GPLv2 licence and can be easily modified for the development of new algorithms and molecular models. WIREs Comput Mol Sci 2015, 5:310–323. doi: 10.1002/wcms.1220 PMID:26753008
Computational experience with a parallel algorithm for tetrangle inequality bound smoothing.
Rajan, K; Deo, N
1999-09-01
Determining molecular structure from interatomic distances is an important and challenging problem. Given a molecule with n atoms, lower and upper bounds on interatomic distances can usually be obtained only for a small subset of the 2(n(n-1)) atom pairs, using NMR. Given the bounds so obtained on the distances between some of the atom pairs, it is often useful to compute tighter bounds on all the 2(n(n-1)) pairwise distances. This process is referred to as bound smoothing. The initial lower and upper bounds for the pairwise distances not measured are usually assumed to be 0 and infinity. One method for bound smoothing is to use the limits imposed by the triangle inequality. The distance bounds so obtained can often be tightened further by applying the tetrangle inequality--the limits imposed on the six pairwise distances among a set of four atoms (instead of three for the triangle inequalities). The tetrangle inequality is expressed by the Cayley-Menger determinants. For every quadruple of atoms, each pass of the tetrangle inequality bound smoothing procedure finds upper and lower limits on each of the six distances in the quadruple. Applying the tetrangle inequalities to each of the (4n) quadruples requires O(n4) time. Here, we propose a parallel algorithm for bound smoothing employing the tetrangle inequality. Each pass of our algorithm requires O(n3 log n) time on a REW PRAM (Concurrent Read Exclusive Write Parallel Random Access Machine) with O(log(n)n) processors. An implementation of this parallel algorithm on the Intel Paragon XP/S and its performance are also discussed.
Droplet-based microfluidic washing module for magnetic particle-based assays
Lee, Hun; Xu, Linfeng; Oh, Kwang W.
2014-01-01
In this paper, we propose a continuous flow droplet-based microfluidic platform for magnetic particle-based assays by employing in-droplet washing. The droplet-based washing was implemented by traversing functionalized magnetic particles across a laterally merged droplet from one side (containing sample and reagent) to the other (containing buffer) by an external magnetic field. Consequently, the magnetic particles were extracted to a parallel-synchronized train of washing buffer droplets, and unbound reagents were left in an original train of sample droplets. To realize the droplet-based washing function, the following four procedures were sequentially carried in a droplet-based microfluidic device: parallel synchronization of two trains of droplets by using a ladder-like channel network; lateral electrocoalescence by an electric field; magnetic particle manipulation by a magnetic field; and asymmetrical splitting of merged droplets. For the stable droplet synchronization and electrocoalescence, we optimized droplet generation conditions by varying the flow rate ratio (or droplet size). Image analysis was carried out to determine the fluorescent intensity of reagents before and after the washing step. As a result, the unbound reagents in sample droplets were significantly removed by more than a factor of 25 in the single washing step, while the magnetic particles were successfully extracted into washing buffer droplets. As a proof-of-principle, we demonstrate a magnetic particle-based immunoassay with streptavidin-coated magnetic particles and fluorescently labelled biotin in the proposed continuous flow droplet-based microfluidic platform. PMID:25379098
Heralded entangling quantum gate via cavity-assisted photon scattering
NASA Astrophysics Data System (ADS)
Borges, Halyne S.; Rossatto, Daniel Z.; Luiz, Fabrício S.; Villas-Boas, Celso J.
2018-01-01
We theoretically investigate the generation of heralded entanglement between two identical atoms via cavity-assisted photon scattering in two different configurations, namely, either both atoms confined in the same cavity or trapped into locally separated ones. Our protocols are given by a very simple and elegant single-step process, the key mechanism of which is a controlled-phase-flip gate implemented by impinging a single photon on single-sided cavities. In particular, when the atoms are localized in remote cavities, we introduce a single-step parallel quantum circuit instead of the serial process extensively adopted in the literature. We also show that such parallel circuit can be straightforwardly applied to entangle two macroscopic clouds of atoms. Both protocols proposed here predict a high entanglement degree with a success probability close to unity for state-of-the-art parameters. Among other applications, our proposal and its extension to multiple atom-cavity systems step toward a suitable route for quantum networking, in particular for quantum state transfer, quantum teleportation, and nonlocal quantum memory.
Massively parallel algorithms for real-time wavefront control of a dense adaptive optics system
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fijany, A.; Milman, M.; Redding, D.
1994-12-31
In this paper massively parallel algorithms and architectures for real-time wavefront control of a dense adaptive optic system (SELENE) are presented. The authors have already shown that the computation of a near optimal control algorithm for SELENE can be reduced to the solution of a discrete Poisson equation on a regular domain. Although, this represents an optimal computation, due the large size of the system and the high sampling rate requirement, the implementation of this control algorithm poses a computationally challenging problem since it demands a sustained computational throughput of the order of 10 GFlops. They develop a novel algorithm,more » designated as Fast Invariant Imbedding algorithm, which offers a massive degree of parallelism with simple communication and synchronization requirements. Due to these features, this algorithm is significantly more efficient than other Fast Poisson Solvers for implementation on massively parallel architectures. The authors also discuss two massively parallel, algorithmically specialized, architectures for low-cost and optimal implementation of the Fast Invariant Imbedding algorithm.« less
Essential issues in multiprocessor systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gajski, D.D.; Peir, J.K.
1985-06-01
During the past several years, a great number of proposals have been made with the objective to increase supercomputer performance by an order of magnitude on the basis of a utilization of new computer architectures. The present paper is concerned with a suitable classification scheme for comparing these architectures. It is pointed out that there are basically four schools of thought as to the most important factor for an enhancement of computer performance. According to one school, the development of faster circuits will make it possible to retain present architectures, except, possibly, for a mechanism providing synchronization of parallel processes.more » A second school assigns priority to the optimization and vectorization of compilers, which will detect parallelism and help users to write better parallel programs. A third school believes in the predominant importance of new parallel algorithms, while the fourth school supports new models of computation. The merits of the four approaches are critically evaluated. 50 references.« less
Archer, Charles Jens; Musselman, Roy Glenn; Peters, Amanda; Pinnow, Kurt Walter; Swartz, Brent Allen; Wallenfelt, Brian Paul
2010-03-16
A massively parallel computer system contains an inter-nodal communications network of node-to-node links. Each node implements a respective routing strategy for routing data through the network, the routing strategies not necessarily being the same in every node. The routing strategies implemented in the nodes are dynamically adjusted during application execution to shift network workload as required. Preferably, adjustment of routing policies in selective nodes is performed at synchronization points. The network may be dynamically monitored, and routing strategies adjusted according to detected network conditions.
Ibrahim, Khaled Z.; Madduri, Kamesh; Williams, Samuel; ...
2013-07-18
The Gyrokinetic Toroidal Code (GTC) uses the particle-in-cell method to efficiently simulate plasma microturbulence. This paper presents novel analysis and optimization techniques to enhance the performance of GTC on large-scale machines. We introduce cell access analysis to better manage locality vs. synchronization tradeoffs on CPU and GPU-based architectures. Finally, our optimized hybrid parallel implementation of GTC uses MPI, OpenMP, and NVIDIA CUDA, achieves up to a 2× speedup over the reference Fortran version on multiple parallel systems, and scales efficiently to tens of thousands of cores.
Runtime support for data parallel tasks
NASA Technical Reports Server (NTRS)
Haines, Matthew; Hess, Bryan; Mehrotra, Piyush; Vanrosendale, John; Zima, Hans
1994-01-01
We have recently introduced a set of Fortran language extensions that allow for integrated support of task and data parallelism, and provide for shared data abstractions (SDA's) as a method for communications and synchronization among these tasks. In this paper we discuss the design and implementation issues of the runtime system necessary to support these extensions, and discuss the underlying requirements for such a system. To test the feasibility of this approach, we implement a prototype of the runtime system and use this to support an abstract multidisciplinary optimization (MDO) problem for aircraft design. We give initial results and discuss future plans.
MTP: An atomic multicast transport protocol
NASA Technical Reports Server (NTRS)
Freier, Alan O.; Marzullo, Keith
1990-01-01
Multicast transport protocol (MTP); a reliable transport protocol that utilizes the multicast strategy of applicable lower layer network architectures is described. In addition to transporting data reliably and efficiently, MTP provides the client synchronization necessary for agreement on the receipt of data and the joining of the group of communicants.
Polymorphous Computing Architectures
2007-12-12
provide a multiprocessor implementation. In this work, we introduce the Atomos transactional programming language, which is the first to include...implicit transactions, strong atomicity, and a scalable multiprocessor implementation [47]. Atomos is derived from Java, but replaces its synchronization...and conditional waiting constructs with transactional alternatives. The Atomos conditional waiting proposal is tailored to allow efficient
A Multiscale Parallel Computing Architecture for Automated Segmentation of the Brain Connectome
Knobe, Kathleen; Newton, Ryan R.; Schlimbach, Frank; Blower, Melanie; Reid, R. Clay
2015-01-01
Several groups in neurobiology have embarked into deciphering the brain circuitry using large-scale imaging of a mouse brain and manual tracing of the connections between neurons. Creating a graph of the brain circuitry, also called a connectome, could have a huge impact on the understanding of neurodegenerative diseases such as Alzheimer’s disease. Although considerably smaller than a human brain, a mouse brain already exhibits one billion connections and manually tracing the connectome of a mouse brain can only be achieved partially. This paper proposes to scale up the tracing by using automated image segmentation and a parallel computing approach designed for domain experts. We explain the design decisions behind our parallel approach and we present our results for the segmentation of the vasculature and the cell nuclei, which have been obtained without any manual intervention. PMID:21926011
Constructive polarization modulation for coherent population trapping clock
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yun, Peter, E-mail: enxue.yun@obspm.fr; Danet, Jean-Marie; Holleville, David
2014-12-08
We propose a constructive polarization modulation scheme for atomic clocks based on coherent population trapping (CPT). In this scheme, the polarization of a bichromatic laser beam is modulated between two opposite circular polarizations to avoid trapping the atomic populations in the extreme Zeeman sublevels. We show that if an appropriate phase modulation between the two optical components of the bichromatic laser is applied synchronously, the two CPT dark states which are produced successively by the alternate polarizations add constructively. Measured CPT resonance contrasts up to 20% in one-pulse CPT and 12% in two-pulse Ramsey-CPT experiments are reported, demonstrating the potentialmore » of this scheme for applications to high performance atomic clocks.« less
Parallel software support for computational structural mechanics
NASA Technical Reports Server (NTRS)
Jordan, Harry F.
1987-01-01
The application of the parallel programming methodology known as the Force was conducted. Two application issues were addressed. The first involves the efficiency of the implementation and its completeness in terms of satisfying the needs of other researchers implementing parallel algorithms. Support for, and interaction with, other Computational Structural Mechanics (CSM) researchers using the Force was the main issue, but some independent investigation of the Barrier construct, which is extremely important to overall performance, was also undertaken. Another efficiency issue which was addressed was that of relaxing the strong synchronization condition imposed on the self-scheduled parallel DO loop. The Force was extended by the addition of logical conditions to the cases of a parallel case construct and by the inclusion of a self-scheduled version of this construct. The second issue involved applying the Force to the parallelization of finite element codes such as those found in the NICE/SPAR testbed system. One of the more difficult problems encountered is the determination of what information in COMMON blocks is actually used outside of a subroutine and when a subroutine uses a COMMON block merely as scratch storage for internal temporary results.
Zhang, Qun; Hepburn, John W
2008-08-15
We propose a novel method that uses the oscillation of an atomic excited wave packet observed through a pump-probe technique to accurately determine the zero time delay between a pair of ultrashort laser pulses. This physically based approach provides an easy fix for the intractable problem of synchronizing two different femtosecond laser pulses in a practical experimental environment, especially where an in situ time zero measurement with high accuracy is required.
On time scales and time synchronization using LORAN-C as a time reference signal
NASA Technical Reports Server (NTRS)
Chi, A. R.
1974-01-01
The long term performance of the eight LORAN-C chains is presented in terms of the Coordinated Universal Time (UTC) of the U.S. Naval Observatory (USNO); and the use of the LORAN-C navigation system for maintaining the user's clock to a UTC scale is described. The atomic time scale and the UTC of several national laboratories and observatories relative to the international atomic time are reported. Typical performance of several NASA tracking station clocks, relative to the USNO master clock, is also presented.
High Fidelity Simulations of Large-Scale Wireless Networks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Onunkwo, Uzoma; Benz, Zachary
The worldwide proliferation of wireless connected devices continues to accelerate. There are 10s of billions of wireless links across the planet with an additional explosion of new wireless usage anticipated as the Internet of Things develops. Wireless technologies do not only provide convenience for mobile applications, but are also extremely cost-effective to deploy. Thus, this trend towards wireless connectivity will only continue and Sandia must develop the necessary simulation technology to proactively analyze the associated emerging vulnerabilities. Wireless networks are marked by mobility and proximity-based connectivity. The de facto standard for exploratory studies of wireless networks is discrete event simulationsmore » (DES). However, the simulation of large-scale wireless networks is extremely difficult due to prohibitively large turnaround time. A path forward is to expedite simulations with parallel discrete event simulation (PDES) techniques. The mobility and distance-based connectivity associated with wireless simulations, however, typically doom PDES and fail to scale (e.g., OPNET and ns-3 simulators). We propose a PDES-based tool aimed at reducing the communication overhead between processors. The proposed solution will use light-weight processes to dynamically distribute computation workload while mitigating communication overhead associated with synchronizations. This work is vital to the analytics and validation capabilities of simulation and emulation at Sandia. We have years of experience in Sandia’s simulation and emulation projects (e.g., MINIMEGA and FIREWHEEL). Sandia’s current highly-regarded capabilities in large-scale emulations have focused on wired networks, where two assumptions prevent scalable wireless studies: (a) the connections between objects are mostly static and (b) the nodes have fixed locations.« less
Leonardy, Simone; Freymark, Gerald; Hebener, Sabrina; Ellehauge, Eva; Søgaard-Andersen, Lotte
2007-01-01
Myxococcus xanthus cells harbor two motility machineries, type IV pili (Tfp) and the A-engine. During reversals, the two machineries switch polarity synchronously. We present a mechanism that synchronizes this polarity switching. We identify the required for motility response regulator (RomR) as essential for A-motility. RomR localizes in a bipolar, asymmetric pattern with a large cluster at the lagging cell pole. The large RomR cluster relocates to the new lagging pole in parallel with cell reversals. Dynamic RomR localization is essential for cell reversals, suggesting that RomR relocalization induces the polarity switching of the A-engine. The analysis of RomR mutants shows that the output domain targets RomR to the poles and the receiver domain is essential for dynamic localization. The small GTPase MglA establishes correct RomR polarity, and the Frz two-component system regulates dynamic RomR localization. FrzS localizes with Tfp at the leading pole and relocates in an Frz-dependent manner to the opposite pole during reversals; FrzS and RomR localize and oscillate independently. The Frz system synchronizes these oscillations and thus the synchronous polarity switching of the motility machineries. PMID:17932488
Reflex effects on components of synchronized renal sympathetic nerve activity.
DiBona, G F; Jones, S Y
1998-09-01
The effects of peripheral thermal receptor stimulation (tail in hot water, n = 8, anesthetized) and cardiac baroreceptor stimulation (volume loading, n = 8, conscious) on components of synchronized renal sympathetic nerve activity (RSNA) were examined in rats. The peak height and peak frequency of synchronized RSNA were determined. The renal sympathoexcitatory response to peripheral thermal receptor stimulation was associated with an increase in the peak height. The renal sympathoinhibitory response to cardiac baroreceptor stimulation was associated with a decrease in the peak height. Although heart rate was significantly increased with peripheral thermal receptor stimulation and significantly decreased with cardiac baroreceptor stimulation, peak frequency was unchanged. As peak height reflects the number of active fibers, reflex increases and decreases in synchronized RSNA are mediated by parallel increases and decreases in the number of active renal nerve fibers rather than changes in the centrally based rhythm or peak frequency. The increase in the number of active renal nerve fibers produced by peripheral thermal receptor stimulation reflects the engagement of a unique group of silent renal sympathetic nerve fibers with a characteristic response pattern to stimulation of arterial baroreceptors, peripheral and central chemoreceptors, and peripheral thermal receptors.
Huang, Yu; Guo, Feng; Li, Yongling; Liu, Yufeng
2015-01-01
Parameter estimation for fractional-order chaotic systems is an important issue in fractional-order chaotic control and synchronization and could be essentially formulated as a multidimensional optimization problem. A novel algorithm called quantum parallel particle swarm optimization (QPPSO) is proposed to solve the parameter estimation for fractional-order chaotic systems. The parallel characteristic of quantum computing is used in QPPSO. This characteristic increases the calculation of each generation exponentially. The behavior of particles in quantum space is restrained by the quantum evolution equation, which consists of the current rotation angle, individual optimal quantum rotation angle, and global optimal quantum rotation angle. Numerical simulation based on several typical fractional-order systems and comparisons with some typical existing algorithms show the effectiveness and efficiency of the proposed algorithm. PMID:25603158
A convenient and accurate parallel Input/Output USB device for E-Prime.
Canto, Rosario; Bufalari, Ilaria; D'Ausilio, Alessandro
2011-03-01
Psychological and neurophysiological experiments require the accurate control of timing and synchrony for Input/Output signals. For instance, a typical Event-Related Potential (ERP) study requires an extremely accurate synchronization of stimulus delivery with recordings. This is typically done via computer software such as E-Prime, and fast communications are typically assured by the Parallel Port (PP). However, the PP is an old and disappearing technology that, for example, is no longer available on portable computers. Here we propose a convenient USB device enabling parallel I/O capabilities. We tested this device against the PP on both a desktop and a laptop machine in different stress tests. Our data demonstrate the accuracy of our system, which suggests that it may be a good substitute for the PP with E-Prime.
Idle waves in high-performance computing
NASA Astrophysics Data System (ADS)
Markidis, Stefano; Vencels, Juris; Peng, Ivy Bo; Akhmetova, Dana; Laure, Erwin; Henri, Pierre
2015-01-01
The vast majority of parallel scientific applications distributes computation among processes that are in a busy state when computing and in an idle state when waiting for information from other processes. We identify the propagation of idle waves through processes in scientific applications with a local information exchange between the two processes. Idle waves are nondispersive and have a phase velocity inversely proportional to the average busy time. The physical mechanism enabling the propagation of idle waves is the local synchronization between two processes due to remote data dependency. This study provides a description of the large number of processes in parallel scientific applications as a continuous medium. This work also is a step towards an understanding of how localized idle periods can affect remote processes, leading to the degradation of global performance in parallel scientific applications.
2015-12-07
Only rarely does an astronomical object have a political association. However, the spiral galaxy NGC 7252 acquired exactly that when it was given an unusual nickname. In December 1953, the US President Dwight D. Eisenhower gave a speech advocating the use of nuclear power for peaceful purposes. This “Atoms for Peace” speech was significant for the scientific community, as it brought nuclear research into the public domain, and NGC 7252, which has a superficial resemblance to an atomic nucleus surrounded by the loops of electronic orbits, was dubbed the Atoms for Peace galaxy in honour of this. These loops are well visible in a wider field of view image. This nickname is quite ironic, as the galaxy’s past was anything but peaceful. Its peculiar appearance is the result of a collision between two galaxies that took place about a billion years ago, which ripped both galaxies apart. The loop-like outer structures, likely made up of dust and stars flung outwards by the crash, but recalling orbiting electrons in an atom, are partly responsible for the galaxy’s nickname. This NASA/ESA Hubble Space Telescope image shows the inner parts of the galaxy, revealing a pinwheel-shaped disc that is rotating in a direction opposite to the rest of the galaxy. This disc resembles a spiral galaxy like our own galaxy, the Milky Way, but is only about 10 000 light-years across — about a tenth of the size of the Milky Way. It is believed that this whirling structure is a remnant of the galactic collision. It will most likely have vanished in a few billion years’ time, when NGC 7252 will have completed its merging process.
Hubble View of a Galaxy Resembling an Atomic Nucleus
2017-12-08
The spiral galaxy NGC 7252 has a superficial resemblance to an atomic nucleus surrounded by the loops of electronic orbits, and was informally dubbed the "Atoms for Peace" galaxy. These loops are well visible in a wider field of view image. This nickname is quite ironic, as the galaxy’s past was anything but peaceful. Its peculiar appearance is the result of a collision between two galaxies that took place about a billion years ago, which ripped both galaxies apart. The loop-like outer structures, likely made up of dust and stars flung outwards by the crash, but recalling orbiting electrons in an atom, are partly responsible for the galaxy’s nickname. This NASA/ESA Hubble Space Telescope image shows the inner parts of the galaxy, revealing a pinwheel-shaped disk that is rotating in a direction opposite to the rest of the galaxy. This disk resembles a spiral galaxy like our own galaxy, the Milky Way, but is only about 10,000 light-years across — about a tenth of the size of the Milky Way. It is believed that this whirling structure is a remnant of the galactic collision. It will most likely have vanished in a few billion years’ time, when NGC 7252 will have completed its merging process. Image credit: NASA & ESA, Acknowledgements: Judy Schmidt NASA image use policy. NASA Goddard Space Flight Center enables NASA’s mission through four scientific endeavors: Earth Science, Heliophysics, Solar System Exploration, and Astrophysics. Goddard plays a leading role in NASA’s accomplishments by contributing compelling scientific knowledge to advance the Agency’s mission. Follow us on Twitter Like us on Facebook Find us on Instagram
NASA Astrophysics Data System (ADS)
Tsunegi, Sumito; Lebrun, Romain; Grimaldi, Eva; Jenkins, Alex S.; Kubota, Hitoshi; Yakushiji, Kay; Bortolotti, Paolo; Grollier, Julie; Fukushima, Akio; Yuasa, Shinji; Cros, Vincent
2016-10-01
The rich physics of spin transfer nano-oscillators (STNO) has provoked a huge interest to create a new generation of multi-functional microwave spintronic devices [1]. It has been often emphasized that their nonlinear behavior gives a unique opportunity to tune their radiofrequency (rf) properties but at the cost of large phase noise, not compatible with practical applications. To tackle this issue as well as to open the opportunities to new developments for non-boolean computations [1], one strategy is to use electrical synchronization of STOs through the rf current. Thereby, it is crucial to understand how the synchronization forces transmitted through the electric current. In this talk, we will first present the results of an experimental study showing the self-synchronization of STNO by re-injecting its rf current after a certain delay time [2]. In the second part, we demonstrate that the synchronization of two vortex-STNOs connected in parallel can be tuned either by an artificial delay or by the spin transfer torques [3]. The synchronization of spin-torque oscillators, combined with the drastic improvement of the rf-features (linewidth decreases by a factor of 2 and power increases by a factor of 4) in the synchronized state, marks an important milestone towards a new generation of rf-devices based on STNO. The authors acknowledge the financial support from ANR agency (SPINNOVA: ANR-11-NANO-0016) and EU grant (MOSAIC: ICT-FP7-317950). [1] N. Locatelli, V. Cros, and J. Grollier, Nat Mater 13, 11 (2014). [2] S. Tsunegi et al., arXiv:1509.05583 (2015) [3] R. Lebrun et al., arXiv:1601.01247 (2016)
The specificity of learned parallelism in dual-memory retrieval.
Strobach, Tilo; Schubert, Torsten; Pashler, Harold; Rickard, Timothy
2014-05-01
Retrieval of two responses from one visually presented cue occurs sequentially at the outset of dual-retrieval practice. Exclusively for subjects who adopt a mode of grouping (i.e., synchronizing) their response execution, however, reaction times after dual-retrieval practice indicate a shift to learned retrieval parallelism (e.g., Nino & Rickard, in Journal of Experimental Psychology: Learning, Memory, and Cognition, 29, 373-388, 2003). In the present study, we investigated how this learned parallelism is achieved and why it appears to occur only for subjects who group their responses. Two main accounts were considered: a task-level versus a cue-level account. The task-level account assumes that learned retrieval parallelism occurs at the level of the task as a whole and is not limited to practiced cues. Grouping response execution may thus promote a general shift to parallel retrieval following practice. The cue-level account states that learned retrieval parallelism is specific to practiced cues. This type of parallelism may result from cue-specific response chunking that occurs uniquely as a consequence of grouped response execution. The results of two experiments favored the second account and were best interpreted in terms of a structural bottleneck model.
Neural synchronization as a hypothetical explanation of the psychoanalytic unconscious.
Ceylan, Mehmet Emin; Dönmez, Aslıhan; Ünsalver, Barış Önen; Evrensel, Alper
2016-02-01
Cognitive scientists have tried to explain the neural mechanisms of unconscious mental states such as coma, epileptic seizures, and anesthesia-induced unconsciousness. However these types of unconscious states are different from the psychoanalytic unconscious. In this review, we aim to present our hypothesis about the neural correlates underlying psychoanalytic unconscious. To fulfill this aim, we firstly review the previous explanations about the neural correlates of conscious and unconscious mental states, such as brain oscillations, synchronicity of neural networks, and cognitive binding. By doing so, we hope to lay a neuroscientific ground for our hypothesis about neural correlates of psychoanalytic unconscious; parallel but unsynchronized neural networks between different layers of consciousness and unconsciousness. Next, we propose a neuroscientific mechanism about how the repressed mental events reach the conscious awareness; the lock of neural synchronization between two mental layers of conscious and unconscious. At the last section, we will discuss the data about schizophrenia as a clinical example of our proposed hypothesis. Copyright © 2015 Elsevier Inc. All rights reserved.
Current harmonics elimination control method for six-phase PM synchronous motor drives.
Yuan, Lei; Chen, Ming-liang; Shen, Jian-qing; Xiao, Fei
2015-11-01
To reduce the undesired 5th and 7th stator harmonic current in the six-phase permanent magnet synchronous motor (PMSM), an improved vector control algorithm was proposed based on vector space decomposition (VSD) transformation method, which can control the fundamental and harmonic subspace separately. To improve the traditional VSD technology, a novel synchronous rotating coordinate transformation matrix was presented in this paper, and only using the traditional PI controller in d-q subspace can meet the non-static difference adjustment, the controller parameter design method is given by employing internal model principle. Moreover, the current PI controller parallel with resonant controller is employed in x-y subspace to realize the specific 5th and 7th harmonic component compensation. In addition, a new six-phase SVPWM algorithm based on VSD transformation theory is also proposed. Simulation and experimental results verify the effectiveness of current decoupling vector controller. Copyright © 2015 ISA. Published by Elsevier Ltd. All rights reserved.
Science 101: How Do Atomic Clocks Work?
ERIC Educational Resources Information Center
Science and Children, 2008
2008-01-01
You might be wondering why in the world we need such precise measures of time. Well, many systems we use everyday, such as Global Positioning Systems, require precise synchronization of time. This comes into play in telecommunications and wireless communications, also. For purely scientific reasons, we can use precise measurement of time to…
On-chip photonic transistor based on the spike synchronization in circuit QED
NASA Astrophysics Data System (ADS)
Gül, Yusuf
2018-03-01
We consider the single photon transistor in coupled cavity system of resonators interacting with multilevel superconducting artificial atom simultaneously. Effective single mode transformation is used for the diagonalization of the Hamiltonian and impedance matching in terms of the normal modes. Storage and transmission of the incident field are described by the interactions between the cavities controlling the atomic transitions of lowest lying states. Rabi splitting of vacuum-induced multiphoton transitions is considered in input/output relations by the quadrature operators in the absence of the input field. Second-order coherence functions are employed to investigate the photon blockade and delocalization-localization transitions of cavity fields. Spontaneous virtual photon conversion into real photons is investigated in localized and oscillating regimes. Reflection and transmission of cavity output fields are investigated in the presence of the multilevel transitions. Accumulation and firing of the reflected and transmitted fields are used to investigate the synchronization of the bunching spike train of transmitted field and population imbalance of cavity fields. In the presence of single photon gate field, gain enhancement is explained for transmitted regime.
ERIC Educational Resources Information Center
Taber, Keith S.
2013-01-01
Comparing the atom to a "tiny solar system" is a common teaching analogy, and the extent to which learners saw the systems as analogous was investigated. English upper secondary students were asked parallel questions about the physical interactions between the components of a simple atomic system and a simple solar system to investigate…
Parallel Algorithms for Switching Edges in Heterogeneous Graphs.
Bhuiyan, Hasanuzzaman; Khan, Maleq; Chen, Jiangzhuo; Marathe, Madhav
2017-06-01
An edge switch is an operation on a graph (or network) where two edges are selected randomly and one of their end vertices are swapped with each other. Edge switch operations have important applications in graph theory and network analysis, such as in generating random networks with a given degree sequence, modeling and analyzing dynamic networks, and in studying various dynamic phenomena over a network. The recent growth of real-world networks motivates the need for efficient parallel algorithms. The dependencies among successive edge switch operations and the requirement to keep the graph simple (i.e., no self-loops or parallel edges) as the edges are switched lead to significant challenges in designing a parallel algorithm. Addressing these challenges requires complex synchronization and communication among the processors leading to difficulties in achieving a good speedup by parallelization. In this paper, we present distributed memory parallel algorithms for switching edges in massive networks. These algorithms provide good speedup and scale well to a large number of processors. A harmonic mean speedup of 73.25 is achieved on eight different networks with 1024 processors. One of the steps in our edge switch algorithms requires the computation of multinomial random variables in parallel. This paper presents the first non-trivial parallel algorithm for the problem, achieving a speedup of 925 using 1024 processors.
Parallel Algorithms for Switching Edges in Heterogeneous Graphs☆
Khan, Maleq; Chen, Jiangzhuo; Marathe, Madhav
2017-01-01
An edge switch is an operation on a graph (or network) where two edges are selected randomly and one of their end vertices are swapped with each other. Edge switch operations have important applications in graph theory and network analysis, such as in generating random networks with a given degree sequence, modeling and analyzing dynamic networks, and in studying various dynamic phenomena over a network. The recent growth of real-world networks motivates the need for efficient parallel algorithms. The dependencies among successive edge switch operations and the requirement to keep the graph simple (i.e., no self-loops or parallel edges) as the edges are switched lead to significant challenges in designing a parallel algorithm. Addressing these challenges requires complex synchronization and communication among the processors leading to difficulties in achieving a good speedup by parallelization. In this paper, we present distributed memory parallel algorithms for switching edges in massive networks. These algorithms provide good speedup and scale well to a large number of processors. A harmonic mean speedup of 73.25 is achieved on eight different networks with 1024 processors. One of the steps in our edge switch algorithms requires the computation of multinomial random variables in parallel. This paper presents the first non-trivial parallel algorithm for the problem, achieving a speedup of 925 using 1024 processors. PMID:28757680
High-Performance Computation of Distributed-Memory Parallel 3D Voronoi and Delaunay Tessellation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peterka, Tom; Morozov, Dmitriy; Phillips, Carolyn
2014-11-14
Computing a Voronoi or Delaunay tessellation from a set of points is a core part of the analysis of many simulated and measured datasets: N-body simulations, molecular dynamics codes, and LIDAR point clouds are just a few examples. Such computational geometry methods are common in data analysis and visualization; but as the scale of simulations and observations surpasses billions of particles, the existing serial and shared-memory algorithms no longer suffice. A distributed-memory scalable parallel algorithm is the only feasible approach. The primary contribution of this paper is a new parallel Delaunay and Voronoi tessellation algorithm that automatically determines which neighbormore » points need to be exchanged among the subdomains of a spatial decomposition. Other contributions include periodic and wall boundary conditions, comparison of our method using two popular serial libraries, and application to numerous science datasets.« less
Parallel discrete event simulation using shared memory
NASA Technical Reports Server (NTRS)
Reed, Daniel A.; Malony, Allen D.; Mccredie, Bradley D.
1988-01-01
With traditional event-list techniques, evaluating a detailed discrete-event simulation-model can often require hours or even days of computation time. By eliminating the event list and maintaining only sufficient synchronization to ensure causality, parallel simulation can potentially provide speedups that are linear in the numbers of processors. A set of shared-memory experiments, using the Chandy-Misra distributed-simulation algorithm, to simulate networks of queues is presented. Parameters of the study include queueing network topology and routing probabilities, number of processors, and assignment of network nodes to processors. These experiments show that Chandy-Misra distributed simulation is a questionable alternative to sequential-simulation of most queueing network models.
Clock Agreement Among Parallel Supercomputer Nodes
Jones, Terry R.; Koenig, Gregory A.
2014-04-30
This dataset presents measurements that quantify the clock synchronization time-agreement characteristics among several high performance computers including the current world's most powerful machine for open science, the U.S. Department of Energy's Titan machine sited at Oak Ridge National Laboratory. These ultra-fast machines derive much of their computational capability from extreme node counts (over 18000 nodes in the case of the Titan machine). Time-agreement is commonly utilized by parallel programming applications and tools, distributed programming application and tools, and system software. Our time-agreement measurements detail the degree of time variance between nodes and how that variance changes over time. The dataset includes empirical measurements and the accompanying spreadsheets.
NASA Astrophysics Data System (ADS)
Tornow, Ralf P.; Milczarek, Aleksandra; Odstrcilik, Jan; Kolar, Radim
2017-07-01
A parallel video ophthalmoscope was developed to acquire short video sequences (25 fps, 250 frames) of both eyes simultaneously with exact synchronization. Video sequences were registered off-line to compensate for eye movements. From registered video sequences dynamic parameters like cardiac cycle induced reflection changes and eye movements can be calculated and compared between eyes.
Linux OS Jitter Measurements at Large Node Counts using a BlueGene/L
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jones, Terry R; Tauferner, Mr. Andrew; Inglett, Mr. Todd
2010-01-01
We present experimental results for a coordinated scheduling implementation of the Linux operating system. Results were collected on an IBM Blue Gene/L machine at scales up to 16K nodes. Our results indicate coordinated scheduling was able to provide a dramatic improvement in scaling performance for two applications characterized as bulk synchronous parallel programs.
Lee, Li-Yu; Lin, Gigin; Chen, Shu-Jen; Lu, Yen-Jung; Huang, Huei-Jean; Yen, Chi-Feng; Han, Chien Min; Lee, Yun-Shien; Wang, Tzu-Hao; Chao, Angel
2017-01-01
Benign metastasizing leiomyoma (BML) is a rare disease entity typically presenting as multiple extrauterine leiomyomas associated with a uterine leiomyoma. It has been hypothesized that the extrauterine leiomyomata represent distant metastasis of the uterine leiomyoma. To date, the only molecular evidence supporting this hypothesis was derived from clonality analyses based on X-chromosome inactivation assays. Here, we sought to address this issue by examining paired specimens of synchronous pulmonary and uterine leiomyomata from three patients using targeted massively parallel sequencing and molecular inversion probe array analysis for detecting somatic mutations and copy number aberrations. We detected identical non-hot-spot somatic mutations and similar patterns of copy number aberrations (CNAs) in paired pulmonary and uterine leiomyomata from two patients, indicating the clonal relationship between pulmonary and uterine leiomyomata. In addition to loss of chromosome 22q found in the literature, we identified additional recurrent CNAs including losses of chromosome 3q and 11q. In conclusion, our findings of the clonal relationship between synchronous pulmonary and uterine leiomyomas support the hypothesis that BML represents a condition wherein a uterine leiomyoma disseminates to distant extrauterine locations. PMID:28533481
Wu, Ren-Chin; Chao, An-Shine; Lee, Li-Yu; Lin, Gigin; Chen, Shu-Jen; Lu, Yen-Jung; Huang, Huei-Jean; Yen, Chi-Feng; Han, Chien Min; Lee, Yun-Shien; Wang, Tzu-Hao; Chao, Angel
2017-07-18
Benign metastasizing leiomyoma (BML) is a rare disease entity typically presenting as multiple extrauterine leiomyomas associated with a uterine leiomyoma. It has been hypothesized that the extrauterine leiomyomata represent distant metastasis of the uterine leiomyoma. To date, the only molecular evidence supporting this hypothesis was derived from clonality analyses based on X-chromosome inactivation assays. Here, we sought to address this issue by examining paired specimens of synchronous pulmonary and uterine leiomyomata from three patients using targeted massively parallel sequencing and molecular inversion probe array analysis for detecting somatic mutations and copy number aberrations. We detected identical non-hot-spot somatic mutations and similar patterns of copy number aberrations (CNAs) in paired pulmonary and uterine leiomyomata from two patients, indicating the clonal relationship between pulmonary and uterine leiomyomata. In addition to loss of chromosome 22q found in the literature, we identified additional recurrent CNAs including losses of chromosome 3q and 11q. In conclusion, our findings of the clonal relationship between synchronous pulmonary and uterine leiomyomas support the hypothesis that BML represents a condition wherein a uterine leiomyoma disseminates to distant extrauterine locations.
Parallel odor processing by mitral and middle tufted cells in the olfactory bulb.
Cavarretta, Francesco; Burton, Shawn D; Igarashi, Kei M; Shepherd, Gordon M; Hines, Michael L; Migliore, Michele
2018-05-16
The olfactory bulb (OB) transforms sensory input into spatially and temporally organized patterns of activity in principal mitral (MC) and middle tufted (mTC) cells. Thus far, the mechanisms underlying odor representations in the OB have been mainly investigated in MCs. However, experimental findings suggest that MC and mTC may encode parallel and complementary odor representations. We have analyzed the functional roles of these pathways by using a morphologically and physiologically realistic three-dimensional model to explore the MC and mTC microcircuits in the glomerular layer and deeper plexiform layer. The model makes several predictions. MCs and mTCs are controlled by similar computations in the glomerular layer but are differentially modulated in deeper layers. The intrinsic properties of mTCs promote their synchronization through a common granule cell input. Finally, the MC and mTC pathways can be coordinated through the deep short-axon cells in providing input to the olfactory cortex. The results suggest how these mechanisms can dynamically select the functional network connectivity to create the overall output of the OB and promote the dynamic synchronization of glomerular units for any given odor stimulus.
Self-Assembly of Parallel Atomic Wires and Periodic Clusters of Silicon on a Vicinal Si(111) Surface
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sekiguchi, Takeharu; Yoshida, Shunji; Itoh, Kohei M.
2005-09-02
Silicon self-assembly at step edges in the initial stage of homoepitaxial growth on a vicinal Si(111) surface is studied by scanning tunneling microscopy. The resulting atomic structures change dramatically from a parallel array of 0.7 nm wide wires to one-dimensionally aligned periodic clusters of diameter {approx}2 nm and periodicity 2.7 nm in the very narrow range of growth temperatures between 400 and 300 deg. C. These nanostructures are expected to play important roles in future developments of silicon quantum computers. Mechanisms leading to such distinct structures are discussed.
NASA Astrophysics Data System (ADS)
Xie, Lizhe; Hu, Yining; Chen, Yang; Shi, Luyao
2015-03-01
Projection and back-projection are the most computational consuming parts in Computed Tomography (CT) reconstruction. Parallelization strategies using GPU computing techniques have been introduced. We in this paper present a new parallelization scheme for both projection and back-projection. The proposed method is based on CUDA technology carried out by NVIDIA Corporation. Instead of build complex model, we aimed on optimizing the existing algorithm and make it suitable for CUDA implementation so as to gain fast computation speed. Besides making use of texture fetching operation which helps gain faster interpolation speed, we fixed sampling numbers in the computation of projection, to ensure the synchronization of blocks and threads, thus prevents the latency caused by inconsistent computation complexity. Experiment results have proven the computational efficiency and imaging quality of the proposed method.
Sublattice parallel replica dynamics.
Martínez, Enrique; Uberuaga, Blas P; Voter, Arthur F
2014-06-01
Exascale computing presents a challenge for the scientific community as new algorithms must be developed to take full advantage of the new computing paradigm. Atomistic simulation methods that offer full fidelity to the underlying potential, i.e., molecular dynamics (MD) and parallel replica dynamics, fail to use the whole machine speedup, leaving a region in time and sample size space that is unattainable with current algorithms. In this paper, we present an extension of the parallel replica dynamics algorithm [A. F. Voter, Phys. Rev. B 57, R13985 (1998)] by combining it with the synchronous sublattice approach of Shim and Amar [ and , Phys. Rev. B 71, 125432 (2005)], thereby exploiting event locality to improve the algorithm scalability. This algorithm is based on a domain decomposition in which events happen independently in different regions in the sample. We develop an analytical expression for the speedup given by this sublattice parallel replica dynamics algorithm and compare it with parallel MD and traditional parallel replica dynamics. We demonstrate how this algorithm, which introduces a slight additional approximation of event locality, enables the study of physical systems unreachable with traditional methodologies and promises to better utilize the resources of current high performance and future exascale computers.
White, Lauren; Oh, Paul; Kwan, Matthew; Gove, Peter; Leahey, Tricia; Faulkner, Guy
2016-01-01
Background The economic burden of physical inactivity in Canada is estimated at Can $6.8 billion (US $5 billion) per year. Employers bear a substantial proportion of the economic costs, as they pay more for inactive workers in health care and other organizational costs. In response, many Canadian employers offer wellness programs, though these are often underutilized. While financial health incentives have been proposed as one way of increasing participation, their longer term effects (ie postintervention effects) are not clear. Objective The objective of this paper is to outline the methodology for a randomized control trial (RCT) examining the longer term impact of an existing physical activity promotion program that is enhanced by adding guaranteed rewards (Can $1 [US $0.74] per day step goal met) in a lower active hospital employee population (less than 10,000 steps per day). Methods A 12-week, parallel-arm RCT (with a 12-week postintervention follow-up) will be employed. Employees using Change4Life (a fully automated, incentive-based wellness program) and accumulating fewer than 10,000 steps per day at baseline (weeks 1 to 2) will be randomly allocated (1:1) to standard care (wellness program, accelerometer) or an intervention group (standard care plus guaranteed incentives). All study participants will be asked to wear the accelerometer and synchronize it to Change4Life daily, although only intervention group participants will receive guaranteed incentives for reaching tailored daily step count goals (Can $1 [US $0.74] per day; weeks 3 to 12). The primary study outcome will be mean proportion of participant-days step goal reached during the postintervention follow-up period (week 24). Mean proportion of participant-days step goal reached during the intervention period (week 12) will be a secondary outcome. Results Enrollment for the study will be completed in February 2017. Data analysis will commence in September 2017. Study results are to be published in the winter of 2018. Conclusions This protocol was designed to examine the impact of guaranteed rewards on physical activity maintenance in lower active hospital employees. ClinicalTrial ClinicalTrials.gov NCT02638675; https://clinicaltrials.gov/ct2/show/NCT0 2638675 (Archived by WebCite at http://www.webcitation.org/6g4pvZvhW) PMID:27956377
Rethinking key–value store for parallel I/O optimization
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kougkas, Anthony; Eslami, Hassan; Sun, Xian-He
2015-01-26
Key-value stores are being widely used as the storage system for large-scale internet services and cloud storage systems. However, they are rarely used in HPC systems, where parallel file systems are the dominant storage solution. In this study, we examine the architecture differences and performance characteristics of parallel file systems and key-value stores. We propose using key-value stores to optimize overall Input/Output (I/O) performance, especially for workloads that parallel file systems cannot handle well, such as the cases with intense data synchronization or heavy metadata operations. We conducted experiments with several synthetic benchmarks, an I/O benchmark, and a real application.more » We modeled the performance of these two systems using collected data from our experiments, and we provide a predictive method to identify which system offers better I/O performance given a specific workload. The results show that we can optimize the I/O performance in HPC systems by utilizing key-value stores.« less
NASA Technical Reports Server (NTRS)
Schutz, Bob E.; Baker, Gregory A.
1997-01-01
The recovery of a high resolution geopotential from satellite gradiometer observations motivates the examination of high performance computational techniques. The primary subject matter addresses specifically the use of satellite gradiometer and GPS observations to form and invert the normal matrix associated with a large degree and order geopotential solution. Memory resident and out-of-core parallel linear algebra techniques along with data parallel batch algorithms form the foundation of the least squares application structure. A secondary topic includes the adoption of object oriented programming techniques to enhance modularity and reusability of code. Applications implementing the parallel and object oriented methods successfully calculate the degree variance for a degree and order 110 geopotential solution on 32 processors of the Cray T3E. The memory resident gradiometer application exhibits an overall application performance of 5.4 Gflops, and the out-of-core linear solver exhibits an overall performance of 2.4 Gflops. The combination solution derived from a sun synchronous gradiometer orbit produce average geoid height variances of 17 millimeters.
NASA Astrophysics Data System (ADS)
Baker, Gregory Allen
The recovery of a high resolution geopotential from satellite gradiometer observations motivates the examination of high performance computational techniques. The primary subject matter addresses specifically the use of satellite gradiometer and GPS observations to form and invert the normal matrix associated with a large degree and order geopotential solution. Memory resident and out-of-core parallel linear algebra techniques along with data parallel batch algorithms form the foundation of the least squares application structure. A secondary topic includes the adoption of object oriented programming techniques to enhance modularity and reusability of code. Applications implementing the parallel and object oriented methods successfully calculate the degree variance for a degree and order 110 geopotential solution on 32 processors of the Cray T3E. The memory resident gradiometer application exhibits an overall application performance of 5.4 Gflops, and the out-of-core linear solver exhibits an overall performance of 2.4 Gflops. The combination solution derived from a sun synchronous gradiometer orbit produce average geoid height variances of 17 millimeters.
Parallel Discrete Molecular Dynamics Simulation With Speculation and In-Order Commitment*†
Khan, Md. Ashfaquzzaman; Herbordt, Martin C.
2011-01-01
Discrete molecular dynamics simulation (DMD) uses simplified and discretized models enabling simulations to advance by event rather than by timestep. DMD is an instance of discrete event simulation and so is difficult to scale: even in this multi-core era, all reported DMD codes are serial. In this paper we discuss the inherent difficulties of scaling DMD and present our method of parallelizing DMD through event-based decomposition. Our method is microarchitecture inspired: speculative processing of events exposes parallelism, while in-order commitment ensures correctness. We analyze the potential of this parallelization method for shared-memory multiprocessors. Achieving scalability required extensive experimentation with scheduling and synchronization methods to mitigate serialization. The speed-up achieved for a variety of system sizes and complexities is nearly 6× on an 8-core and over 9× on a 12-core processor. We present and verify analytical models that account for the achieved performance as a function of available concurrency and architectural limitations. PMID:21822327
Parallel Discrete Molecular Dynamics Simulation With Speculation and In-Order Commitment.
Khan, Md Ashfaquzzaman; Herbordt, Martin C
2011-07-20
Discrete molecular dynamics simulation (DMD) uses simplified and discretized models enabling simulations to advance by event rather than by timestep. DMD is an instance of discrete event simulation and so is difficult to scale: even in this multi-core era, all reported DMD codes are serial. In this paper we discuss the inherent difficulties of scaling DMD and present our method of parallelizing DMD through event-based decomposition. Our method is microarchitecture inspired: speculative processing of events exposes parallelism, while in-order commitment ensures correctness. We analyze the potential of this parallelization method for shared-memory multiprocessors. Achieving scalability required extensive experimentation with scheduling and synchronization methods to mitigate serialization. The speed-up achieved for a variety of system sizes and complexities is nearly 6× on an 8-core and over 9× on a 12-core processor. We present and verify analytical models that account for the achieved performance as a function of available concurrency and architectural limitations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baker, Ann E; Barker, Ashley D; Bland, Arthur S Buddy
Oak Ridge National Laboratory's Leadership Computing Facility (OLCF) continues to deliver the most powerful resources in the U.S. for open science. At 2.33 petaflops peak performance, the Cray XT Jaguar delivered more than 1.4 billion core hours in calendar year (CY) 2011 to researchers around the world for computational simulations relevant to national and energy security; advancing the frontiers of knowledge in physical sciences and areas of biological, medical, environmental, and computer sciences; and providing world-class research facilities for the nation's science enterprise. Users reported more than 670 publications this year arising from their use of OLCF resources. Of thesemore » we report the 300 in this review that are consistent with guidance provided. Scientific achievements by OLCF users cut across all range scales from atomic to molecular to large-scale structures. At the atomic scale, researchers discovered that the anomalously long half-life of Carbon-14 can be explained by calculating, for the first time, the very complex three-body interactions between all the neutrons and protons in the nucleus. At the molecular scale, researchers combined experimental results from LBL's light source and simulations on Jaguar to discover how DNA replication continues past a damaged site so a mutation can be repaired later. Other researchers combined experimental results from ORNL's Spallation Neutron Source and simulations on Jaguar to reveal the molecular structure of ligno-cellulosic material used in bioethanol production. This year, Jaguar has been used to do billion-cell CFD calculations to develop shock wave compression turbo machinery as a means to meet DOE goals for reducing carbon sequestration costs. General Electric used Jaguar to calculate the unsteady flow through turbo machinery to learn what efficiencies the traditional steady flow assumption is hiding from designers. Even a 1% improvement in turbine design can save the nation billions of gallons of fuel.« less
Synchronization crossover of polariton condensates in weakly disordered lattices
NASA Astrophysics Data System (ADS)
Ohadi, H.; del Valle-Inclan Redondo, Y.; Ramsay, A. J.; Hatzopoulos, Z.; Liew, T. C. H.; Eastham, P. R.; Savvidis, P. G.; Baumberg, J. J.
2018-05-01
We demonstrate that the synchronization of a lattice of solid-state condensates when intersite tunneling is switched on depends strongly on the weak local disorder. This finding is vital for implementation of condensate arrays as computation devices. The condensates here are nonlinear bosonic fluids of exciton-polaritons trapped in a weakly disordered Bose-Hubbard potential, where the nearest-neighboring tunneling rate (Josephson coupling) can be dynamically tuned. The system can thus be tuned from a localized to a delocalized fluid as the number density or the Josephson coupling between nearest neighbors increases. The localized fluid is observed as a lattice of unsynchronized condensates emitting at different energies set by the disorder potential. In the delocalized phase, the condensates synchronize and long-range order appears, evidenced by narrowing of momentum and energy distributions, new diffraction peaks in momentum space, and spatial coherence between condensates. Our paper identifies similarities and differences of this nonequilibrium crossover to the traditional Bose-glass to superfluid transition in atomic condensates.
Xu, Jingxiang; Higuchi, Yuji; Ozawa, Nobuki; Sato, Kazuhisa; Hashida, Toshiyuki; Kubo, Momoji
2017-09-20
Ni sintering in the Ni/YSZ porous anode of a solid oxide fuel cell changes the porous structure, leading to degradation. Preventing sintering and degradation during operation is a great challenge. Usually, a sintering molecular dynamics (MD) simulation model consisting of two particles on a substrate is used; however, the model cannot reflect the porous structure effect on sintering. In our previous study, a multi-nanoparticle sintering modeling method with tens of thousands of atoms revealed the effect of the particle framework and porosity on sintering. However, the method cannot reveal the effect of the particle size on sintering and the effect of sintering on the change in the porous structure. In the present study, we report a strategy to reveal them in the porous structure by using our multi-nanoparticle modeling method and a parallel large-scale multimillion-atom MD simulator. We used this method to investigate the effect of YSZ particle size and tortuosity on sintering and degradation in the Ni/YSZ anodes. Our parallel large-scale MD simulation showed that the sintering degree decreased as the YSZ particle size decreased. The gas fuel diffusion path, which reflects the overpotential, was blocked by pore coalescence during sintering. The degradation of gas diffusion performance increased as the YSZ particle size increased. Furthermore, the gas diffusion performance was quantified by a tortuosity parameter and an optimal YSZ particle size, which is equal to that of Ni, was found for good diffusion after sintering. These findings cannot be obtained by previous MD sintering studies with tens of thousands of atoms. The present parallel large-scale multimillion-atom MD simulation makes it possible to clarify the effects of the particle size and tortuosity on sintering and degradation.
Throughput analysis of the IEEE 802.4 token bus standard under heavy load
NASA Technical Reports Server (NTRS)
Pang, Joseph; Tobagi, Fouad
1987-01-01
It has become clear in the last few years that there is a trend towards integrated digital services. Parallel to the development of public Integrated Services Digital Network (ISDN) is service integration in the local area (e.g., a campus, a building, an aircraft). The types of services to be integrated depend very much on the specific local environment. However, applications tend to generate data traffic belonging to one of two classes. According to IEEE 802.4 terminology, the first major class of traffic is termed synchronous, such as packetized voice and data generated from other applications with real-time constraints, and the second class is called asynchronous which includes most computer data traffic such as file transfer or facsimile. The IEEE 802.4 token bus protocol which was designed to support both synchronous and asynchronous traffic is examined. The protocol is basically a timer-controlled token bus access scheme. By a suitable choice of the design parameters, it can be shown that access delay is bounded for synchronous traffic. As well, the bandwidth allocated to asynchronous traffic can be controlled. A throughput analysis of the protocol under heavy load with constant channel occupation of synchronous traffic and constant token-passing times is presented.
Clapping in time parallels literacy and calls upon overlapping neural mechanisms in early readers.
Bonacina, Silvia; Krizman, Jennifer; White-Schwoch, Travis; Kraus, Nina
2018-05-12
The auditory system is extremely precise in processing the temporal information of perceptual events and using these cues to coordinate action. Synchronizing movement to a steady beat relies on this bidirectional connection between sensory and motor systems, and activates many of the auditory and cognitive processes used when reading. Here, we use Interactive Metronome, a clinical intervention technology requiring an individual to clap her hands in time with a steady beat, to investigate whether the links between literacy and synchronization skills, previously established in older children, are also evident in children who are learning to read. We tested 64 typically developing children (ages 5-7 years) on their synchronization abilities, neurophysiological responses to speech in noise, and literacy skills. We found that children who have lower variability in synchronizing have higher phase consistency, higher stability, and more accurate envelope encoding-all neurophysiological response components linked to language skills. Moreover, performing the same task with visual feedback reveals links with literacy skills, notably processing speed, phonological processing, word reading, spelling, morphology, and syntax. These results suggest that rhythm skills and literacy call on overlapping neural mechanisms, supporting the idea that rhythm training may boost literacy in part by engaging sensory-motor systems. © 2018 New York Academy of Sciences.
War in the Information Age: A Primer for Cyberspace Operations in 21st Century Warfare
2010-01-01
funds transfers ( EFT ). 30 Paralleling the rapid expansion of civilian cyberspace use is the increasing use of cyberspace by modern militaries...company files by using a thumb drive to tap the corporate system. Boeing estimated that the stolen documents would have cost it between $5 billion...tactics and intelligence operations such as collecting data, recruiting members of state security services, and setting up phone taps .‖ 69
ICASE Semiannual Report, 1 April 1990 - 30 September 1990
1990-11-01
underlies parallel simulation protocols that synchronize based on logical time (all known approaches). This framework describes a suf- ficient set of...conducted primarily by visiting scientists from universities and from industry, who have resident appointments for limited periods of time , and by consultants...wave equation with point sources and semireflecting impedance boundary conditions. For sources that are piece- wise polynomial in time we get a finite
USSOCOM’s Role in Addressing Human Trafficking
2010-12-02
global issue runs parallel and at times intersects with the increasing prevalence of VEOs as a transnational threat. Already tasked to synchronize...There are 104 countries without laws, policies, or regulations to prevent victims’ deportation.8 These numbers indicate both the size of global HT...whole of government response through USSOCOM integration. HT exhibits the global connectivity of other transnational crimes, but is also
Petrović, Z Lj; Phelps, A V
2009-12-01
Absolute spectral emissivities for Doppler broadened H(alpha) profiles are measured and compared with predictions of energetic hydrogen ion, atom, and molecule behavior in low-current electrical discharges in H2 at very high electric field E to gas density N ratios E/N and low values of Nd , where d is the parallel-plate electrode separation. These observations reflect the energy and angular distributions for the excited atoms and quantitatively test features of multiple-scattering kinetic models in weakly ionized hydrogen in the presence of an electric field that are not tested by the spatial distributions of H(alpha) emission. Absolute spectral intensities agree well with predictions. Asymmetries in Doppler profiles observed parallel to the electric field at 4
Study on the dynamics responses of a transmission system made from carbon nanotubes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yin, Hang; Cai, Kun, E-mail: kuicansj@163.com; Wei, Ning
2015-06-21
A rotational transmission system from coaxial carbon nanotubes (CNTs) is investigated using a computational molecular dynamics approach. The system consists of a motor from a single-walled carbon nanotube and a bearing from a double-walled carbon nanotube. The motor has a high fixed rotational frequency and the two ends of the outer tube in the bearing are fixed. The inner tube in the bearing works as a rotor. Because of the interlayer friction in the bearing, configurations of the joint between the adjacent ends of motor and rotor have significant effects on rotational transmission properties. Four factors are considered in simulation,more » i.e., the bonding types of atoms (sp{sup 1} and sp{sup 2}) on the ends of motor and rotor, the difference between motor and rotor radii, the rotational speed of motor, and the environmental temperature. It is found that the synchronous transmission happens if the sp{sup 1} atoms on the jointed ends of motor and rotor are bonded each other and become new sp{sup 2} atoms. Therefore, the lower difference between radii of motor and rotor, higher temperature of environment leads to synchronous rotational transmission easily. If the environmental temperature is too low (e.g., <150 K), the end of motor adjacent to rotor is easily under buckling and new sp{sup 2} atoms appear, too. With capped CNTs or higher radii difference between rotor and motor at an appropriate temperature, a stable asynchronous rotation of rotor can be generated, and the rotor's frequency varying linearly with motor's frequency between 230 and 270 GHz. A multi-signal transmission device combined with oscillating and rotational motion is proposed for motor and stator shares a same size in radius.« less
Scalable asynchronous execution of cellular automata
NASA Astrophysics Data System (ADS)
Folino, Gianluigi; Giordano, Andrea; Mastroianni, Carlo
2016-10-01
The performance and scalability of cellular automata, when executed on parallel/distributed machines, are limited by the necessity of synchronizing all the nodes at each time step, i.e., a node can execute only after the execution of the previous step at all the other nodes. However, these synchronization requirements can be relaxed: a node can execute one step after synchronizing only with the adjacent nodes. In this fashion, different nodes can execute different time steps. This can be a notable advantageous in many novel and increasingly popular applications of cellular automata, such as smart city applications, simulation of natural phenomena, etc., in which the execution times can be different and variable, due to the heterogeneity of machines and/or data and/or executed functions. Indeed, a longer execution time at a node does not slow down the execution at all the other nodes but only at the neighboring nodes. This is particularly advantageous when the nodes that act as bottlenecks vary during the application execution. The goal of the paper is to analyze the benefits that can be achieved with the described asynchronous implementation of cellular automata, when compared to the classical all-to-all synchronization pattern. The performance and scalability have been evaluated through a Petri net model, as this model is very useful to represent the synchronization barrier among nodes. We examined the usual case in which the territory is partitioned into a number of regions, and the computation associated with a region is assigned to a computing node. We considered both the cases of mono-dimensional and two-dimensional partitioning. The results show that the advantage obtained through the asynchronous execution, when compared to the all-to-all synchronous approach is notable, and it can be as large as 90% in terms of speedup.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tao, Juan-Juan; Zhou, Min-Kang, E-mail: zkhu@hust.edu.cn, E-mail: zmk@hust.edu.cn; Zhang, Qiao-Zhen
2015-09-15
During gravity measurements with Raman type atom interferometry, the frequency of the laser used to drive Raman transition is scanned by chirping the frequency of a direct digital synthesizer (DDS), and the local gravity is determined by precisely measuring the chip rate α of DDS. We present an effective method that can directly evaluate the frequency chirp rate stability of our DDS. By mixing a pair of synchronous linear sweeping signals, the chirp rate fluctuation is precisely measured with a frequency counter. The measurement result shows that the relative α instability can reach 5.7 × 10{sup −11} in 1 s,more » which is neglectable in a 10{sup −9} g level atom interferometry gravimeter.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Söngen, Hagen, E-mail: soengen@uni-mainz.de; Graduate School Materials Science in Mainz, Staudinger Weg 9, 55128 Mainz; Nalbach, Martin
2016-06-15
We present the implementation of a three-dimensional mapping routine for probing solid-liquid interfaces using frequency modulation atomic force microscopy. Our implementation enables fast and flexible data acquisition of up to 20 channels simultaneously. The acquired data can be directly synchronized with commercial atomic force microscope controllers, making our routine easily extendable for related techniques that require additional data channels, e.g., Kelvin probe force microscopy. Moreover, the closest approach of the tip to the sample is limited by a user-defined threshold, providing the possibility to prevent potential damage to the tip. The performance of our setup is demonstrated by visualizing themore » hydration structure above the calcite (10.4) surface in water.« less
NASA Astrophysics Data System (ADS)
Esmaily, M.; Jofre, L.; Mani, A.; Iaccarino, G.
2018-03-01
A geometric multigrid algorithm is introduced for solving nonsymmetric linear systems resulting from the discretization of the variable density Navier-Stokes equations on nonuniform structured rectilinear grids and high-Reynolds number flows. The restriction operation is defined such that the resulting system on the coarser grids is symmetric, thereby allowing for the use of efficient smoother algorithms. To achieve an optimal rate of convergence, the sequence of interpolation and restriction operations are determined through a dynamic procedure. A parallel partitioning strategy is introduced to minimize communication while maintaining the load balance between all processors. To test the proposed algorithm, we consider two cases: 1) homogeneous isotropic turbulence discretized on uniform grids and 2) turbulent duct flow discretized on stretched grids. Testing the algorithm on systems with up to a billion unknowns shows that the cost varies linearly with the number of unknowns. This O (N) behavior confirms the robustness of the proposed multigrid method regarding ill-conditioning of large systems characteristic of multiscale high-Reynolds number turbulent flows. The robustness of our method to density variations is established by considering cases where density varies sharply in space by a factor of up to 104, showing its applicability to two-phase flow problems. Strong and weak scalability studies are carried out, employing up to 30,000 processors, to examine the parallel performance of our implementation. Excellent scalability of our solver is shown for a granularity as low as 104 to 105 unknowns per processor. At its tested peak throughput, it solves approximately 4 billion unknowns per second employing over 16,000 processors with a parallel efficiency higher than 50%.
Cylindrical Vector Beams for Rapid Polarization-Dependent Measurements in Atomic Systems
2011-12-05
www.opticsinfobase.org/abstract.cfm?URI=oe-18-24-25035. 16. S. Tripathi and K. C. Toussaint, Jr., “Rapid Mueller matrix polarimetry based on parallelized...optical trapping [11], atom guiding [12], laser machining [13], charged particle acceleration [14,15], and polarimetry [16]. Yet despite numerous
DOE Office of Scientific and Technical Information (OSTI.GOV)
Marrinan, Thomas; Leigh, Jason; Renambot, Luc
Mixed presence collaboration involves remote collaboration between multiple collocated groups. This paper presents the design and results of a user study that focused on mixed presence collaboration using large-scale tiled display walls. The research was conducted in order to compare data synchronization schemes for multi-user visualization applications. Our study compared three techniques for sharing data between display spaces with varying constraints and affordances. The results provide empirical evidence that using data sharing techniques with continuous synchronization between the sites lead to improved collaboration for a search and analysis task between remotely located groups. We have also identified aspects of synchronizedmore » sessions that result in increased remote collaborator awareness and parallel task coordination. It is believed that this research will lead to better utilization of large-scale tiled display walls for distributed group work.« less
Recurrence spectra of a helium atom in parallel electric and magnetic fields
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wang, Dehua; Department of Mathematics and Physics, Shandong Architecture and Engineering Institute, Jinan 250014, People's Republic of China; Ding, Shiliang
2003-08-01
A model potential for the general Rydberg atom is put forward, which includes not only the Coulomb interaction potential and the core-attractive potential, but also the exchange potential between the excited electron and other electrons. Using the region-splitting consistent and iterative method, we calculated the scaled recurrence spectra of the helium atom in parallel electric and magnetic fields and the closed orbits in the corresponding classical system have also been obtained. In order to remove the Coulomb singularity of the classical motion of Hamiltonian, we implement the Kustaanheimo-Stiefel transformation, which transforms the system from a three-dimensional to a four-dimensional one.more » The Fourier-transformed spectra of the helium atom has allowed direct comparison between peaks in such a plot and the scaled action values of closed orbits. Considering the exchange potential, the number of the closed orbits increased, which led to more peaks in the recurrence spectra. The results are compared with those of the hydrogen case, which shows that the core-scattered effects and the electron exchange potential play an important role in the multielectron Rydberg atom.« less
A density functional theory study on the acetylene cyclotrimerization on Pd-modified Au(111) surface
NASA Astrophysics Data System (ADS)
Ren, Bohua; Dong, Xiuqin; Yu, Yingzhe; Zhang, Minhua
2017-10-01
Calculations based on the first-principle density functional theory were carried out to study the possible acetylene cyclotrimerization reactions on Pd-Au(111) surface and to investigate the effect of Au atom alloying with Pd. The adsorption of C2H2, C4H4, C6H6 and the PDOS of 4d orbitals of surface Pd and Au atoms were studied. The comparison of d-band center of Pd and Au atom before and after C2H2 or C4H4 adsorption suggests that these molecules affect the activity of Pd-Au(111) surface to some degree due to the high binding energy of the adsorption. In our study, the second neighboring Pd ensembles on Pd-Au(111) surface can adsorb two acetylene molecules on parallel-bridge site of two Au atoms and one Pd atom, respectively. Csbnd C bonds are parallel to each other and two acetylenes are adsorbed face to face to produce four-membered ring C4H4 firstly. The geometric effect and electronic effect of Pd-Au(111) surface with the second neighboring Pd ensembles both help to reduce this activation barrier.
Io: Escape and ionization of atmospheric gases
NASA Technical Reports Server (NTRS)
Smyth, W. H.
1981-01-01
Models for the Io oxygen clouds were improved to calculate the two dimensional sky plane intensity of the 1304 A emission and the 880 A emission of atomic oxygen, in addition to the 6300 A emission intensity. These three wavelength emissions are those for which observational measurements have been performed by ground based, rocket, Earth orbiting satellite and Voyager spacecraft instruments. Comparison of model results and observations suggests that an oxygen flux from Io of about 3 billion atoms sq cm sec is required for agreement. Quantitative analysis of the Io sodium cloud has focused upon the initial tasks of acquiring and preliminary evaluation of new sodium cloud and Io plasma torus data.
NASA Technical Reports Server (NTRS)
Wang, H. T.
1979-01-01
Three kinds of frequency measuring systems are described: frequency comparison, phase comparison, and time comparison. With the help of the portable cesium clock in determining the time delay between two stations, a time synchronization, experiment was conducted using the Symphonie satellite. A result with an accuracy of 30 ns and an uncertainty of about 10 ns was obtained. Another experiment, applying the television pulse technique for time synchronization, yielded a result with an error of about 0.5 mu s in 24 hours. In order to measure the short term frequency stability of crystal oscillators or other frequency sources, a rubidium maser atomic frequency standard was developed as well as a short term stability measuring system.
Steady state quantum discord for circularly accelerated atoms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, Jiawei, E-mail: hujiawei@nbu.edu.cn; Yu, Hongwei, E-mail: hwyu@hunnu.edu.cn; Synergetic Innovation Center for Quantum Effects and Applications, Hunan Normal University, Changsha, Hunan 410081
2015-12-15
We study, in the framework of open quantum systems, the dynamics of quantum entanglement and quantum discord of two mutually independent circularly accelerated two-level atoms in interaction with a bath of fluctuating massless scalar fields in the Minkowski vacuum. We assume that the two atoms rotate synchronically with their separation perpendicular to the rotating plane. The time evolution of the quantum entanglement and quantum discord of the two-atom system is investigated. For a maximally entangled initial state, the entanglement measured by concurrence diminishes to zero within a finite time, while the quantum discord can either decrease monotonically to an asymptoticmore » value or diminish to zero at first and then followed by a revival depending on whether the initial state is antisymmetric or symmetric. When both of the two atoms are initially excited, the generation of quantum entanglement shows a delayed feature, while quantum discord is created immediately. Remarkably, the quantum discord for such a circularly accelerated two-atom system takes a nonvanishing value in the steady state, and this is distinct from what happens in both the linear acceleration case and the case of static atoms immersed in a thermal bath.« less
Collective network for computer structures
Blumrich, Matthias A; Coteus, Paul W; Chen, Dong; Gara, Alan; Giampapa, Mark E; Heidelberger, Philip; Hoenicke, Dirk; Takken, Todd E; Steinmacher-Burow, Burkhard D; Vranas, Pavlos M
2014-01-07
A system and method for enabling high-speed, low-latency global collective communications among interconnected processing nodes. The global collective network optimally enables collective reduction operations to be performed during parallel algorithm operations executing in a computer structure having a plurality of the interconnected processing nodes. Router devices are included that interconnect the nodes of the network via links to facilitate performance of low-latency global processing operations at nodes of the virtual network. The global collective network may be configured to provide global barrier and interrupt functionality in asynchronous or synchronized manner. When implemented in a massively-parallel supercomputing structure, the global collective network is physically and logically partitionable according to the needs of a processing algorithm.
Collective network for computer structures
Blumrich, Matthias A [Ridgefield, CT; Coteus, Paul W [Yorktown Heights, NY; Chen, Dong [Croton On Hudson, NY; Gara, Alan [Mount Kisco, NY; Giampapa, Mark E [Irvington, NY; Heidelberger, Philip [Cortlandt Manor, NY; Hoenicke, Dirk [Ossining, NY; Takken, Todd E [Brewster, NY; Steinmacher-Burow, Burkhard D [Wernau, DE; Vranas, Pavlos M [Bedford Hills, NY
2011-08-16
A system and method for enabling high-speed, low-latency global collective communications among interconnected processing nodes. The global collective network optimally enables collective reduction operations to be performed during parallel algorithm operations executing in a computer structure having a plurality of the interconnected processing nodes. Router devices ate included that interconnect the nodes of the network via links to facilitate performance of low-latency global processing operations at nodes of the virtual network and class structures. The global collective network may be configured to provide global barrier and interrupt functionality in asynchronous or synchronized manner. When implemented in a massively-parallel supercomputing structure, the global collective network is physically and logically partitionable according to needs of a processing algorithm.
NASA Astrophysics Data System (ADS)
Konduri, Aditya
Many natural and engineering systems are governed by nonlinear partial differential equations (PDEs) which result in a multiscale phenomena, e.g. turbulent flows. Numerical simulations of these problems are computationally very expensive and demand for extreme levels of parallelism. At realistic conditions, simulations are being carried out on massively parallel computers with hundreds of thousands of processing elements (PEs). It has been observed that communication between PEs as well as their synchronization at these extreme scales take up a significant portion of the total simulation time and result in poor scalability of codes. This issue is likely to pose a bottleneck in scalability of codes on future Exascale systems. In this work, we propose an asynchronous computing algorithm based on widely used finite difference methods to solve PDEs in which synchronization between PEs due to communication is relaxed at a mathematical level. We show that while stability is conserved when schemes are used asynchronously, accuracy is greatly degraded. Since message arrivals at PEs are random processes, so is the behavior of the error. We propose a new statistical framework in which we show that average errors drop always to first-order regardless of the original scheme. We propose new asynchrony-tolerant schemes that maintain accuracy when synchronization is relaxed. The quality of the solution is shown to depend, not only on the physical phenomena and numerical schemes, but also on the characteristics of the computing machine. A novel algorithm using remote memory access communications has been developed to demonstrate excellent scalability of the method for large-scale computing. Finally, we present a path to extend this method in solving complex multi-scale problems on Exascale machines.
JPRS Report Nuclear Developments
1988-06-21
Radio Nacional da Amazonia Network] 23 NEAR EAST & SOUTH ASIA BANGLADESH Energy Minister Addresses Atomic Energy Officials [Dhaka THE BANGLADESH...SATURDAY WINDSOR STAR in English 23 May 88pA5 [Text] Ottawa—MP Herb Gray (L-Windsor West) is asking an affiliate of the UN to investigate the safety of...nuclear power plants at a cost of $7.7 billion. Seabra submitted this proposal to the Senate CPI [Congressional Investigating Commission] that is
Statistical evaluation of synchronous spike patterns extracted by frequent item set mining
Torre, Emiliano; Picado-Muiño, David; Denker, Michael; Borgelt, Christian; Grün, Sonja
2013-01-01
We recently proposed frequent itemset mining (FIM) as a method to perform an optimized search for patterns of synchronous spikes (item sets) in massively parallel spike trains. This search outputs the occurrence count (support) of individual patterns that are not trivially explained by the counts of any superset (closed frequent item sets). The number of patterns found by FIM makes direct statistical tests infeasible due to severe multiple testing. To overcome this issue, we proposed to test the significance not of individual patterns, but instead of their signatures, defined as the pairs of pattern size z and support c. Here, we derive in detail a statistical test for the significance of the signatures under the null hypothesis of full independence (pattern spectrum filtering, PSF) by means of surrogate data. As a result, injected spike patterns that mimic assembly activity are well detected, yielding a low false negative rate. However, this approach is prone to additionally classify patterns resulting from chance overlap of real assembly activity and background spiking as significant. These patterns represent false positives with respect to the null hypothesis of having one assembly of given signature embedded in otherwise independent spiking activity. We propose the additional method of pattern set reduction (PSR) to remove these false positives by conditional filtering. By employing stochastic simulations of parallel spike trains with correlated activity in form of injected spike synchrony in subsets of the neurons, we demonstrate for a range of parameter settings that the analysis scheme composed of FIM, PSF and PSR allows to reliably detect active assemblies in massively parallel spike trains. PMID:24167487
Role of polyamines at the G1/S boundary and G2/M phase of the cell cycle.
Yamashita, Tomoko; Nishimura, Kazuhiro; Saiki, Ryotaro; Okudaira, Hiroyuki; Tome, Mayuko; Higashi, Kyohei; Nakamura, Mizuho; Terui, Yusuke; Fujiwara, Kunio; Kashiwagi, Keiko; Igarashi, Kazuei
2013-06-01
The role of polyamines at the G1/S boundary and in the G2/M phase of the cell cycle was studied using synchronized HeLa cells treated with thymidine or with thymidine and aphidicolin. Synchronized cells were cultured in the absence or presence of α-difluoromethylornithine (DFMO), an inhibitor of ornithine decarboxylase, plus ethylglyoxal bis(guanylhydrazone) (EGBG), an inhibitor of S-adenosylmethionine decarboxylase. When polyamine content was reduced by treatment with DFMO and EGBG, the transition from G1 to S phase was delayed. In parallel, the level of p27(Kip1) was greatly increased, so its mechanism was studied in detail. Synthesis of p27(Kip1) was stimulated at the level of translation by a decrease in polyamine levels, because of the existence of long 5'-untranslated region (5'-UTR) in p27(Kip1) mRNA. Similarly, the transition from the G2/M to the G1 phase was delayed by a reduction in polyamine levels. In parallel, the number of multinucleate cells increased by 3-fold. This was parallel with the inhibition of cytokinesis due to an unusual distribution of actin and α-tubulin at the M phase. Since an association of polyamines with chromosomes was not observed by immunofluorescence microscopy at the M phase, polyamines may have only a minor role in structural changes of chromosomes at the M phase. In general, the involvement of polyamines at the G2/M phase was smaller than that at the G1/S boundary. Copyright © 2013 Elsevier Ltd. All rights reserved.
High-order asynchrony-tolerant finite difference schemes for partial differential equations
NASA Astrophysics Data System (ADS)
Aditya, Konduri; Donzis, Diego A.
2017-12-01
Synchronizations of processing elements (PEs) in massively parallel simulations, which arise due to communication or load imbalances between PEs, significantly affect the scalability of scientific applications. We have recently proposed a method based on finite-difference schemes to solve partial differential equations in an asynchronous fashion - synchronization between PEs is relaxed at a mathematical level. While standard schemes can maintain their stability in the presence of asynchrony, their accuracy is drastically affected. In this work, we present a general methodology to derive asynchrony-tolerant (AT) finite difference schemes of arbitrary order of accuracy, which can maintain their accuracy when synchronizations are relaxed. We show that there are several choices available in selecting a stencil to derive these schemes and discuss their effect on numerical and computational performance. We provide a simple classification of schemes based on the stencil and derive schemes that are representative of different classes. Their numerical error is rigorously analyzed within a statistical framework to obtain the overall accuracy of the solution. Results from numerical experiments are used to validate the performance of the schemes.
Apollo and the geology of the moon /Twenty-eighth William Smith Lecture/
NASA Technical Reports Server (NTRS)
Schmitt, H. H.
1975-01-01
Lunar geology evidence is examined for clues to the origin and evolution of the moon and earth. Seven evolutionary episodes, the last covering three billion years to the present day, are constructed for the moon. Parallel episodes in the earth's evolution are masked by the dynamic continuing evolution of the earth over a 4.5 billion year span, in contrast to the moon's quiescence and inability to retain fluids. Comparisons are drawn between the geochemistry and tectonics of the lunar basaltic maria and the earth's ocean basins. Lunar maria rocks differ strikingly in chemical composition from meteoritic matter and solar material. Inundation of frontside lunar maria basins by vast oceans of dark basalt mark the last of the major internally generated evolutionary episodes, and is attributed to consequences of meltdown of the lunar mantle and crust by radioisotope decay from below. Data are drawn primarily from Apollo missions 11-17, supplemented by other sources.
Stepanyuk, Andrey R.; Belan, Pavel V.; Kononenko, Nikolai I.
2014-01-01
When dispersed and cultured in a multielectrode dish (MED), suprachiasmatic nucleus (SCN) neurons express fast oscillations of firing rate (FOFR; fast relative to the circadian cycle), with burst duration ∼10 min, and interburst interval varying from 20 to 60 min in different cells but remaining nevertheless rather regular in individual cells. In many cases, separate neurons in distant parts of the 1 mm recording area of a MED exhibited correlated FOFR. Neither the mechanism of FOFR nor the mechanism of their synchronization among neurons is known. Based on recent data implicating vasoactive intestinal polypeptide (VIP) as a key intercellular synchronizing agent, we built a model in which VIP acts as both a feedback regulator to generate FOFR in individual neurons, and a diffusible synchronizing agent to produce coherent electrical output of a neuronal network. In our model, VIP binding to its (VPAC2) receptors acts through Gs G-proteins to activate adenylyl cyclase (AC), increase intracellular cAMP, and open cyclic-nucleotide-gated (CNG) cation channels, thus depolarizing the cell and generating neuronal firing to release VIP. In parallel, slowly developing homologous desensitization and internalization of VPAC2 receptors terminates elevation of cAMP and thereby provides an interpulse silent interval. Through mathematical modeling, we show that this VIP/VPAC2/AC/cAMP/CNG-channel mechanism is sufficient for generating reliable FOFR in single neurons. When our model for FOFR is combined with a published model of synchronization of circadian rhythms based on VIP/VPAC2 and Per gene regulation synchronization of circadian rhythms is significantly accelerated. These results suggest that (a) auto/paracrine regulation by VIP/VPAC2 and intracellular AC/cAMP/CNG-channels are sufficient to provide robust FOFR and synchrony among neurons in a heterogeneous network, and (b) this system may also participate in synchronization of circadian rhythms. PMID:25192180
Communication overhead on the Intel Paragon, IBM SP2 and Meiko CS-2
NASA Technical Reports Server (NTRS)
Bokhari, Shahid H.
1995-01-01
Interprocessor communication overhead is a crucial measure of the power of parallel computing systems-its impact can severely limit the performance of parallel programs. This report presents measurements of communication overhead on three contemporary commercial multicomputer systems: the Intel Paragon, the IBM SP2 and the Meiko CS-2. In each case the time to communicate between processors is presented as a function of message length. The time for global synchronization and memory access is discussed. The performance of these machines in emulating hypercubes and executing random pairwise exchanges is also investigated. It is shown that the interprocessor communication time depends heavily on the specific communication pattern required. These observations contradict the commonly held belief that communication overhead on contemporary machines is independent of the placement of tasks on processors. The information presented in this report permits the evaluation of the efficiency of parallel algorithm implementations against standard baselines.
A Big Data Approach to Analyzing Market Volatility
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, Kesheng; Bethel, E. Wes; Gu, Ming
2013-06-05
Understanding the microstructure of the financial market requires the processing of a vast amount of data related to individual trades, and sometimes even multiple levels of quotes. Analyzing such a large volume of data requires tremendous computing power that is not easily available to financial academics and regulators. Fortunately, public funded High Performance Computing (HPC) power is widely available at the National Laboratories in the US. In this paper we demonstrate that the HPC resource and the techniques for data-intensive sciences can be used to greatly accelerate the computation of an early warning indicator called Volume-synchronized Probability of Informed tradingmore » (VPIN). The test data used in this study contains five and a half year's worth of trading data for about 100 most liquid futures contracts, includes about 3 billion trades, and takes 140GB as text files. By using (1) a more efficient file format for storing the trading records, (2) more effective data structures and algorithms, and (3) parallelizing the computations, we are able to explore 16,000 different ways of computing VPIN in less than 20 hours on a 32-core IBM DataPlex machine. Our test demonstrates that a modest computer is sufficient to monitor a vast number of trading activities in real-time – an ability that could be valuable to regulators. Our test results also confirm that VPIN is a strong predictor of liquidity-induced volatility. With appropriate parameter choices, the false positive rates are about 7% averaged over all the futures contracts in the test data set. More specifically, when VPIN values rise above a threshold (CDF > 0.99), the volatility in the subsequent time windows is higher than the average in 93% of the cases.« less
Parallel Quantum Circuit in a Tunnel Junction
NASA Astrophysics Data System (ADS)
Faizy Namarvar, Omid; Dridi, Ghassen; Joachim, Christian; GNS theory Group Team
In between 2 metallic nanopads, adding identical and independent electron transfer paths in parallel increases the electronic effective coupling between the 2 nanopads through the quantum circuit defined by those paths. Measuring this increase of effective coupling using the tunnelling current intensity can lead for example for 2 paths in parallel to the now standard G =G1 +G2 + 2√{G1 .G2 } conductance superposition law (1). This is only valid for the tunnelling regime (2). For large electronic coupling to the nanopads (or at resonance), G can saturate and even decay as a function of the number of parallel paths added in the quantum circuit (3). We provide here the explanation of this phenomenon: the measurement of the effective Rabi oscillation frequency using the current intensity is constrained by the normalization principle of quantum mechanics. This limits the quantum conductance G for example to go when there is only one channel per metallic nanopads. This ef fect has important consequences for the design of Boolean logic gates at the atomic scale using atomic scale or intramolecular circuits. References: This has the financial support by European PAMS project.
Production of an ordered (B2) CuPd nanoalloy by low-temperature annealing under hydrogen atmosphere.
Yamauchi, Miho; Tsukuda, Tatsuya
2011-05-14
CuPd (1/1) nanoalloys composed of disordered body-centered-cubic crystals (crystal size = 1.6 nm) were prepared by synchronous reduction of Cu and Pd precursor ions with NaBH(4). In situ XRD measurement revealed that Cu and Pd atoms in the CuPd nanoalloys are arranged into an ordered B2 structure under exposure to H(2) (5 kPa) at 373 K. Ordering of Cu and Pd atoms over a longer distance (up to 3.6 nm) was achieved by annealing the nanoalloys for a longer time under a H(2) atmosphere.
Time-resolved atomic inner-shell spectroscopy
NASA Astrophysics Data System (ADS)
Drescher, M.; Hentschel, M.; Kienberger, R.; Uiberacker, M.; Yakovlev, V.; Scrinzi, A.; Westerwalbesloh, Th.; Kleineberg, U.; Heinzmann, U.; Krausz, F.
2002-10-01
The characteristic time constants of the relaxation dynamics of core-excited atoms have hitherto been inferred from the linewidths of electronic transitions measured by continuous-wave extreme ultraviolet or X-ray spectroscopy. Here we demonstrate that a laser-based sampling system, consisting of a few-femtosecond visible light pulse and a synchronized sub-femtosecond soft X-ray pulse, allows us to trace these dynamics directly in the time domain with attosecond resolution. We have measured a lifetime of 7.9
Spin Self-Rephasing and Very Long Coherence Times in a Trapped Atomic Ensemble
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deutsch, C.; Reinhard, F.; Schneider, T.
2010-07-09
We perform Ramsey spectroscopy on the ground state of ultracold {sup 87}Rb atoms magnetically trapped on a chip in the Knudsen regime. Field inhomogeneities over the sample should limit the 1/e contrast decay time to about 3 s, while decay times of 58{+-}12 s are actually observed. We explain this surprising result by a spin self-rephasing mechanism induced by the identical spin rotation effect originating from particle indistinguishability. We propose a theory of this synchronization mechanism and obtain good agreement with the experimental observations. The effect is general and may appear in other physical systems.
Cumulative Reports and Publications through December 31, 1990.
1991-02-01
visiting scientists from universities and industry who have resident appointments for limited periods of time , and by consultants. Members of NASA’s...David M.: The cost of conservative synchronization in parallel discrete event simula- tions. ICASE Report No. 90-20, May 9, 1990, 31 pages. Submitted...Computing Conference, Charleston, South Carolina, Vol. II, pp. 1028-1037, April 1990. Saltz, Joel H., Ravi Mirchandaney and Kay Crowley: Run- time
Tradeoffs Between Synchronization, Communication, and Work in Parallel Linear Algebra Computations
2014-01-25
Demmel Electrical Engineering and Computer Sciences University of California at Berkeley Technical Report No. UCB/EECS-2014- 8 http...www.eecs.berkeley.edu/Pubs/TechRpts/2014/EECS-2014- 8 .html January 25, 2014 Report Documentation Page Form ApprovedOMB No. 0704-0188 Public reporting burden for the...University of California at Berkeley,Electrical Engineering and Computer Sciences,Berkeley,CA,94720 8 . PERFORMING ORGANIZATION REPORT NUMBER 9. SPONSORING
Time Warp Operating System, Version 2.5.1
NASA Technical Reports Server (NTRS)
Bellenot, Steven F.; Gieselman, John S.; Hawley, Lawrence R.; Peterson, Judy; Presley, Matthew T.; Reiher, Peter L.; Springer, Paul L.; Tupman, John R.; Wedel, John J., Jr.; Wieland, Frederick P.;
1993-01-01
Time Warp Operating System, TWOS, is special purpose computer program designed to support parallel simulation of discrete events. Complete implementation of Time Warp software mechanism, which implements distributed protocol for virtual synchronization based on rollback of processes and annihilation of messages. Supports simulations and other computations in which both virtual time and dynamic load balancing used. Program utilizes underlying resources of operating system. Written in C programming language.
Principles for problem aggregation and assignment in medium scale multiprocessors
NASA Technical Reports Server (NTRS)
Nicol, David M.; Saltz, Joel H.
1987-01-01
One of the most important issues in parallel processing is the mapping of workload to processors. This paper considers a large class of problems having a high degree of potential fine grained parallelism, and execution requirements that are either not predictable, or are too costly to predict. The main issues in mapping such a problem onto medium scale multiprocessors are those of aggregation and assignment. We study a method of parameterized aggregation that makes few assumptions about the workload. The mapping of aggregate units of work onto processors is uniform, and exploits locality of workload intensity to balance the unknown workload. In general, a finer aggregate granularity leads to a better balance at the price of increased communication/synchronization costs; the aggregation parameters can be adjusted to find a reasonable granularity. The effectiveness of this scheme is demonstrated on three model problems: an adaptive one-dimensional fluid dynamics problem with message passing, a sparse triangular linear system solver on both a shared memory and a message-passing machine, and a two-dimensional time-driven battlefield simulation employing message passing. Using the model problems, the tradeoffs are studied between balanced workload and the communication/synchronization costs. Finally, an analytical model is used to explain why the method balances workload and minimizes the variance in system behavior.
A broader classification of damage zones
NASA Astrophysics Data System (ADS)
Peacock, D. C. P.; Dimmen, V.; Rotevatn, A.; Sanderson, D. J.
2017-09-01
Damage zones have previously been classified in terms of their positions at fault tips, walls or areas of linkage, with the latter being described in terms of sub-parallel and synchronously active faults. We broaden the idea of linkage to include structures around the intersections of non-parallel and/or non-synchronous faults. These interaction damage zones can be divided into approaching damage zones, where the faults kinematically interact but are not physically connected, and intersection damage zones, where the faults either abut or cross-cut. The damage zone concept is applied to other settings in which strain or displacement variations are taken up by a range of structures, such as at fault bends. It is recommended that a prefix can be added to a wide range of damage zones, to describe the locations in which they formed, e.g., approaching, intersection and fault bend damage zone. Such interpretations are commonly based on limited knowledge of the 3D geometries of the structures, such as from exposure surfaces, and there may be spatial variations. For example, approaching faults and related damage seen in outcrop may be intersecting elsewhere on the fault planes. Dilation in intersection damage zones can represent narrow and localised channels for fluid flow, and such dilation can be influenced by post-faulting stress patterns.
A DFT study of the stability of SIAs and small SIA clusters in the vicinity of solute atoms in Fe
NASA Astrophysics Data System (ADS)
Becquart, C. S.; Ngayam Happy, R.; Olsson, P.; Domain, C.
2018-03-01
The energetics, defect volume and magnetic properties of single SIAs and small SIA clusters up to size 6 have been calculated by DFT for different configurations like the parallel 〈110〉 dumbbell, the non parallel 〈110〉 dumbbell and the C15 structure. The most stable configurations of each type have been further analyzed to determine the influence on their stability of various solute atoms (Ti, V, Cr, Mn, Co, Ni, Cu, Mo, W, Pd, Al, Si, P), relevant for steels used under irradiation. The results show that the presence of solute atoms does not change the relative stability order among SIA clusters. The small SIA clusters investigated can bind to both undersized and oversized solutes. Several descriptors have been considered to derive interesting trends from results. It appears that the local atomic volume available for the solute is the main physical quantity governing the binding energy evolution, whatever the solute type (undersized or oversized) and the cluster configuration (size and type).
Synchronous Spike Patterns in Macaque Motor Cortex during an Instructed-Delay Reach-to-Grasp Task
Torre, Emiliano; Quaglio, Pietro; Denker, Michael; Brochier, Thomas; Riehle, Alexa
2016-01-01
The computational role of spike time synchronization at millisecond precision among neurons in the cerebral cortex is hotly debated. Studies performed on data of limited size provided experimental evidence that low-order correlations occur in relation to behavior. Advances in electrophysiological technology to record from hundreds of neurons simultaneously provide the opportunity to observe coordinated spiking activity of larger populations of cells. We recently published a method that combines data mining and statistical evaluation to search for significant patterns of synchronous spikes in massively parallel spike trains (Torre et al., 2013). The method solves the computational and multiple testing problems raised by the high dimensionality of the data. In the current study, we used our method on simultaneous recordings from two macaque monkeys engaged in an instructed-delay reach-to-grasp task to determine the emergence of spike synchronization in relation to behavior. We found a multitude of synchronous spike patterns aligned in both monkeys along a preferential mediolateral orientation in brain space. The occurrence of the patterns is highly specific to behavior, indicating that different behaviors are associated with the synchronization of different groups of neurons (“cell assemblies”). However, pooled patterns that overlap in neuronal composition exhibit no specificity, suggesting that exclusive cell assemblies become active during different behaviors, but can recruit partly identical neurons. These findings are consistent across multiple recording sessions analyzed across the two monkeys. SIGNIFICANCE STATEMENT Neurons in the brain communicate via electrical impulses called spikes. How spikes are coordinated to process information is still largely unknown. Synchronous spikes are effective in triggering a spike emission in receiving neurons and have been shown to occur in relation to behavior in a number of studies on simultaneous recordings of few neurons. We recently published a method to extend this type of investigation to larger data. Here, we apply it to simultaneous recordings of hundreds of neurons from the motor cortex of macaque monkeys performing a motor task. Our analysis reveals groups of neurons selectively synchronizing their activity in relation to behavior, which sheds new light on the role of synchrony in information processing in the cerebral cortex. PMID:27511007
Synchronous Spike Patterns in Macaque Motor Cortex during an Instructed-Delay Reach-to-Grasp Task.
Torre, Emiliano; Quaglio, Pietro; Denker, Michael; Brochier, Thomas; Riehle, Alexa; Grün, Sonja
2016-08-10
The computational role of spike time synchronization at millisecond precision among neurons in the cerebral cortex is hotly debated. Studies performed on data of limited size provided experimental evidence that low-order correlations occur in relation to behavior. Advances in electrophysiological technology to record from hundreds of neurons simultaneously provide the opportunity to observe coordinated spiking activity of larger populations of cells. We recently published a method that combines data mining and statistical evaluation to search for significant patterns of synchronous spikes in massively parallel spike trains (Torre et al., 2013). The method solves the computational and multiple testing problems raised by the high dimensionality of the data. In the current study, we used our method on simultaneous recordings from two macaque monkeys engaged in an instructed-delay reach-to-grasp task to determine the emergence of spike synchronization in relation to behavior. We found a multitude of synchronous spike patterns aligned in both monkeys along a preferential mediolateral orientation in brain space. The occurrence of the patterns is highly specific to behavior, indicating that different behaviors are associated with the synchronization of different groups of neurons ("cell assemblies"). However, pooled patterns that overlap in neuronal composition exhibit no specificity, suggesting that exclusive cell assemblies become active during different behaviors, but can recruit partly identical neurons. These findings are consistent across multiple recording sessions analyzed across the two monkeys. Neurons in the brain communicate via electrical impulses called spikes. How spikes are coordinated to process information is still largely unknown. Synchronous spikes are effective in triggering a spike emission in receiving neurons and have been shown to occur in relation to behavior in a number of studies on simultaneous recordings of few neurons. We recently published a method to extend this type of investigation to larger data. Here, we apply it to simultaneous recordings of hundreds of neurons from the motor cortex of macaque monkeys performing a motor task. Our analysis reveals groups of neurons selectively synchronizing their activity in relation to behavior, which sheds new light on the role of synchrony in information processing in the cerebral cortex. Copyright © 2016 Torre, et al.
Komarov, Ivan; D'Souza, Roshan M
2012-01-01
The Gillespie Stochastic Simulation Algorithm (GSSA) and its variants are cornerstone techniques to simulate reaction kinetics in situations where the concentration of the reactant is too low to allow deterministic techniques such as differential equations. The inherent limitations of the GSSA include the time required for executing a single run and the need for multiple runs for parameter sweep exercises due to the stochastic nature of the simulation. Even very efficient variants of GSSA are prohibitively expensive to compute and perform parameter sweeps. Here we present a novel variant of the exact GSSA that is amenable to acceleration by using graphics processing units (GPUs). We parallelize the execution of a single realization across threads in a warp (fine-grained parallelism). A warp is a collection of threads that are executed synchronously on a single multi-processor. Warps executing in parallel on different multi-processors (coarse-grained parallelism) simultaneously generate multiple trajectories. Novel data-structures and algorithms reduce memory traffic, which is the bottleneck in computing the GSSA. Our benchmarks show an 8×-120× performance gain over various state-of-the-art serial algorithms when simulating different types of models.
Molecular-dynamics simulations of self-assembled monolayers (SAM) on parallel computers
NASA Astrophysics Data System (ADS)
Vemparala, Satyavani
The purpose of this dissertation is to investigate the properties of self-assembled monolayers, particularly alkanethiols and Poly (ethylene glycol) terminated alkanethiols. These simulations are based on realistic interatomic potentials and require scalable and portable multiresolution algorithms implemented on parallel computers. Large-scale molecular dynamics simulations of self-assembled alkanethiol monolayer systems have been carried out using an all-atom model involving a million atoms to investigate their structural properties as a function of temperature, lattice spacing and molecular chain-length. Results show that the alkanethiol chains tilt from the surface normal by a collective angle of 25° along next-nearest neighbor direction at 300K. At 350K the system transforms to a disordered phase characterized by small tilt angle, flexible tilt direction, and random distribution of backbone planes. With increasing lattice spacing, a, the tilt angle increases rapidly from a nearly zero value at a = 4.7A to as high as 34° at a = 5.3A at 300K. We also studied the effect of end groups on the tilt structure of SAM films. We characterized the system with respect to temperature, the alkane chain length, lattice spacing, and the length of the end group. We found that the gauche defects were predominant only in the tails, and the gauche defects increased with the temperature and number of EG units. Effect of electric field on the structure of poly (ethylene glycol) (PEG) terminated alkanethiol self assembled monolayer (SAM) on gold has been studied using parallel molecular dynamics method. An applied electric field triggers a conformational transition from all-trans to a mostly gauche conformation. The polarity of the electric field has a significant effect on the surface structure of PEG leading to a profound effect on the hydrophilicity of the surface. The electric field applied anti-parallel to the surface normal causes a reversible transition to an ordered state in which the oxygen atoms are exposed. On the other hand, an electric field applied in a direction parallel to the surface normal introduces considerable disorder in the system and the oxygen atoms are buried inside.
Chen, Chenghao; Xu, Min; Anantaprakorn, Yuto; Rosing, Mechthild; Stanewsky, Ralf
2018-05-21
Circadian clocks organize biological processes to occur at optimized times of day and thereby contribute to overall fitness. While the regular daily changes of environmental light and temperature synchronize circadian clocks, extreme external conditions can bypass the temporal constraints dictated by the clock. Despite advanced knowledge about how the daily light-dark changes synchronize the clock, relatively little is known with regard to how the daily temperature changes influence daily timing and how temperature and light signals are integrated. In Drosophila, a network of ∼150 brain clock neurons exhibit 24-hr oscillations of clock gene expression to regulate daily activity and sleep. We show here that a temperature input pathway from peripheral sensory organs, which depends on the gene nocte, targets specific subsets of these clock neurons to synchronize molecular and behavioral rhythms to temperature cycles. Strikingly, while nocte 1 mutant flies synchronize normally to light-dark cycles at constant temperatures, the combined presence of light-dark and temperature cycles inhibits synchronization. nocte 1 flies exhibit altered siesta sleep, suggesting that the sleep-regulating clock neurons are an important target for nocte-dependent temperature input, which dominates a parallel light input into these cells. In conclusion, we reveal a nocte-dependent temperature input pathway to central clock neurons and show that this pathway and its target neurons are important for the integration of sensory light and temperature information in order to temporally regulate activity and sleep during daily light and temperature cycles. Copyright © 2018 Elsevier Ltd. All rights reserved.
Balancing Contention and Synchronization on the Intel Paragon
NASA Technical Reports Server (NTRS)
Bokhari, Shahid H.; Nicol, David M.
1996-01-01
The Intel Paragon is a mesh-connected distributed memory parallel computer. It uses an oblivious and deterministic message routing algorithm: this permits us to develop highly optimized schedules for frequently needed communication patterns. The complete exchange is one such pattern. Several approaches are available for carrying it out on the mesh. We study an algorithm developed by Scott. This algorithm assumes that a communication link can carry one message at a time and that a node can only transmit one message at a time. It requires global synchronization to enforce a schedule of transmissions. Unfortunately global synchronization has substantial overhead on the Paragon. At the same time the powerful interconnection mechanism of this machine permits 2 or 3 messages to share a communication link with minor overhead. It can also overlap multiple message transmission from the same node to some extent. We develop a generalization of Scott's algorithm that executes complete exchange with a prescribed contention. Schedules that incur greater contention require fewer synchronization steps. This permits us to tradeoff contention against synchronization overhead. We describe the performance of this algorithm and compare it with Scott's original algorithm as well as with a naive algorithm that does not take interconnection structure into account. The Bounded contention algorithm is always better than Scott's algorithm and outperforms the naive algorithm for all but the smallest message sizes. The naive algorithm fails to work on meshes larger than 12 x 12. These results show that due consideration of processor interconnect and machine performance parameters is necessary to obtain peak performance from the Paragon and its successor mesh machines.
The Tera Multithreaded Architecture and Unstructured Meshes
NASA Technical Reports Server (NTRS)
Bokhari, Shahid H.; Mavriplis, Dimitri J.
1998-01-01
The Tera Multithreaded Architecture (MTA) is a new parallel supercomputer currently being installed at San Diego Supercomputing Center (SDSC). This machine has an architecture quite different from contemporary parallel machines. The computational processor is a custom design and the machine uses hardware to support very fine grained multithreading. The main memory is shared, hardware randomized and flat. These features make the machine highly suited to the execution of unstructured mesh problems, which are difficult to parallelize on other architectures. We report the results of a study carried out during July-August 1998 to evaluate the execution of EUL3D, a code that solves the Euler equations on an unstructured mesh, on the 2 processor Tera MTA at SDSC. Our investigation shows that parallelization of an unstructured code is extremely easy on the Tera. We were able to get an existing parallel code (designed for a shared memory machine), running on the Tera by changing only the compiler directives. Furthermore, a serial version of this code was compiled to run in parallel on the Tera by judicious use of directives to invoke the "full/empty" tag bits of the machine to obtain synchronization. This version achieves 212 and 406 Mflop/s on one and two processors respectively, and requires no attention to partitioning or placement of data issues that would be of paramount importance in other parallel architectures.
Using Histories to Implement Atomic Objects
NASA Technical Reports Server (NTRS)
Ng, Pui
1987-01-01
In this paper we describe an approach of implementing atomicity. Atomicity requires that computations appear to be all-or-nothing and executed in a serialization order. The approach we describe has three characteristics. First, it utilizes the semantics of an application to improve concurrency. Second, it reduces the complexity of application-dependent synchronization code by analyzing the process of writing it. In fact, the process can be automated with logic programming. Third, our approach hides the protocol used to arrive at a serialization order from the applications. As a result, different protocols can be used without affecting the applications. Our approach uses a history tree abstraction. The history tree captures the ordering relationship among concurrent computations. By determining what types of computations exist in the history tree and their parameters, a computation can determine whether it can proceed.
Japanese project aims at supercomputer that executes 10 gflops
DOE Office of Scientific and Technical Information (OSTI.GOV)
Burskey, D.
1984-05-03
Dubbed supercom by its multicompany design team, the decade-long project's goal is an engineering supercomputer that can execute 10 billion floating-point operations/s-about 20 times faster than today's supercomputers. The project, guided by Japan's Ministry of International Trade and Industry (MITI) and the Agency of Industrial Science and Technology encompasses three parallel research programs, all aimed at some angle of the superconductor. One program should lead to superfast logic and memory circuits, another to a system architecture that will afford the best performance, and the last to the software that will ultimately control the computer. The work on logic and memorymore » chips is based on: GAAS circuit; Josephson junction devices; and high electron mobility transistor structures. The architecture will involve parallel processing.« less
A simple mercury vapor detector for geochemical prospecting
Vaughn, William W.
1967-01-01
The detector utilizes a large-volume atomic-absorption technique for quantitative determinations of mercury vapor thermally released from crushed rock. A quartz-enclosed noble-metal amalgamative stage, which is temperature controlled and is actuated by a radio-frequency induction heater, selectively traps the mercury and eliminates low-level contamination. As little as 1 part per billion of mercury can be detected in a 1-gram sample in a 1-minute analytical period.
Sullivan, Shane Z; DeWalt, Emma L; Schmitt, Paul D; Muir, Ryan M; Simpson, Garth J
2015-03-09
Fast beam-scanning non-linear optical microscopy, coupled with fast (8 MHz) polarization modulation and analytical modeling have enabled simultaneous nonlinear optical Stokes ellipsometry (NOSE) and linear Stokes ellipsometry imaging at video rate (15 Hz). NOSE enables recovery of the complex-valued Jones tensor that describes the polarization-dependent observables, in contrast to polarimetry, in which the polarization stated of the exciting beam is recorded. Each data acquisition consists of 30 images (10 for each detector, with three detectors operating in parallel), each of which corresponds to polarization-dependent results. Processing of this image set by linear fitting contracts down each set of 10 images to a set of 5 parameters for each detector in second harmonic generation (SHG) and three parameters for the transmittance of the fundamental laser beam. Using these parameters, it is possible to recover the Jones tensor elements of the sample at video rate. Video rate imaging is enabled by performing synchronous digitization (SD), in which a PCIe digital oscilloscope card is synchronized to the laser (the laser is the master clock.) Fast polarization modulation was achieved by modulating an electro-optic modulator synchronously with the laser and digitizer, with a simple sine-wave at 1/10th the period of the laser, producing a repeating pattern of 10 polarization states. This approach was validated using Z-cut quartz, and NOSE microscopy was performed for micro-crystals of naproxen.
NASA Astrophysics Data System (ADS)
Sullivan, Shane Z.; DeWalt, Emma L.; Schmitt, Paul D.; Muir, Ryan D.; Simpson, Garth J.
2015-03-01
Fast beam-scanning non-linear optical microscopy, coupled with fast (8 MHz) polarization modulation and analytical modeling have enabled simultaneous nonlinear optical Stokes ellipsometry (NOSE) and linear Stokes ellipsometry imaging at video rate (15 Hz). NOSE enables recovery of the complex-valued Jones tensor that describes the polarization-dependent observables, in contrast to polarimetry, in which the polarization stated of the exciting beam is recorded. Each data acquisition consists of 30 images (10 for each detector, with three detectors operating in parallel), each of which corresponds to polarization-dependent results. Processing of this image set by linear fitting contracts down each set of 10 images to a set of 5 parameters for each detector in second harmonic generation (SHG) and three parameters for the transmittance of the fundamental laser beam. Using these parameters, it is possible to recover the Jones tensor elements of the sample at video rate. Video rate imaging is enabled by performing synchronous digitization (SD), in which a PCIe digital oscilloscope card is synchronized to the laser (the laser is the master clock.) Fast polarization modulation was achieved by modulating an electro-optic modulator synchronously with the laser and digitizer, with a simple sine-wave at 1/10th the period of the laser, producing a repeating pattern of 10 polarization states. This approach was validated using Z-cut quartz, and NOSE microscopy was performed for micro-crystals of naproxen.
Providing a parallel and distributed capability for JMASS using SPEEDES
NASA Astrophysics Data System (ADS)
Valinski, Maria; Driscoll, Jonathan; McGraw, Robert M.; Meyer, Bob
2002-07-01
The Joint Modeling And Simulation System (JMASS) is a Tri-Service simulation environment that supports engineering and engagement-level simulations. As JMASS is expanded to support other Tri-Service domains, the current set of modeling services must be expanded for High Performance Computing (HPC) applications by adding support for advanced time-management algorithms, parallel and distributed topologies, and high speed communications. By providing support for these services, JMASS can better address modeling domains requiring parallel computationally intense calculations such clutter, vulnerability and lethality calculations, and underwater-based scenarios. A risk reduction effort implementing some HPC services for JMASS using the SPEEDES (Synchronous Parallel Environment for Emulation and Discrete Event Simulation) Simulation Framework has recently concluded. As an artifact of the JMASS-SPEEDES integration, not only can HPC functionality be brought to the JMASS program through SPEEDES, but an additional HLA-based capability can be demonstrated that further addresses interoperability issues. The JMASS-SPEEDES integration provided a means of adding HLA capability to preexisting JMASS scenarios through an implementation of the standard JMASS port communication mechanism that allows players to communicate.
On Parallelizing Single Dynamic Simulation Using HPC Techniques and APIs of Commercial Software
DOE Office of Scientific and Technical Information (OSTI.GOV)
Diao, Ruisheng; Jin, Shuangshuang; Howell, Frederic
Time-domain simulations are heavily used in today’s planning and operation practices to assess power system transient stability and post-transient voltage/frequency profiles following severe contingencies to comply with industry standards. Because of the increased modeling complexity, it is several times slower than real time for state-of-the-art commercial packages to complete a dynamic simulation for a large-scale model. With the growing stochastic behavior introduced by emerging technologies, power industry has seen a growing need for performing security assessment in real time. This paper presents a parallel implementation framework to speed up a single dynamic simulation by leveraging the existing stability model librarymore » in commercial tools through their application programming interfaces (APIs). Several high performance computing (HPC) techniques are explored such as parallelizing the calculation of generator current injection, identifying fast linear solvers for network solution, and parallelizing data outputs when interacting with APIs in the commercial package, TSAT. The proposed method has been tested on a WECC planning base case with detailed synchronous generator models and exhibits outstanding scalable performance with sufficient accuracy.« less
Arpaia, P; Cimmino, P; Girone, M; La Commara, G; Maisto, D; Manna, C; Pezzetti, M
2014-09-01
Evolutionary approach to centralized multiple-faults diagnostics is extended to distributed transducer networks monitoring large experimental systems. Given a set of anomalies detected by the transducers, each instance of the multiple-fault problem is formulated as several parallel communicating sub-tasks running on different transducers, and thus solved one-by-one on spatially separated parallel processes. A micro-genetic algorithm merges evaluation time efficiency, arising from a small-size population distributed on parallel-synchronized processors, with the effectiveness of centralized evolutionary techniques due to optimal mix of exploitation and exploration. In this way, holistic view and effectiveness advantages of evolutionary global diagnostics are combined with reliability and efficiency benefits of distributed parallel architectures. The proposed approach was validated both (i) by simulation at CERN, on a case study of a cold box for enhancing the cryogeny diagnostics of the Large Hadron Collider, and (ii) by experiments, under the framework of the industrial research project MONDIEVOB (Building Remote Monitoring and Evolutionary Diagnostics), co-funded by EU and the company Del Bo srl, Napoli, Italy.
Experiences with Cray multi-tasking
NASA Technical Reports Server (NTRS)
Miya, E. N.
1985-01-01
The issues involved in modifying an existing code for multitasking is explored. They include Cray extensions to FORTRAN, an examination of the application code under study, designing workable modifications, specific code modifications to the VAX and Cray versions, performance, and efficiency results. The finished product is a faster, fully synchronous, parallel version of the original program. A production program is partitioned by hand to run on two CPUs. Loop splitting multitasks three key subroutines. Simply dividing subroutine data and control structure down the middle of a subroutine is not safe. Simple division produces results that are inconsistent with uniprocessor runs. The safest way to partition the code is to transfer one block of loops at a time and check the results of each on a test case. Other issues include debugging and performance. Task startup and maintenance (e.g., synchronization) are potentially expensive.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cao, Wei; Warrick, Erika R.; Neumark, Daniel M.
Using attosecond transient absorption, the dipole response of an argon atom in the vacuum ultraviolet (VUV) region is studied when an external electromagnetic field is present. An isolated attosecond VUV pulse populates Rydberg states lying 15 eV above the argon ground state. A synchronized few-cycle near infrared (NIR) pulse modifies the oscillating dipoles of argon impulsively, leading to alterations in the VUV absorption spectra. As the NIR pulse is delayed with respect to the VUV pulse, multiple features in the absorption profile emerge simultaneously including line broadening, sideband structure, sub-cycle fast modulations, and 5-10 fs slow modulations. These features indicatemore » the coexistence of two general processes of the light-matter interaction: the energy shift of individual atomic levels and coherent population transfer between atomic eigenstates, revealing coherent superpositions. Finally, an intuitive formula is derived to treat both effects in a unifying framework, allowing one to identify and quantify the two processes in a single absorption spectrogram.« less
NASA Astrophysics Data System (ADS)
Cao, Wei; Warrick, Erika R.; Neumark, Daniel M.; Leone, Stephen R.
2016-01-01
Using attosecond transient absorption, the dipole response of an argon atom in the vacuum ultraviolet (VUV) region is studied when an external electromagnetic field is present. An isolated attosecond VUV pulse populates Rydberg states lying 15 eV above the argon ground state. A synchronized few-cycle near infrared (NIR) pulse modifies the oscillating dipoles of argon impulsively, leading to alterations in the VUV absorption spectra. As the NIR pulse is delayed with respect to the VUV pulse, multiple features in the absorption profile emerge simultaneously including line broadening, sideband structure, sub-cycle fast modulations, and 5-10 fs slow modulations. These features indicate the coexistence of two general processes of the light-matter interaction: the energy shift of individual atomic levels and coherent population transfer between atomic eigenstates, revealing coherent superpositions. An intuitive formula is derived to treat both effects in a unifying framework, allowing one to identify and quantify the two processes in a single absorption spectrogram.
Cao, Wei; Warrick, Erika R.; Neumark, Daniel M.; ...
2016-01-18
Using attosecond transient absorption, the dipole response of an argon atom in the vacuum ultraviolet (VUV) region is studied when an external electromagnetic field is present. An isolated attosecond VUV pulse populates Rydberg states lying 15 eV above the argon ground state. A synchronized few-cycle near infrared (NIR) pulse modifies the oscillating dipoles of argon impulsively, leading to alterations in the VUV absorption spectra. As the NIR pulse is delayed with respect to the VUV pulse, multiple features in the absorption profile emerge simultaneously including line broadening, sideband structure, sub-cycle fast modulations, and 5-10 fs slow modulations. These features indicatemore » the coexistence of two general processes of the light-matter interaction: the energy shift of individual atomic levels and coherent population transfer between atomic eigenstates, revealing coherent superpositions. Finally, an intuitive formula is derived to treat both effects in a unifying framework, allowing one to identify and quantify the two processes in a single absorption spectrogram.« less
Galloro, Vince; Vesely, Rebecca; Zigmond, Jessica
2010-08-16
With more government involvement comes more government attention. That's the lesson about executive pay healthcare CEOs could learn as the reform law and its ramifications settle into place. "It's important for the people who are scraping together the dollars to pay their health insurance premiums to know what luxurious lives these CEOs are leading. They're living in a parallel universe", says U.S. Rep. Jan Schakowsky, left.
Military Engagement and Forward Presence: Down But Not Out as Tools to Shape and Win
2016-01-01
the $8 billion spent by the United States on military engagement in Colombia , as part of the “Plan Colombia ” initiative, is largely viewed as money...well- 11 spent.22 Colombia is not free of violence, but it is far from the near-failed narco-state that some feared it would become over a decade ago...Philippines, Colombia , Iraq, and Afghanistan a key independent variable is the unmistakable presence of parallel interests and policy preferences on the
Atomic switch networks as complex adaptive systems
NASA Astrophysics Data System (ADS)
Scharnhorst, Kelsey S.; Carbajal, Juan P.; Aguilera, Renato C.; Sandouk, Eric J.; Aono, Masakazu; Stieg, Adam Z.; Gimzewski, James K.
2018-03-01
Complexity is an increasingly crucial aspect of societal, environmental and biological phenomena. Using a dense unorganized network of synthetic synapses it is shown that a complex adaptive system can be physically created on a microchip built especially for complex problems. These neuro-inspired atomic switch networks (ASNs) are a dynamic system with inherent and distributed memory, recurrent pathways, and up to a billion interacting elements. We demonstrate key parameters describing self-organized behavior such as non-linearity, power law dynamics, and multistate switching regimes. Device dynamics are then investigated using a feedback loop which provides control over current and voltage power-law behavior. Wide ranging prospective applications include understanding and eventually predicting future events that display complex emergent behavior in the critical regime.
Atomic force microscopy of torus-bearing pit membranes
Roland R. Dute; Thomas Elder
2011-01-01
Atomic force microscopy was used to compare the structures of dried, torus-bearing pit membranes from four woody species, three angiosperms and one gymnosperm. Tori of Osmanthus armatus are bipartite consisting of a pustular zone overlying parallel sets of microfibrils that form a peripheral corona. Microfibrils of the corona form radial spokes as they traverse the...
Quantum Evaporation from Liquid 4He by Rotons
NASA Astrophysics Data System (ADS)
Hope, F. R.; Baird, M. J.; Wyatt, A. F. G.
1984-04-01
We have shown that rotons as well as phonons can evaporate 4He atoms in a single-quantum process. Measurements of the time of flight and the angular distribution of the evaporated atoms clearly distinguish between evaporation by phonons and rotons. The results indicate that energy and the parallel component of momentum are conserved at the free liquid surface.
Integrated 3-D vision system for autonomous vehicles
NASA Astrophysics Data System (ADS)
Hou, Kun M.; Shawky, Mohamed; Tu, Xiaowei
1992-03-01
Nowadays, autonomous vehicles have become a multidiscipline field. Its evolution is taking advantage of the recent technological progress in computer architectures. As the development tools became more sophisticated, the trend is being more specialized, or even dedicated architectures. In this paper, we will focus our interest on a parallel vision subsystem integrated in the overall system architecture. The system modules work in parallel, communicating through a hierarchical blackboard, an extension of the 'tuple space' from LINDA concepts, where they may exchange data or synchronization messages. The general purpose processing elements are of different skills, built around 40 MHz i860 Intel RISC processors for high level processing and pipelined systolic array processors based on PLAs or FPGAs for low-level processing.
Double-trap measurement of the proton magnetic moment at 0.3 parts per billion precision.
Schneider, Georg; Mooser, Andreas; Bohman, Matthew; Schön, Natalie; Harrington, James; Higuchi, Takashi; Nagahama, Hiroki; Sellner, Stefan; Smorra, Christian; Blaum, Klaus; Matsuda, Yasuyuki; Quint, Wolfgang; Walz, Jochen; Ulmer, Stefan
2017-11-24
Precise knowledge of the fundamental properties of the proton is essential for our understanding of atomic structure as well as for precise tests of fundamental symmetries. We report on a direct high-precision measurement of the magnetic moment μ p of the proton in units of the nuclear magneton μ N The result, μ p = 2.79284734462 (±0.00000000082) μ N , has a fractional precision of 0.3 parts per billion, improves the previous best measurement by a factor of 11, and is consistent with the currently accepted value. This was achieved with the use of an optimized double-Penning trap technique. Provided a similar measurement of the antiproton magnetic moment can be performed, this result will enable a test of the fundamental symmetry between matter and antimatter in the baryonic sector at the 10 -10 level. Copyright © 2017, American Association for the Advancement of Science.
Hubble Spins a Web Into a Giant Red Spider Nebula
2017-12-08
Huge waves are sculpted in this two-lobed nebula called the Red Spider Nebula, located some 3,000 light-years away in the constellation of Sagittarius. This warm planetary nebula harbors one of the hottest stars known and its powerful stellar winds generate waves 100 billion kilometers (62.4 billion miles) high. The waves are caused by supersonic shocks, formed when the local gas is compressed and heated in front of the rapidly expanding lobes. The atoms caught in the shock emit the spectacular radiation seen in this image. Image credit: ESA/Garrelt Mellema (Leiden University, the Netherlands) NASA image use policy. NASA Goddard Space Flight Center enables NASA’s mission through four scientific endeavors: Earth Science, Heliophysics, Solar System Exploration, and Astrophysics. Goddard plays a leading role in NASA’s accomplishments by contributing compelling scientific knowledge to advance the Agency’s mission. Follow us on Twitter Like us on Facebook Find us on Instagram
Vertical nuclear proliferation.
Sidel, Victor W
2007-01-01
All the nuclear-weapon states are working to develop new nuclear-weapon systems and upgrade their existing ones. Although the US Congress has recently blocked further development of small nuclear weapons and earth-penetrating nuclear weapons, the United States is planning a range of new warheads under the Reliable Replacement Warhead programme, and renewing its nuclear weapons infrastructure. The United Kingdom is spending 1 billion pounds sterling on updating the Atomic Weapons Establishment at Aldermaston, and about 20 billion pounds sterling on replacing its Vanguard submarines and maintaining its Trident warhead stockpile. The US has withdrawn from the Anti-Ballistic Missile Treaty and plans to install missile defence systems in Poland and the Czech Republic; Russia threatens to upgrade its nuclear countermeasures. The nuclear-weapon states should comply with their obligations under Article VI of the Non-Proliferation Treaty, as summarised in the 13-point plan agreed at the 2000 NPT Review Conference, and they should negotiate a Nuclear Weapons Convention.
PFLOTRAN: Reactive Flow & Transport Code for Use on Laptops to Leadership-Class Supercomputers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hammond, Glenn E.; Lichtner, Peter C.; Lu, Chuan
PFLOTRAN, a next-generation reactive flow and transport code for modeling subsurface processes, has been designed from the ground up to run efficiently on machines ranging from leadership-class supercomputers to laptops. Based on an object-oriented design, the code is easily extensible to incorporate additional processes. It can interface seamlessly with Fortran 9X, C and C++ codes. Domain decomposition parallelism is employed, with the PETSc parallel framework used to manage parallel solvers, data structures and communication. Features of the code include a modular input file, implementation of high-performance I/O using parallel HDF5, ability to perform multiple realization simulations with multiple processors permore » realization in a seamless manner, and multiple modes for multiphase flow and multicomponent geochemical transport. Chemical reactions currently implemented in the code include homogeneous aqueous complexing reactions and heterogeneous mineral precipitation/dissolution, ion exchange, surface complexation and a multirate kinetic sorption model. PFLOTRAN has demonstrated petascale performance using 2{sup 17} processor cores with over 2 billion degrees of freedom. Accomplishments achieved to date include applications to the Hanford 300 Area and modeling CO{sub 2} sequestration in deep geologic formations.« less
Parallel Low-Loss Measurement of Multiple Atomic Qubits
NASA Astrophysics Data System (ADS)
Kwon, Minho; Ebert, Matthew F.; Walker, Thad G.; Saffman, M.
2017-11-01
We demonstrate low-loss measurement of the hyperfine ground state of rubidium atoms by state dependent fluorescence detection in a dipole trap array of five sites. The presence of atoms and their internal states are minimally altered by utilizing circularly polarized probe light and a strictly controlled quantization axis. We achieve mean state detection fidelity of 97% without correcting for imperfect state preparation or background losses, and 98.7% when corrected. After state detection and correction for background losses, the probability of atom loss due to the state measurement is <2 % and the initial hyperfine state is preserved with >98 % probability.
Reducing Response Time Bounds for DAG-Based Task Systems on Heterogeneous Multicore Platforms
2016-01-01
synchronous parallel tasks on multicore platforms. In 25th ECRTS, 2013. [10] U. Devi. Soft Real - Time Scheduling on Multiprocessors. PhD thesis...report, Washington University in St Louis, 2014. [18] C. Liu and J. Anderson. Supporting soft real - time DAG-based sys- tems on multiprocessors with...analysis for DAG-based real - time task systems im- plemented on heterogeneous multicore platforms. The spe- cific analysis problem that is considered was
XMOS XC-2 Development Board for Mechanical Control and Data Collection
NASA Technical Reports Server (NTRS)
Jarnot, Robert F.; Bowden, William J.
2011-01-01
The scanning microwave limb sounder (SMLS) will use technological improvements in low-noise mixers to provide precise data on the Earth s atmospheric composition with high spatial resolution. This project focuses on the design and implementation of a realtime control system needed for airborne engineering tests of the SMLS. The system must coordinate the actuation of optical components using four motors with encoder readback, while collecting synchronized telemetric data from a GPS receiver and 3-axis gyrometric system. A graphical user interface for testing the control system was also designed using Python. Although the system could have been implemented with an FPGA(fieldprogrammable gate array)-based setup, a processor development kit manufactured by XMOS was chosen. The XMOS architecture allows parallel execution of multiple tasks on separate threads, making it ideal for this application. It is easily programmed using XC (a subset of C). The necessary communication interfaces were implemented in software, including Ethernet, with significant cost and time reduction compared to an FPGA-based approach. A simple approach to control the chopper, calibration mirror, and gimbal for the airborne SMLS was needed. The XMOS board allows for multiple threads and real-time data acquisition. The XC-2 development kit is an attractive choice for synchronized, real-time, event-driven applications. The XMOS is based on the transputer microprocessor architecture developed for parallel computing, which is being revamped in this new platform. The XMOS device has multiple cores capable of running parallel applications on separate threads. The threads communicate with each other via user-defined channels capable of transmitting data within the device. XMOS provides a C-based development environment using XC, which eliminates the need for custom tool kits associated with FPGA programming. The XC-2 has four cores and necessary hardware for Ethernet I/O.
Wilkinson, Karl A; Hine, Nicholas D M; Skylaris, Chris-Kriton
2014-11-11
We present a hybrid MPI-OpenMP implementation of Linear-Scaling Density Functional Theory within the ONETEP code. We illustrate its performance on a range of high performance computing (HPC) platforms comprising shared-memory nodes with fast interconnect. Our work has focused on applying OpenMP parallelism to the routines which dominate the computational load, attempting where possible to parallelize different loops from those already parallelized within MPI. This includes 3D FFT box operations, sparse matrix algebra operations, calculation of integrals, and Ewald summation. While the underlying numerical methods are unchanged, these developments represent significant changes to the algorithms used within ONETEP to distribute the workload across CPU cores. The new hybrid code exhibits much-improved strong scaling relative to the MPI-only code and permits calculations with a much higher ratio of cores to atoms. These developments result in a significantly shorter time to solution than was possible using MPI alone and facilitate the application of the ONETEP code to systems larger than previously feasible. We illustrate this with benchmark calculations from an amyloid fibril trimer containing 41,907 atoms. We use the code to study the mechanism of delamination of cellulose nanofibrils when undergoing sonification, a process which is controlled by a large number of interactions that collectively determine the structural properties of the fibrils. Many energy evaluations were needed for these simulations, and as these systems comprise up to 21,276 atoms this would not have been feasible without the developments described here.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nogami, Keisuke; Sakai, Yasuhiro; Mineta, Shota
2015-11-15
Visible emission spectra were acquired from neutral atoms sputtered by 35–60 keV Kr{sup +} ions from a polycrystalline tungsten surface. Mean velocities of excited tungsten atoms in seven different 6p states were also obtained via the dependence of photon intensities on the distance from the surface. The average velocities parallel to the surface normal varied by factors of 2–4 for atoms in the different 6p energy levels. However, they were almost independent of the incident ion kinetic energy. The 6p-level energy dependence indicated that the velocities of the excited atoms were determined by inelastic processes that involve resonant charge exchange.
The Beginning and End of the Universe
NASA Technical Reports Server (NTRS)
Gardner, Jonathan P.
2007-01-01
Cosmology is the scientific study of how the Universe began more than 13 billion years ago, how its properties have changed, and what its future might be. The balance of forces and energy cause the Universe to expand, first accelerating, then decelerating and then accelerating again. Within this overall structure, the interplay of atoms and light with the mysterious dark matter and dark energy causes stars and galaxies to form and evolve, leading to galaxies like our own home, the Milky Way. Observational cosmology uses telescopes on Earth and in space to reach back in time to find the faint remaining echoes of the Big Bang and to trace the formation and evolution of the galaxies and structures that fill the Universe. In this lecture, Dr. Gardner will give an overview of cosmology, outlining the 13-billion year history of the Universe, and highlighting the very rapid progress this field has made in the last decade. He will discuss the role that NASA space telescopes have played in this progress and will continue to play in the years to come. He will give a time-based history of the Universe, discussing the successive processes that formed matter, particles, atoms, stars and galaxies. In particular, he will focus on cosmological inflation, the rapid accelerated expansion that marks the beginning of the Universe, and dark energy, a tenuous substance that overcomes gravity and whose properties will determine its final fate.
The Beginning and End of the Universe
NASA Technical Reports Server (NTRS)
Gardner, Jonathan
2008-01-01
Cosmology is the scientific study of how the Universe began more than 13 billion years ago, how its properties have changed, and what its future might be. The balance of forces and energy cause the Universe to expand, first accelerating, then decelerating and then accelerating again. Within this overall structure, the interplay of atoms and light with the mysterious dark matter and dark energy causes stars and galaxies to form and evolve, leading to galaxies like our own home, the Milky Way. Observational cosmology uses telescopes on Earth and in space to reach back in time to find the faint remaining echoes of the Big Bang and to trace the formation and evolution of the galaxies and structures that fill the Universe. In this lecture, Dr. Gradner will give an overview of cosmology, outlining the 13-billion year history of the Universe, and highlighting the very rapid progress this field has made i the last decade. He will discuss the role that NASA space telescopes have played in this progress and wil continue to play in the years to come. He will give a time-based history of the Universe, discussing the successive processes that formed matter, particles, atoms, stars and galaxies. In particular, he will focus on cosmological inflation, the rapid accelerated expansion that marks the beginning of the Universe, and dark energy, a tenuous substance that overcomes gravity and whose properties will determine its final fate.
Hemani, H; Warrier, M; Sakthivel, N; Chaturvedi, S
2014-05-01
Molecular dynamics (MD) simulations are used in the study of void nucleation and growth in crystals that are subjected to tensile deformation. These simulations are run for typically several hundred thousand time steps depending on the problem. We output the atom positions at a required frequency for post processing to determine the void nucleation, growth and coalescence due to tensile deformation. The simulation volume is broken up into voxels of size equal to the unit cell size of crystal. In this paper, we present the algorithm to identify the empty unit cells (voids), their connections (void size) and dynamic changes (growth and coalescence of voids) for MD simulations of large atomic systems (multi-million atoms). We discuss the parallel algorithms that were implemented and discuss their relative applicability in terms of their speedup and scalability. We also present the results on scalability of our algorithm when it is incorporated into MD software LAMMPS. Copyright © 2014 Elsevier Inc. All rights reserved.
DOE R&D Accomplishments Database
Chu, S.
1976-10-01
A measurement of the 6{sup 2}P{sub ?} --> 7{sup 2}P{sub ?} forbidden magnetic dipole matrix element in atomic thallium is described. A pulsed, linearly polarized dye laser tuned to the transition frequency is used to excite the thallium vapor from the 6{sup 2}P{sub ?} ground state to the 7{sup 2}P{sub ?} excited state. Interference between the magnetic dipole M1 amplitude and a static electric field induced E1 amplitude results in an atomic polarization of the 7{sup 2}P{sub ?} state, and the subsequent circular polarization of 535 nm fluorescence. The circular polarization is seen to be proportional to / as expected, and measured for several transitions between hyperfine levels of the 6{sup 2}P{sub ?} and 7{sup 2}P{sub ?} states. The result is = -(2.11 +- 0.30) x 10{sup -5} parallel bar e parallel bar dirac constant/2mc, in agreement with theory.
Inversion of the resonance line of Sr/+/ produced by optically pumping Sr atoms
NASA Technical Reports Server (NTRS)
Green, W. R.; Falcone, R. W.
1978-01-01
A description is presented of an experiment which demonstrates the selective production of excited-state ions by an optical absorption from neutrals. An inversion on the resonance line of Sr(+) was produced by laser excitation of a two-electron transition, followed by ionization of one of the excited electrons by the same laser. A pulsed, mode-locked laser operating at 2680 A was used to excite atoms from the Sr ground level. The same laser then ionized the excited atoms. The 2680-A pump beam was generated by frequency doubling the output of a synchronously pumped mode-locked dye laser in a KDP crystal. It is pointed out that the reported results are significant for the construction of vacuum-ultraviolet and X-ray lasers. Many of the proposed methods for making such lasers depend on the selective production of excited-state ions.
A dynamic magneto-optical trap for atom chips
NASA Astrophysics Data System (ADS)
Rushton, Jo; Roy, Ritayan; Bateman, James; Himsworth, Matt
2016-11-01
We describe a dynamic magneto-optical trap (MOT) suitable for the use with vacuum systems in which optical access is limited to a single window. This technique facilitates the long-standing desire of producing integrated atom chips, many of which are likely to have severely restricted optical access compared with conventional vacuum chambers. This ‘switching-MOT’ relies on the synchronized pulsing of optical and magnetic fields at audio frequencies. The trap’s beam geometry is obtained using a planar mirror surface, and does not require a patterned substrate or bulky optics inside the vacuum chamber. Central to the design is a novel magnetic field geometry that requires no external quadrupole or bias coils which leads toward a very compact system. We have implemented the trap for 85Rb and shown that it is capable of capturing 2 million atoms and directly cooling below the Doppler temperature.
Ndiaye, Mamadou; Samb, Abdoulaye; Diop, Libasse; Maris, Thierry
2016-01-01
In the structure of the title salt, {(C5H14N3)[CdCl3]}n, the CdII atom of the complex anion is five-coordinated by one terminal and four bridging Cl atoms. The corresponding coordination polyhedron is a distorted trigonal bipyramid, with Cd—Cl distances in the range 2.4829 (4)–2.6402 (4) Å. The bipyramids are condensed into a polyanionic zigzag chain extending parallel to [101]. The tetramethylguanidinium cations are situated between the polyanionic chains and are linked to them through N—H⋯Cl hydrogen bonds, forming a layered network parallel to (010). PMID:26870572
A proposed experimental search for chameleons using asymmetric parallel plates
DOE Office of Scientific and Technical Information (OSTI.GOV)
Burrage, Clare; Copeland, Edmund J.; Stevenson, James A., E-mail: Clare.Burrage@nottingham.ac.uk, E-mail: ed.copeland@nottingham.ac.uk, E-mail: james.stevenson@nottingham.ac.uk
2016-08-01
Light scalar fields coupled to matter are a common consequence of theories of dark energy and attempts to solve the cosmological constant problem. The chameleon screening mechanism is commonly invoked in order to suppress the fifth forces mediated by these scalars, sufficiently to avoid current experimental constraints, without fine tuning. The force is suppressed dynamically by allowing the mass of the scalar to vary with the local density. Recently it has been shown that near future cold atoms experiments using atom-interferometry have the ability to access a large proportion of the chameleon parameter space. In this work we demonstrate howmore » experiments utilising asymmetric parallel plates can push deeper into the remaining parameter space available to the chameleon.« less
Developing a Complete and Effective ACT-R Architecture
2008-01-01
of computational primitives , as contrasted with the predominant “one-off” and “grab-bag” cognitive models in the field. These architectures have...transport/ semaphore protocols connected via a glue script. Both protocols rely on the fact that file rename and file remove operations are atomic...the Trial Log file until just prior to processing the next input request. Thus, to perform synchronous identifications it is necessary to run an
Battle Staff Operations: Synchronization of Planning at Battalion and Brigade Level
1989-06-02
tactical employment of the tank and mechanized infantry battalion task force. In consonance with Airland Battle doctrine, the document emphasizes ...letter entitled *Emphasis on Rapid Estimates and Decisions on the Atomic Battlefield.’ Emphasizing the increased tempo of the post war mechanized army...If so, a copy of the message is automatically canted to the user(s) identified in the distribution field. Queries are messages retrived records from
Parallel algorithms for the molecular conformation problem
NASA Astrophysics Data System (ADS)
Rajan, Kumar
Given a set of objects, and some of the pairwise distances between them, the problem of identifying the positions of the objects in the Euclidean space is referred to as the molecular conformation problem. This problem is known to be computationally difficult. One of the most important applications of this problem is the determination of the structure of molecules. In the case of molecular structure determination, usually only the lower and upper bounds on some of the interatomic distances are available. The process of obtaining a tighter set of bounds between all pairs of atoms, using the available interatomic distance bounds is referred to as bound-smoothing . One method for bound-smoothing is to use the limits imposed by the triangle inequality. The distance bounds so obtained can often be tightened further by applying the tetrangle inequality---the limits imposed on the six pairwise distances among a set of four atoms (instead of three for the triangle inequalities). The tetrangle inequality is expressed by the Cayley-Menger determinants. The sequential tetrangle-inequality bound-smoothing algorithm considers a quadruple of atoms at a time, and tightens the bounds on each of its six distances. The sequential algorithm is computationally expensive, and its application is limited to molecules with up to a few hundred atoms. Here, we conduct an experimental study of tetrangle-inequality bound-smoothing and reduce the sequential time by identifying the most computationally expensive portions of the process. We also present a simple criterion to determine which of the quadruples of atoms are likely to be tightened the most by tetrangle-inequality bound-smoothing. This test could be used to enhance the applicability of this process to large molecules. We map the problem of parallelizing tetrangle-inequality bound-smoothing to that of generating disjoint packing designs of a certain kind. We map this, in turn, to a regular-graph coloring problem, and present a simple, parallel algorithm for tetrangle-inequality bound-smoothing. We implement the parallel algorithm on the Intel Paragon X/PS, and apply it to real-life molecules. Our results show that with this parallel algorithm, tetrangle inequality can be applied to large molecules in a reasonable amount of time. We extend the regular graph to represent more general packing designs, and present a coloring algorithm for this graph. This can be used to generate constant-weight binary codes in parallel. Once a tighter set of distance bounds is obtained, the molecular conformation problem is usually formulated as a non-linear optimization problem, and a global optimization algorithm is then used to solve the problem. Here we present a parallel, deterministic algorithm for the optimization problem based on Interval Analysis. We implement our algorithm, using dynamic load balancing, on a network of Sun Ultra-Sparc workstations. Our experience with this algorithm shows that its application is limited to small instances of the molecular conformation problem, where the number of measured, pairwise distances is close to the maximum value. However, since the interval method eliminates a substantial portion of the initial search space very quickly, it can be used to prune the search space before any of the more efficient, nondeterministic methods can be applied.
Distinguishing between evidence and its explanations in the steering of atomic clocks
NASA Astrophysics Data System (ADS)
Myers, John M.; Hadi Madjid, F.
2014-11-01
Quantum theory reflects within itself a separation of evidence from explanations. This separation leads to a known proof that: (1) no wave function can be determined uniquely by evidence, and (2) any chosen wave function requires a guess reaching beyond logic to things unforeseeable. Chosen wave functions are encoded into computer-mediated feedback essential to atomic clocks, including clocks that step computers through their phases of computation and clocks in space vehicles that supply evidence of signal propagation explained by hypotheses of spacetimes with metric tensor fields. The propagation of logical symbols from one computer to another requires a shared rhythm-like a bucket brigade. Here we show how hypothesized metric tensors, dependent on guesswork, take part in the logical synchronization by which clocks are steered in rate and position toward aiming points that satisfy phase constraints, thereby linking the physics of signal propagation with the sharing of logical symbols among computers. Recognizing the dependence of the phasing of symbol arrivals on guesses about signal propagation transports logical synchronization from the engineering of digital communications to a discipline essential to physics. Within this discipline we begin to explore questions invisible under any concept of time that fails to acknowledge unforeseeable events. In particular, variation of spacetime curvature is shown to limit the bit rate of logical communication.
Jerath, Ravinder; Cearley, Shannon M; Barnes, Vernon A; Jensen, Mike
2018-01-01
A fundamental function of the visual system is detecting motion, yet visual perception is poorly understood. Current research has determined that the retina and ganglion cells elicit responses for motion detection; however, the underlying mechanism for this is incompletely understood. Previously we proposed that retinogeniculo-cortical oscillations and photoreceptors work in parallel to process vision. Here we propose that motion could also be processed within the retina, and not in the brain as current theory suggests. In this paper, we discuss: 1) internal neural space formation; 2) primary, secondary, and tertiary roles of vision; 3) gamma as the secondary role; and 4) synchronization and coherence. Movement within the external field is instantly detected by primary processing within the space formed by the retina, providing a unified view of the world from an internal point of view. Our new theory begins to answer questions about: 1) perception of space, erect images, and motion, 2) purpose of lateral inhibition, 3) speed of visual perception, and 4) how peripheral color vision occurs without a large population of cones located peripherally in the retina. We explain that strong oscillatory activity influences on brain activity and is necessary for: 1) visual processing, and 2) formation of the internal visuospatial area necessary for visual consciousness, which could allow rods to receive precise visual and visuospatial information, while retinal waves could link the lateral geniculate body with the cortex to form a neural space formed by membrane potential-based oscillations and photoreceptors. We propose that vision is tripartite, with three components that allow a person to make sense of the world, terming them "primary, secondary, and tertiary roles" of vision. Finally, we propose that Gamma waves that are higher in strength and volume allow communication among the retina, thalamus, and various areas of the cortex, and synchronization brings cortical faculties to the retina, while the thalamus is the link that couples the retina to the rest of the brain through activity by gamma oscillations. This novel theory lays groundwork for further research by providing a theoretical understanding that expands upon the functions of the retina, photoreceptors, and retinal plexus to include parallel processing needed to form the internal visual space that we perceive as the external world. Copyright © 2017 Elsevier Ltd. All rights reserved.
The Small Satellites of Pluto as Observed by New Horizons
NASA Technical Reports Server (NTRS)
Weaver, H. A.; Buie, M. W; Buratti, B. J.; Grundy, W. M.; Lauer, T. R.; Olkin, C. B.; Parker, A .H.; Porter, S. B.; Showalter, M. R.; Spencer, J. R.;
2016-01-01
The New Horizons mission has provided resolved measurements of Pluto's moons Styx, Nix, Kerberos, and Hydra. All four are small, with equivalent spherical diameters of approx.40 kilometers for Nix and Hydra and approx. 10 kilometers for Styx and Kerberos. They are also highly elongated, with maximum to minimum axis ratios of approx. 2. All four moons have high albedos (approx.50 to 90%) suggestive of a water-ice surface composition. Crater densities on Nix and Hydra imply surface ages of at least 4 billion years. The small moons rotate much faster than synchronous, with rotational poles clustered nearly orthogonal to the common pole directions of Pluto and Charon. These results reinforce the hypothesis that the small moons formed in the aftermath of a collision that produced the Pluto-Charon binary.
High voltage pulse generator. [Patent application
Fasching, G.E.
1975-06-12
An improved high-voltage pulse generator is described which is especially useful in ultrasonic testing of rock core samples. An N number of capacitors are charged in parallel to V volts and at the proper instance are coupled in series to produce a high-voltage pulse of N times V volts. Rapid switching of the capacitors from the paralleled charging configuration to the series discharging configuration is accomplished by using silicon-controlled rectifiers which are chain self-triggered following the initial triggering of the first rectifier connected between the first and second capacitors. A timing and triggering circuit is provided to properly synchronize triggering pulses to the first SCR at a time when the charging voltage is not being applied to the parallel-connected charging capacitors. The output voltage can be readily increased by adding additional charging networks. The circuit allows the peak level of the output to be easily varied over a wide range by using a variable autotransformer in the charging circuit.
NASA Technical Reports Server (NTRS)
Barnes, George H. (Inventor); Lundstrom, Stephen F. (Inventor); Shafer, Philip E. (Inventor)
1983-01-01
A high speed parallel array data processing architecture fashioned under a computational envelope approach includes a data base memory for secondary storage of programs and data, and a plurality of memory modules interconnected to a plurality of processing modules by a connection network of the Omega gender. Programs and data are fed from the data base memory to the plurality of memory modules and from hence the programs are fed through the connection network to the array of processors (one copy of each program for each processor). Execution of the programs occur with the processors operating normally quite independently of each other in a multiprocessing fashion. For data dependent operations and other suitable operations, all processors are instructed to finish one given task or program branch before all are instructed to proceed in parallel processing fashion on the next instruction. Even when functioning in the parallel processing mode however, the processors are not locked-step but execute their own copy of the program individually unless or until another overall processor array synchronization instruction is issued.
Aono, Masashi; Gunji, Yukio-Pegio
2003-10-01
The emergence derived from errors is the key importance for both novel computing and novel usage of the computer. In this paper, we propose an implementable experimental plan for the biological computing so as to elicit the emergent property of complex systems. An individual plasmodium of the true slime mold Physarum polycephalum acts in the slime mold computer. Modifying the Elementary Cellular Automaton as it entails the global synchronization problem upon the parallel computing provides the NP-complete problem solved by the slime mold computer. The possibility to solve the problem by giving neither all possible results nor explicit prescription of solution-seeking is discussed. In slime mold computing, the distributivity in the local computing logic can change dynamically, and its parallel non-distributed computing cannot be reduced into the spatial addition of multiple serial computings. The computing system based on exhaustive absence of the super-system may produce, something more than filling the vacancy.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lichtner, Peter C.; Hammond, Glenn E.; Lu, Chuan
PFLOTRAN solves a system of generally nonlinear partial differential equations describing multi-phase, multicomponent and multiscale reactive flow and transport in porous materials. The code is designed to run on massively parallel computing architectures as well as workstations and laptops (e.g. Hammond et al., 2011). Parallelization is achieved through domain decomposition using the PETSc (Portable Extensible Toolkit for Scientific Computation) libraries for the parallelization framework (Balay et al., 1997). PFLOTRAN has been developed from the ground up for parallel scalability and has been run on up to 218 processor cores with problem sizes up to 2 billion degrees of freedom. Writtenmore » in object oriented Fortran 90, the code requires the latest compilers compatible with Fortran 2003. At the time of this writing this requires gcc 4.7.x, Intel 12.1.x and PGC compilers. As a requirement of running problems with a large number of degrees of freedom, PFLOTRAN allows reading input data that is too large to fit into memory allotted to a single processor core. The current limitation to the problem size PFLOTRAN can handle is the limitation of the HDF5 file format used for parallel IO to 32 bit integers. Noting that 2 32 = 4; 294; 967; 296, this gives an estimate of the maximum problem size that can be currently run with PFLOTRAN. Hopefully this limitation will be remedied in the near future.« less
Distributed Optimal Power Flow of AC/DC Interconnected Power Grid Using Synchronous ADMM
NASA Astrophysics Data System (ADS)
Liang, Zijun; Lin, Shunjiang; Liu, Mingbo
2017-05-01
Distributed optimal power flow (OPF) is of great importance and challenge to AC/DC interconnected power grid with different dispatching centres, considering the security and privacy of information transmission. In this paper, a fully distributed algorithm for OPF problem of AC/DC interconnected power grid called synchronous ADMM is proposed, and it requires no form of central controller. The algorithm is based on the fundamental alternating direction multiplier method (ADMM), by using the average value of boundary variables of adjacent regions obtained from current iteration as the reference values of both regions for next iteration, which realizes the parallel computation among different regions. The algorithm is tested with the IEEE 11-bus AC/DC interconnected power grid, and by comparing the results with centralized algorithm, we find it nearly no differences, and its correctness and effectiveness can be validated.
A Parameter Communication Optimization Strategy for Distributed Machine Learning in Sensors.
Zhang, Jilin; Tu, Hangdi; Ren, Yongjian; Wan, Jian; Zhou, Li; Li, Mingwei; Wang, Jue; Yu, Lifeng; Zhao, Chang; Zhang, Lei
2017-09-21
In order to utilize the distributed characteristic of sensors, distributed machine learning has become the mainstream approach, but the different computing capability of sensors and network delays greatly influence the accuracy and the convergence rate of the machine learning model. Our paper describes a reasonable parameter communication optimization strategy to balance the training overhead and the communication overhead. We extend the fault tolerance of iterative-convergent machine learning algorithms and propose the Dynamic Finite Fault Tolerance (DFFT). Based on the DFFT, we implement a parameter communication optimization strategy for distributed machine learning, named Dynamic Synchronous Parallel Strategy (DSP), which uses the performance monitoring model to dynamically adjust the parameter synchronization strategy between worker nodes and the Parameter Server (PS). This strategy makes full use of the computing power of each sensor, ensures the accuracy of the machine learning model, and avoids the situation that the model training is disturbed by any tasks unrelated to the sensors.
Grand, Laszlo; Ftomov, Sergiu; Timofeev, Igor
2012-01-01
Parallel electrophysiological recording and behavioral monitoring of freely moving animals is essential for a better understanding of the neural mechanisms underlying behavior. In this paper we describe a novel wireless recording technique, which is capable of synchronously recording in vivo multichannel electrophysiological (LFP, MUA, EOG, EMG) and activity data (accelerometer, video) from freely moving cats. The method is based on the integration of commercially available components into a simple monitoring system and is complete with accelerometers and the needed signal processing tools. LFP activities of freely moving group-housed cats were recorded from multiple intracortical areas and from the hippocampus. EMG, EOG, accelerometer and video were simultaneously acquired with LFP activities 24-h a day for 3 months. These recordings confirm the possibility of using our wireless method for 24-h long-term monitoring of neurophysiological and behavioral data of freely moving experimental animals such as cats, ferrets, rabbits and other large animals. PMID:23099345
Haydon, D. T.; Stenseth, N. C.; Boyce, M. S.; Greenwood, P. E.
2001-01-01
Population ecologists have traditionally focused on the patterns and causes of population variation in the temporal domain for which a substantial body of practical analytic techniques have been developed. More recently, numerous studies have documented how populations may fluctuate synchronously over large spatial areas; analyses of such spatially extended time-series have started to provide additional clues regarding the causes of these population fluctuations and explanations for their synchronous occurrence. Here, we report on the development of a phase-based method for identifying coupling between temporally coincident but spatially distributed cyclic time-series, which we apply to the numbers of muskrat and mink recorded at 81 locations across Canada. The analysis reveals remarkable parallel clines in the strength of coupling between proximate populations of both species—declining from west to east—together with a corresponding increase in observed synchrony between these populations the further east they are located. PMID:11606729
1986-12-31
synthesize synchronization skeletons"Science of Computer Programming 2, 1982, pp. 241-266 [Gel85] Gelernter, David, "Generative communication in...effective computation based on given primitives . An architecture is an abstract object-type, whose instances are computing systems. By a parallel computing...explaining the language primitives on this basis. We explain how such a basis can be "simpler" than a general-purpose manual-programming language such as
Cumulative Reports and Publications through December 31, 1992
1993-04-01
periodls of time , and by consultants. Members of NASA’s research staff also may be residenit at l( ASE for limited pieriodls. The major categories of the...To appear in Theoretical and Computational Fluid Dynamics. Nicol, David M.: Optimistic Barricr Synchronization . I(CASE Report No. 92-34, July 27, 1992...continuous time Markov chains. ICASE Report No. 92-60, November 18, 1992, 23 pages. Submitted to the 7th Annual Workshop on Parallel and Distributed
Human tumors show a high level of genetic heterogeneity, but the processes that influence the timing and route of metastatic dissemination of the subclones are unknown. Here we have used whole-exome sequencing of 103 matched benign, malignant and metastatic skin tumors from genetically heterogeneous mice to demonstrate that most metastases disseminate synchronously from the primary tumor, supporting parallel rather than linear evolution as the predominant model of metastasis.
Tracing the Fuel for Forming Stars
NASA Astrophysics Data System (ADS)
Kohler, Susanna
2017-11-01
Huge reservoirs of cold hydrogen gas the raw fuel for star formation lurk in galaxies throughout the universe. A new study examines whether these reservoirs have always been similar, or whether those in distant galaxies are very different from those in local galaxies today.Left: Optical SLOAN images of the five HIGHz galaxies in this study. Right: ALMA images of the molecular gas in these galaxies. Both images are 30 wide. [Adapted from Cortese et al. 2017]Molecular or Atomic?The formation of stars is a crucial process that determines how galaxies are built and evolve over time. Weve observed that star formation takes place in cold clouds of molecular gas, and that star-formation rates increase in galaxies with a larger surface density of molecular hydrogen so we know that molecular hydrogen feeds the star-forming process.But not all cold gas in the interstellar medium of galaxies exists in molecular form. In the local universe, only around 30% of cold gas is found in molecular form (H2) and able to directly feed star formation; the rest is atomic hydrogen (H I). But is this true of galaxies earlier in the universe as well?Studying Distant GalaxiesCosmological simulations have predicted that earlier in our universes history, the ratio of molecular to atomic hydrogen could be larger i.e., more cold hydrogen may be in a form ready to fuel star formation but this prediction is difficult to test observationally. Currently, radio telescopes are not able to measure the atomic hydrogen in very distant galaxies, such as those at the peak of star formation in the universe, 10 billion years ago.Recently, however, we have measured atomic hydrogen in closer galaxies: those at a redshift of about z 0.20.4, a few billion years ago. One recent study of seven galaxies at this distance, usinga sample from a survey known as COOL BUDHIES, showed that the hydrogen reservoirs of these galaxies are dominated by molecular hydrogen, unlike in the local universe. If this is true of most galaxies at this distance, it would suggest that gas reservoirs have drastically changed in the short time between then and now.But a team of scientists from the International Centre for Radio Astronomy Research in Australia, led by Luca Cortese, has now challenged this conclusion.Top: molecular vs. atomic hydrogen gas in galaxies between z = 0 and z = 1.5. Bottom: the evolution of the molecular-to-atomic mass ratio with redshift. [Adapted from Cortese et al. 2017]Adding to the SampleCortese and collaborators combined observations from the Atacama Large Millimeter/submillimeter Array (ALMA) and Arecibo to estimate the ratio of molecular to atomic hydrogen in five HIGHz-survey massive star-forming galaxies at a redshift of z 0.2. They then combine these results with those of the COOL BUDHIES survey; they argue that, since the two surveys use different selection criteria, the combination of the two samples provides a fairer view of the overall population of star-forming galaxies at z 0.2.Intriguingly, the HIGHz galaxies do not show the molecular-gas dominance that the COOL BUDHIES galaxies do. Cortese and collaborators demonstrate that the addition of the HIGHz galaxies to the sample reveals that the gas reservoirs of star-forming disks 3 billion years ago are, in fact, still the same as what we see today, suggesting that star formation in galaxies at z 0.2 is likely fueled in much the same way as it is today.As telescope capabilities increase, we may be able to explore whether this continues to hold true for more distant galaxies. In the meantime, increasing our sample size within the range that we can observe will help us to further explore how galaxies have formed stars over time.CitationLuca Cortese et al 2017 ApJL 848 L7. doi:10.3847/2041-8213/aa8cc3
Recent progress of the research work on frequency and time at the NIM. [China
NASA Technical Reports Server (NTRS)
Bingying, H.
1979-01-01
Chinese activities reported include (1) research and development on the primary cesium beam standard and the high precision crystal oscillator; (2) keeping the atomic time and calibrating frequency standards; (3) determining methods for transferring the standard frequency at the highest precision. The primary beam installation gives an accuracy of 1.2 x 10 to the minus 12 power (1 sigma). Improvements are being made to attain an uncertainity goal of the order of 10 to the minus 13 power. Two experiments conducted are described. One involved standard frequency transfer via TV color subcarrier; the other involved time synchronization via Symphonie satellite. The best results are the random fluctuation of direct measurement data is 1 sigma sub r (RMS) 10 ns, and the absolute error of clock synchronization is 1 sigma sub A (RMS) 30 ns.
Parallel design patterns for a low-power, software-defined compressed video encoder
NASA Astrophysics Data System (ADS)
Bruns, Michael W.; Hunt, Martin A.; Prasad, Durga; Gunupudi, Nageswara R.; Sonachalam, Sekar
2011-06-01
Video compression algorithms such as H.264 offer much potential for parallel processing that is not always exploited by the technology of a particular implementation. Consumer mobile encoding devices often achieve real-time performance and low power consumption through parallel processing in Application Specific Integrated Circuit (ASIC) technology, but many other applications require a software-defined encoder. High quality compression features needed for some applications such as 10-bit sample depth or 4:2:2 chroma format often go beyond the capability of a typical consumer electronics device. An application may also need to efficiently combine compression with other functions such as noise reduction, image stabilization, real time clocks, GPS data, mission/ESD/user data or software-defined radio in a low power, field upgradable implementation. Low power, software-defined encoders may be implemented using a massively parallel memory-network processor array with 100 or more cores and distributed memory. The large number of processor elements allow the silicon device to operate more efficiently than conventional DSP or CPU technology. A dataflow programming methodology may be used to express all of the encoding processes including motion compensation, transform and quantization, and entropy coding. This is a declarative programming model in which the parallelism of the compression algorithm is expressed as a hierarchical graph of tasks with message communication. Data parallel and task parallel design patterns are supported without the need for explicit global synchronization control. An example is described of an H.264 encoder developed for a commercially available, massively parallel memorynetwork processor device.
Mitchell, Marc; White, Lauren; Oh, Paul; Kwan, Matthew; Gove, Peter; Leahey, Tricia; Faulkner, Guy
2016-12-12
The economic burden of physical inactivity in Canada is estimated at Can $6.8 billion (US $5 billion) per year. Employers bear a substantial proportion of the economic costs, as they pay more for inactive workers in health care and other organizational costs. In response, many Canadian employers offer wellness programs, though these are often underutilized. While financial health incentives have been proposed as one way of increasing participation, their longer term effects (ie postintervention effects) are not clear. The objective of this paper is to outline the methodology for a randomized control trial (RCT) examining the longer term impact of an existing physical activity promotion program that is enhanced by adding guaranteed rewards (Can $1 [US $0.74] per day step goal met) in a lower active hospital employee population (less than 10,000 steps per day). A 12-week, parallel-arm RCT (with a 12-week postintervention follow-up) will be employed. Employees using Change4Life (a fully automated, incentive-based wellness program) and accumulating fewer than 10,000 steps per day at baseline (weeks 1 to 2) will be randomly allocated (1:1) to standard care (wellness program, accelerometer) or an intervention group (standard care plus guaranteed incentives). All study participants will be asked to wear the accelerometer and synchronize it to Change4Life daily, although only intervention group participants will receive guaranteed incentives for reaching tailored daily step count goals (Can $1 [US $0.74] per day; weeks 3 to 12). The primary study outcome will be mean proportion of participant-days step goal reached during the postintervention follow-up period (week 24). Mean proportion of participant-days step goal reached during the intervention period (week 12) will be a secondary outcome. Enrollment for the study will be completed in February 2017. Data analysis will commence in September 2017. Study results are to be published in the winter of 2018. This protocol was designed to examine the impact of guaranteed rewards on physical activity maintenance in lower active hospital employees. ClinicalTrials.gov NCT02638675; https://clinicaltrials.gov/ct2/show/NCT0 2638675 (Archived by WebCite at http://www.webcitation.org/6g4pvZvhW). ©Marc Mitchell, Lauren White, Paul Oh, Matthew Kwan, Peter Gove, Tricia Leahey, Guy Faulkner. Originally published in JMIR Research Protocols (http://www.researchprotocols.org), 12.12.2016.
Petascale turbulence simulation using a highly parallel fast multipole method on GPUs
NASA Astrophysics Data System (ADS)
Yokota, Rio; Barba, L. A.; Narumi, Tetsu; Yasuoka, Kenji
2013-03-01
This paper reports large-scale direct numerical simulations of homogeneous-isotropic fluid turbulence, achieving sustained performance of 1.08 petaflop/s on GPU hardware using single precision. The simulations use a vortex particle method to solve the Navier-Stokes equations, with a highly parallel fast multipole method (FMM) as numerical engine, and match the current record in mesh size for this application, a cube of 40963 computational points solved with a spectral method. The standard numerical approach used in this field is the pseudo-spectral method, relying on the FFT algorithm as the numerical engine. The particle-based simulations presented in this paper quantitatively match the kinetic energy spectrum obtained with a pseudo-spectral method, using a trusted code. In terms of parallel performance, weak scaling results show the FMM-based vortex method achieving 74% parallel efficiency on 4096 processes (one GPU per MPI process, 3 GPUs per node of the TSUBAME-2.0 system). The FFT-based spectral method is able to achieve just 14% parallel efficiency on the same number of MPI processes (using only CPU cores), due to the all-to-all communication pattern of the FFT algorithm. The calculation time for one time step was 108 s for the vortex method and 154 s for the spectral method, under these conditions. Computing with 69 billion particles, this work exceeds by an order of magnitude the largest vortex-method calculations to date.
Phase synchronization motion and neural coding in dynamic transmission of neural information.
Wang, Rubin; Zhang, Zhikang; Qu, Jingyi; Cao, Jianting
2011-07-01
In order to explore the dynamic characteristics of neural coding in the transmission of neural information in the brain, a model of neural network consisting of three neuronal populations is proposed in this paper using the theory of stochastic phase dynamics. Based on the model established, the neural phase synchronization motion and neural coding under spontaneous activity and stimulation are examined, for the case of varying network structure. Our analysis shows that, under the condition of spontaneous activity, the characteristics of phase neural coding are unrelated to the number of neurons participated in neural firing within the neuronal populations. The result of numerical simulation supports the existence of sparse coding within the brain, and verifies the crucial importance of the magnitudes of the coupling coefficients in neural information processing as well as the completely different information processing capability of neural information transmission in both serial and parallel couplings. The result also testifies that under external stimulation, the bigger the number of neurons in a neuronal population, the more the stimulation influences the phase synchronization motion and neural coding evolution in other neuronal populations. We verify numerically the experimental result in neurobiology that the reduction of the coupling coefficient between neuronal populations implies the enhancement of lateral inhibition function in neural networks, with the enhancement equivalent to depressing neuronal excitability threshold. Thus, the neuronal populations tend to have a stronger reaction under the same stimulation, and more neurons get excited, leading to more neurons participating in neural coding and phase synchronization motion.
Precise time and time interval applications to electric power systems
NASA Technical Reports Server (NTRS)
Wilson, Robert E.
1992-01-01
There are many applications of precise time and time interval (frequency) in operating modern electric power systems. Many generators and customer loads are operated in parallel. The reliable transfer of electrical power to the consumer partly depends on measuring power system frequency consistently in many locations. The internal oscillators in the widely dispersed frequency measuring units must be syntonized. Elaborate protection and control systems guard the high voltage equipment from short and open circuits. For the highest reliability of electric service, engineers need to study all control system operations. Precise timekeeping networks aid in the analysis of power system operations by synchronizing the clocks on recording instruments. Utility engineers want to reproduce events that caused loss of service to customers. Precise timekeeping networks can synchronize protective relay test-sets. For dependable electrical service, all generators and large motors must remain close to speed synchronism. The stable response of a power system to perturbations is critical to continuity of electrical service. Research shows that measurement of the power system state vector can aid in the monitoring and control of system stability. If power system operators know that a lightning storm is approaching a critical transmission line or transformer, they can modify operating strategies. Knowledge of the location of a short circuit fault can speed the re-energizing of a transmission line. One fault location technique requires clocks synchronized to one microsecond. Current research seeks to find out if one microsecond timekeeping can aid and improve power system control and operation.
Assaad, Aziz; Pontvianne, Steve; Pons, Marie-Noëlle
2017-05-01
To rapidly monitor the surface water quality in terms of organic pollution of an industrial river undergoing restoration, optical methods (UV-visible spectrometry and fluorescence) were applied in parallel to classical physical-chemical analyses. UV-visible spectra were analyzed using the maximum of the second derivative at 225 nm (related to nitrates), specific absorbance at 254 nm (SUVA 254 ), and the spectral slope between 275 and 295 nm (S 275-295 ) (related to the aromaticity and molecular weight of dissolved organic carbon). The synchronous fluorescence spectra (wavelength difference = 50 nm) exhibited a high variability in the composition of dissolved organic material between the upstream and downstream sections and also versus time. The principal components analysis of the entire set of synchronous fluorescence spectra helped to define three river sections with different pollution characteristics. Spectral decomposition was applied to the two most upstream sections: five fluorophores, classical in rivers impacted by domestic sewage and related to protein-like (λ ex = 280 nm) and humic-like fluorescence (M-type with λ ex ≈ 305-310 nm and C-type with λ ex ≥ 335 nm), were identified. The irregular shape of the synchronous fluorescence spectra in the most downstream section is likely due to organic pollutants of industrial origin; however, their variability and the complexity of the spectra did not allow the further elucidation of their nature.
NASA Technical Reports Server (NTRS)
Wheatley, Thomas E.; Michaloski, John L.; Lumia, Ronald
1989-01-01
Analysis of a robot control system leads to a broad range of processing requirements. One fundamental requirement of a robot control system is the necessity of a microcomputer system in order to provide sufficient processing capability.The use of multiple processors in a parallel architecture is beneficial for a number of reasons, including better cost performance, modular growth, increased reliability through replication, and flexibility for testing alternate control strategies via different partitioning. A survey of the progression from low level control synchronizing primitives to higher level communication tools is presented. The system communication and control mechanisms of existing robot control systems are compared to the hierarchical control model. The impact of this design methodology on the current robot control systems is explored.
Multi-Kilowatt Power Module for High-Power Hall Thrusters
NASA Technical Reports Server (NTRS)
Pinero, Luis R.; Bowers, Glen E.
2005-01-01
Future NASA missions will require high-performance electric propulsion systems. Hall thrusters are being developed at NASA Glenn for high-power, high-specific impulse operation. These thrusters operate at power levels up to 50 kW of power and discharge voltages in excess of 600 V. A parallel effort is being conducted to develop power electronics for these thrusters that push the technology beyond the 5kW state-of-the-art power level. A 10 kW power module was designed to produce an output of 500 V and 20 A from a nominal 100 V input. Resistive load tests revealed efficiencies in excess of 96 percent. Load current share and phase synchronization circuits were designed and tested that will allow connecting multiple modules in parallel to process higher power.
FEM analysis of an single stator dual PM rotors axial synchronous machine
NASA Astrophysics Data System (ADS)
Tutelea, L. N.; Deaconu, S. I.; Popa, G. N.
2017-01-01
The actual e - continuously variable transmission (e-CVT) solution for the parallel Hybrid Electric Vehicle (HEV) requires two electric machines, two inverters, and a planetary gear. A distinct electric generator and a propulsion electric motor, both with full power converters, are typical for a series HEV. In an effort to simplify the planetary-geared e-CVT for the parallel HEV or the series HEV we hereby propose to replace the basically two electric machines and their two power converters by a single, axial-air-gap, electric machine central stator, fed from a single PWM converter with dual frequency voltage output and two independent PM rotors. The proposed topologies, the magneto-motive force analysis and quasi 3D-FEM analysis are the core of the paper.
Parallel algorithm for multiscale atomistic/continuum simulations using LAMMPS
NASA Astrophysics Data System (ADS)
Pavia, F.; Curtin, W. A.
2015-07-01
Deformation and fracture processes in engineering materials often require simultaneous descriptions over a range of length and time scales, with each scale using a different computational technique. Here we present a high-performance parallel 3D computing framework for executing large multiscale studies that couple an atomic domain, modeled using molecular dynamics and a continuum domain, modeled using explicit finite elements. We use the robust Coupled Atomistic/Discrete-Dislocation (CADD) displacement-coupling method, but without the transfer of dislocations between atoms and continuum. The main purpose of the work is to provide a multiscale implementation within an existing large-scale parallel molecular dynamics code (LAMMPS) that enables use of all the tools associated with this popular open-source code, while extending CADD-type coupling to 3D. Validation of the implementation includes the demonstration of (i) stability in finite-temperature dynamics using Langevin dynamics, (ii) elimination of wave reflections due to large dynamic events occurring in the MD region and (iii) the absence of spurious forces acting on dislocations due to the MD/FE coupling, for dislocations further than 10 Å from the coupling boundary. A first non-trivial example application of dislocation glide and bowing around obstacles is shown, for dislocation lengths of ∼50 nm using fewer than 1 000 000 atoms but reproducing results of extremely large atomistic simulations at much lower computational cost.
Potassium in the atmosphere of Mercury
NASA Technical Reports Server (NTRS)
Potter, A. E.; Morgan, T. H.
1986-01-01
Spectral data are reported from a search for potassium in the Mercury atmosphere. The data were collected with instrumentation at Kitt Peak (7699 A) and at McDonald Observatory (7698.98 and 7664.86 A). The equivalent mean widths of the potassium emission lines observed are tabulated, along with the estimated abundances, which are compared with sodium abundances as determined by resonance lines. The average column abundance of potassium is projected to be 1 billion atoms/sq cm, about 1 percent the column abundance of sodium.
Chow, Tze-Show
1988-04-22
A photon calorimeter is provided that comprises a laminar substrate that is uniform in density and homogeneous in atomic composition. A plasma-sprayed coating, that is generally uniform in density and homogeneous in atomic composition within the proximity of planes that are parallel to the surfaces of the substrate, is applied to either one or both sides of the laminar substrate. The plasma-sprayed coatings may be very efficiently spectrally tailored in atomic number. Thermocouple measuring junctions, are positioned within the plasma-sprayed coatings. The calorimeter is rugged, inexpensive, and equilibrates in temperature very rapidly. 4 figs.
GPURFSCREEN: a GPU based virtual screening tool using random forest classifier.
Jayaraj, P B; Ajay, Mathias K; Nufail, M; Gopakumar, G; Jaleel, U C A
2016-01-01
In-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU). Random Forest is a robust classification algorithm that can be employed in the virtual screening. A ligand based virtual screening tool (GPURFSCREEN) that uses random forests on GPU systems has been proposed and evaluated in this paper. This tool produces optimized results at a lower execution time for large bioassay data sets. The quality of results produced by our tool on GPU is same as that on a regular serial environment. Considering the magnitude of data to be screened, the parallelized virtual screening has a significantly lower running time at high throughput. The proposed parallel tool outperforms its serial counterpart by successfully screening billions of molecules in training and prediction phases.
Bromidotetra-kis-(2-isopropyl-1H-imidazole-κN)copper(II) bromide.
Godlewska, Sylwia; Socha, Joanna; Baranowska, Katarzyna; Dołęga, Anna
2011-10-01
The Cu(II) atom in the title salt, [CuBr(C(6)H(10)N(2))(4)]Br, is coordinated in a square-pyramidal geometry by four imidazole N atoms and one bromide anion that is located at the apex of the pyramid. The cations and the anions form a two-dimensional network parallel to (001) through N-H⋯Br hydrogen bonds.
Visual Data-Analytics of Large-Scale Parallel Discrete-Event Simulations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ross, Caitlin; Carothers, Christopher D.; Mubarak, Misbah
Parallel discrete-event simulation (PDES) is an important tool in the codesign of extreme-scale systems because PDES provides a cost-effective way to evaluate designs of highperformance computing systems. Optimistic synchronization algorithms for PDES, such as Time Warp, allow events to be processed without global synchronization among the processing elements. A rollback mechanism is provided when events are processed out of timestamp order. Although optimistic synchronization protocols enable the scalability of large-scale PDES, the performance of the simulations must be tuned to reduce the number of rollbacks and provide an improved simulation runtime. To enable efficient large-scale optimistic simulations, one has tomore » gain insight into the factors that affect the rollback behavior and simulation performance. We developed a tool for ROSS model developers that gives them detailed metrics on the performance of their large-scale optimistic simulations at varying levels of simulation granularity. Model developers can use this information for parameter tuning of optimistic simulations in order to achieve better runtime and fewer rollbacks. In this work, we instrument the ROSS optimistic PDES framework to gather detailed statistics about the simulation engine. We have also developed an interactive visualization interface that uses the data collected by the ROSS instrumentation to understand the underlying behavior of the simulation engine. The interface connects real time to virtual time in the simulation and provides the ability to view simulation data at different granularities. We demonstrate the usefulness of our framework by performing a visual analysis of the dragonfly network topology model provided by the CODES simulation framework built on top of ROSS. The instrumentation needs to minimize overhead in order to accurately collect data about the simulation performance. To ensure that the instrumentation does not introduce unnecessary overhead, we perform a scaling study that compares instrumented ROSS simulations with their noninstrumented counterparts in order to determine the amount of perturbation when running at different simulation scales.« less
Supercomputer modeling of flow past hypersonic flight vehicles
NASA Astrophysics Data System (ADS)
Ermakov, M. K.; Kryukov, I. A.
2017-02-01
A software platform for MPI-based parallel solution of the Navier-Stokes (Euler) equations for viscous heat-conductive compressible perfect gas on 3-D unstructured meshes is developed. The discretization and solution of the Navier-Stokes equations are constructed on generalized S.K. Godunov’s method and the second order approximation in space and time. Developed software platform allows to carry out effectively flow past hypersonic flight vehicles simulations for the Mach numbers 6 and higher, and numerical meshes with up to 1 billion numerical cells and with up to 128 processors.
Spellberg, B; Miller, L G; Kuo, M N; Bradley, J; Scheld, W M; Edwards, J E
2007-06-01
Over the last two decades, an alarming rise in infections caused by antibiotic-resistant microbes has been paralleled by an equally alarming decline in the development of new antibiotics to deal with the threat. In response to this brewing "perfect storm" of infectious diseases, the Infectious Diseases Society of America (IDSA) has released a white paper that proposes incentives to stimulate critically needed antibiotic development by pharmaceutical companies. A cornerstone of the recommendations is establishment of a "wild-card patent extension" program. This program would allow a company receiving United States (US) Food and Drug Administration (FDA) approval for a new anti-infective agent targeting a drug-resistant pathogen to extend the patent on a drug within their active portfolio. However, wild-card patent extension legislation is highly controversial due to concerns regarding its societal cost. We performed a systematic literature review to estimate the societal cost of wild-card patent extension compared to the savings resulting from the availability of one new antibiotic to treat multi-drug-resistant Pseudomonas aeruginosa. We conservatively estimate that wild-card patent extension applied to one new antibiotic would cost $7.7 billion over the first 2 years, and $3.9 billion over the next 18 years. Thus, even if the new antibiotic abrogated only 50% of the annual societal cost of multidrug-resistant P. aeruginosa (estimated $2.7 billion), wild-card patent extension would be cost neutral by 10 years after approval of the new antibiotic, and would save society approximately $4.6 billion by 20 years after approval. Wild-card patent extension appears to be a cost-effective strategy to spur anti-infective development. Although our analysis is limited by the precision of published data, our model employed conservative assumptions.
NASA Astrophysics Data System (ADS)
Peters, Bradley J.; Carlson, Richard W.; Day, James M. D.; Horan, Mary F.
2018-03-01
Active volcanic hotspots can tap into domains in Earth’s deep interior that were formed more than two billion years ago. High-precision data on variability in tungsten isotopes have shown that some of these domains resulted from differentiation events that occurred within the first fifty million years of Earth history. However, it has not proved easy to resolve analogous variability in neodymium isotope compositions that would track regions of Earth’s interior whose composition was established by events occurring within roughly the first five hundred million years of Earth history. Here we report 142Nd/144Nd ratios for Réunion Island igneous rocks, some of which are resolvably either higher or lower than the ratios in modern upper-mantle domains. We also find that Réunion 142Nd/144Nd ratios correlate with helium-isotope ratios (3He/4He), suggesting parallel behaviour of these isotopic systems during very early silicate differentiation, perhaps as early as 4.39 billion years ago. The range of 142Nd/144Nd ratios in Réunion basalts is inconsistent with a single-stage differentiation process, and instead requires mixing of a conjugate melt and residue formed in at least one melting event during the Hadean eon, 4.56 billion to 4 billion years ago. Efficient post-Hadean mixing nearly erased the ancient, anomalous 142Nd/144Nd signatures, and produced the relatively homogeneous 143Nd/144Nd composition that is characteristic of Réunion basalts. Our results show that Réunion magmas tap into a particularly ancient, primitive source compared with other volcanic hotspots, offering insight into the formation and preservation of ancient heterogeneities in Earth’s interior.
Friction force microscopy at a regularly stepped Au(665) electrode: Anisotropy effects
NASA Astrophysics Data System (ADS)
Podgaynyy, Nikolay; Iqbal, Shahid; Baltruschat, Helmut
2015-01-01
Using friction force microscopy, friction was determined for the AFM-tip scanning parallel and vertically to the monoatomic steps of Au(665) electrode for different coverages of Cu in sulfuric acid. When the tip was scanning parallel to the steps, the results were similar to those obtained before for a Au(111) surface: a higher coverage of Cu leads to an increased friction. However, differently from Au(111), no transitions in the friction coefficient were observed with increasing load. Atomic stick slip was observed both for the Au surface and the √{ 3} × √{ 3} honeycomb Cu adlayer with a Cu coverage of 2/3. When the tip was scanning perpendicular to the steps, friction did not depend much on coverage; astonishingly, atomic stick slip was also observed.
NASA Technical Reports Server (NTRS)
Silva, P. M.; Silva, I. M.
1974-01-01
Various methods presently used for the dissemination of time at several levels of precision are described along with future projects in the field. Different aspects of time coordination are reviewed and a list of future laboratories participating in a National Time Scale will be presented. A Brazilian Atomic Time Scale will be obtained from as many of these laboratories as possible. The problem of intercomparison between the Brazilian National Time Scale and the International one will be presented and probable solutions will be discussed. Needs related to the TV Line-10 method will be explained and comments will be made on the legal aspects of time dissemination throughout the country.
In-Process Atomic-Force Microscopy (AFM) Based Inspection
Mekid, Samir
2017-01-01
A new in-process atomic-force microscopy (AFM) based inspection is presented for nanolithography to compensate for any deviation such as instantaneous degradation of the lithography probe tip. Traditional method used the AFM probes for lithography work and retract to inspect the obtained feature but this practice degrades the probe tip shape and hence, affects the measurement quality. This paper suggests a second dedicated lithography probe that is positioned back-to-back to the AFM probe under two synchronized controllers to correct any deviation in the process compared to specifications. This method shows that the quality improvement of the nanomachining, in progress probe tip wear, and better understanding of nanomachining. The system is hosted in a recently developed nanomanipulator for educational and research purposes. PMID:28561747
1992-12-01
Dynamics and Free Energy Perturbation Methods." Reviews in Computational Chem- istry edited by Kenny B. Lipkowitz and Donald B. Boyd, chapter 8, 295-320...atomic motions during annealing, allows the search to probabilistically move in a locally non-optimal direction. The probability of doing so is...Network processors communicate via communication links. This type of communication is generally very slow relative to other processor activities
Configuration of twins in glass-embedded silver nanoparticles of various origin
NASA Astrophysics Data System (ADS)
Hofmeister, H.; Dubiel, M.; Tan, G. L.; Schicke, K.-D.
2005-09-01
Structural characterization using high resolution electron microscopy and diffractogram analysis of silver nanoparticles embedded in glass by various routes of fabrication was aimed at revealing the characteristic features of twin faults occuring in such particles. Nearly spherical silver nanoparticles well below 10 nm size embedded in commercial soda-lime silicate float glass have been fabricated either by silver/sodium ion exchange or by Ag+ ion implantation. Twinned nanoparticles, besides single crystalline species, have frequently been observed for both fabrication routes, mainly at sizes above 5 nm, but also at smaller sizes, even around 1 nm. The variety of particle forms comprises single crystalline particles of nearly cuboctahedron shape, particles containing single twin faults, and multiply twinned particles containing parallel twin lamellae, or cyclic twinned segments arranged around axes of fivefold symmetry. Parallel twinning is distinctly favoured by ion implantation whereas cyclic twinning preferably occurs upon ion exchange processing. Regardless of single or repeated twinning, parallel or cyclic twin arrangement, one may classify simple twin faults of regular atomic configuration and compound twin faults whose irregular configuration consists of additional planar defects like associated stacking faults or secondary twin faults. Besides, a particular superstructure composed of parallel twin lamellae of only three atomic layers thickness is observed.
GPU accelerated cell-based adaptive mesh refinement on unstructured quadrilateral grid
NASA Astrophysics Data System (ADS)
Luo, Xisheng; Wang, Luying; Ran, Wei; Qin, Fenghua
2016-10-01
A GPU accelerated inviscid flow solver is developed on an unstructured quadrilateral grid in the present work. For the first time, the cell-based adaptive mesh refinement (AMR) is fully implemented on GPU for the unstructured quadrilateral grid, which greatly reduces the frequency of data exchange between GPU and CPU. Specifically, the AMR is processed with atomic operations to parallelize list operations, and null memory recycling is realized to improve the efficiency of memory utilization. It is found that results obtained by GPUs agree very well with the exact or experimental results in literature. An acceleration ratio of 4 is obtained between the parallel code running on the old GPU GT9800 and the serial code running on E3-1230 V2. With the optimization of configuring a larger L1 cache and adopting Shared Memory based atomic operations on the newer GPU C2050, an acceleration ratio of 20 is achieved. The parallelized cell-based AMR processes have achieved 2x speedup on GT9800 and 18x on Tesla C2050, which demonstrates that parallel running of the cell-based AMR method on GPU is feasible and efficient. Our results also indicate that the new development of GPU architecture benefits the fluid dynamics computing significantly.
Scattered Atomic Oxygen Effects on Spacecraft Materials
NASA Technical Reports Server (NTRS)
Banks, Bruce A.; Miller, Sharon K. R.; deGroh, Kim K.; Demko, Rikako
2003-01-01
Low Earth orbital (LEO) atomic oxygen cannot only erode the external surfaces of polymers on spacecraft, but can cause degradation of surfaces internal to components on the spacecraft where openings to the space environment exist. Although atomic oxygen attack on internal or interior surfaces may not have direct exposure to the LEO atomic oxygen flux scattered impingement can have serious degradation effects where sensitive interior surfaces are present. The effects of atomic oxygen erosion of polymer interior to an aperture on a spacecraft is simulated using Monte Carlo computational techniques. A 2-dimensional model is used to provide quantitative indications of the attenuation of atomic oxygen flux as a function of distance into a parallel walled cavity. The degree of erosion re1ative is compared between the various interior locations and the external surface of a LEO spacecraft.
Atomic Oxygen Effects on Spacecraft Materials
NASA Technical Reports Server (NTRS)
Banks, Bruce A.; Miller, Sharon K. R.; deGroh, Kim K.; Demko, Rikako
2003-01-01
Low Earth orbital (LEO) atomic oxygen cannot only erode the external surfaces of polymers on spacecraft, but can cause degradation of surfaces internal to components on the spacecraft where openings to the space environment exist. Although atomic oxygen attack on internal or interior surfaces may not have direct exposure to the LEO atomic oxygen flux, scattered impingement can have can have serious degradation effects where sensitive interior surfaces are present. The effects of atomic oxygen erosion of polymers interior to an aperture on a spacecraft is simulated using Monte Carlo computational techniques. A 2-dimensional model is used to provide quantitative indications of the attenuation of atomic oxygen flux as a function of distance into a parallel walled cavity. The degree of erosion relative is compared between the various interior locations and the external surface of an LEO spacecraft.
Distinguishing between evidence and its explanations in the steering of atomic clocks
DOE Office of Scientific and Technical Information (OSTI.GOV)
Myers, John M., E-mail: myers@seas.harvard.edu; Hadi Madjid, F., E-mail: gmadjid@aol.com
2014-11-15
Quantum theory reflects within itself a separation of evidence from explanations. This separation leads to a known proof that: (1) no wave function can be determined uniquely by evidence, and (2) any chosen wave function requires a guess reaching beyond logic to things unforeseeable. Chosen wave functions are encoded into computer-mediated feedback essential to atomic clocks, including clocks that step computers through their phases of computation and clocks in space vehicles that supply evidence of signal propagation explained by hypotheses of spacetimes with metric tensor fields. The propagation of logical symbols from one computer to another requires a shared rhythm—likemore » a bucket brigade. Here we show how hypothesized metric tensors, dependent on guesswork, take part in the logical synchronization by which clocks are steered in rate and position toward aiming points that satisfy phase constraints, thereby linking the physics of signal propagation with the sharing of logical symbols among computers. Recognizing the dependence of the phasing of symbol arrivals on guesses about signal propagation transports logical synchronization from the engineering of digital communications to a discipline essential to physics. Within this discipline we begin to explore questions invisible under any concept of time that fails to acknowledge unforeseeable events. In particular, variation of spacetime curvature is shown to limit the bit rate of logical communication. - Highlights: • Atomic clocks are steered in frequency toward an aiming point. • The aiming point depends on a chosen wave function. • No evidence alone can determine the wave function. • The unknowability of the wave function has implications for spacetime curvature. • Variability in spacetime curvature limits the bit rate of communications.« less
2016-05-11
the phases of the system load and ground, so to size the voltage divider appropriately Vsys is set equal to the maximum phase-to-ground voltage. The...civilian and military systems is increasing due to technological improvements in power conversion and changing requirements in system loads. The development...of high-power pulsed loads on naval platforms, such as the Laser Weapon System (LaWS) and the electromagnetic railgun, calls for the ability to
1992-02-01
universities and industry who have resident appointments for limited periods of time , and by consultants. Members of NASA’s research staff also may be...Submitted to Journal of Computational Physics. Banks, H. T., G. Propst, and R. J. Silcox: A comparison of time domain boundary conditions for acoustic...2, pp. 117-145, i991. Nicol, David M.: T/ cost of conservative synchronization in parallel discrete event sim- ulations. ICASE Report No. 90-20, May
High-performance parallel interface to synchronous optical network gateway
St. John, Wallace B.; DuBois, David H.
1998-08-11
A digital system provides sending and receiving gateways for HIPPI interfaces. Electronic logic circuitry formats data signals and overhead signals in a data frame that is suitable for transmission over a connecting fiber optic link. Multiplexers route the data and overhead signals to a framer module. The framer module allocates the data and overhead signals to a plurality of 9-byte words that are arranged in a selected protocol. The formatted words are stored in a storage register for output through the gateway.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoshii, Kazutomo; Llopis, Pablo; Zhang, Kaicheng
As CMOS scaling nears its end, parameter variations (process, temperature and voltage) are becoming a major concern. To overcome parameter variations and provide stability, modern processors are becoming dynamic, opportunistically adjusting voltage and frequency based on thermal and energy constraints, which negatively impacts traditional bulk-synchronous parallelism-minded hardware and software designs. As node-level architecture is growing in complexity, implementing variation control mechanisms only with hardware can be a challenging task. In this paper we investigate a software strategy to manage hardwareinduced variations, leveraging low-level monitoring/controlling mechanisms.
Comprehensive Synchronization Elimination for Java (PREPRINT)
2003-01-01
e : % thread-local % reentrant % enclosed Figure...0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 ca ss ow ar y ja va c ja va cu p ja va do c jg l jle x pi zz a ar ra y in st an td b jlo go pl as m a sl ic e Figure 6...1998. [DR98] P. Diniz and M. Rinard. Lock Coarsening: Eliminating Lock Overhead in Automatically Parallelized Object-based Programs. In Journal
Qu, Zhechao; Steinvall, Erik; Ghorbani, Ramin; Schmidt, Florian M
2016-04-05
Potassium (K) is an important element related to ash and fine-particle formation in biomass combustion processes. In situ measurements of gaseous atomic potassium, K(g), using robust optical absorption techniques can provide valuable insight into the K chemistry. However, for typical parts per billion K(g) concentrations in biomass flames and reactor gases, the product of atomic line strength and absorption path length can give rise to such high absorbance that the sample becomes opaque around the transition line center. We present a tunable diode laser atomic absorption spectroscopy (TDLAAS) methodology that enables accurate, calibration-free species quantification even under optically thick conditions, given that Beer-Lambert's law is valid. Analyte concentration and collisional line shape broadening are simultaneously determined by a least-squares fit of simulated to measured absorption profiles. Method validation measurements of K(g) concentrations in saturated potassium hydroxide vapor in the temperature range 950-1200 K showed excellent agreement with equilibrium calculations, and a dynamic range from 40 pptv cm to 40 ppmv cm. The applicability of the compact TDLAAS sensor is demonstrated by real-time detection of K(g) concentrations close to biomass pellets during atmospheric combustion in a laboratory reactor.
Interconnect-free parallel logic circuits in a single mechanical resonator
Mahboob, I.; Flurin, E.; Nishiguchi, K.; Fujiwara, A.; Yamaguchi, H.
2011-01-01
In conventional computers, wiring between transistors is required to enable the execution of Boolean logic functions. This has resulted in processors in which billions of transistors are physically interconnected, which limits integration densities, gives rise to huge power consumption and restricts processing speeds. A method to eliminate wiring amongst transistors by condensing Boolean logic into a single active element is thus highly desirable. Here, we demonstrate a novel logic architecture using only a single electromechanical parametric resonator into which multiple channels of binary information are encoded as mechanical oscillations at different frequencies. The parametric resonator can mix these channels, resulting in new mechanical oscillation states that enable the construction of AND, OR and XOR logic gates as well as multibit logic circuits. Moreover, the mechanical logic gates and circuits can be executed simultaneously, giving rise to the prospect of a parallel logic processor in just a single mechanical resonator. PMID:21326230
Interconnect-free parallel logic circuits in a single mechanical resonator.
Mahboob, I; Flurin, E; Nishiguchi, K; Fujiwara, A; Yamaguchi, H
2011-02-15
In conventional computers, wiring between transistors is required to enable the execution of Boolean logic functions. This has resulted in processors in which billions of transistors are physically interconnected, which limits integration densities, gives rise to huge power consumption and restricts processing speeds. A method to eliminate wiring amongst transistors by condensing Boolean logic into a single active element is thus highly desirable. Here, we demonstrate a novel logic architecture using only a single electromechanical parametric resonator into which multiple channels of binary information are encoded as mechanical oscillations at different frequencies. The parametric resonator can mix these channels, resulting in new mechanical oscillation states that enable the construction of AND, OR and XOR logic gates as well as multibit logic circuits. Moreover, the mechanical logic gates and circuits can be executed simultaneously, giving rise to the prospect of a parallel logic processor in just a single mechanical resonator.
GPU-accelerated Tersoff potentials for massively parallel Molecular Dynamics simulations
NASA Astrophysics Data System (ADS)
Nguyen, Trung Dac
2017-03-01
The Tersoff potential is one of the empirical many-body potentials that has been widely used in simulation studies at atomic scales. Unlike pair-wise potentials, the Tersoff potential involves three-body terms, which require much more arithmetic operations and data dependency. In this contribution, we have implemented the GPU-accelerated version of several variants of the Tersoff potential for LAMMPS, an open-source massively parallel Molecular Dynamics code. Compared to the existing MPI implementation in LAMMPS, the GPU implementation exhibits a better scalability and offers a speedup of 2.2X when run on 1000 compute nodes on the Titan supercomputer. On a single node, the speedup ranges from 2.0 to 8.0 times, depending on the number of atoms per GPU and hardware configurations. The most notable features of our GPU-accelerated version include its design for MPI/accelerator heterogeneous parallelism, its compatibility with other functionalities in LAMMPS, its ability to give deterministic results and to support both NVIDIA CUDA- and OpenCL-enabled accelerators. Our implementation is now part of the GPU package in LAMMPS and accessible for public use.
Trapped Atoms in One-Dimensional Photonic Crystals
2013-08-09
a single silicon -nitride nanobeam (refractive index n = 2) with a 1D array of filleted rectangular holes along the propagation direction; atoms are...trapped in the centers of the holes (figure 1( a )). The second waveguide consists of two parallel silicon nitride nanobeams, each with a periodic array...the refractive index of silicon nitride is approximately constant across the optical domain, we adopt the approximation based on a frequency
Bromidotetrakis(2-isopropyl-1H-imidazole-κN 3)copper(II) bromide
Godlewska, Sylwia; Socha, Joanna; Baranowska, Katarzyna; Dołęga, Anna
2011-01-01
The CuII atom in the title salt, [CuBr(C6H10N2)4]Br, is coordinated in a square-pyramidal geometry by four imidazole N atoms and one bromide anion that is located at the apex of the pyramid. The cations and the anions form a two-dimensional network parallel to (001) through N—H⋯Br hydrogen bonds. PMID:22064905
Crystal structure of poly[{μ-N,N′-bis[(pyridin-4-yl)methyl]oxalamide}-μ-oxalato-cobalt(II)
Zou, Hengye; Qi, Yanjuan
2014-01-01
In the polymeric title compound, [Co(C2O4)(C14H14N4O2)]n, the CoII atom is six-coordinated by two N atoms from symmetry-related bis[(pyridin-4-yl)methyl]oxalamide (BPMO) ligands and four O atoms from two centrosymmetric oxalate anions in a distorted octahedral coordination geometry. The CoII atoms are linked by the oxalate anions into a chain running parallel to [100]. The chains are linked by the BPMO ligands into a three-dimensional architecture. In addition, N—H⋯O hydrogen bonds stabilize the crystal packing. PMID:25309173
Ballestero-Martínez, Ernesto; Campos-Fernández, Cristian Saul; Soto-Tellini, Victor Hugo; Gonzalez-Montiel, Simplicio; Martínez-Otero, Diego
2013-06-01
In the title compound, {[Cu(C10H8N4)3(H2O)2](ClO4)2} n , the coordination environment of the cationic Cu(II) atom is distorted octa-hedral, formed by pairs of symmetry-equivalent 1,2-bis-(pyridin-4-yl)diazene ligands, bridging 1,2-bis-(pyridin-4-yl)diazene ligands and two non-equivalent water mol-ecules. The 1,2-bis-(pyridin-4-yl)diazene mol-ecules form polymeric chains parallel to [-101] via azo bonds which are situated about inversion centres. Since the Cu(II) atom is situated on a twofold rotation axis, the monomeric unit has point symmetry 2. The perchlorate anions are disordered in a 0.536 (9):0.464 (9) ratio and are acceptors of water H atoms in medium-strong O-H⋯O hydrogen bonds with graph set R 4 (4)(12). The water mol-ecules, which are coordinated to the Cu(II) atom and are hydrogen-bonded to the perchlorate anions, form columns parallel to [010]. A π-π inter-action [centroid-centroid distance = 3.913 (2) Å] occurs between pyridine rings, and weak C-H⋯O inter-actions also occur.
Zhang, Hong; Zapol, Peter; Dixon, David A.; ...
2015-11-17
The Shift-and-invert parallel spectral transformations (SIPs), a computational approach to solve sparse eigenvalue problems, is developed for massively parallel architectures with exceptional parallel scalability and robustness. The capabilities of SIPs are demonstrated by diagonalization of density-functional based tight-binding (DFTB) Hamiltonian and overlap matrices for single-wall metallic carbon nanotubes, diamond nanowires, and bulk diamond crystals. The largest (smallest) example studied is a 128,000 (2000) atom nanotube for which ~330,000 (~5600) eigenvalues and eigenfunctions are obtained in ~190 (~5) seconds when parallelized over 266,144 (16,384) Blue Gene/Q cores. Weak scaling and strong scaling of SIPs are analyzed and the performance of SIPsmore » is compared with other novel methods. Different matrix ordering methods are investigated to reduce the cost of the factorization step, which dominates the time-to-solution at the strong scaling limit. As a result, a parallel implementation of assembling the density matrix from the distributed eigenvectors is demonstrated.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Hong; Zapol, Peter; Dixon, David A.
The Shift-and-invert parallel spectral transformations (SIPs), a computational approach to solve sparse eigenvalue problems, is developed for massively parallel architectures with exceptional parallel scalability and robustness. The capabilities of SIPs are demonstrated by diagonalization of density-functional based tight-binding (DFTB) Hamiltonian and overlap matrices for single-wall metallic carbon nanotubes, diamond nanowires, and bulk diamond crystals. The largest (smallest) example studied is a 128,000 (2000) atom nanotube for which ~330,000 (~5600) eigenvalues and eigenfunctions are obtained in ~190 (~5) seconds when parallelized over 266,144 (16,384) Blue Gene/Q cores. Weak scaling and strong scaling of SIPs are analyzed and the performance of SIPsmore » is compared with other novel methods. Different matrix ordering methods are investigated to reduce the cost of the factorization step, which dominates the time-to-solution at the strong scaling limit. As a result, a parallel implementation of assembling the density matrix from the distributed eigenvectors is demonstrated.« less
Gao, Qi; Zhou, Min; Han, Chengyin; Li, Shangyan; Zhang, Shuang; Yao, Yuan; Li, Bo; Qiao, Hao; Ai, Di; Lou, Ge; Zhang, Mengya; Jiang, Yanyi; Bi, Zhiyi; Ma, Longsheng; Xu, Xinye
2018-05-22
Optical clocks are the most precise measurement devices. Here we experimentally characterize one such clock based on the 1 S 0 - 3 P 0 transition of neutral 171 Yb atoms confined in an optical lattice. Given that the systematic evaluation using an interleaved stabilization scheme is unable to avoid noise from the clock laser, synchronous comparisons against a second 171 Yb lattice system were implemented to accelerate the evaluation. The fractional instability of one clock falls below 4 × 10 -17 after an averaging over a time of 5,000 seconds. The systematic frequency shifts were corrected with a total uncertainty of 1.7 × 10 -16 . The lattice polarizability shift currently contributes the largest source. This work paves the way to measuring the absolute clock transition frequency relative to the primary Cs standard or against the International System of Units (SI) second.
NASA Astrophysics Data System (ADS)
Girish, B. S.; Pandey, Deepak; Ramachandran, Hema
2017-08-01
We present a compact, inexpensive multichannel module, APODAS (Avalanche Photodiode Output Data Acquisition System), capable of detecting 0.8 billion photons per second and providing real-time recording on a computer hard-disk, of channel- and time-tagged information of the arrival of upto 0.4 billion photons per second. Built around a Virtex-5 Field Programmable Gate Array (FPGA) unit, APODAS offers a temporal resolution of 5 nanoseconds with zero deadtime in data acquisition, utilising an efficient scheme for time and channel tagging and employing Gigabit ethernet for the transfer of data. Analysis tools have been developed on a Linux platform for multi-fold coincidence studies and time-delayed intensity interferometry. As illustrative examples, the second-order intensity correlation function ( g 2) of light from two commonly used sources in quantum optics —a coherent laser source and a dilute atomic vapour emitting spontaneously, constituting a thermal source— are presented. With easy reconfigurability and with no restriction on the total record length, APODAS can be readily used for studies over various time scales. This is demonstrated by using APODAS to reveal Rabi oscillations on nanosecond time scales in the emission of ultracold atoms, on the one hand, and, on the other hand, to measure the second-order correlation function on the millisecond time scales from tailored light sources. The efficient and versatile performance of APODAS promises its utility in diverse fields, like quantum optics, quantum communication, nuclear physics, astrophysics and biology.
Diffusion phenomenon at the interface of Cu-brass under a strong gravitational field
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ogata, Yudai; Tokuda, Makoto; Januszko, Kamila
2015-03-28
To investigate diffusion phenomenon at the interface between Cu and brass under a strong gravitational field generated by ultracentrifuge apparatus, we performed gravity experiments on samples prepared by electroplating with interfaces normal and parallel to the direction of gravity. For the parallel-mode sample, for which sedimentation cannot occur thorough the interface, the concentration change was significant within the lower gravity region; many pores were observed in this region. Many vacancies arising from crystal strain due to the strong gravitational field moved into the lower gravity region, and enhanced the atoms mobilities. For the two normal-mode samples, which have interface normalmore » to the direction of gravity, the composition gradient of the brass-on-Cu sample was steeper than that for Cu-on-brass. This showed that the atoms of denser Cu diffuse in the direction of gravity, whereas Zn atoms diffuse in the opposite direction by sedimentation. The interdiffusion coefficients became higher in the Cu-on-brass sample, and became lower in the brass-on-Cu sample. This rise may be related to the behavior of the vacancies.« less
Human Embryonic Stem Cell-Derived Cardiomyocytes Regenerate Non-Human Primate Hearts
Chong, James J.H.; Yang, Xiulan; Don, Creighton W.; Minami, Elina; Liu, Yen-Wen; Weyers, Jill J; Mahoney, William M.; Van Biber, Benjamin; Cook, Savannah M.; Palpant, Nathan J; Gantz, Jay; Fugate, James A.; Muskheli, Veronica; Gough, G. Michael; Vogel, Keith W.; Astley, Cliff A.; Hotchkiss, Charlotte E.; Baldessari, Audrey; Pabon, Lil; Reinecke, Hans; Gill, Edward A.; Nelson, Veronica; Kiem, Hans-Peter; Laflamme, Michael A.; Murry, Charles E.
2014-01-01
Pluripotent stem cells provide a potential solution to current epidemic rates of heart failure 1 by providing human cardiomyocytes to support heart regeneration 2. Studies of human embryonic stem cell-derived cardiomyocytes (hESC-CMs) in small animal models have shown favorable effects of this treatment 3–7. It remains unknown, however, whether clinical scale hESC-CMs transplantation is feasible, safe or can provide large-scale myocardial regeneration. Here we show that hESC-CMs can be produced at a clinical scale (>1 billion cells/batch) and cryopreserved with good viability. Using a non-human primate (NHP) model of myocardial ischemia-reperfusion, we show that that cryopreservation and intra-myocardial delivery of 1 billion hESC-CMs generates significant remuscularization of the infarcted heart. The hESC-CMs showed progressive but incomplete maturation over a three-month period. Grafts were perfused by host vasculature, and electromechanical junctions between graft and host myocytes were present within 2 weeks of engraftment. Importantly, grafts showed regular calcium transients that were synchronized to the host electrocardiogram, indicating electromechanical coupling. In contrast to small animal models 7, non-fatal ventricular arrhythmias were observed in hESC-CM engrafted primates. Thus, hESC-CMs can remuscularize substantial amounts of the infarcted monkey heart. Comparable remuscularization of a human heart should be possible, but potential arrhythmic complications need to be overcome. PMID:24776797
Recent progress on the National Ignition Facility advanced radiographic capability
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wegner, P.; Bowers, M.; Chen, H.
2016-01-08
The National Ignition Facility (NIF) is a megajoule (million-joule)-class laser and experimental facility built for Stockpile Stewardship and High Energy Density (HED) science research [1]. Up to several times a day, 192 laser pulses from NIF's 192 laser beamlines converge on a millimeter-scale target located at the center of the facility's 10-meter diameter target chamber. The carefully synchronized pulses, typically a few nanoseconds (billionths of a second) in duration and co-times to better than 20 picoseconds (trillionths of a second), a deliver a combined energy of up to 1.8 megajoules and a peak power of 500 terawatts (trillion watts). Furthermore,more » this drives temperatures inside the target to tens of millions of degrees and pressures to many billion times greater than Earth's atmosphere.« less
The small satellites of Pluto as observed by New Horizons.
Weaver, H A; Buie, M W; Buratti, B J; Grundy, W M; Lauer, T R; Olkin, C B; Parker, A H; Porter, S B; Showalter, M R; Spencer, J R; Stern, S A; Verbiscer, A J; McKinnon, W B; Moore, J M; Robbins, S J; Schenk, P; Singer, K N; Barnouin, O S; Cheng, A F; Ernst, C M; Lisse, C M; Jennings, D E; Lunsford, A W; Reuter, D C; Hamilton, D P; Kaufmann, D E; Ennico, K; Young, L A; Beyer, R A; Binzel, R P; Bray, V J; Chaikin, A L; Cook, J C; Cruikshank, D P; Dalle Ore, C M; Earle, A M; Gladstone, G R; Howett, C J A; Linscott, I R; Nimmo, F; Parker, J Wm; Philippe, S; Protopapa, S; Reitsema, H J; Schmitt, B; Stryk, T; Summers, M E; Tsang, C C C; Throop, H H B; White, O L; Zangari, A M
2016-03-18
The New Horizons mission has provided resolved measurements of Pluto's moons Styx, Nix, Kerberos, and Hydra. All four are small, with equivalent spherical diameters of ~40 kilometers for Nix and Hydra and ~10 kilometers for Styx and Kerberos. They are also highly elongated, with maximum to minimum axis ratios of ~2. All four moons have high albedos (~50 to 90%) suggestive of a water-ice surface composition. Crater densities on Nix and Hydra imply surface ages of at least 4 billion years. The small moons rotate much faster than synchronous, with rotational poles clustered nearly orthogonal to the common pole directions of Pluto and Charon. These results reinforce the hypothesis that the small moons formed in the aftermath of a collision that produced the Pluto-Charon binary. Copyright © 2016, American Association for the Advancement of Science.
Gilmore, John H.; Shen, Dinggang; Smith, Jeffery Keith; Zhu, Hongtu
2013-01-01
An anticorrelated interaction between the dorsal attention and the default-mode networks has been observed, although how these 2 networks establish such relationship remains elusive. Behavioral studies have reported the emergence of attention and default network–related functions and a preliminary competing relationship between them at early infancy. This study attempted to test the hypothesis—resting-state functional magnetic resonance imaging will demonstrate not only improved network synchronization of the dorsal attention and the default networks, respectively, during the first 2 years of life but also an anticorrelated network interaction pattern between the 2 networks at 1 year which will be further enhanced at 2 years old. Our results demonstrate that both networks start from an isolated region in neonates but evolve to highly synchronized networks at 1 year old. Paralleling the individual network maturation process, the anticorrelated behaviors are absent at birth but become apparent at 1 year and are further enhanced during the second year of life. Our studies elucidate not only the individual maturation process of the dorsal attention and default networks but also offer evidence that the maturation of the individual networks may be needed prior exhibiting the adult-like interaction patterns between the 2 networks. PMID:22368080
Bingham, Adrian; Arjunan, Sridhar P; Kumar, Dinesh K
2016-08-01
In this study we have tested the hypothesis regarding the increase in synchronization with the onset of muscle fatigue. For this aim, we have investigated the difference in the synchronicity between high density surface electromyogram (sEMG) channels of the rested muscles and when at the limit of endurance. Synchronization was measured by computing and normalizing the mutual information between the sEMG signals recorded from the high-density array electrode locations. Ten volunteers (Age range: 21 and 35 years; Mean age = 26 years; Male = 6, Female = 4) participated in our experiment. The participants performed isometric dorsiflexion of their dominate foot at two levels of contraction; 40% and 80% of their maximum voluntary contraction (MVC) until task failure. During the experiment an array of 64 electrodes (16 by 4) placed over the TA parallel to the muscle fiber was used to record the HD-sEMG. Normalized Mutual Information (NMI) between electrodes was calculated using the HD-sEMG data and then analyzed. The results show that that the average NMI of the TA significantly increased during fatigue at both levels of contraction. There was a statistically significant difference between NMI of the rested muscle compared with it being at the point of task failure.
NASA Astrophysics Data System (ADS)
Karlstrom, K. E.; Williams, M. L.
1995-01-01
The syntectonic 1.70 Ga Crazy Basin Monzogranite provides an example of the complex spatial and temporal interactions between metamorphism, deformation, and plutonism. Synchronous plutonism and deformation is indicated by syn-shortening dikes, sills, and veins; parallel magmatic and solid state fabrics; fabrics in xenoliths; and a foliation triple point. Synchronous plutonism and metamorphism is indicated by a systematic increase from 400 °C to 630 °C towards the pluton at a constant pressure of 300 MPa (3 kb). Temperatures are consistent with a conductive cooling model in which a 700 °C pluton was emplaced into country rocks undergoing greenschist facies regional metamorphism. Synchronous deformation and metamorphism is indicated by porphyroblast inclusion geometries that document the synmetamorphic development of the S2 cleavage. The pluton was emplaced adjacent to the Shylock shear zone during progressive shortening. Emplacement of granite as NE-trending sheets was facilitated by temporal partitioning of transpressional convergence into strike-slip and dip-slip components. At the scale of the pluton's aureole and on the relatively rapid time scale of 10 3-10 6 y, regional deformation and metamorphism were punctuated by thermal softening and increased diffusion rates. Data suggests that accretion of Proterozoic arcs in Arizona involved diachronous pluton-enhanced deformation and associated high temperature-low pressure regional metamorphism.
Prenucleation Induced by Crystalline Substrates
NASA Astrophysics Data System (ADS)
Men, H.; Fan, Z.
2018-04-01
Prenucleation refers to the phenomenon of atomic ordering in the liquid adjacent to the substrate/liquid interface at temperatures above the liquidus. In this paper, we have systematically investigated and holistically quantified the prenucleation phenomenon as a function of temperature and the lattice misfit between the substrate and the solid, using molecular dynamics (MD) simulations. Our results have confirmed that at temperatures above the liquidus, the atoms in the liquid at the interface may exhibit pronounced atomic ordering, manifested by atomic layering normal to the interface, in-plane atomic ordering parallel to the interface, and the formation of a 2-dimensional (2D) ordered structure (a few atomic layers in thickness) on the substrate surface. Holistic quantification of such atomic ordering at the interface has revealed that the atomic layering is independent of lattice misfit and is only slightly enhanced by reducing temperature while both in-plane atomic ordering and the formation of the 2D ordered structure are significantly enhanced by reducing the lattice misfit and/or temperature. This substrate-induced atomic ordering in the liquid may have a significant influence on the subsequent heterogeneous nucleation process.
Giant Gas Cloud Made of Atoms Formed in First Stars Revealed in Universe's Most Distant Quasar
NASA Astrophysics Data System (ADS)
2003-07-01
Astronomers studying the most distant quasar yet found in the Universe have discovered a massive reservoir of gas containing atoms made in the cores of some of the first stars ever formed. The carbon-monoxide gas was revealed by the National Science Foundation's Very Large Array (VLA) and the Plateau de Bure radio interferometer in Europe. The gas, along with the young galaxy containing it, is seen as it was when the Universe was only one-sixteenth its current age, just emerging from the primeval "Dark Ages" before light could travel freely through the cosmos. VLA Image of Quasar VLA Image of J1148+5251 CREDIT: NRAO/AUI/NSF (Click on Image for Larger Version) "Our discovery of this much carbon monoxide gas in such an extremely distant and young galaxy is surprising. It means that, even at a very early time in the history of the Universe, galaxies already had huge amounts of molecular gas that would eventually form new generations of stars," said Chris Carilli, of the National Radio Astronomy Observatory (NRAO) in Socorro, New Mexico. The distant galaxy, dubbed J1148+5251, contains a bright quasar powered by a black hole at least a billion times more massive than the Sun. The galaxy is seen as it was only 870 million years after the Big Bang. The Universe now is 13.7 billion years old. J1148+5251 would have been among the first luminous objects in the Universe. The original atoms formed in the Universe within the first three minutes of the Big Bang were only hydrogen and helium. Carbon and oxygen -- the atoms making up carbon monoxide -- had to be made in the thermonuclear furnaces at the cores of the earliest stars. "The carbon and oxygen atoms in the gas we detected were made by some of the first stars ever formed, only about 650 million years after the Big Bang. In the next 200 million years or so, those stars -- probably very different stars from those we see today -- exploded as supernovae, spreading the carbon and oxygen out into space. Those atoms then cooled and combined into the carbon monoxide molecules we detected with our radio telescopes," said Fabian Walter, a Jansky Postdoctoral Fellow at the NRAO. Walter is lead author of a research paper in the July 24 issue of the scientific journal Nature, and, with Carilli and K.Y. Lo of NRAO, did the VLA observations. Frank Bertoldi of the Max-Planck Institute in Germany and Pierre Cox of the Institute of Space Astrophysics in Orsay, France, led the collaborators using the Plateau de Bure telescope. J1148+5251 Timeline Time Since Big Bang Event <300,000 years Universe Fully Ionized 300,000 years Hot charged particles cool and combine into neutral atoms; Universe becomes opaque; "Dark Ages" begin. ~200 million years First luminous objects form; Reionization begins. ~650 million years Stars forming in galaxy J1148+5251; Make carbon, oxygen atoms and begin to blast these atoms into interstellar space 870 million years J1148+5251 has accumulated massive reservoir of cool molecular gas containing Carbon Monoxide (CO) molecules; Radio waves from these molecules begin their journey to Earth. One billion years Reionization complete; Universe is transparent, ending "Dark Ages." 13.7 billion years Radio waves from J1148+5251's CO molecules arrive at radio telescopes on Earth. The discovery gives scientists a tantalizing direct view of one of the earliest galaxies in the young Universe, and raises questions about the nature of the first stars and how galaxies and quasars formed. "The Universe in which this galaxy existed is a very different Universe from the one we know today," Walter said. For about 300,000 years after the Big Bang, the Universe was filled with very hot gas which eventually became protons and electrons. Then, through expansion, the Universe cooled and the protons and electrons combined into neutral atoms that absorbed light and other forms of electromagnetic radiation. This period, from 300,000 years after the Big Bang, until a few hundred million years later when the first stars and galaxies began forming, is known as the cosmic Dark Ages. As the first stars and galaxies formed, intense radiation from the stars began to break apart -- or ionize -- the neutral atoms, allowing light once again to pass. As each new star's radiation ionized interstellar atoms, it formed a transparent "bubble" in the opaque Universe. The Universe began to resemble a cosmic Swiss cheese, with the holes growing larger until, about a billion years after the Big Bang, the holes all met each other and the Universe became fully transparent once again. This period is known as the Reionization Era of the Universe. In fact, combining the radio observations with data from optical telescopes shows that the transparent "bubble" around J1148+5251 is about 30 million light-years in diameter. "This is direct evidence that we are seeing this object helping reionize the Universe," Walter said. The amount of molecular gas in the galaxy -- a mass more than 10 billion times that of the Sun -- tells the scientists that things were happening quickly in the early Universe. "This is as much mass as we see in big galaxies today, and it had little time, astronomically speaking, to accumulate," said Carilli. Also, the most popular theory for how big galaxies formed is that they were built up over long spans of time by multiple mergers of smaller galaxies. "That's why it's so surprising to see such a massive galaxy so early in the Universe," said Walter. Studies of J1148+5251 and other distant objects yet to be discovered will help scientists find the answers to their questions about the Universe's early stars and galaxies. The radio observations of J1148+5251 gave astronomers a look at the galaxy itself, Walter emphasized, while optical telescopes showed only light coming from the bright quasar "engine" at the galaxy's core. Walter added that more VLA observations now being planned are aimed at producing an image of the young galaxy. Discovery Image of J1148+5251 SDSS Discovery Image of J1148+5251: Quasar is Red Dot Pointed Out by Arrow CREDIT: Sloan Digital Sky Survey At Apache Point Observatory (Click on Image for Larger Version) In addition, Walter also looks forward to studying other objects deeper into the era of reionization, both with the expanded VLA (EVLA) and with the Atacama Large Millimeter Array (ALMA), a joint North America-Europe project to be built in Chile. "With the EVLA and ALMA, we will be able to study the structures and dynamics of similar systems in great detail," Walter said. J1148+5251 was discovered by the Sloan Digital Sky Survey, using a 2.5-meter optical telescope at Apache Point, NM, earlier this year. At a distance of more than 12.8 billion light-years, it is the most distant quasar yet found in the Universe. Followup observations at the W.M. Keck Observatory in Hawaii showed a clear signature of light absorption indicating that the object is seen at the end of the reionization era. This signature, found using a spectroscope to analyze light from the object, is known as the Gunn-Peterson Effect, after James Gunn and Bruce Peterson, who predicted it in 1965. The carbon monoxide gas was found using radio telescopes that detected radio waves emitted by the gas molecules. The wavelength of this radio emission was greatly increased by the Doppler Effect produced by the expansion of the Universe. For example, at the great distance of J1148+5251, waves that left the galaxy with a length of less than one millimeter were received by the VLA at a wavelength of more than six millimeters. In addition to Walter, Carilli and Lo, who used the VLA to observe J1148+5251, other team members led by Bertoldi and Cox used the Institute of Millimeter Radio Astronomy's (IRAM) Plateau de Bure radio interferometer in France. These included Roberto Neri of IRAM; Alain Omont of the Paris Institute of Astrophysics; and Karl Menten of Germany's Max Planck Instutute for Radioastronomy. Xiaohui Fan of the University of Arizona's Steward Observatory and Michael Strauss of Princeton University were the Sloan Digital Sky Survey collaborators on the Nature paper. The National Radio Astronomy Observatory is a facility of the National Science Foundation, operated under cooperative agreement by Associated Universities, Inc.
The ARES High-level Intermediate Representation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moss, Nicholas David
The LLVM intermediate representation (IR) lacks semantic constructs for depicting common high-performance operations such as parallel and concurrent execution, communication and synchronization. Currently, representing such semantics in LLVM requires either extending the intermediate form (a signi cant undertaking) or the use of ad hoc indirect means such as encoding them as intrinsics and/or the use of metadata constructs. In this paper we discuss a work in progress to explore the design and implementation of a new compilation stage and associated high-level intermediate form that is placed between the abstract syntax tree and when it is lowered to LLVM's IR. Thismore » highlevel representation is a superset of LLVM IR and supports the direct representation of these common parallel computing constructs along with the infrastructure for supporting analysis and transformation passes on this representation.« less
NASA Technical Reports Server (NTRS)
2003-01-01
Topics covered include: Stable, Thermally Conductive Fillers for Bolted Joints; Connecting to Thermocouples with Fewer Lead Wires; Zipper Connectors for Flexible Electronic Circuits; Safety Interlock for Angularly Misdirected Power Tool; Modular, Parallel Pulse-Shaping Filter Architectures; High-Fidelity Piezoelectric Audio Device; Photovoltaic Power Station with Ultracapacitors for Storage; Time Analyzer for Time Synchronization and Monitor of the Deep Space Network; Program for Computing Albedo; Integrated Software for Analyzing Designs of Launch Vehicles; Abstract-Reasoning Software for Coordinating Multiple Agents; Software Searches for Better Spacecraft-Navigation Models; Software for Partly Automated Recognition of Targets; Antistatic Polycarbonate/Copper Oxide Composite; Better VPS Fabrication of Crucibles and Furnace Cartridges; Burn-Resistant, Strong Metal-Matrix Composites; Self-Deployable Spring-Strip Booms; Explosion Welding for Hermetic Containerization; Improved Process for Fabricating Carbon Nanotube Probes; Automated Serial Sectioning for 3D Reconstruction; and Parallel Subconvolution Filtering Architectures.
Plagianakos, V P; Magoulas, G D; Vrahatis, M N
2006-03-01
Distributed computing is a process through which a set of computers connected by a network is used collectively to solve a single problem. In this paper, we propose a distributed computing methodology for training neural networks for the detection of lesions in colonoscopy. Our approach is based on partitioning the training set across multiple processors using a parallel virtual machine. In this way, interconnected computers of varied architectures can be used for the distributed evaluation of the error function and gradient values, and, thus, training neural networks utilizing various learning methods. The proposed methodology has large granularity and low synchronization, and has been implemented and tested. Our results indicate that the parallel virtual machine implementation of the training algorithms developed leads to considerable speedup, especially when large network architectures and training sets are used.
Hökelek, Tuncer; Akduran, Nurcan; Özen, Azer; Uğurlu, Güventürk; Necefoğlu, Hacali
2017-03-01
The asymmetric unit of the title compound, [Cd 2 (C 7 H 4 NO 4 ) 4 (C 6 H 4 N 2 ) 4 ], contains one Cd II atom, two 3-nitro-benzoate (NB) anions and two 3-cyano-pyridine (CPy) ligands. The two CPy ligands act as monodentate N(pyridine)-bonding ligands, while the two NB anions act as bidentate ligands through the carboxyl-ate O atoms. The centrosymmetric dinuclear complex is generated by application of inversion symmetry, whereby the Cd II atoms are bridged by the carboxyl-ate O atoms of two symmetry-related NB anions, thus completing the distorted N 2 O 5 penta-gonal-bipyramidal coordination sphere of each Cd II atom. The benzene and pyridine rings are oriented at dihedral angles of 10.02 (7) and 5.76 (9)°, respectively. In the crystal, C-H⋯N hydrogen bonds link the mol-ecules, enclosing R 2 2 (26) ring motifs, in which they are further linked via C-H⋯O hydrogen bonds, resulting in a three-dimensional network. In addition, π-π stacking inter-actions between parallel benzene rings and between parallel pyridine rings of adjacent mol-ecules [shortest centroid-to-centroid distances = 3.885 (1) and 3.712 (1) Å, respectively], as well as a weak C-H⋯π inter-action, may further stabilize the crystal structure.
Comparison of stochastic optimization methods for all-atom folding of the Trp-Cage protein.
Schug, Alexander; Herges, Thomas; Verma, Abhinav; Lee, Kyu Hwan; Wenzel, Wolfgang
2005-12-09
The performances of three different stochastic optimization methods for all-atom protein structure prediction are investigated and compared. We use the recently developed all-atom free-energy force field (PFF01), which was demonstrated to correctly predict the native conformation of several proteins as the global optimum of the free energy surface. The trp-cage protein (PDB-code 1L2Y) is folded with the stochastic tunneling method, a modified parallel tempering method, and the basin-hopping technique. All the methods correctly identify the native conformation, and their relative efficiency is discussed.
Poly[[[μ3-N′-(carboxymethyl)ethylenediamine-N,N,N′-triacetato]dysprosium(III)] trihydrate
Zhuang, Xiaomei; Long, Qingping; Wang, Jun
2010-01-01
In the title coordination polymer, {[Dy(C10H13N2O8)]·3H2O}n, the dysprosium(III) ion is coordinated by two N atoms and six O atoms from three different (carboxymethyl)ethylenediaminetriacetate ligands in a distorted square-antiprismatic geometry. The ligands connect the metal atoms, forming layers parallel to the ab plane. O—H⋯O hydrogen bonds further assemble adjacent layers into a three-dimensional supramolecular network. PMID:21588859
Chow, Tze-Show
1989-01-01
A photon calorimeter (20, 40) is provided that comprises a laminar substrate (10, 22, 42) that is uniform in density and homogeneous in atomic composition. A plasma-sprayed coating (28, 48, 52), that is generally uniform in density and homogeneous in atomic composition within the proximity of planes that are parallel to the surfaces of the substrate, is applied to either one or both sides of the laminar substrate. The plasma-sprayed coatings may be very efficiently spectrally tailored in atomic number. Thermocouple measuring junctions (30, 50, 54) are positioned within the plasma-sprayed coatings. The calorimeter is rugged, inexpensive, and equilibrates in temperature very rapidly.
The force on the flex: Global parallelism and portability
NASA Technical Reports Server (NTRS)
Jordan, H. F.
1986-01-01
A parallel programming methodology, called the force, supports the construction of programs to be executed in parallel by an unspecified, but potentially large, number of processes. The methodology was originally developed on a pipelined, shared memory multiprocessor, the Denelcor HEP, and embodies the primitive operations of the force in a set of macros which expand into multiprocessor Fortran code. A small set of primitives is sufficient to write large parallel programs, and the system has been used to produce 10,000 line programs in computational fluid dynamics. The level of complexity of the force primitives is intermediate. It is high enough to mask detailed architectural differences between multiprocessors but low enough to give the user control over performance. The system is being ported to a medium scale multiprocessor, the Flex/32, which is a 20 processor system with a mixture of shared and local memory. Memory organization and the type of processor synchronization supported by the hardware on the two machines lead to some differences in efficient implementations of the force primitives, but the user interface remains the same. An initial implementation was done by retargeting the macros to Flexible Computer Corporation's ConCurrent C language. Subsequently, the macros were caused to directly produce the system calls which form the basis for ConCurrent C. The implementation of the Fortran based system is in step with Flexible Computer Corporations's implementation of a Fortran system in the parallel environment.
Concurrent Probabilistic Simulation of High Temperature Composite Structural Response
NASA Technical Reports Server (NTRS)
Abdi, Frank
1996-01-01
A computational structural/material analysis and design tool which would meet industry's future demand for expedience and reduced cost is presented. This unique software 'GENOA' is dedicated to parallel and high speed analysis to perform probabilistic evaluation of high temperature composite response of aerospace systems. The development is based on detailed integration and modification of diverse fields of specialized analysis techniques and mathematical models to combine their latest innovative capabilities into a commercially viable software package. The technique is specifically designed to exploit the availability of processors to perform computationally intense probabilistic analysis assessing uncertainties in structural reliability analysis and composite micromechanics. The primary objectives which were achieved in performing the development were: (1) Utilization of the power of parallel processing and static/dynamic load balancing optimization to make the complex simulation of structure, material and processing of high temperature composite affordable; (2) Computational integration and synchronization of probabilistic mathematics, structural/material mechanics and parallel computing; (3) Implementation of an innovative multi-level domain decomposition technique to identify the inherent parallelism, and increasing convergence rates through high- and low-level processor assignment; (4) Creating the framework for Portable Paralleled architecture for the machine independent Multi Instruction Multi Data, (MIMD), Single Instruction Multi Data (SIMD), hybrid and distributed workstation type of computers; and (5) Market evaluation. The results of Phase-2 effort provides a good basis for continuation and warrants Phase-3 government, and industry partnership.
Taboo search algorithm for item assignment in synchronized zone automated order picking system
NASA Astrophysics Data System (ADS)
Wu, Yingying; Wu, Yaohua
2014-07-01
The idle time which is part of the order fulfillment time is decided by the number of items in the zone; therefore the item assignment method affects the picking efficiency. Whereas previous studies only focus on the balance of number of kinds of items between different zones but not the number of items and the idle time in each zone. In this paper, an idle factor is proposed to measure the idle time exactly. The idle factor is proven to obey the same vary trend with the idle time, so the object of this problem can be simplified from minimizing idle time to minimizing idle factor. Based on this, the model of item assignment problem in synchronized zone automated order picking system is built. The model is a form of relaxation of parallel machine scheduling problem which had been proven to be NP-complete. To solve the model, a taboo search algorithm is proposed. The main idea of the algorithm is minimizing the greatest idle factor of zones with the 2-exchange algorithm. Finally, the simulation which applies the data collected from a tobacco distribution center is conducted to evaluate the performance of the algorithm. The result verifies the model and shows the algorithm can do a steady work to reduce idle time and the idle time can be reduced by 45.63% on average. This research proposed an approach to measure the idle time in synchronized zone automated order picking system. The approach can improve the picking efficiency significantly and can be seen as theoretical basis when optimizing the synchronized automated order picking systems.
Rokszin, Alice; Gombköto, Péter; Berényi, Antal; Márkus, Zita; Braunitzer, Gábor; Benedek, György; Nagy, Attila
2011-10-18
Recent morphological and physiological studies have suggested a strong relationship between the suprageniculate nucleus (Sg) of the posterior thalamus and the input structure of the basal ganglia, the caudate nucleus (CN) of the feline brain. Accordingly, to clarify if there is a real functional relationship between Sg and CN during visual information processing, we investigated the temporal relations of simultaneously recorded neuronal spike trains of these two structures, looking for any significant cross-correlation between the spiking of the simultaneously recorded neurons. For the purposes of statistical analysis, we used the shuffle and jittering resampling methods. Of the recorded 288 Sg-CN neuron pairs, 26 (9.2%) showed significantly correlated spontaneous activity. Nineteen pairs (6.7%) showed correlated activity during stationary visual stimulation, while 21 (7.4%) pairs during stimulus movement. There was no overlap between the neuron pairs that showed cross-correlated spontaneous activity and the pairs that synchronized their activity during visual stimulation. Thus visual stimulation seems to have been able to synchronize, and also, by other neuron pairs, desynchronize the activity of CN and Sg. In about half of the cases, the activation of Sg preceded the activation of CN by a few milliseconds, while in the other half, CN was activated earlier. Our results provide the first piece of evidence for the existence of a functional cooperation between Sg and CN. We argue that either a monosynaptic bidirectional direct connection should exist between these structures, or a common input comprising of parallel pathways synchronizing them. Copyright © 2011 Elsevier B.V. All rights reserved.
Gadolinium photoionization process
Paisner, J.A.; Comaskey, B.J.; Haynam, C.A.; Eggert, J.H.
1993-04-13
A method is provided for selective photoionization of the odd-numbered atomic mass gadolinium isotopes 155 and 157. The selective photoionization is accomplished by circular or linear parallel polarized laser beam energy effecting a three-step photoionization pathway.
Gadolinium photoionization process
Paisner, Jeffrey A.; Comaskey, Brian J.; Haynam, Christopher A.; Eggert, Jon H.
1993-01-01
A method is provided for selective photoionization of the odd-numbered atomic mass gadolinium isotopes 155 and 157. The selective photoionization is accomplished by circular or linear parallel polarized laser beam energy effecting a three-step photoionization pathway.
Fast and Accurate Support Vector Machines on Large Scale Systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Vishnu, Abhinav; Narasimhan, Jayenthi; Holder, Larry
Support Vector Machines (SVM) is a supervised Machine Learning and Data Mining (MLDM) algorithm, which has become ubiquitous largely due to its high accuracy and obliviousness to dimensionality. The objective of SVM is to find an optimal boundary --- also known as hyperplane --- which separates the samples (examples in a dataset) of different classes by a maximum margin. Usually, very few samples contribute to the definition of the boundary. However, existing parallel algorithms use the entire dataset for finding the boundary, which is sub-optimal for performance reasons. In this paper, we propose a novel distributed memory algorithm to eliminatemore » the samples which do not contribute to the boundary definition in SVM. We propose several heuristics, which range from early (aggressive) to late (conservative) elimination of the samples, such that the overall time for generating the boundary is reduced considerably. In a few cases, a sample may be eliminated (shrunk) pre-emptively --- potentially resulting in an incorrect boundary. We propose a scalable approach to synchronize the necessary data structures such that the proposed algorithm maintains its accuracy. We consider the necessary trade-offs of single/multiple synchronization using in-depth time-space complexity analysis. We implement the proposed algorithm using MPI and compare it with libsvm--- de facto sequential SVM software --- which we enhance with OpenMP for multi-core/many-core parallelism. Our proposed approach shows excellent efficiency using up to 4096 processes on several large datasets such as UCI HIGGS Boson dataset and Offending URL dataset.« less
Development of high precision digital driver of acoustic-optical frequency shifter for ROG
NASA Astrophysics Data System (ADS)
Zhang, Rong; Kong, Mei; Xu, Yameng
2016-10-01
We develop a high precision digital driver of the acoustic-optical frequency shifter (AOFS) based on the parallel direct digital synthesizer (DDS) technology. We use an atomic clock as the phase-locked loop (PLL) reference clock, and the PLL is realized by a dual digital phase-locked loop. A DDS sampling clock up to 320 MHz with a frequency stability as low as 10-12 Hz is obtained. By constructing the RF signal measurement system, it is measured that the frequency output range of the AOFS-driver is 52-58 MHz, the center frequency of the band-pass filter is 55 MHz, the ripple in the band is less than 1 dB@3MHz, the single channel output power is up to 0.3 W, the frequency stability is 1 ppb (1 hour duration), and the frequency-shift precision is 0.1 Hz. The obtained frequency stability has two orders of improvement compared to that of the analog AOFS-drivers. For the designed binary frequency shift keying (2-FSK) and binary phase shift keying (2-PSK) modulation system, the demodulating frequency of the input TTL synchronous level signal is up to 10 kHz. The designed digital-bus coding/decoding system is compatible with many conventional digital bus protocols. It can interface with the ROG signal detecting software through the integrated drive electronics (IDE) and exchange data with the two DDS frequency-shift channels through the signal detecting software.
NASA Astrophysics Data System (ADS)
Tolson, B.; Matott, L. S.; Gaffoor, T. A.; Asadzadeh, M.; Shafii, M.; Pomorski, P.; Xu, X.; Jahanpour, M.; Razavi, S.; Haghnegahdar, A.; Craig, J. R.
2015-12-01
We introduce asynchronous parallel implementations of the Dynamically Dimensioned Search (DDS) family of algorithms including DDS, discrete DDS, PA-DDS and DDS-AU. These parallel algorithms are unique from most existing parallel optimization algorithms in the water resources field in that parallel DDS is asynchronous and does not require an entire population (set of candidate solutions) to be evaluated before generating and then sending a new candidate solution for evaluation. One key advance in this study is developing the first parallel PA-DDS multi-objective optimization algorithm. The other key advance is enhancing the computational efficiency of solving optimization problems (such as model calibration) by combining a parallel optimization algorithm with the deterministic model pre-emption concept. These two efficiency techniques can only be combined because of the asynchronous nature of parallel DDS. Model pre-emption functions to terminate simulation model runs early, prior to completely simulating the model calibration period for example, when intermediate results indicate the candidate solution is so poor that it will definitely have no influence on the generation of further candidate solutions. The computational savings of deterministic model preemption available in serial implementations of population-based algorithms (e.g., PSO) disappear in synchronous parallel implementations as these algorithms. In addition to the key advances above, we implement the algorithms across a range of computation platforms (Windows and Unix-based operating systems from multi-core desktops to a supercomputer system) and package these for future modellers within a model-independent calibration software package called Ostrich as well as MATLAB versions. Results across multiple platforms and multiple case studies (from 4 to 64 processors) demonstrate the vast improvement over serial DDS-based algorithms and highlight the important role model pre-emption plays in the performance of parallel, pre-emptable DDS algorithms. Case studies include single- and multiple-objective optimization problems in water resources model calibration and in many cases linear or near linear speedups are observed.
Multimillion to billion atom simulations of nanosystems under extreme conditions
NASA Astrophysics Data System (ADS)
Vashishta, P.
2008-12-01
Advanced materials and devices with nanometer grain/feature sizes are being developed to achieve higher strength and toughness in ceramic materials and greater speeds in electronic devices. Below 100 nm, however, continuum description of materials and devices must be supplemented by atomistic descriptions. Current state of the art atomistic simulations involve 10 million - 1 billion atoms. We investigate initiation, growth and healing of wing cracks in confined silica glass by multimillion atom molecular dynamics (MD) simulations. Under dynamic compression, frictional sliding of pre-crack surfaces nucleates nanovoids, which evolve into nanocrack columns at the pre-crack tip. Nanocrack columns merge to form a wing crack, which grows via coalescence with nanovoids in the direction of maximum compression. Lateral confinement arrests the growth and partially heals the wing crack. Growth and arrest of the wing crack occur repeatedly, as observed in dynamic compression experiments on brittle solids under lateral confinement. MD simulation of hypervelocity projectile impact in aluminum nitride and alumina has also been studied. The simulations reveal strong interplay between shock- induced structural phase transformation, plastic deformation and brittle cracks. The shock wave splits into an elastic precursor and a wurtzite-to-rocksalt structural transformation wave. When the elastic wave reflected from the boundary of the sample interacts with the transformation wave front, nanocavities are generated along the penetration path of the projectile and dislocations in adjacent regions. The nanocavities coalesce to form mode I brittle cracks while dislocations generate kink bands that give rise to mode II cracks. These simulations provide a microscopic view of defects associated with simultaneous tensile and shear cracking at the structural phase transformation boundary due to shock impact in high-strength ceramics. Initiation of chemical reactions at shock fronts prior to detonation and dynamic transition in the shock structure of an energetic material (RDX) and reaction of aluminium nanoparticles in oxygen atmosphere followed by explosive burning is also discussed.
Su, Hao; Dickstein-Fischer, Laurie; Harrington, Kevin; Fu, Qiushi; Lu, Weina; Huang, Haibo; Cole, Gregory; Fischer, Gregory S
2010-01-01
This paper presents the development of new prismatic actuation approach and its application in human-safe humanoid head design. To reduce actuator output impedance and mitigate unexpected external shock, the prismatic actuation method uses cables to drive a piston with preloaded spring. By leveraging the advantages of parallel manipulator and cable-driven mechanism, the developed neck has a parallel manipulator embodiment with two cable-driven limbs embedded with preloaded springs and one passive limb. The eye mechanism is adapted for low-cost webcam with succinct "ball-in-socket" structure. Based on human head anatomy and biomimetics, the neck has 3 degree of freedom (DOF) motion: pan, tilt and one decoupled roll while each eye has independent pan and synchronous tilt motion (3 DOF eyes). A Kalman filter based face tracking algorithm is implemented to interact with the human. This neck and eye structure is translatable to other human-safe humanoid robots. The robot's appearance reflects a non-threatening image of a penguin, which can be translated into a possible therapeutic intervention for children with Autism Spectrum Disorders.
Multi-mode sensor processing on a dynamically reconfigurable massively parallel processor array
NASA Astrophysics Data System (ADS)
Chen, Paul; Butts, Mike; Budlong, Brad; Wasson, Paul
2008-04-01
This paper introduces a novel computing architecture that can be reconfigured in real time to adapt on demand to multi-mode sensor platforms' dynamic computational and functional requirements. This 1 teraOPS reconfigurable Massively Parallel Processor Array (MPPA) has 336 32-bit processors. The programmable 32-bit communication fabric provides streamlined inter-processor connections with deterministically high performance. Software programmability, scalability, ease of use, and fast reconfiguration time (ranging from microseconds to milliseconds) are the most significant advantages over FPGAs and DSPs. This paper introduces the MPPA architecture, its programming model, and methods of reconfigurability. An MPPA platform for reconfigurable computing is based on a structural object programming model. Objects are software programs running concurrently on hundreds of 32-bit RISC processors and memories. They exchange data and control through a network of self-synchronizing channels. A common application design pattern on this platform, called a work farm, is a parallel set of worker objects, with one input and one output stream. Statically configured work farms with homogeneous and heterogeneous sets of workers have been used in video compression and decompression, network processing, and graphics applications.
Many-Body Physics in Long-Range Interacting Quantum Systems
NASA Astrophysics Data System (ADS)
Zhu, Bihui
Ultracold atomic and molecular systems provide a useful platform for understanding quantum many-body physics. Recent progresses in AMO experiments enable access to systems exhibiting long-range interactions, opening a window for exploring the interplay between long-range interactions and dissipation. In this thesis, I develop theoretical approaches to study non-equilibrium dynamics in systems where such interplay is crucial. I first focus on a system of KRb molecules, where dipolar interactions and fast chemical reactions coexist. Using a classical kinetic theory and Monte Carlo methods, I study the evaporative cooling in a quasi-two-dimensional trap, and develop a protocol to reach quantum degeneracy. I also study the case where molecules are loaded into an optical lattice, and show that the strong dissipation induces a quantum Zeno effect, which suppresses the molecule loss. The analysis requires including multiple bands to explain recent experimental measurements, and can be used to determine the molecular filling fraction. I also investigate a system of radiating atoms, which experience long-range elastic and dissipative interactions. I explore the collective behavior of atoms and the role of atomic motion. The model is validated by comparison with a recent light scattering experiment using Sr atoms. I also show that incoherently pumped dipoles can undergo a dynamical phase transition to synchronization, and study its signature in the quantum regime.
Interference of resonance fluorescence from two four-level atoms
NASA Astrophysics Data System (ADS)
Wong, T.; Tan, S. M.; Collett, M. J.; Walls, D. F.
1997-02-01
In a recent experiment by Eichmann et al. [Phys. Rev. Lett. 70, 2359 (1993)], polarization-sensitive measurements of the fluorescence from two four-level ions driven by a linearly polarized laser were made. Depending on the polarization chosen, different degrees of interference were observed. We carry out a theoretical and numerical study of this system, showing that the results can largely be understood by treating the atoms as independent radiators which are synchronized by the phase of the incident laser field. The interference and its loss may be described in terms of the difference between coherent and incoherent driving of the various atomic transitions in the steady state. In the numerical simulations, which are carried out using the Monte Carlo wave-function method, we remove the assumption that the atoms radiate independently and consider the photodetection process in detail. This allows us to see the total interference pattern build up from individual photodetections and also to see the effects of superfluorescence, which become important when the atomic separation is comparable to an optical wavelength. The results of the calculations are compared with the experiment. We also carry out simulations in the non-steady-state regime and discuss the relationship between the visibility of the interference pattern and which-path considerations.
Santos, Elson C; Neto, Abel F G; Maneschy, Carlos E; Chen, James; Ramalho, Teodorico C; Neto, A M J C
2015-05-01
Here we analyzed several physical behaviors through computational simulation of systems consisting of a zig-zag type carbon nanotube and relaxed cold atoms (Rb, Au, Si and Ar). These atoms were chosen due to their different chemical properties. The atoms individually were relaxed on the outside of the nanotube during the simulations. Each system was found under the influence of a uniform electric field parallel to the carbon nanotube and under the thermal effect of the initial temperature at the simulations. Because of the electric field, the cold atoms orbited the carbon nanotube while increasing the initial temperature allowed the variation of the radius of the orbiting atoms. We calculated the following quantities: kinetic energy, potential energy and total energy and in situ temperature, molar entropy variation and average radius of the orbit of the atoms. Our data suggest that only the action of electric field is enough to generate the attractive potential and this system could be used as a selected atoms sensor.
A Red Carpet for Iron Metabolism
Muckenthaler, Martina U.; Rivella, Stefano; Hentze, Matthias W.; Galy, Bruno
2017-01-01
200 billion red blood cells (RBCs) are produced every day, requiring more than 2 × 3 1015 iron atoms every second to maintain adequate erythropoiesis. These numbers translate into 20 mL of blood being produced each day, containing 6 g of hemoglobin and 20 mg of iron. These impressive numbers illustrate why the making and breaking of RBCs is at the heart of iron physiology, providing an ideal context to discuss recent progress in understanding the systemic and cellular mechanisms that underlie the regulation of iron homeostasis and its disorders. PMID:28129536
Redetermination of clinobarylite, BaBe2Si2O7
Domizio, Adrien J. Di; Downs, Robert T.; Yang, Hexiong
2012-01-01
Clinobarylite, ideally BaBe2Si2O7 (chemical name barium diberyllium disilicate), is a sorosilicate mineral and dimorphic with barylite. It belongs to a group of compounds characterized by the general formula BaM 2+ 2Si2O7, with M 2+ = Be, Mg, Fe, Mn, Zn, Co, or Cu, among which the Be-, Fe-, and Cu-members have been found in nature. The crystal structure of clinobarylite has been re-examined in this study based on single-crystal X-ray diffraction data collected from a natural sample from the type locality (Khibiny Massif, Kola Peninsula, Russia). The structure of clinobarylite can be considered as a framework of BeO4 and SiO4 tetrahedra, with one of the O atoms coordinated to two Be and one Si, one coordinated to two Si, and two O atoms coordinated to one Si and one Be atom. The BeO4 tetrahedra share corners, forming chains parallel to the c axis, which are interlinked by the Si2O7 units oriented parallel to the a axis. The Ba2+ cations (site symmetry m..) are in the framework channels and are coordinated by eleven O atoms in form of an irregular polyhedron. The Si—Obr (bridging O atom, at site symmetry m..) bond length, the Si—Onbr (non-bridging O atoms) bond lengths, and the Si—O—Si angle within the Si2O7 unit are in marked contrast to the corresponding values determined in the previous study [Krivovichev et al. (2004 ▶). N. Jb. Miner. Mh. pp. 373–384]. PMID:23125568
Redetermination of clinobaryl-ite, BaBe(2)Si(2)O(7).
Domizio, Adrien J Di; Downs, Robert T; Yang, Hexiong
2012-10-01
Clinobaryl-ite, ideally BaBe(2)Si(2)O(7) (chemical name barium diberyllium disilicate), is a sorosilicate mineral and dimorphic with baryl-ite. It belongs to a group of compounds characterized by the general formula BaM(2+) (2)Si(2)O(7), with M(2+) = Be, Mg, Fe, Mn, Zn, Co, or Cu, among which the Be-, Fe-, and Cu-members have been found in nature. The crystal structure of clinobaryl-ite has been re-examined in this study based on single-crystal X-ray diffraction data collected from a natural sample from the type locality (Khibiny Massif, Kola Peninsula, Russia). The structure of clinobaryl-ite can be considered as a framework of BeO(4) and SiO(4) tetra-hedra, with one of the O atoms coordinated to two Be and one Si, one coordinated to two Si, and two O atoms coordinated to one Si and one Be atom. The BeO(4) tetra-hedra share corners, forming chains parallel to the c axis, which are inter-linked by the Si(2)O(7) units oriented parallel to the a axis. The Ba(2+) cations (site symmetry m..) are in the framework channels and are coordinated by eleven O atoms in form of an irregular polyhedron. The Si-O(br) (bridging O atom, at site symmetry m..) bond length, the Si-O(nbr) (non-bridging O atoms) bond lengths, and the Si-O-Si angle within the Si(2)O(7) unit are in marked contrast to the corresponding values determined in the previous study [Krivovichev et al. (2004 ▶). N. Jb. Miner. Mh. pp. 373-384].
Insights into the Hydrogen-Atom Transfer of the Blue Aroxyl.
Bächle, Josua; Marković, Marijana; Kelterer, Anne-Marie; Grampp, Günter
2017-10-19
An experimental and theoretical study on hydrogen-atom transfer dynamics in the hydrogen-bonded substituted phenol/phenoxyl complex of the blue aroxyl (2,4,6-tri-tert-butylphenoxyl) is presented. The experimental exchange dynamics is determined in different organic solvents from the temperature-dependent alternating line-width effect in the continuous-wave ESR spectrum. From bent Arrhenius plots, effective tunnelling contributions with parallel heavy-atom motion are concluded. To clarify the transfer mechanism, reaction paths for different conformers of the substituted phenol/phenoxyl complex are modelled theoretically. Various DFT and post-Hartree-Fock methods including multireference methods are applied. From the comparison of experimental and theoretical data it is concluded that the system favours concerted hydrogen-atom transfer along a parabolic reaction path caused by heavy-atom motion. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
An open-source, extensible system for laboratory timing and control
NASA Astrophysics Data System (ADS)
Gaskell, Peter E.; Thorn, Jeremy J.; Alba, Sequoia; Steck, Daniel A.
2009-11-01
We describe a simple system for timing and control, which provides control of analog, digital, and radio-frequency signals. Our system differs from most common laboratory setups in that it is open source, built from off-the-shelf components, synchronized to a common and accurate clock, and connected over an Ethernet network. A simple bus architecture facilitates creating new and specialized devices with only moderate experience in circuit design. Each device operates independently, requiring only an Ethernet network connection to the controlling computer, a clock signal, and a trigger signal. This makes the system highly robust and scalable. The devices can all be connected to a single external clock, allowing synchronous operation of a large number of devices for situations requiring precise timing of many parallel control and acquisition channels. Provided an accurate enough clock, these devices are capable of triggering events separated by one day with near-microsecond precision. We have achieved precisions of ˜0.1 ppb (parts per 109) over 16 s.
A Parameter Communication Optimization Strategy for Distributed Machine Learning in Sensors
Zhang, Jilin; Tu, Hangdi; Ren, Yongjian; Wan, Jian; Zhou, Li; Li, Mingwei; Wang, Jue; Yu, Lifeng; Zhao, Chang; Zhang, Lei
2017-01-01
In order to utilize the distributed characteristic of sensors, distributed machine learning has become the mainstream approach, but the different computing capability of sensors and network delays greatly influence the accuracy and the convergence rate of the machine learning model. Our paper describes a reasonable parameter communication optimization strategy to balance the training overhead and the communication overhead. We extend the fault tolerance of iterative-convergent machine learning algorithms and propose the Dynamic Finite Fault Tolerance (DFFT). Based on the DFFT, we implement a parameter communication optimization strategy for distributed machine learning, named Dynamic Synchronous Parallel Strategy (DSP), which uses the performance monitoring model to dynamically adjust the parameter synchronization strategy between worker nodes and the Parameter Server (PS). This strategy makes full use of the computing power of each sensor, ensures the accuracy of the machine learning model, and avoids the situation that the model training is disturbed by any tasks unrelated to the sensors. PMID:28934163
Furjan Mandic, Gordana; Peric, Mia; Krzelj, Lucijana; Stankovic, Sladana; Zenic, Natasa
2013-01-01
Although nutrition and doping are important factors in sports, neither is often investigated in synchronized swimming (Synchro).This study aimed to define and compare Synchro athletes and their coaches on their knowledge of sports nutrition (KSN)and knowledge of doping (KD); and to study factors related to KSN and KD in each of these groups. Additionally, the KSNand KD questionnaires were evaluated for their reliability and validity. Altogether, 82 athletes (17.2 ± 1.92 years of age) and 28 coaches (30.8 ± 5.26 years of age) from Croatia and Serbia were included in the study, with a 99% response rate. The testand retest correlations were 0.94 and 0.90 for the KD and KSN,respectively. Subjects responded equally to 91% queries of the KD and 89% queries of the KSN. Although most of the coache sare highly educated, they declared self-education as the primary source of information about doping and sport-nutrition. Coaches scored higher than their athletes on both questionnaires which defined appropriate discriminative validity of the questionnaires. Variables such as age, sports experience and formal education are positively correlated to KSN and KD scores among athletes. The athletes who scored better on the KD are less prone to doping behavior in the future. These data reinforce the need for systematic educational programs on doping and sports nutrition in synchronized swimming. Special attention should be placed on younger athletes. Key PointsAlthough most of the synchro coaches are highly educated, self-education is declared as the primary source of information about doping and sportnutrition.The knowledge of doping and doping-health hazards are negatively related to potential doping behavior in the future among synchronized swimmersThe data reinforce the need for systematic educational programs on doping and sports nutrition in synchronized swimming.We advocate improving the knowledge of sports nutrition among older coaches and the knowledge of doping among younger coaches, while among athletes,younger swimmers should be targeted.
Furjan Mandic, Gordana; Peric, Mia; Krzelj, Lucijana; Stankovic, Sladana; Zenic, Natasa
2013-01-01
Although nutrition and doping are important factors in sports, neither is often investigated in synchronized swimming (Synchro).This study aimed to define and compare Synchro athletes and their coaches on their knowledge of sports nutrition (KSN)and knowledge of doping (KD); and to study factors related to KSN and KD in each of these groups. Additionally, the KSNand KD questionnaires were evaluated for their reliability and validity. Altogether, 82 athletes (17.2 ± 1.92 years of age) and 28 coaches (30.8 ± 5.26 years of age) from Croatia and Serbia were included in the study, with a 99% response rate. The testand retest correlations were 0.94 and 0.90 for the KD and KSN,respectively. Subjects responded equally to 91% queries of the KD and 89% queries of the KSN. Although most of the coache sare highly educated, they declared self-education as the primary source of information about doping and sport-nutrition. Coaches scored higher than their athletes on both questionnaires which defined appropriate discriminative validity of the questionnaires. Variables such as age, sports experience and formal education are positively correlated to KSN and KD scores among athletes. The athletes who scored better on the KD are less prone to doping behavior in the future. These data reinforce the need for systematic educational programs on doping and sports nutrition in synchronized swimming. Special attention should be placed on younger athletes. Key Points Although most of the synchro coaches are highly educated, self-education is declared as the primary source of information about doping and sportnutrition. The knowledge of doping and doping-health hazards are negatively related to potential doping behavior in the future among synchronized swimmers The data reinforce the need for systematic educational programs on doping and sports nutrition in synchronized swimming. We advocate improving the knowledge of sports nutrition among older coaches and the knowledge of doping among younger coaches, while among athletes,younger swimmers should be targeted PMID:24421736
Research in Application of Geodetic GPS Receivers in Time Synchronization
NASA Astrophysics Data System (ADS)
Zhang, Q.; Zhang, P.; Sun, Z.; Wang, F.; Wang, X.
2018-04-01
In recent years, with the development of satellite orbit and clock parameters accurately determining technology and the popularity of geodetic GPS receivers, Common-View (CV) which proposed in 1980 by Allan has gained widespread application and achieved higher accuracy time synchronization results. GPS Common View (GPS CV) is the technology that based on multi-channel geodetic GPS receivers located in different place and under the same common-view schedule to receiving same GPS satellite signal at the same time, and then calculating the time difference between respective local receiver time and GPST by weighted theory, we will obtain the difference between above local time of receivers that installed in different station with external atomic clock. Multi-channel geodetic GPS receivers have significant advantages such as higher stability, higher accuracy and more common-view satellites in long baseline time synchronization application over the single-channel geodetic GPS receivers. At present, receiver hardware delay and surrounding environment influence are main error factors that affect the accuracy of GPS common-view result. But most error factors will be suppressed by observation data smoothing and using of observation data from different satellites in multi-channel geodetic GPS receiver. After the SA (Selective Availability) cancellation, using a combination of precise satellite ephemeris, ionospheric-free dual-frequency P-code observations and accurately measuring of receiver hardware delay, we can achieve time synchronization result on the order of nanoseconds (ns). In this paper, 6 days observation data of two IGS core stations with external atomic clock (PTB, USNO distance of two stations about 6000 km) were used to verify the GPS common-view theory. Through GPS observation data analysis, there are at least 2-4 common-view satellites and 5 satellites in a few tracking periods between two stations when the elevation angle is 15°, even there will be at least 2 common-view satellites for each tracking period when the elevation angle is 30°. Data processing used precise GPS satellite ephemeris, double-frequency P-code combination observations without ionosphere effects and the correction of the Black troposphere Delay Model. the weighted average of all common-viewed GPS satellites in the same tracking period is taken by weighting the root-mean-square error of each satellite, finally a time comparison data between two stations is obtained, and then the time synchronization result between the two stations (PTB and USNO) is obtained. It can be seen from the analysis of time synchronization result that the root mean square error of REFSV (the difference between the local frequency standard at the mid-point of the actual tracking length and the tracked satellite time in unit of 0.1 ns) shows a linear change within one day, However the jump occurs when jumping over the day which is mainly caused by satellites position being changed due to the interpolation of two-day precise satellite ephemeris across the day. the overall trend of time synchronization result is declining and tends to be stable within a week-long time. We compared the time synchronization results (without considering the hardware delay correction) with those published by the International Bureau of Weights and Measures (BIPM), and the comparing result from a week earlier shows that the trend is same but there is a systematic bias which was mainly caused by hardware delays of geodetic GPS receiver. Regardless of the hardware delay, the comparing result is about between 102 ns and 106 ns. the vast majority of the difference within 2 ns but the difference of individual moment does not exceed 4ns when taking into account the systemic bias which mainly caused by hardware delay. Therefore, it is feasible to use the geodetic GPS receiver to achieve the time synchronization result in nanosecond order between two stations which separated by thousands kilometers, and multi-channel geodetic GPS receivers have obvious advantages over single-channel geodetic GPS receivers in the number of common-viewing satellites. In order to obtain higher precision (e.g sub-nanosecond order) time synchronization results, we shall take account into carrier phase observations, hardware delay ,and more error-influencing factors should be considered such as troposphere delay correction, multipath effects, and hardware delays changes due to temperature changes.
FOREWORD: Modern Applications of Timescales Modern Applications of Timescales
NASA Astrophysics Data System (ADS)
Arias, E. F.; Lewandowski, W.
2011-08-01
The development of the first atomic frequency standard by Louis Essen in the 1950s is at the origin of the adoption of the atomic definition of the SI second by the 13th General Conference on Weights and Measures in 1967 and the consequent adoption of the atomic timescale. After the short reign of ephemeris time as the world's reference timescale from 1954 until 1967, Coordinated Universal Time (UTC), synchronized to universal time UT1, appeared as the best compromise for satisfying the requests of all users. At the moment of the discussion on the adoption of an atomic timescale to replace ephemeris time, the possibility of having both an astronomical time and an atomic time to serve different purposes was discussed. In the words of Essen [1], this 'would cause endless confusion as well as involving duplication of equipment'. Forty years after the adoption of the definition of Coordinated Universal Time at the International Telecommunication Union (ITU), we are close to the moment of making a decision on whether or not to decouple UTC from its tight link to the rotation of the Earth embodied in UT1. It has been a ten-year process of discussion, mainly at the ITU with the input of the International Astronomical Union, the BIPM, the Consultative Committee for Time and Frequency and other organizations. The majority opinion supported the change based on developers and users of systems that need time synchronization to a stable and continuous reference timescale; others insist on the necessity of keeping the leap-second strategy for serving some applications or just for tradition. It is our hope that, as happened in the seventies, the most appropriate definition to serve all modern applications will be adopted with the consensus of the different sectors. The redirection of international timekeeping from astronomy to metrology can be considered the benchmark that started the era of modern timescales, all based on atomic properties. The aim of this special issue of Metrologia is to review timescales in use today, either the internationally recognized references or those adapted to some specific applications, to discuss new and future developments and to present the sometimes complex procedures for making international recommendations. We are grateful to our colleagues who, without exception, accepted our invitation to contribute to this special issue. Reference Henderson D 2005 Metrologia 42 S4-29 The pdf file contains an appendix: "Glossary of acronyms related to timescales used in this issue".
Three holes bound to a double acceptor - Be(+) in germanium
NASA Technical Reports Server (NTRS)
Haller, E. E.; Mcmurray, R. E., Jr.; Falicov, L. M.; Haegel, N. M.; Hansen, W. L.
1983-01-01
A double acceptor binding three holes has been observed for the first time with photoconductive far-infrared spectroscopy in beryllium-doped germanium single crystals. This new center, Be(+), has a hole binding energy of about 5 meV and is only present when free holes are generated by ionization of either neutral shallow acceptors or neutral Be double acceptors. The Be(+) center thermally ionizes above 4 K. It disappears at a uniaxial stress higher than about a billion dyn/sq cm parallel to (111) as a result of the lifting of the valence-band degeneracy.
Hinds, Joanne M; Payne, Stephen J
2018-04-01
Collaborative inhibition is a phenomenon where collaborating groups experience a decrement in recall when interacting with others. Despite this, collaboration has been found to improve subsequent individual recall. We explore these effects in semantic recall, which is seldom studied in collaborative retrieval. We also examine "parallel CMC", a synchronous form of computer-mediated communication that has previously been found to improve collaborative recall [Hinds, J. M., & Payne, S. J. (2016). Collaborative inhibition and semantic recall: Improving collaboration through computer-mediated communication. Applied Cognitive Psychology, 30(4), 554-565]. Sixty three triads completed a semantic recall task, which involved generating words beginning with "PO" or "HE" across three recall trials, in one of three retrieval conditions: Individual-Individual-Individual (III), Face-to-face-Face-to-Face-Individual (FFI) and Parallel-Parallel-Individual (PPI). Collaborative inhibition was present across both collaborative conditions. Individual recall in Recall 3 was higher when participants had previously collaborated in comparison to recalling three times individually. There was no difference between face-to-face and parallel CMC recall, however subsidiary analyses of instance repetitions and subjective organisation highlighted differences in group members' approaches to recall in terms of organisation and attention to others' contributions. We discuss the implications of these findings in relation to retrieval strategy disruption.
Bit error rate tester using fast parallel generation of linear recurring sequences
Pierson, Lyndon G.; Witzke, Edward L.; Maestas, Joseph H.
2003-05-06
A fast method for generating linear recurring sequences by parallel linear recurring sequence generators (LRSGs) with a feedback circuit optimized to balance minimum propagation delay against maximal sequence period. Parallel generation of linear recurring sequences requires decimating the sequence (creating small contiguous sections of the sequence in each LRSG). A companion matrix form is selected depending on whether the LFSR is right-shifting or left-shifting. The companion matrix is completed by selecting a primitive irreducible polynomial with 1's most closely grouped in a corner of the companion matrix. A decimation matrix is created by raising the companion matrix to the (n*k).sup.th power, where k is the number of parallel LRSGs and n is the number of bits to be generated at a time by each LRSG. Companion matrices with 1's closely grouped in a corner will yield sparse decimation matrices. A feedback circuit comprised of XOR logic gates implements the decimation matrix in hardware. Sparse decimation matrices can be implemented with minimum number of XOR gates, and therefore a minimum propagation delay through the feedback circuit. The LRSG of the invention is particularly well suited to use as a bit error rate tester on high speed communication lines because it permits the receiver to synchronize to the transmitted pattern within 2n bits.
A high-sensitivity push-pull magnetometer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Breschi, E.; Grujić, Z. D.; Knowles, P.
2014-01-13
We describe our approach to atomic magnetometry based on the push-pull optical pumping technique. Cesium vapor is pumped and probed by a resonant laser beam whose circular polarization is modulated synchronously with the spin evolution dynamics induced by a static magnetic field. The magnetometer is operated in a phase-locked loop, and it has an intrinsic sensitivity below 20fT/√(Hz), using a room temperature paraffin-coated cell. We use the magnetometer to monitor magnetic field fluctuations with a sensitivity of 300fT/√(Hz)
Progress of the LASSO experiment
NASA Technical Reports Server (NTRS)
Serene, B. E. H.
1981-01-01
The LASSO (Later Synchronisation from Stationary Orbit) experiment, designed to demonstrate the feasibility of achieving time synchronization between remote atomic clocks with an accuracy of one nanosecond or better by using laser techniques for the first time is described. The experiment uses groundbased laser stations and the SIRIO-2 geostationary satellite to be launched towards the end of 1981. The qualification of the LASSO on-board equipment is discussed with a brief description of the electrical and optical test equipment used. The progress of the operational organization is included.
Scalable domain decomposition solvers for stochastic PDEs in high performance computing
Desai, Ajit; Khalil, Mohammad; Pettit, Chris; ...
2017-09-21
Stochastic spectral finite element models of practical engineering systems may involve solutions of linear systems or linearized systems for non-linear problems with billions of unknowns. For stochastic modeling, it is therefore essential to design robust, parallel and scalable algorithms that can efficiently utilize high-performance computing to tackle such large-scale systems. Domain decomposition based iterative solvers can handle such systems. And though these algorithms exhibit excellent scalabilities, significant algorithmic and implementational challenges exist to extend them to solve extreme-scale stochastic systems using emerging computing platforms. Intrusive polynomial chaos expansion based domain decomposition algorithms are extended here to concurrently handle high resolutionmore » in both spatial and stochastic domains using an in-house implementation. Sparse iterative solvers with efficient preconditioners are employed to solve the resulting global and subdomain level local systems through multi-level iterative solvers. We also use parallel sparse matrix–vector operations to reduce the floating-point operations and memory requirements. Numerical and parallel scalabilities of these algorithms are presented for the diffusion equation having spatially varying diffusion coefficient modeled by a non-Gaussian stochastic process. Scalability of the solvers with respect to the number of random variables is also investigated.« less
Scaling Semantic Graph Databases in Size and Performance
DOE Office of Scientific and Technical Information (OSTI.GOV)
Morari, Alessandro; Castellana, Vito G.; Villa, Oreste
In this paper we present SGEM, a full software system for accelerating large-scale semantic graph databases on commodity clusters. Unlike current approaches, SGEM addresses semantic graph databases by only employing graph methods at all the levels of the stack. On one hand, this allows exploiting the space efficiency of graph data structures and the inherent parallelism of graph algorithms. These features adapt well to the increasing system memory and core counts of modern commodity clusters. On the other hand, however, these systems are optimized for regular computation and batched data transfers, while graph methods usually are irregular and generate fine-grainedmore » data accesses with poor spatial and temporal locality. Our framework comprises a SPARQL to data parallel C compiler, a library of parallel graph methods and a custom, multithreaded runtime system. We introduce our stack, motivate its advantages with respect to other solutions and show how we solved the challenges posed by irregular behaviors. We present the result of our software stack on the Berlin SPARQL benchmarks with datasets up to 10 billion triples (a triple corresponds to a graph edge), demonstrating scaling in dataset size and in performance as more nodes are added to the cluster.« less
Scalable domain decomposition solvers for stochastic PDEs in high performance computing
DOE Office of Scientific and Technical Information (OSTI.GOV)
Desai, Ajit; Khalil, Mohammad; Pettit, Chris
Stochastic spectral finite element models of practical engineering systems may involve solutions of linear systems or linearized systems for non-linear problems with billions of unknowns. For stochastic modeling, it is therefore essential to design robust, parallel and scalable algorithms that can efficiently utilize high-performance computing to tackle such large-scale systems. Domain decomposition based iterative solvers can handle such systems. And though these algorithms exhibit excellent scalabilities, significant algorithmic and implementational challenges exist to extend them to solve extreme-scale stochastic systems using emerging computing platforms. Intrusive polynomial chaos expansion based domain decomposition algorithms are extended here to concurrently handle high resolutionmore » in both spatial and stochastic domains using an in-house implementation. Sparse iterative solvers with efficient preconditioners are employed to solve the resulting global and subdomain level local systems through multi-level iterative solvers. We also use parallel sparse matrix–vector operations to reduce the floating-point operations and memory requirements. Numerical and parallel scalabilities of these algorithms are presented for the diffusion equation having spatially varying diffusion coefficient modeled by a non-Gaussian stochastic process. Scalability of the solvers with respect to the number of random variables is also investigated.« less
Parallel algorithms for quantum chemistry. I. Integral transformations on a hypercube multiprocessor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Whiteside, R.A.; Binkley, J.S.; Colvin, M.E.
1987-02-15
For many years it has been recognized that fundamental physical constraints such as the speed of light will limit the ultimate speed of single processor computers to less than about three billion floating point operations per second (3 GFLOPS). This limitation is becoming increasingly restrictive as commercially available machines are now within an order of magnitude of this asymptotic limit. A natural way to avoid this limit is to harness together many processors to work on a single computational problem. In principle, these parallel processing computers have speeds limited only by the number of processors one chooses to acquire. Themore » usefulness of potentially unlimited processing speed to a computationally intensive field such as quantum chemistry is obvious. If these methods are to be applied to significantly larger chemical systems, parallel schemes will have to be employed. For this reason we have developed distributed-memory algorithms for a number of standard quantum chemical methods. We are currently implementing these on a 32 processor Intel hypercube. In this paper we present our algorithm and benchmark results for one of the bottleneck steps in quantum chemical calculations: the four index integral transformation.« less
Coherent population trapping with polarization modulation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yun, Peter, E-mail: enxue.yun@obspm.fr; Guérandel, Stéphane; Clercq, Emeric de
Coherent population trapping (CPT) is extensively studied for future vapor cell clocks of high frequency stability. In the constructive polarization modulation CPT scheme, a bichromatic laser field with polarization and phase synchronously modulated is applied on an atomic medium. A high contrast CPT signal is observed in this so-called double-modulation configuration, due to the fact that the atomic population does not leak to the extreme Zeeman states, and that the two CPT dark states, which are produced successively by the alternate polarizations, add constructively. Here, we experimentally investigate CPT signal dynamics first in the usual configuration, a single circular polarization.more » The double-modulation scheme is then addressed in both cases: one pulse Rabi interaction and two pulses Ramsey interaction. The impact and the optimization of the experimental parameters involved in the time sequence are reviewed. We show that a simple seven-level model explains the experimental observations. The double-modulation scheme yields a high contrast similar to the one of other high contrast configurations like push-pull optical pumping or crossed linear polarization scheme, with a setup allowing a higher compactness. The constructive polarization modulation is attractive for atomic clock, atomic magnetometer, and high precision spectroscopy applications.« less
Atomic resolution characterization of a SrTiO{sub 3} grain boundary in the STEM
DOE Office of Scientific and Technical Information (OSTI.GOV)
McGibbon, M.M.; Browning, N.D.; Chisholm, M.F.
This paper uses the complementary techniques of high resolution Z-contrast imaging and PEELS (parallel detection electron energy loss spectroscopy) to investigate the atomic structure and chemistry of a 25 degree symmetric tilt boundary in a bicrystal of the electroceramic SrTiO{sub 3}. The gain boundary is composed of two different boundary structural units which occur in about equal numbers: one which contains Ti-O columns and the other without.
Crystal structure of 4,5-dinitro-1 H-imidazole
Windler, G. Kenneth; Scott, Brian L.; Tomson, Neil C.; ...
2015-01-01
Here, the title compound, C 3H 2N 4O 4, forms crystals with two molecules in the asymmetric unit which are conformationally similar. With the exception of the O atoms of the nitro groups, the molecules are essentially planar. In the crystal, adjacent molecules are associated by N—H...N hydrogen bonds involving the imidazole N—H donors and N-atom acceptors of the unsaturated nitrogen of neighboring rings, forming layers parallel to (010).
Ballestero-Martínez, Ernesto; Campos-Fernández, Cristian Saul; Soto-Tellini, Victor Hugo; Gonzalez-Montiel, Simplicio; Martínez-Otero, Diego
2013-01-01
In the title compound, {[Cu(C10H8N4)3(H2O)2](ClO4)2}n, the coordination environment of the cationic CuII atom is distorted octahedral, formed by pairs of symmetry-equivalent 1,2-bis(pyridin-4-yl)diazene ligands, bridging 1,2-bis(pyridin-4-yl)diazene ligands and two non-equivalent water molecules. The 1,2-bis(pyridin-4-yl)diazene molecules form polymeric chains parallel to [-101] via azo bonds which are situated about inversion centres. Since the CuII atom is situated on a twofold rotation axis, the monomeric unit has point symmetry 2. The perchlorate anions are disordered in a 0.536 (9):0.464 (9) ratio and are acceptors of water H atoms in medium–strong O—H⋯O hydrogen bonds with graph set R 4 4(12). The water molecules, which are coordinated to the CuII atom and are hydrogen-bonded to the perchlorate anions, form columns parallel to [010]. A π–π interaction [centroid–centroid distance = 3.913 (2) Å] occurs between pyridine rings, and weak C—H⋯O interactions also occur. PMID:23794983
An evaluation of the state of time synchronization on leadership class supercomputers
Jones, Terry; Ostrouchov, George; Koenig, Gregory A.; ...
2017-10-09
We present a detailed examination of time agreement characteristics for nodes within extreme-scale parallel computers. Using a software tool we introduce in this paper, we quantify attributes of clock skew among nodes in three representative high-performance computers sited at three national laboratories. Our measurements detail the statistical properties of time agreement among nodes and how time agreement drifts over typical application execution durations. We discuss the implications of our measurements, why the current state of the field is inadequate, and propose strategies to address observed shortcomings.
Multitasking-Pascal extensions solve concurrency problems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mackie, P.H.
1982-09-29
To avoid deadlock (one process waiting for a resource than another process can't release) and indefinite postponement (one process being continually denied a resource request) in a multitasking-system application, it is possible to use a high-level development language with built-in concurrency handlers. Parallel Pascal is one such language; it extends standard Pascal via special task synchronizers: a new data type called signal, new system procedures called wait and send and a Boolean function termed awaited. To understand the language's use the author examines the problems it helps solve.
Tunable terahertz radiation source
Boulaevskii, Lev; Feldmann, David M; Jia, Quanxi; Koshelev, Alexei; Moody, Nathan A
2014-01-21
Terahertz radiation source and method of producing terahertz radiation, said source comprising a junction stack, said junction stack comprising a crystalline material comprising a plurality of self-synchronized intrinsic Josephson junctions; an electrically conductive material in contact with two opposing sides of said crystalline material; and a substrate layer disposed upon at least a portion of both the crystalline material and the electrically-conductive material, wherein the crystalline material has a c-axis which is parallel to the substrate layer, and wherein the source emits at least 1 mW of power.
Television animation store: Recording pictures on a parallel transfer magnetic disc
NASA Astrophysics Data System (ADS)
Durey, A. J.
1984-12-01
The recording and replaying of digital video signals using a computer-type magnetic disc-drive as part of an electronic rostrum camera animation system is described. The system was developed to enable picture sequences to be generated directly as television signals, instead of using cine film. The characteristics of the disc-drive are described together with data processing, error protection and signal synchronization systems, which enable digital television YUV component signals, sampled at 12 MHz, 4 MHz and 4 MHz respectively, to be recorded and replayed in real time.
An evaluation of the state of time synchronization on leadership class supercomputers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jones, Terry; Ostrouchov, George; Koenig, Gregory A.
We present a detailed examination of time agreement characteristics for nodes within extreme-scale parallel computers. Using a software tool we introduce in this paper, we quantify attributes of clock skew among nodes in three representative high-performance computers sited at three national laboratories. Our measurements detail the statistical properties of time agreement among nodes and how time agreement drifts over typical application execution durations. We discuss the implications of our measurements, why the current state of the field is inadequate, and propose strategies to address observed shortcomings.
Brushless exciters using a high temperature superconducting field winding
Garces, Luis Jose [Schenectady, NY; Delmerico, Robert William [Clifton Park, NY; Jansen, Patrick Lee [Scotia, NY; Parslow, John Harold [Scotia, NY; Sanderson, Harold Copeland [Tribes Hill, NY; Sinha, Gautam [Chesterfield, MO
2008-03-18
A brushless exciter for a synchronous generator or motor generally includes a stator and a rotor rotatably disposed within the stator. The rotor has a field winding and a voltage rectifying bridge circuit connected in parallel to the field winding. A plurality of firing circuits are connected the voltage rectifying bridge circuit. The firing circuit is configured to fire a signal at an angle of less than 90.degree. or at an angle greater than 90.degree.. The voltage rectifying bridge circuit rectifies the AC voltage to excite or de-excite the field winding.
NASA Technical Reports Server (NTRS)
Mehandru, S. P.; Anderson, A. B.; Ross, P. N.
1985-01-01
The CO adsorption on a 40 atom cluster model of the (111) surface and a 36 atom cluster model of the (100) surface of the Pt3Ti alloy was studied. Parallel binding to high coordinate sites associated with Ti and low CO bond scission barriers are predicted for both surfaces. The binding of CO to Pt sites occurs in an upright orientation. These orientations are a consequence of the nature of the CO pi donation interactions with the surface. On the Ti sites the orbitals donate to the nearly empty Ti 3d band and the antibonding counterpart orbitals are empty. On the Pt sites, however, they are in the filled Pt 5d region of the alloy band, which causes CO to bond in a vertical orientation by 5 delta donation from the carbon end.
ADSORPTION AND DISSOCIATION OF O2 ON Ti3Al (0001) STUDIED BY FIRST-PRINCIPLES
NASA Astrophysics Data System (ADS)
Wei, Li-Jing; Guo, Jian-Xin; Dai, Xiu-Hong; Wang, Ying-Long; Liu, Bao-Ting
2015-05-01
The adsorption and dissociation of oxygen molecule on Ti3Al (0001) surface have been investigated by density functional theory (DFT) with the generalized gradient approximation (GGA). All possible adsorption sites including nine vertical and fifteen parallel sites of O2 are considered on Ti3Al (0001) surface. It is found that all oxygen molecules dissociate except for three vertical adsorption sites after structure optimization. This indicates that oxygen molecules prefer to dissociate on the junction site between Ti and Al atoms. Oxygen atoms coming from dissociation of oxygen molecule tend to occupy the most stable adsorption sites of the Ti3Al (0001) surface. The distance of O-O is related to the surface dissociation distance of Ti3Al (0001) surface. The valence electron localization function (ELF) and projected density of states (DOS) show that the bonds of O-O are breakaway at parallel adsorption end structures.
Decarboxylation of furfural on Pd(111): Ab initio molecular dynamics simulations
NASA Astrophysics Data System (ADS)
Xue, Wenhua; Dang, Hongli; Shields, Darwin; Liu, Yingdi; Jentoft, Friederike; Resasco, Daniel; Wang, Sanwu
2013-03-01
Furfural conversion over metal catalysts plays an important role in the studies of biomass-derived feedstocks. We report ab initio molecular dynamics simulations for the decarboxylation process of furfural on the palladium surface at finite temperatures. We observed and analyzed the atomic-scale dynamics of furfural on the Pd(111) surface and the fluctuations of the bondlengths between the atoms in furfural. We found that the dominant bonding structure is the parallel structure in which the furfural plane, while slightly distorted, is parallel to the Pd surface. Analysis of the bondlength fluctuations indicates that the C-H bond is the aldehyde group of a furfural molecule is likely to be broken first, while the C =O bond has a tendency to be isolated as CO. Our results show that the reaction of decarbonylation dominates, consistent with the experimental measurements. Supported by DOE (DE-SC0004600). Simulations and calculations were performed on XSEDE's and NERSC's supercomputers.
LAMMPS strong scaling performance optimization on Blue Gene/Q
DOE Office of Scientific and Technical Information (OSTI.GOV)
Coffman, Paul; Jiang, Wei; Romero, Nichols A.
2014-11-12
LAMMPS "Large-scale Atomic/Molecular Massively Parallel Simulator" is an open-source molecular dynamics package from Sandia National Laboratories. Significant performance improvements in strong-scaling and time-to-solution for this application on IBM's Blue Gene/Q have been achieved through computational optimizations of the OpenMP versions of the short-range Lennard-Jones term of the CHARMM force field and the long-range Coulombic interaction implemented with the PPPM (particle-particle-particle mesh) algorithm, enhanced by runtime parameter settings controlling thread utilization. Additionally, MPI communication performance improvements were made to the PPPM calculation by re-engineering the parallel 3D FFT to use MPICH collectives instead of point-to-point. Performance testing was done using anmore » 8.4-million atom simulation scaling up to 16 racks on the Mira system at Argonne Leadership Computing Facility (ALCF). Speedups resulting from this effort were in some cases over 2x.« less
The Nature of Bonding in Bulk Tellurium Composed of One-Dimensional Helical Chains.
Yi, Seho; Zhu, Zhili; Cai, Xiaolin; Jia, Yu; Cho, Jun-Hyung
2018-05-07
Bulk tellurium (Te) is composed of one-dimensional (1D) helical chains which have been considered to be coupled by van der Waals (vdW) interactions. However, on the basis of first-principles density functional theory calculations, we here propose a different bonding nature between neighboring chains: i.e., helical chains made of normal covalent bonds are connected together by coordinate covalent bonds. It is revealed that the lone pairs of electrons of Te atoms participate in forming coordinate covalent bonds between neighboring chains, where each Te atom behaves as both an electron donor to neighboring chains and an electron acceptor from neighboring chains. This ligand-metal-like bonding nature in bulk Te results in the same order of bulk moduli along the directions parallel and perpendicular to the chains, contrasting with the large anisotropy of bulk moduli in vdW crystals. We further find that the electron effective masses parallel and perpendicular to the chains are almost the same as each other, consistent with the observed nearly isotropic electrical resistivity. It is thus demonstrated that the normal/coordinate covalent bonds parallel/perpendicular to the chains in bulk Te lead to a minor anisotropy in structural and transport properties.
Raman-Ramsey multizone spectroscopy in a pure rubidium vapor cell
DOE Office of Scientific and Technical Information (OSTI.GOV)
Failache, H.; Lenci, L.; Lezama, A.
2010-02-15
In view of application to a miniaturized spectroscopy system, we consider an optical setup that splits a laser beam into several parallel narrow light sheets allowing an effective beam expansion and consequently longer atom-light interaction times. We analyze the multizone coherent population trapping (MZCPT) spectroscopy of alkali-metal-vapor atoms, without buffer gas, in the presence of a split light beam. We show that the MZCPT signal is largely insensitive to intensity broadening. Experimentally observed spectra are in qualitative agreement with the predictions of a simplified model that describes each spectrum as an integral over the atomic velocity distribution of Ramsey multizonemore » spectra.« less
Blackbody emission from laser breakdown in high-pressure gases.
Bataller, A; Plateau, G R; Kappus, B; Putterman, S
2014-08-15
Laser induced breakdown of pressurized gases is used to generate plasmas under conditions where the atomic density and temperature are similar to those found in sonoluminescing bubbles. Calibrated streak spectroscopy reveals that a blackbody persists well after the exciting femtosecond laser pulse has turned off. Deviation from Saha's equation of state and an accompanying large reduction in ionization potential are observed at unexpectedly low atomic densities-in parallel with sonoluminescence. In laser breakdown, energy input proceeds via excitation of electrons whereas in sonoluminescence it is initiated via the atoms. The similar responses indicate that these systems are revealing the thermodynamics and transport of a strongly coupled plasma.
Blackbody Emission from Laser Breakdown in High-Pressure Gases
NASA Astrophysics Data System (ADS)
Bataller, A.; Plateau, G. R.; Kappus, B.; Putterman, S.
2014-08-01
Laser induced breakdown of pressurized gases is used to generate plasmas under conditions where the atomic density and temperature are similar to those found in sonoluminescing bubbles. Calibrated streak spectroscopy reveals that a blackbody persists well after the exciting femtosecond laser pulse has turned off. Deviation from Saha's equation of state and an accompanying large reduction in ionization potential are observed at unexpectedly low atomic densities—in parallel with sonoluminescence. In laser breakdown, energy input proceeds via excitation of electrons whereas in sonoluminescence it is initiated via the atoms. The similar responses indicate that these systems are revealing the thermodynamics and transport of a strongly coupled plasma.
N′-[(E)-3-Chloro-2-fluorobenzylidene]-6-methylnicotinohydrazide monohydrate
Fun, Hoong-Kun; Quah, Ching Kheng; Shyma, P. C.; Kalluraya, Balakrishna; Vidyashree, J. H. S.
2012-01-01
The title compound, C14H11ClFN3O·H2O, exists in an E conformation with respect to the N=C bond. The pyridine ring forms a dihedral angle of 5.00 (9)° with the benzene ring. In the crystal, the ketone O atom accepts one O—H⋯O and one C—H⋯O hydrogen bond, the water O atom accepts one N—H⋯O and two C—H⋯O hydrogen bonds and the pyridine N atom accepts one O—H⋯N hydrogen bond, forming layers parallel to the ab plane. PMID:22798798
Structures of 38-atom gold-platinum nanoalloy clusters
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ong, Yee Pin; Yoon, Tiem Leong; Lim, Thong Leng
2015-04-24
Bimetallic nanoclusters, such as gold-platinum nanoclusters, are nanomaterials promising wide range of applications. We perform a numerical study of 38-atom gold-platinum nanoalloy clusters, Au{sub n}Pt{sub 38−n} (0 ≤ n ≤ 38), to elucidate the geometrical structures of these clusters. The lowest-energy structures of these bimetallic nanoclusters at the semi-empirical level are obtained via a global-minimum search algorithm known as parallel tempering multi-canonical basin hopping plus genetic algorithm (PTMBHGA), in which empirical Gupta many-body potential is used to describe the inter-atomic interactions among the constituent atoms. The structures of gold-platinum nanoalloy clusters are predicted to be core-shell segregated nanoclusters. Gold atomsmore » are observed to preferentially occupy the surface of the clusters, while platinum atoms tend to occupy the core due to the slightly smaller atomic radius of platinum as compared to gold’s. The evolution of the geometrical structure of 38-atom Au-Pt clusters displays striking similarity with that of 38-atom Au-Cu nanoalloy clusters as reported in the literature.« less
High performance computing in biology: multimillion atom simulations of nanoscale systems
Sanbonmatsu, K. Y.; Tung, C.-S.
2007-01-01
Computational methods have been used in biology for sequence analysis (bioinformatics), all-atom simulation (molecular dynamics and quantum calculations), and more recently for modeling biological networks (systems biology). Of these three techniques, all-atom simulation is currently the most computationally demanding, in terms of compute load, communication speed, and memory load. Breakthroughs in electrostatic force calculation and dynamic load balancing have enabled molecular dynamics simulations of large biomolecular complexes. Here, we report simulation results for the ribosome, using approximately 2.64 million atoms, the largest all-atom biomolecular simulation published to date. Several other nanoscale systems with different numbers of atoms were studied to measure the performance of the NAMD molecular dynamics simulation program on the Los Alamos National Laboratory Q Machine. We demonstrate that multimillion atom systems represent a 'sweet spot' for the NAMD code on large supercomputers. NAMD displays an unprecedented 85% parallel scaling efficiency for the ribosome system on 1024 CPUs. We also review recent targeted molecular dynamics simulations of the ribosome that prove useful for studying conformational changes of this large biomolecular complex in atomic detail. PMID:17187988
A search for isotopic anomalies in uranium. [in chondritic meteorites and terrestrial basalt
NASA Technical Reports Server (NTRS)
Chen, J. H.; Wasserburg, G. J.
1980-01-01
The U-238/U-235 ratios for nine bulk chondritic meteorites and a terrestrial basalt were measured. The total range in U-238/U-235 determined for both total meteorites and for acid leaches was from 137.2 terrestrial U. The typical errors in a single determination are plus or minus 6 per thousand (2 sigma m) for a 2 ng U sample from a chondrite. Taking the extreme values of delta U-235 for each measurement the maximum amount of excess U-235 that can be allowed to be present ranges from 200 million to 2 billion atoms per gram of bulk meteorite. These results do not support the claims of variations in U-238/U-235 at the percentage levels or number of excess U-235 atoms in some of the same meteorites as reported by several other previous workers.
High sensitive formaldehyde graphene gas sensor modified by atomic layer deposition zinc oxide films
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mu, Haichuan; Zhang, Zhiqiang; Wang, Keke
2014-07-21
Zinc oxide (ZnO) thin films with various thicknesses were fabricated by Atomic Layer Deposition on Chemical Vapor Deposition grown graphene films and their response to formaldehyde has been investigated. It was found that 0.5 nm ZnO films modified graphene sensors showed high response to formaldehyde with the resistance change up to 52% at the concentration of 9 parts-per-million (ppm) at room temperature. Meanwhile, the detection limit could reach 180 parts-per-billion (ppb) and fast response of 36 s was also obtained. The high sensitivity could be attributed to the combining effect from the highly reactive, top mounted ZnO thin films, and high conductivemore » graphene base network. The dependence of ZnO films surface morphology and its sensitivity on the ZnO films thickness was also investigated.« less