DOE Office of Scientific and Technical Information (OSTI.GOV)
Chen, Chao; Pouransari, Hadi; Rajamanickam, Sivasankaran
We present a parallel hierarchical solver for general sparse linear systems on distributed-memory machines. For large-scale problems, this fully algebraic algorithm is faster and more memory-efficient than sparse direct solvers because it exploits the low-rank structure of fill-in blocks. Depending on the accuracy of low-rank approximations, the hierarchical solver can be used either as a direct solver or as a preconditioner. The parallel algorithm is based on data decomposition and requires only local communication for updating boundary data on every processor. Moreover, the computation-to-communication ratio of the parallel algorithm is approximately the volume-to-surface-area ratio of the subdomain owned by everymore » processor. We also provide various numerical results to demonstrate the versatility and scalability of the parallel algorithm.« less
NASA Technical Reports Server (NTRS)
Voellmer, George
1992-01-01
Compliant element for robot wrist accepts small displacements in one direction only (to first approximation). Three such elements combined to obtain translational compliance along three orthogonal directions, without rotational compliance along any of them. Element is double-blade flexure joint in which two sheets of spring steel attached between opposing blocks, forming rectangle. Blocks moved parallel to each other in one direction only. Sheets act as double cantilever beams deforming in S-shape, keeping blocks parallel.
NASA Technical Reports Server (NTRS)
Sanger, Eugen
1932-01-01
A method is presented for approximate static calculation, which is based on the customary assumption of rigid ribs, while taking into account the systematic errors in the calculation results due to this arbitrary assumption. The procedure is given in greater detail for semicantilever and cantilever wings with polygonal spar plan form and for wings under direct loading only. The last example illustrates the advantages of the use of influence lines for such wing structures and their practical interpretation.
Discovery of Grooves on Gaspra
Veverka, J.; Thomas, P.; Simonelli, D.; Belton, M.J.S.; Carr, M.; Chapman, C.; Davies, M.E.; Greeley, R.; Greenberg, R.; Head, J.; Klaasen, K.; Johnson, T.V.; Morrison, D.; Neukum, G.
1994-01-01
We report the discovery of grooves in Galileo high-resolution images of Gaspra. These features, previously seen only on Mars' satellite Phobos, are most likely related to severe impacts. Grooves on Gaspra occur as linear and pitted depressions, typically 100-200 m wide, 0.8 to 2.5 km long, and 10-20 m deep. Most occur in two major groups, one of which trends approximately parallel to the asteroid's long axis, but is offset by some 15??; the other is approximately perpendicular to this trend. The first of these directions falls along a family of planes which parallel three extensive flat facets identified by Thomas et al., Icarus 107. The occurrence of grooves on Gaspra is consistent with other indications (irregular shape, cratering record) that this asteroid has evolved through a violent collisional history. The bodywide congruence of major groove directions and other structural elements suggests that present-day Gaspra is a globally coherent body. ?? 1994 Academic Press. All rights reserved.
Trapped Atoms in One-Dimensional Photonic Crystals
2013-08-09
a single silicon -nitride nanobeam (refractive index n = 2) with a 1D array of filleted rectangular holes along the propagation direction; atoms are...trapped in the centers of the holes (figure 1( a )). The second waveguide consists of two parallel silicon nitride nanobeams, each with a periodic array...the refractive index of silicon nitride is approximately constant across the optical domain, we adopt the approximation based on a frequency
Parallel Anisotropic Tetrahedral Adaptation
NASA Technical Reports Server (NTRS)
Park, Michael A.; Darmofal, David L.
2008-01-01
An adaptive method that robustly produces high aspect ratio tetrahedra to a general 3D metric specification without introducing hybrid semi-structured regions is presented. The elemental operators and higher-level logic is described with their respective domain-decomposed parallelizations. An anisotropic tetrahedral grid adaptation scheme is demonstrated for 1000-1 stretching for a simple cube geometry. This form of adaptation is applicable to more complex domain boundaries via a cut-cell approach as demonstrated by a parallel 3D supersonic simulation of a complex fighter aircraft. To avoid the assumptions and approximations required to form a metric to specify adaptation, an approach is introduced that directly evaluates interpolation error. The grid is adapted to reduce and equidistribute this interpolation error calculation without the use of an intervening anisotropic metric. Direct interpolation error adaptation is illustrated for 1D and 3D domains.
NASA Astrophysics Data System (ADS)
Allphin, Devin
Computational fluid dynamics (CFD) solution approximations for complex fluid flow problems have become a common and powerful engineering analysis technique. These tools, though qualitatively useful, remain limited in practice by their underlying inverse relationship between simulation accuracy and overall computational expense. While a great volume of research has focused on remedying these issues inherent to CFD, one traditionally overlooked area of resource reduction for engineering analysis concerns the basic definition and determination of functional relationships for the studied fluid flow variables. This artificial relationship-building technique, called meta-modeling or surrogate/offline approximation, uses design of experiments (DOE) theory to efficiently approximate non-physical coupling between the variables of interest in a fluid flow analysis problem. By mathematically approximating these variables, DOE methods can effectively reduce the required quantity of CFD simulations, freeing computational resources for other analytical focuses. An idealized interpretation of a fluid flow problem can also be employed to create suitably accurate approximations of fluid flow variables for the purposes of engineering analysis. When used in parallel with a meta-modeling approximation, a closed-form approximation can provide useful feedback concerning proper construction, suitability, or even necessity of an offline approximation tool. It also provides a short-circuit pathway for further reducing the overall computational demands of a fluid flow analysis, again freeing resources for otherwise unsuitable resource expenditures. To validate these inferences, a design optimization problem was presented requiring the inexpensive estimation of aerodynamic forces applied to a valve operating on a simulated piston-cylinder heat engine. The determination of these forces was to be found using parallel surrogate and exact approximation methods, thus evidencing the comparative benefits of this technique. For the offline approximation, latin hypercube sampling (LHS) was used for design space filling across four (4) independent design variable degrees of freedom (DOF). Flow solutions at the mapped test sites were converged using STAR-CCM+ with aerodynamic forces from the CFD models then functionally approximated using Kriging interpolation. For the closed-form approximation, the problem was interpreted as an ideal 2-D converging-diverging (C-D) nozzle, where aerodynamic forces were directly mapped by application of the Euler equation solutions for isentropic compression/expansion. A cost-weighting procedure was finally established for creating model-selective discretionary logic, with a synthesized parallel simulation resource summary provided.
A distributed-memory approximation algorithm for maximum weight perfect bipartite matching
DOE Office of Scientific and Technical Information (OSTI.GOV)
Azad, Ariful; Buluc, Aydin; Li, Xiaoye S.
We design and implement an efficient parallel approximation algorithm for the problem of maximum weight perfect matching in bipartite graphs, i.e. the problem of finding a set of non-adjacent edges that covers all vertices and has maximum weight. This problem differs from the maximum weight matching problem, for which scalable approximation algorithms are known. It is primarily motivated by finding good pivots in scalable sparse direct solvers before factorization where sequential implementations of maximum weight perfect matching algorithms, such as those available in MC64, are widely used due to the lack of scalable alternatives. To overcome this limitation, we proposemore » a fully parallel distributed memory algorithm that first generates a perfect matching and then searches for weightaugmenting cycles of length four in parallel and iteratively augments the matching with a vertex disjoint set of such cycles. For most practical problems the weights of the perfect matchings generated by our algorithm are very close to the optimum. An efficient implementation of the algorithm scales up to 256 nodes (17,408 cores) on a Cray XC40 supercomputer and can solve instances that are too large to be handled by a single node using the sequential algorithm.« less
The transference of heat from a hot plate to an air stream
NASA Technical Reports Server (NTRS)
Elias, Franz
1931-01-01
The object of the present study was to define experimentally the field of temperature and velocity in a heated flat plate when exposed to an air stream whose direction is parallel to it, then calculate therefrom the heat transference and the friction past the flat plate, and lastly, compare the test data with the mathematical theory. To ensure comparable results, we were to actually obtain or else approximate: a) two-dimensional flow; b) constant plate temperature in the direction of the stream. To approximate the flow in two dimensions, we chose a relatively wide plate and measured the velocity and temperature in the median plane.
Anisotropic Behaviour of Magnetic Power Spectra in Solar Wind Turbulence.
NASA Astrophysics Data System (ADS)
Banerjee, S.; Saur, J.; Gerick, F.; von Papen, M.
2017-12-01
Introduction:High altitude fast solar wind turbulence (SWT) shows different spectral properties as a function of the angle between the flow direction and the scale dependent mean magnetic field (Horbury et al., PRL, 2008). The average magnetic power contained in the near perpendicular direction (80º-90º) was found to be approximately 5 times larger than the average power in the parallel direction (0º- 10º). In addition, the parallel power spectra was found to give a steeper (-2) power law than the perpendicular power spectral density (PSD) which followed a near Kolmogorov slope (-5/3). Similar anisotropic behaviour has also been observed (Chen et al., MNRAS, 2011) for slow solar wind (SSW), but using a different method exploiting multi-spacecraft data of Cluster. Purpose:In the current study, using Ulysses data, we investigate (i) the anisotropic behaviour of near ecliptic slow solar wind using the same methodology (described below) as that of Horbury et al. (2008) and (ii) the dependence of the anisotropic behaviour of SWT as a function of the heliospheric latitude.Method:We apply the wavelet method to calculate the turbulent power spectra of the magnetic field fluctuations parallel and perpendicular to the local mean magnetic field (LMF). According to Horbury et al., LMF for a given scale (or size) is obtained using an envelope of the envelope of that size. Results:(i) SSW intervals always show near -5/3 perpendicular spectra. Unlike the fast solar wind (FSW) intervals, for SSW, we often find intervals where power parallel to the mean field is not observed. For a few intervals with sufficient power in parallel direction, slow wind turbulence also exhibit -2 parallel spectra similar to FSW.(ii) The behaviours of parallel and perpendicular power spectra are found to be independent of the heliospheric latitude. Conclusion:In the current study we do not find significant influence of the heliospheric latitude on the spectral slopes of parallel and perpendicular magnetic spectra. This indicates that the spectral anisotropy in parallel and perpendicular direction is governed by intrinsic properties of SWT.
Block iterative restoration of astronomical images with the massively parallel processor
NASA Technical Reports Server (NTRS)
Heap, Sara R.; Lindler, Don J.
1987-01-01
A method is described for algebraic image restoration capable of treating astronomical images. For a typical 500 x 500 image, direct algebraic restoration would require the solution of a 250,000 x 250,000 linear system. The block iterative approach is used to reduce the problem to solving 4900 121 x 121 linear systems. The algorithm was implemented on the Goddard Massively Parallel Processor, which can solve a 121 x 121 system in approximately 0.06 seconds. Examples are shown of the results for various astronomical images.
Campione, Salvatore; Warne, Larry K.; Basilio, Lorena I.
2017-09-29
In this paper we develop a fully-retarded, dipole approximation model to estimate the effective polarizabilities of a dimer made of dielectric resonators. They are computed from the polarizabilities of the two resonators composing the dimer. We analyze the situation of full-cubes as well as split-cubes, which have been shown to exhibit overlapping electric and magnetic resonances. We compare the effective dimer polarizabilities to ones retrieved via full-wave simulations as well as ones computed via a quasi-static, dipole approximation. We observe good agreement between the fully-retarded solution and the full-wave results, whereas the quasi-static approximation is less accurate for the problemmore » at hand. The developed model can be used to predict the electric and magnetic resonances of a dimer under parallel or orthogonal (to the dimer axis) excitation. This is particularly helpful when interested in locating frequencies at which the dimer will emit directional radiation.« less
NASA Astrophysics Data System (ADS)
Mortensen, Kell; Borger, Anine L.; Kirkensgaard, Jacob J. K.; Garvey, Christopher J.; Almdal, Kristoffer; Dorokhin, Andriy; Huang, Qian; Hassager, Ole
2018-05-01
We present structural small-angle neutron scattering studies of a three-armed polystyrene star polymer with short deuterated segments at the end of each arm. We show that the form factor of the three-armed star molecules in the relaxed state agrees with that of the random phase approximation of Gaussian chains. Upon exposure to large extensional flow conditions, the star polymers change conformation resulting in a highly stretched structure that mimics a fully extended three-armed tube model. All three arms are parallel to the flow, one arm being either in positive or negative stretching direction, while the two other arms are oriented parallel, right next to each other in the direction opposite to the first arm.
27 CFR 9.125 - Fredericksburg in the Texas Hill Country.
Code of Federal Regulations, 2010 CFR
2010-04-01
...) 1504, at the junction of a light-duty road known locally as Jung Road. (1) From the beginning point, the boundary proceeds on Jung Road in a northwesterly direction across the Pedernales River. (2) Then northwesterly approximately 1 mile along Jung Road as it parallels the Pedernales River. (3) Then north along...
NASA Technical Reports Server (NTRS)
Bame, S. J.; Asbridge, J. R.; Feldman, W. C.; Gosling, J. T.; Zwickl, R. D.
1981-01-01
In near time coincidence with the arrival of helium enriched plasma driving the shock wave disturbance of November 12-13, 1978, strong bi-directional streaming of solar wind electrons greater than about 80 eV was observed with Los Alamos instrumentation on ISEE 3. The streaming persisted for many hours simultaneously parallel and anti-parallel to the interplanetary magnetic field which was directed roughly perpendicular to the sun-satellite line. This example of bidirectional streaming cannot be explained by field line connection to the earth's bow shock or the outward propagating interplanetary shock which passed ISEE 3 approximately 16 hours earlier. The event is explained if the local interplanetary field was a part of a magnetic bottle rooted at the sun or a disconnected loop propagating outward.
NASA Technical Reports Server (NTRS)
Gutmann, R. J.; Borrego, J. M.
1978-01-01
Rectenna conversion efficiencies (RF to dc) approximating 85 percent were demonstrated on a small scale, clearly indicating the feasibility and potential of efficiency of microwave power to dc. The overall cost estimates of the solar power satellite indicate that the baseline rectenna subsystem will be between 25 to 40 percent of the system cost. The directional receiving elements and element extensions were studied, along with power combining evaluation and evaluation extensions.
Tie Points Extraction for SAR Images Based on Differential Constraints
NASA Astrophysics Data System (ADS)
Xiong, X.; Jin, G.; Xu, Q.; Zhang, H.
2018-04-01
Automatically extracting tie points (TPs) on large-size synthetic aperture radar (SAR) images is still challenging because the efficiency and correct ratio of the image matching need to be improved. This paper proposes an automatic TPs extraction method based on differential constraints for large-size SAR images obtained from approximately parallel tracks, between which the relative geometric distortions are small in azimuth direction and large in range direction. Image pyramids are built firstly, and then corresponding layers of pyramids are matched from the top to the bottom. In the process, the similarity is measured by the normalized cross correlation (NCC) algorithm, which is calculated from a rectangular window with the long side parallel to the azimuth direction. False matches are removed by the differential constrained random sample consensus (DC-RANSAC) algorithm, which appends strong constraints in azimuth direction and weak constraints in range direction. Matching points in the lower pyramid images are predicted with the local bilinear transformation model in range direction. Experiments performed on ENVISAT ASAR and Chinese airborne SAR images validated the efficiency, correct ratio and accuracy of the proposed method.
Wave Number Selection for Incompressible Parallel Jet Flows Periodic in Space
NASA Technical Reports Server (NTRS)
Miles, Jeffrey Hilton
1997-01-01
The temporal instability of a spatially periodic parallel flow of an incompressible inviscid fluid for various jet velocity profiles is studied numerically using Floquet Analysis. The transition matrix at the end of a period is evaluated by direct numerical integration. For verification, a method based on approximating a continuous function by a series of step functions was used. Unstable solutions were found only over a limited range of wave numbers and have a band type structure. The results obtained are analogous to the behavior observed in systems exhibiting complexity at the edge of order and chaos.
Parallel spatial direct numerical simulations on the Intel iPSC/860 hypercube
NASA Technical Reports Server (NTRS)
Joslin, Ronald D.; Zubair, Mohammad
1993-01-01
The implementation and performance of a parallel spatial direct numerical simulation (PSDNS) approach on the Intel iPSC/860 hypercube is documented. The direct numerical simulation approach is used to compute spatially evolving disturbances associated with the laminar-to-turbulent transition in boundary-layer flows. The feasibility of using the PSDNS on the hypercube to perform transition studies is examined. The results indicate that the direct numerical simulation approach can effectively be parallelized on a distributed-memory parallel machine. By increasing the number of processors nearly ideal linear speedups are achieved with nonoptimized routines; slower than linear speedups are achieved with optimized (machine dependent library) routines. This slower than linear speedup results because the Fast Fourier Transform (FFT) routine dominates the computational cost and because the routine indicates less than ideal speedups. However with the machine-dependent routines the total computational cost decreases by a factor of 4 to 5 compared with standard FORTRAN routines. The computational cost increases linearly with spanwise wall-normal and streamwise grid refinements. The hypercube with 32 processors was estimated to require approximately twice the amount of Cray supercomputer single processor time to complete a comparable simulation; however it is estimated that a subgrid-scale model which reduces the required number of grid points and becomes a large-eddy simulation (PSLES) would reduce the computational cost and memory requirements by a factor of 10 over the PSDNS. This PSLES implementation would enable transition simulations on the hypercube at a reasonable computational cost.
Magnetic intermittency of solar wind turbulence in the dissipation range
NASA Astrophysics Data System (ADS)
Pei, Zhongtian; He, Jiansen; Tu, Chuanyi; Marsch, Eckart; Wang, Linghua
2016-04-01
The feature, nature, and fate of intermittency in the dissipation range are an interesting topic in the solar wind turbulence. We calculate the distribution of flatness for the magnetic field fluctuations as a functionof angle and scale. The flatness distribution shows a "butterfly" pattern, with two wings located at angles parallel/anti-parallel to local mean magnetic field direction and main body located at angles perpendicular to local B0. This "butterfly" pattern illustrates that the flatness profile in (anti-) parallel direction approaches to the maximum value at larger scale and drops faster than that in perpendicular direction. The contours for probability distribution functions at different scales illustrate a "vase" pattern, more clear in parallel direction, which confirms the scale-variation of flatness and indicates the intermittency generation and dissipation. The angular distribution of structure function in the dissipation range shows an anisotropic pattern. The quasi-mono-fractal scaling of structure function in the dissipation range is also illustrated and investigated with the mathematical model for inhomogeneous cascading (extended p-model). Different from the inertial range, the extended p-model for the dissipation range results in approximate uniform fragmentation measure. However, more complete mathematicaland physical model involving both non-uniform cascading and dissipation is needed. The nature of intermittency may be strong structures or large amplitude fluctuations, which may be tested with magnetic helicity. In one case study, we find the heating effect in terms of entropy for large amplitude fluctuations seems to be more obvious than strong structures.
Domain decomposition methods in aerodynamics
NASA Technical Reports Server (NTRS)
Venkatakrishnan, V.; Saltz, Joel
1990-01-01
Compressible Euler equations are solved for two-dimensional problems by a preconditioned conjugate gradient-like technique. An approximate Riemann solver is used to compute the numerical fluxes to second order accuracy in space. Two ways to achieve parallelism are tested, one which makes use of parallelism inherent in triangular solves and the other which employs domain decomposition techniques. The vectorization/parallelism in triangular solves is realized by the use of a recording technique called wavefront ordering. This process involves the interpretation of the triangular matrix as a directed graph and the analysis of the data dependencies. It is noted that the factorization can also be done in parallel with the wave front ordering. The performances of two ways of partitioning the domain, strips and slabs, are compared. Results on Cray YMP are reported for an inviscid transonic test case. The performances of linear algebra kernels are also reported.
Three-Dimensional High-Lift Analysis Using a Parallel Unstructured Multigrid Solver
NASA Technical Reports Server (NTRS)
Mavriplis, Dimitri J.
1998-01-01
A directional implicit unstructured agglomeration multigrid solver is ported to shared and distributed memory massively parallel machines using the explicit domain-decomposition and message-passing approach. Because the algorithm operates on local implicit lines in the unstructured mesh, special care is required in partitioning the problem for parallel computing. A weighted partitioning strategy is described which avoids breaking the implicit lines across processor boundaries, while incurring minimal additional communication overhead. Good scalability is demonstrated on a 128 processor SGI Origin 2000 machine and on a 512 processor CRAY T3E machine for reasonably fine grids. The feasibility of performing large-scale unstructured grid calculations with the parallel multigrid algorithm is demonstrated by computing the flow over a partial-span flap wing high-lift geometry on a highly resolved grid of 13.5 million points in approximately 4 hours of wall clock time on the CRAY T3E.
EFFECT OF MASSIVE NEUTRON EXPOSURE ON THE DISTORTION OF REACTOR GRAPHITE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Helm, J.W.; Davidson, J.M.
1963-05-28
Distortion of reactor-grade graphites was studied at varying neutron exposures ranging up to 14 x 10/sup 21/ neutrons per cm/sup 2/ (nvt)/sup */ at temperatures of irradiation ranging from 425 to 800 deg C. This exposure level corresponds to approximately 100,000 megawatt days per adjacent ton of fuel (Mwd/ At) in a graphite-moderated reactor. A conventionalcoke graphite, CSF, and two needle-coke graphites, NC-7 and NC-8, were studied. At all temperatures of irradiation the contraction rate of the samples cut parallel to the extrusion axis increased with increasing neutron exposure. For parallel samples the needle- coke graphites and the CSF graphitemore » contracted approximately the same amount. In the transverse direction the rate of cortraction at the higher irradiation temperntures appeared to be decreasing. Volume contractions derived from the linear contractions are discussed. (auth)« less
NASA Astrophysics Data System (ADS)
Yin, An; Pappalardo, Robert T.
2015-11-01
Despite a decade of intense research the mechanical origin of the tiger-stripe fractures (TSF) and their geologic relationship to the hosting South Polar Terrain (SPT) of Enceladus remain poorly understood. Here we show via systematic photo-geological mapping that the semi-squared SPT is bounded by right-slip, left-slip, extensional, and contractional zones on its four edges. Discrete deformation along the edges in turn accommodates translation of the SPT as a single sheet with its transport direction parallel to the regional topographic gradient. This parallel relationship implies that the gradient of gravitational potential energy drove the SPT motion. In map view, internal deformation of the SPT is expressed by distributed right-slip shear parallel to the SPT transport direction. The broad right-slip shear across the whole SPT was facilitated by left-slip bookshelf faulting along the parallel TSF. We suggest that the flow-like tectonics, to the first approximation across the SPT on Enceladus, is best explained by the occurrence of a transient thermal event, which allowed the release of gravitational potential energy via lateral viscous flow within the thermally weakened ice shell.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Anovitz, Lawrence; Mamontov, Eugene; Ishai, Paul ben
2013-01-01
The properties of fluids can be significantly altered by the geometry of their confining environments. While there has been significant work on the properties of such confined fluids, the properties of fluids under ultraconfinement, environments where, at least in one plane, the dimensions of the confining environment are similar to that of the confined molecule, have not been investigated. This paper investigates the dynamic properties of water in beryl (Be3Al2Si6O18), the structure of which contains approximately 5-A-diam channels parallel to the c axis. Three techniques, inelastic neutron scattering, quasielastic neutron scattering, and dielectric spectroscopy, have been used to quantify thesemore » properties over a dynamic range covering approximately 16 orders of magnitude. Because beryl can be obtained in large single crystals we were able to quantify directional variations, perpendicular and parallel to the channel directions, in the dynamics of the confined fluid. These are significantly anisotropic and, somewhat counterintuitively, show that vibrations parallel to the c-axis channels are significantly more hindered than those perpendicular to the channels. The effective potential for vibrations in the c direction is harder than the potential in directions perpendicular to it. There is evidence of single-file diffusion of water molecules along the channels at higher temperatures, but below 150 K this diffusion is strongly suppressed. No such suppression, however, has been observed in the channel-perpendicular direction. Inelastic neutron scattering spectra include an intramolecular stretching O-H peak at 465 meV. As this is nearly coincident with that known for free water molecules and approximately 30 meV higher than that in liquid water or ice, this suggests that there is no hydrogen bonding constraining vibrations between the channel water and the beryl structure. However, dielectric spectroscopic measurements at higher temperatures and lower frequencies yield an activation energy for the dipole reorientation of 16.4 0.14 kJ/mol, close to the energy required to break a hydrogen bond in bulk water. This may suggest the presence of some other form of bonding between the water molecules and the structure, but the resolution of the apparent contradiction between the inelastic neutron and dielectric spectroscopic results remains uncertain.« less
Ji, Hong-Mei; Zhang, Wen-Qian; Wang, Xu; Li, Xiao-Wu
2015-01-01
The three-point bending strength and fracture behavior of single oriented crossed-lamellar structure in Scapharca broughtonii shell were investigated. The samples for bending tests were prepared with two different orientations perpendicular and parallel to the radial ribs of the shell, which corresponds to the tiled and stacked directions of the first-order lamellae, respectively. The bending strength in the tiled direction is approximately 60% higher than that in the stacked direction, primarily because the regularly staggered arrangement of the second-order lamellae in the tiled direction can effectively hinder the crack propagation, whereas the cracks can easily propagate along the interfaces between lamellae in the stacked direction. PMID:28793557
Effect of magnetic pulses on Caribbean spiny lobsters: implications for magnetoreception.
Ernst, David A; Lohmann, Kenneth J
2016-06-15
The Caribbean spiny lobster, Panulirus argus, is a migratory crustacean that uses Earth's magnetic field as a navigational cue, but how these lobsters detect magnetic fields is not known. Magnetic material thought to be magnetite has previously been detected in spiny lobsters, but its role in magnetoreception, if any, remains unclear. As a first step toward investigating whether lobsters might have magnetite-based magnetoreceptors, we subjected lobsters to strong, pulsed magnetic fields capable of reversing the magnetic dipole moment of biogenic magnetite crystals. Lobsters were subjected to a single pulse directed from posterior to anterior and either: (1) parallel to the horizontal component of the geomagnetic field (i.e. toward magnetic north); or (2) antiparallel to the horizontal field (i.e. toward magnetic south). An additional control group was handled but not subjected to a magnetic pulse. After treatment, each lobster was tethered in a water-filled arena located within 200 m of the capture location and allowed to walk in any direction. Control lobsters walked in seemingly random directions and were not significantly oriented as a group. In contrast, the two groups exposed to pulsed fields were significantly oriented in approximately opposite directions. Lobsters subjected to a magnetic pulse applied parallel to the geomagnetic horizontal component walked westward; those subjected to a pulse directed antiparallel to the geomagnetic horizontal component oriented approximately northeast. The finding that a magnetic pulse alters subsequent orientation behavior is consistent with the hypothesis that magnetoreception in spiny lobsters is based at least partly on magnetite-based magnetoreceptors. © 2016. Published by The Company of Biologists Ltd.
STOCHSIMGPU: parallel stochastic simulation for the Systems Biology Toolbox 2 for MATLAB.
Klingbeil, Guido; Erban, Radek; Giles, Mike; Maini, Philip K
2011-04-15
The importance of stochasticity in biological systems is becoming increasingly recognized and the computational cost of biologically realistic stochastic simulations urgently requires development of efficient software. We present a new software tool STOCHSIMGPU that exploits graphics processing units (GPUs) for parallel stochastic simulations of biological/chemical reaction systems and show that significant gains in efficiency can be made. It is integrated into MATLAB and works with the Systems Biology Toolbox 2 (SBTOOLBOX2) for MATLAB. The GPU-based parallel implementation of the Gillespie stochastic simulation algorithm (SSA), the logarithmic direct method (LDM) and the next reaction method (NRM) is approximately 85 times faster than the sequential implementation of the NRM on a central processing unit (CPU). Using our software does not require any changes to the user's models, since it acts as a direct replacement of the stochastic simulation software of the SBTOOLBOX2. The software is open source under the GPL v3 and available at http://www.maths.ox.ac.uk/cmb/STOCHSIMGPU. The web site also contains supplementary information. klingbeil@maths.ox.ac.uk Supplementary data are available at Bioinformatics online.
NASA Technical Reports Server (NTRS)
Kanemasu, E. T.; Asrar, Ghassem; Myneni, Ranga; Martin, Robert, Jr.; Burnett, R. Bruce
1987-01-01
Research activities for the following study areas are summarized: single scattering of parallel direct and axially symmetric diffuse solar radiation in vegetative canopies; the use of successive orders of scattering approximations (SOSA) for treating multiple scattering in a plant canopy; reflectance of a soybean canopy using the SOSA method; and C-band scatterometer measurements of the Konza tallgrass prairie.
Making almost commuting matrices commute
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hastings, Matthew B
Suppose two Hermitian matrices A, B almost commute ({parallel}[A,B]{parallel} {<=} {delta}). Are they close to a commuting pair of Hermitian matrices, A', B', with {parallel}A-A'{parallel},{parallel}B-B'{parallel} {<=} {epsilon}? A theorem of H. Lin shows that this is uniformly true, in that for every {epsilon} > 0 there exists a {delta} > 0, independent of the size N of the matrices, for which almost commuting implies being close to a commuting pair. However, this theorem does not specifiy how {delta} depends on {epsilon}. We give uniform bounds relating {delta} and {epsilon}. The proof is constructive, giving an explicit algorithm to construct A'more » and B'. We provide tighter bounds in the case of block tridiagonal and tridiagnonal matrices. Within the context of quantum measurement, this implies an algorithm to construct a basis in which we can make a projective measurement that approximately measures two approximately commuting operators simultaneously. Finally, we comment briefly on the case of approximately measuring three or more approximately commuting operators using POVMs (positive operator-valued measures) instead of projective measurements.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cevallos, F. Alex; Stolze, Karoline; Cava, Robert J.
The single crystal growth, structure, and basic magnetic properties of ErMgGaO 4 are reported. The structure consists of triangular layers of magnetic ErO 6 octahedra separated by a double layer of randomly occupied non-magnetic (Ga,Mg)O 5 bipyramids. The Er atoms are positionally disordered. Magnetic measurements parallel and perpendicular to the c axis of a single crystal reveal dominantly antiferromagnetic interactions, with a small degree of magnetic anisotropy. A weighted average of the directional data suggests an antiferromagnetic Curie Weiss temperature of approximately -30 K. Below 10 K the temperature dependences of the inverse susceptibilities in the in-plane and perpendicular-to planemore » directions are parallel, indicative of an isotropic magnetic moment at low temperatures. In conclusion, no sign of magnetic ordering is observed above 1.8 K, suggesting that ErMgGaO 4 is a geometrically frustrated magnet.« less
Cevallos, F. Alex; Stolze, Karoline; Cava, Robert J.
2018-03-23
The single crystal growth, structure, and basic magnetic properties of ErMgGaO 4 are reported. The structure consists of triangular layers of magnetic ErO 6 octahedra separated by a double layer of randomly occupied non-magnetic (Ga,Mg)O 5 bipyramids. The Er atoms are positionally disordered. Magnetic measurements parallel and perpendicular to the c axis of a single crystal reveal dominantly antiferromagnetic interactions, with a small degree of magnetic anisotropy. A weighted average of the directional data suggests an antiferromagnetic Curie Weiss temperature of approximately -30 K. Below 10 K the temperature dependences of the inverse susceptibilities in the in-plane and perpendicular-to planemore » directions are parallel, indicative of an isotropic magnetic moment at low temperatures. In conclusion, no sign of magnetic ordering is observed above 1.8 K, suggesting that ErMgGaO 4 is a geometrically frustrated magnet.« less
MLP: A Parallel Programming Alternative to MPI for New Shared Memory Parallel Systems
NASA Technical Reports Server (NTRS)
Taft, James R.
1999-01-01
Recent developments at the NASA AMES Research Center's NAS Division have demonstrated that the new generation of NUMA based Symmetric Multi-Processing systems (SMPs), such as the Silicon Graphics Origin 2000, can successfully execute legacy vector oriented CFD production codes at sustained rates far exceeding processing rates possible on dedicated 16 CPU Cray C90 systems. This high level of performance is achieved via shared memory based Multi-Level Parallelism (MLP). This programming approach, developed at NAS and outlined below, is distinct from the message passing paradigm of MPI. It offers parallelism at both the fine and coarse grained level, with communication latencies that are approximately 50-100 times lower than typical MPI implementations on the same platform. Such latency reductions offer the promise of performance scaling to very large CPU counts. The method draws on, but is also distinct from, the newly defined OpenMP specification, which uses compiler directives to support a limited subset of multi-level parallel operations. The NAS MLP method is general, and applicable to a large class of NASA CFD codes.
Aben, Ilse; Tanzi, Cristina P; Hartmann, Wouter; Stam, Daphne M; Stammes, Piet
2003-06-20
A method is presented for in-flight validation of space-based polarization measurements based on approximation of the direction of polarization of scattered sunlight by the Rayleigh single-scattering value. This approximation is verified by simulations of radiative transfer calculations for various atmospheric conditions. The simulations show locations along an orbit where the scattering geometries are such that the intensities of the parallel and orthogonal polarization components of the light are equal, regardless of the observed atmosphere and surface. The method can be applied to any space-based instrument that measures the polarization of reflected solar light. We successfully applied the method to validate the Global Ozone Monitoring Experiment (GOME) polarization measurements. The error in the GOME's three broadband polarization measurements appears to be approximately 1%.
A picoliter-volume mixer for microfluidic analytical systems.
He, B; Burke, B J; Zhang, X; Zhang, R; Regnier, F E
2001-05-01
Mixing confluent liquid streams is an important, but difficult operation in microfluidic systems. This paper reports the construction and characterization of a 100-pL mixer for liquids transported by electroosmotic flow. Mixing was achieved in a microfabricated device with multiple intersecting channels of varying lengths and a bimodal width distribution. All channels running parallel to the direction of flow were 5 microm in width whereas larger 27-microm-width channels ran back and forth through the parallel channel network at a 45 degrees angle. The channel network composing the mixer was approximately 10 microm deep. It was observed that little mixing of the confluent solvent streams occurred in the 100-microm-wide, 300-microm-long mixer inlet channel where mixing would be achieved almost exclusively by diffusion. In contrast, after passage through the channel network in the approximately 200-microm-length static mixer bed, mixing was complete as determined by confocal microscopy and CCD detection. Theoretical simulations were also performed in an attempt to describe the extent of mixing in microfabricated systems.
Design and energetic evaluation of a prosthetic knee joint actuator with a lockable parallel spring.
Geeroms, J; Flynn, L; Jimenez-Fabian, R; Vanderborght, B; Lefeber, D
2017-02-03
There are disadvantages to existing damping knee prostheses which cause an asymmetric gait and higher metabolic cost during level walking compared to non-amputees. Most existing active knee prostheses which could benefit the amputees use a significant amount of energy and require a considerable motor. In this work, a novel semi-active actuator with a lockable parallel spring for a prosthetic knee joint has been developed and tested. This actuator is able to provide an approximation of the behavior of a healthy knee during most of the gait cycle of level walking. This actuator is expanded with a series-elastic actuator to mimic the full gait cycle and enable its use in other functional tasks like stair climbing and sit-to-stance. The proposed novel actuator reduces the energy consumption for the same trajectory with respect to a compliant or directly-driven prosthetic active knee joint and improves the approximation of healthy knee behavior during level walking compared to passive or variable damping knee prostheses.
Parallelism Effects and Verb Activation: The Sustained Reactivation Hypothesis
ERIC Educational Resources Information Center
Callahan, Sarah M.; Shapiro, Lewis P.; Love, Tracy
2010-01-01
This study investigated the processes underlying parallelism by evaluating the activation of a parallel element (i.e., a verb) throughout "and"-coordinated sentences. Four points were tested: (1) approximately 1,600ms after the verb in the first conjunct (PP1), (2) immediately following the conjunction (PP2), (3) approximately 1,100ms after the…
NASA Astrophysics Data System (ADS)
Palmesi, P.; Abert, C.; Bruckner, F.; Suess, D.
2018-05-01
Fast stray field calculation is commonly considered of great importance for micromagnetic simulations, since it is the most time consuming part of the simulation. The Fast Multipole Method (FMM) has displayed linear O(N) parallelization behavior on many cores. This article investigates the error of a recent FMM approach approximating sources using linear—instead of constant—finite elements in the singular integral for calculating the stray field and the corresponding potential. After measuring performance in an earlier manuscript, this manuscript investigates the convergence of the relative L2 error for several FMM simulation parameters. Various scenarios either calculating the stray field directly or via potential are discussed.
2013-02-01
November 2012 phosphorus, and vitamin B12. Additionally a reductant reacts directly with hexavalent chromium to reduce it to the trivalent state. SRS...being operated under continuous flow conditions in the laboratory. Entire assembly takes up approximately 5 sq. ft. in a fume hood. . 44 Figure 5-3...61 Figure 5-16. Hexavalent Chromium detected in ISMA effluent post in situ incubation
Iterative methods for 3D implicit finite-difference migration using the complex Padé approximation
NASA Astrophysics Data System (ADS)
Costa, Carlos A. N.; Campos, Itamara S.; Costa, Jessé C.; Neto, Francisco A.; Schleicher, Jörg; Novais, Amélia
2013-08-01
Conventional implementations of 3D finite-difference (FD) migration use splitting techniques to accelerate performance and save computational cost. However, such techniques are plagued with numerical anisotropy that jeopardises the correct positioning of dipping reflectors in the directions not used for the operator splitting. We implement 3D downward continuation FD migration without splitting using a complex Padé approximation. In this way, the numerical anisotropy is eliminated at the expense of a computationally more intensive solution of a large-band linear system. We compare the performance of the iterative stabilized biconjugate gradient (BICGSTAB) and that of the multifrontal massively parallel direct solver (MUMPS). It turns out that the use of the complex Padé approximation not only stabilizes the solution, but also acts as an effective preconditioner for the BICGSTAB algorithm, reducing the number of iterations as compared to the implementation using the real Padé expansion. As a consequence, the iterative BICGSTAB method is more efficient than the direct MUMPS method when solving a single term in the Padé expansion. The results of both algorithms, here evaluated by computing the migration impulse response in the SEG/EAGE salt model, are of comparable quality.
Hong, Ie-Hong; Yen, Shang-Chieh; Lin, Fu-Shiang
2009-08-17
A well-ordered two-dimensional (2D) network consisting of two crossed Au silicide nanowire (NW) arrays is self-organized on a Si(110)-16 x 2 surface by the direct-current heating of approximately 1.5 monolayers of Au on the surface at 1100 K. Such a highly regular crossbar nanomesh exhibits both a perfect long-range spatial order and a high integration density over a mesoscopic area, and these two self-ordering crossed arrays of parallel-aligned NWs have distinctly different sizes and conductivities. NWs are fabricated with widths and pitches as small as approximately 2 and approximately 5 nm, respectively. The difference in the conductivities of two crossed-NW arrays opens up the possibility for their utilization in nanodevices of crossbar architecture. Scanning tunneling microscopy/spectroscopy studies show that the 2D self-organization of this perfect Au silicide nanomesh can be achieved through two different directional electromigrations of Au silicide NWs along different orientations of two nonorthogonal 16 x 2 domains, which are driven by the electrical field of direct-current heating. Prospects for this Au silicide nanomesh are also discussed.
Enhanced directional second harmonic radiation via nonlinear interference in 1D metamaterials
NASA Astrophysics Data System (ADS)
Guo, B. S.; Loo, Y. L.; Zhao, Q.; Ong, C. K.
2018-06-01
By using a one-dimensional nonlinear metamaterial in the experiment, we achieve a directional second harmonic radiation via nonlinear interference at approximately 2.5 GHz. Each meta-atom has the structure of coupled split-ring resonators and two varactors arranged parallel (symmetric) or antiparallel (antisymmetric) to each other. With an incident power of approximately ‑2.7 dBm, the power of the emitted directional wave from the sample is at the scale of nanowatt. This relatively high magnitude of directional nonlinear power is the result of the 1D metamaterial abilities in exhibiting nonlinear magnetoelectric coupling, as well as supporting an electric dipole or magnetic dipole resonance within a narrow second harmonic frequency range.
Chahl, J S
2014-01-20
This paper describes an application for arrays of narrow-field-of-view sensors with parallel optical axes. These devices exhibit some complementary characteristics with respect to conventional perspective projection or angular projection imaging devices. Conventional imaging devices measure rotational egomotion directly by measuring the angular velocity of the projected image. Translational egomotion cannot be measured directly by these devices because the induced image motion depends on the unknown range of the viewed object. On the other hand, a known translational motion generates image velocities which can be used to recover the ranges of objects and hence the three-dimensional (3D) structure of the environment. A new method is presented for computing egomotion and range using the properties of linear arrays of independent narrow-field-of-view optical sensors. An approximate parallel projection can be used to measure translational egomotion in terms of the velocity of the image. On the other hand, a known rotational motion of the paraxial sensor array generates image velocities, which can be used to recover the 3D structure of the environment. Results of tests of an experimental array confirm these properties.
Conductance spectra of asymmetric ferromagnet/ferromagnet/ferromagnet junctions
NASA Astrophysics Data System (ADS)
Pasanai, K.
2017-01-01
A theory of tunneling spectroscopy of ferromagnet/ferromagnet/ferromagnet junctions was studied. We applied a delta-functional approximation for the interface scattering properties under a one-dimensional system of a free electron approach. The reflection and transmission probabilities were calculated in the ballistic regime, and the conductance spectra were then calculated using the Landauer formulation. The magnetization directions were set to be either parallel (P) or anti-parallel (AP) alignments, for comparison. We found that the conductance spectra was suppressed when increasing the interfacial scattering at the interfaces. Moreover, the electron could exhibit direct transmission when the thickness was rather thin. Thus, there was no oscillation in this case. However, in the case of a thick layer the conductance spectra oscillated, and this oscillation was most prominent when the middle layer thickness increased. In the case of direct transmission, the conductance spectra of P and AP systems were definitely suppressed with increased exchange energy of the middle ferromagnet. This also refers to an increase in the magnetoresistance of the junction. In the case of oscillatory behavior, the positions of the resonance peaks were changed as the exchange energy was changed.
NASA Astrophysics Data System (ADS)
Ma, Sangback
In this paper we compare various parallel preconditioners such as Point-SSOR (Symmetric Successive OverRelaxation), ILU(0) (Incomplete LU) in the Wavefront ordering, ILU(0) in the Multi-color ordering, Multi-Color Block SOR (Successive OverRelaxation), SPAI (SParse Approximate Inverse) and pARMS (Parallel Algebraic Recursive Multilevel Solver) for solving large sparse linear systems arising from two-dimensional PDE (Partial Differential Equation)s on structured grids. Point-SSOR is well-known, and ILU(0) is one of the most popular preconditioner, but it is inherently serial. ILU(0) in the Wavefront ordering maximizes the parallelism in the natural order, but the lengths of the wave-fronts are often nonuniform. ILU(0) in the Multi-color ordering is a simple way of achieving a parallelism of the order N, where N is the order of the matrix, but its convergence rate often deteriorates as compared to that of natural ordering. We have chosen the Multi-Color Block SOR preconditioner combined with direct sparse matrix solver, since for the Laplacian matrix the SOR method is known to have a nondeteriorating rate of convergence when used with the Multi-Color ordering. By using block version we expect to minimize the interprocessor communications. SPAI computes the sparse approximate inverse directly by least squares method. Finally, ARMS is a preconditioner recursively exploiting the concept of independent sets and pARMS is the parallel version of ARMS. Experiments were conducted for the Finite Difference and Finite Element discretizations of five two-dimensional PDEs with large meshsizes up to a million on an IBM p595 machine with distributed memory. Our matrices are real positive, i. e., their real parts of the eigenvalues are positive. We have used GMRES(m) as our outer iterative method, so that the convergence of GMRES(m) for our test matrices are mathematically guaranteed. Interprocessor communications were done using MPI (Message Passing Interface) primitives. The results show that in general ILU(0) in the Multi-Color ordering ahd ILU(0) in the Wavefront ordering outperform the other methods but for symmetric and nearly symmetric 5-point matrices Multi-Color Block SOR gives the best performance, except for a few cases with a small number of processors.
GRAVIDY, a GPU modular, parallel direct-summation N-body integrator: dynamics with softening
NASA Astrophysics Data System (ADS)
Maureira-Fredes, Cristián; Amaro-Seoane, Pau
2018-01-01
A wide variety of outstanding problems in astrophysics involve the motion of a large number of particles under the force of gravity. These include the global evolution of globular clusters, tidal disruptions of stars by a massive black hole, the formation of protoplanets and sources of gravitational radiation. The direct-summation of N gravitational forces is a complex problem with no analytical solution and can only be tackled with approximations and numerical methods. To this end, the Hermite scheme is a widely used integration method. With different numerical techniques and special-purpose hardware, it can be used to speed up the calculations. But these methods tend to be computationally slow and cumbersome to work with. We present a new graphics processing unit (GPU), direct-summation N-body integrator written from scratch and based on this scheme, which includes relativistic corrections for sources of gravitational radiation. GRAVIDY has high modularity, allowing users to readily introduce new physics, it exploits available computational resources and will be maintained by regular updates. GRAVIDY can be used in parallel on multiple CPUs and GPUs, with a considerable speed-up benefit. The single-GPU version is between one and two orders of magnitude faster than the single-CPU version. A test run using four GPUs in parallel shows a speed-up factor of about 3 as compared to the single-GPU version. The conception and design of this first release is aimed at users with access to traditional parallel CPU clusters or computational nodes with one or a few GPU cards.
Modeling of hydride precipitation and re-orientation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tikare, Veena; Weck, Philippe F.; Mitchell, John Anthony
In this report, we present a thermodynamic-based model of hydride precipitation in Zr-based claddings. The model considers the state of the cladding immediately following drying, after removal from cooling-pools, and presents the evolution of precipitate formation upon cooling as follows: The pilgering process used to form Zr-based cladding imparts strong crystallographic and grain shape texture, with the basal plane of the hexagonal α-Zr grains being strongly aligned in the rolling-direction and the grains are elongated with grain size being approximately twice as long parallel to the rolling direction, which is also the long axis of the tubular cladding, as itmore » is in the orthogonal directions.« less
"Tools For Analysis and Visualization of Large Time- Varying CFD Data Sets"
NASA Technical Reports Server (NTRS)
Wilhelms, Jane; vanGelder, Allen
1999-01-01
During the four years of this grant (including the one year extension), we have explored many aspects of the visualization of large CFD (Computational Fluid Dynamics) datasets. These have included new direct volume rendering approaches, hierarchical methods, volume decimation, error metrics, parallelization, hardware texture mapping, and methods for analyzing and comparing images. First, we implemented an extremely general direct volume rendering approach that can be used to render rectilinear, curvilinear, or tetrahedral grids, including overlapping multiple zone grids, and time-varying grids. Next, we developed techniques for associating the sample data with a k-d tree, a simple hierarchial data model to approximate samples in the regions covered by each node of the tree, and an error metric for the accuracy of the model. We also explored a new method for determining the accuracy of approximate models based on the light field method described at ACM SIGGRAPH (Association for Computing Machinery Special Interest Group on Computer Graphics) '96. In our initial implementation, we automatically image the volume from 32 approximately evenly distributed positions on the surface of an enclosing tessellated sphere. We then calculate differences between these images under different conditions of volume approximation or decimation.
Modeling Sound Propagation Through Non-Axisymmetric Jets
NASA Technical Reports Server (NTRS)
Leib, Stewart J.
2014-01-01
A method for computing the far-field adjoint Green's function of the generalized acoustic analogy equations under a locally parallel mean flow approximation is presented. The method is based on expanding the mean-flow-dependent coefficients in the governing equation and the scalar Green's function in truncated Fourier series in the azimuthal direction and a finite difference approximation in the radial direction in circular cylindrical coordinates. The combined spectral/finite difference method yields a highly banded system of algebraic equations that can be efficiently solved using a standard sparse system solver. The method is applied to test cases, with mean flow specified by analytical functions, corresponding to two noise reduction concepts of current interest: the offset jet and the fluid shield. Sample results for the Green's function are given for these two test cases and recommendations made as to the use of the method as part of a RANS-based jet noise prediction code.
Magnetic spectral signatures in the Earth's magnetosheath and plasma depletion layer
NASA Technical Reports Server (NTRS)
Anderson, Brian J.; Fuselier, Stephen A.; Gary, S. Peter; Denton, Richard E.
1994-01-01
Correlations between plasma properties and magnetic fluctuations in the sub-solar magnetosheath downstream of a quasi-perpendicular shock have been found and indicate that mirror and ion cyclotronlike fluctuations correlate with the magnetosheath proper and plasma depletion layer, respectively (Anderson and Fueselier, 1993). We explore the entire range of magnetic spectral signatures observed from the Active Magnetospheric Particle Tracer Explorers/Charge Composition Explorer (AMPTE/CCE)spacecraft in the magnetosheath downstream of a quasi-perpendicular shock. The magnetic spectral signatures typically progress from predominantly compressional fluctuations,delta B(sub parallel)/delta B perpendicular to approximately 3, with F/F (sub p) less than 0.2 (F and F (sub p) are the wave frequency and proton gyrofrequency, respectively) to predominantly transverse fluctuations, delta B(sub parallel)/delta B perpendicular to approximately 0.3, extending up to F(sub p). The compressional fluctuations are characterized by anticorrelation between the field magnitude and electron density, n(sub e), and by a small compressibility, C(sub e) identically equal to (delta n(sub e)/n(sub e)) (exp 2) (B/delta B(sub parallel)) (exp 2) approximately 0.13, indicative of mirror waves. The spectral characteristics of the transverse fluctuations are in agreement with predictions of linear Vlasov theory for the H(+) and He(2+) cyclotron modes. The power spectra and local plasma parameters are found to vary in concert: mirror waves occur for beta(s ub parallel p) (beta (sub parallel p) identically = 2 mu(sub zero) n(sub p) kT (sub parallel p) / B(exp 2) approximately = 2, A(sub p) indentically = T(sub perpendicular to p)/T(sub parallel p) - 1 approximately = 0.4, whereas cyclotron waves occur for beta (sub parallel p) approximately = 0.2 and A(sub p) approximately = 2. The transition from mirror to cyclotron modes is predicted by linear theory. The spectral characteristics overlap for intermediate plasma parameters. The plasma observations are described by A(sub p) = 0.85 beta(sub parallel P) (exp - 0.48) with a log regression coefficient of -0.74. This inverse A(sub p) - beta(sub parallel p) correlation corresponds closely to the isocontours of maximum ion anisotropy instability growth, gamma (sub m)/omega(sub p) = 0.01, for the mirror and cyclotron modes. The agreement of observed properties and predictions of local theory suggests that the spectral signatures reflect the local plasma environment and that the anisotropy instabilities regulate A(sub p). We suggest that the spectral characteristics may provide a useful basis for ordering observations in the magnetosheath and that the A(sub p) - beta(sub parallel p) inverse correlation may be used as a beta-dependent upper limit on the proton anisotropy to represent kinetic effects.
NASA Astrophysics Data System (ADS)
Huang, Na; Liu, Richeng; Jiang, Yujing; Li, Bo; Yu, Liyuan
2018-03-01
While shear-flow behavior through fractured media has been so far studied at single fracture scale, a numerical analysis of the shear effect on the hydraulic response of 3D crossed fracture model is presented. The analysis was based on a series of crossed fracture models, in which the effects of fracture surface roughness and shear displacement were considered. The rough fracture surfaces were generated using the modified successive random additions (SRA) algorithm. The shear displacement was applied on one fracture, and at the same time another fracture shifted along with the upper and lower surfaces of the sheared fracture. The simulation results reveal the development and variation of preferential flow paths through the model during the shear, accompanied by the change of the flow rate ratios between two flow planes at the outlet boundary. The average contact area accounts for approximately 5-27% of the fracture planes during shear, but the actual calculated flow area is about 38-55% of the fracture planes, which is much smaller than the noncontact area. The equivalent permeability will either increase or decrease as shear displacement increases from 0 to 4 mm, depending on the aperture distribution of intersection part between two fractures. When the shear displacement continuously increases by up to 20 mm, the equivalent permeability increases sharply first, and then keeps increasing with a lower gradient. The equivalent permeability of rough fractured model is about 26-80% of that calculated from the parallel plate model, and the equivalent permeability in the direction perpendicular to shear direction is approximately 1.31-3.67 times larger than that in the direction parallel to shear direction. These results can provide a fundamental understanding of fluid flow through crossed fracture model under shear.
Optimizing Approximate Weighted Matching on Nvidia Kepler K40
DOE Office of Scientific and Technical Information (OSTI.GOV)
Naim, Md; Manne, Fredrik; Halappanavar, Mahantesh
Matching is a fundamental graph problem with numerous applications in science and engineering. While algorithms for computing optimal matchings are difficult to parallelize, approximation algorithms on the other hand generally compute high quality solutions and are amenable to parallelization. In this paper, we present efficient implementations of the current best algorithm for half-approximate weighted matching, the Suitor algorithm, on Nvidia Kepler K-40 platform. We develop four variants of the algorithm that exploit hardware features to address key challenges for a GPU implementation. We also experiment with different combinations of work assigned to a warp. Using an exhaustive set ofmore » $269$ inputs, we demonstrate that the new implementation outperforms the previous best GPU algorithm by $10$ to $$100\\times$$ for over $100$ instances, and from $100$ to $$1000\\times$$ for $15$ instances. We also demonstrate up to $$20\\times$$ speedup relative to $2$ threads, and up to $$5\\times$$ relative to $16$ threads on Intel Xeon platform with $16$ cores for the same algorithm. The new algorithms and implementations provided in this paper will have a direct impact on several applications that repeatedly use matching as a key compute kernel. Further, algorithm designs and insights provided in this paper will benefit other researchers implementing graph algorithms on modern GPU architectures.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Haut, T. S.; Babb, T.; Martinsson, P. G.
2015-06-16
Our manuscript demonstrates a technique for efficiently solving the classical wave equation, the shallow water equations, and, more generally, equations of the form ∂u/∂t=Lu∂u/∂t=Lu, where LL is a skew-Hermitian differential operator. The idea is to explicitly construct an approximation to the time-evolution operator exp(τL)exp(τL) for a relatively large time-step ττ. Recently developed techniques for approximating oscillatory scalar functions by rational functions, and accelerated algorithms for computing functions of discretized differential operators are exploited. Principal advantages of the proposed method include: stability even for large time-steps, the possibility to parallelize in time over many characteristic wavelengths and large speed-ups over existingmore » methods in situations where simulation over long times are required. Numerical examples involving the 2D rotating shallow water equations and the 2D wave equation in an inhomogenous medium are presented, and the method is compared to the 4th order Runge–Kutta (RK4) method and to the use of Chebyshev polynomials. The new method achieved high accuracy over long-time intervals, and with speeds that are orders of magnitude faster than both RK4 and the use of Chebyshev polynomials.« less
NASA Technical Reports Server (NTRS)
1976-01-01
The structure and direction of bow shock waves and the occurence of Pc 3, 4 micropulsations were investigated. An observational description is given of a quasi-parallel structure in a plasma parameter regime. The use of approximation to estimate the thickness of thin, nearly perpendicular bow shocks at supralaminar Mach numbers is discussed. The pattern of energies of backstreaming protons in the foreshock are predicted.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghysels, Pieter; Li, Xiaoye S.; Rouet, Francois -Henry
Here, we present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which have low-rank off-diagonal blocks, to approximate the frontal matrices. For HSS matrix construction, a randomized sampling algorithm is used together with interpolative decompositions. The combination of the randomized compression with a fast ULV HSS factoriz ation leads to a solver with lower computational complexity than the standard multifrontal method for many applications, resulting in speedups up to 7 fold for problems in our test suite.more » The implementation targets many-core systems by using task parallelism with dynamic runtime scheduling. Numerical experiments show performance improvements over state-of-the-art sparse direct solvers. The implementation achieves high performance and good scalability on a range of modern shared memory parallel systems, including the Intel Xeon Phi (MIC). The code is part of a software package called STRUMPACK - STRUctured Matrices PACKage, which also has a distributed memory component for dense rank-structured matrices.« less
Ghysels, Pieter; Li, Xiaoye S.; Rouet, Francois -Henry; ...
2016-10-27
Here, we present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which have low-rank off-diagonal blocks, to approximate the frontal matrices. For HSS matrix construction, a randomized sampling algorithm is used together with interpolative decompositions. The combination of the randomized compression with a fast ULV HSS factoriz ation leads to a solver with lower computational complexity than the standard multifrontal method for many applications, resulting in speedups up to 7 fold for problems in our test suite.more » The implementation targets many-core systems by using task parallelism with dynamic runtime scheduling. Numerical experiments show performance improvements over state-of-the-art sparse direct solvers. The implementation achieves high performance and good scalability on a range of modern shared memory parallel systems, including the Intel Xeon Phi (MIC). The code is part of a software package called STRUMPACK - STRUctured Matrices PACKage, which also has a distributed memory component for dense rank-structured matrices.« less
Ho, ThienLuan; Oh, Seung-Rohk
2017-01-01
Approximate string matching with k-differences has a number of practical applications, ranging from pattern recognition to computational biology. This paper proposes an efficient memory-access algorithm for parallel approximate string matching with k-differences on Graphics Processing Units (GPUs). In the proposed algorithm, all threads in the same GPUs warp share data using warp-shuffle operation instead of accessing the shared memory. Moreover, we implement the proposed algorithm by exploiting the memory structure of GPUs to optimize its performance. Experiment results for real DNA packages revealed that the performance of the proposed algorithm and its implementation archived up to 122.64 and 1.53 times compared to that of sequential algorithm on CPU and previous parallel approximate string matching algorithm on GPUs, respectively. PMID:29016700
On the parallel solution of parabolic equations
NASA Technical Reports Server (NTRS)
Gallopoulos, E.; Saad, Youcef
1989-01-01
Parallel algorithms for the solution of linear parabolic problems are proposed. The first of these methods is based on using polynomial approximation to the exponential. It does not require solving any linear systems and is highly parallelizable. The two other methods proposed are based on Pade and Chebyshev approximations to the matrix exponential. The parallelization of these methods is achieved by using partial fraction decomposition techniques to solve the resulting systems and thus offers the potential for increased time parallelism in time dependent problems. Experimental results from the Alliant FX/8 and the Cray Y-MP/832 vector multiprocessors are also presented.
Impact of local diffusion on macroscopic dispersion in three-dimensional porous media
NASA Astrophysics Data System (ADS)
Dartois, Arthur; Beaudoin, Anthony; Huberson, Serge
2018-02-01
While macroscopic longitudinal and transverse dispersion in three-dimensional porous media has been simulated previously mostly under purely advective conditions, the impact of diffusion on macroscopic dispersion in 3D remains an open question. Furthermore, both in 2D and 3D, recurring difficulties have been encountered due to computer limitation or analytical approximation. In this work, we use the Lagrangian velocity covariance function and the temporal derivative of second-order moments to study the influence of diffusion on dispersion in highly heterogeneous 2D and 3D porous media. The first approach characterizes the correlation between the values of Eulerian velocity components sampled by particles undergoing diffusion at two times. The second approach allows the estimation of dispersion coefficients and the analysis of their behaviours as functions of diffusion. These two approaches allowed us to reach new results. The influence of diffusion on dispersion seems to be globally similar between highly heterogeneous 2D and 3D porous media. Diffusion induces a decrease in the dispersion in the direction parallel to the flow direction and an increase in the dispersion in the direction perpendicular to the flow direction. However, the amplification of these two effects with the permeability variance is clearly different between 2D and 3D. For the direction parallel to the flow direction, the amplification is more important in 3D than in 2D. It is reversed in the direction perpendicular to the flow direction.
The energetic ion signature of an O-type neutral line in the geomagnetic tail
NASA Technical Reports Server (NTRS)
Martin, R. F., Jr.; Johnson, D. F.; Speiser, T. W.
1991-01-01
An energetic ion signature is presented which has the potential for remote sensing of an O-type neutral line embedded in a current sheet. A source plasma with a tailward flowing Kappa distribution yields a strongly non-Kappa distribution after interacting with the neutral line: sharp jumps, or ridges, occur in the velocity space distribution function f(nu-perpendicular, nu-parallel) associated with both increases and decreases in f. The jumps occur when orbits are reversed in the x-direction: a reversal causing initially earthward particles (low probability in the source distribution) to be observed results in a decrease in f, while a reversal causing initially tailward particles to be observed produces an increase in f. The reversals, and hence the jumps, occur at approximately constant values of perpendicular velocity in both the positive nu parallel and negative nu parallel half planes. The results were obtained using single particle simulations in a fixed magnetic field model.
Integral manifolding structure for fuel cell core having parallel gas flow
Herceg, Joseph E.
1984-01-01
Disclosed herein are manifolding means for directing the fuel and oxidant gases to parallel flow passageways in a fuel cell core. Each core passageway is defined by electrolyte and interconnect walls. Each electrolyte and interconnect wall consists respectively of anode and cathode materials layered on the opposite sides of electrolyte material, or on the opposite sides of interconnect material. A core wall projects beyond the open ends of the defined core passageways and is disposed approximately midway between and parallel to the adjacent overlaying and underlying interconnect walls to define manifold chambers therebetween on opposite sides of the wall. Each electrolyte wall defining the flow passageways is shaped to blend into and be connected to this wall in order to redirect the corresponding fuel and oxidant passageways to the respective manifold chambers either above or below this intermediate wall. Inlet and outlet connections are made to these separate manifold chambers respectively, for carrying the fuel and oxidant gases to the core, and for carrying their reaction products away from the core.
Integral manifolding structure for fuel cell core having parallel gas flow
Herceg, J.E.
1983-10-12
Disclosed herein are manifolding means for directing the fuel and oxidant gases to parallel flow passageways in a fuel cell core. Each core passageway is defined by electrolyte and interconnect walls. Each electrolyte and interconnect wall consists respectively of anode and cathode materials layered on the opposite sides of electrolyte material, or on the opposite sides of interconnect material. A core wall projects beyond the open ends of the defined core passageways and is disposed approximately midway between and parallel to the adjacent overlaying and underlying interconnect walls to define manifold chambers therebetween on opposite sides of the wall. Each electrolyte wall defining the flow passageways is shaped to blend into and be connected to this wall in order to redirect the corresponding fuel and oxidant passageways to the respective manifold chambers either above or below this intermediate wall. Inlet and outlet connections are made to these separate manifold chambers respectively, for carrying the fuel and oxidant gases to the core, and for carrying their reaction products away from the core.
A formulation of directivity for earthquake sources using isochrone theory
Spudich, Paul; Chiou, Brian S.J.; Graves, Robert; Collins, Nancy; Somerville, Paul
2004-01-01
A functional form for directivity effects can be derived from isochrone theory, in which the measure of the directivity-induced amplification of an S body wave is c, the isochrone velocity. Ground displacement of the near-, intermediate-, and far-field terms of P and S waves is linear in isochrone velocity for a finite source in a whole space. We have developed an approximation c-tilde-prime of isochrone velocity that can easily be implemented as a predictor of directivity effects in empirical ground motion prediction relations. Typically, for a given fault surface, hypocenter, and site geometry, c-tilde-prime is a simple function of the hypocentral distance, the rupture distance, the crustal shear wave speed in the seismogenic zone, and the rupture velocity. c-tilde-prime typically ranges in the interval 0.44, for rupture away from the station, to about 4, for rupture toward the station. In this version of the theory directivity is independent of period. Additionally, we have created another functional form which is c-tilde-prime modified to include the approximate radiation pattern of a finite fault having a given rake. This functional form can be used to model the spatial variations of fault-parallel and fault-normal horizontal ground motions. The strengths of this formulation are 1) the proposed functional form is based on theory, 2) the predictor is unambiguously defined for all possible site locations and source rakes, and 3) it can easily be implemented for well-studied important previous earthquakes. We compare predictions of our functional form with synthetic ground motions calculated for finite strike-slip and dip-slip faults in the magnitude range 6.5 - 7.5. In general our functional form correlates best with computed fault-normal and fault-parallel motions in the synthetic motions calculated for events with M6.5. Correlation degrades but is still useful for larger events and for the geometric average horizontal motions. We have had limited success applying it to geometrically complicated faults.
Assignment Of Finite Elements To Parallel Processors
NASA Technical Reports Server (NTRS)
Salama, Moktar A.; Flower, Jon W.; Otto, Steve W.
1990-01-01
Elements assigned approximately optimally to subdomains. Mapping algorithm based on simulated-annealing concept used to minimize approximate time required to perform finite-element computation on hypercube computer or other network of parallel data processors. Mapping algorithm needed when shape of domain complicated or otherwise not obvious what allocation of elements to subdomains minimizes cost of computation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Campione, Salvatore; Warne, Larry K.; Basilio, Lorena I.
In this paper we develop a fully-retarded, dipole approximation model to estimate the effective polarizabilities of a dimer made of dielectric resonators. They are computed from the polarizabilities of the two resonators composing the dimer. We analyze the situation of full-cubes as well as split-cubes, which have been shown to exhibit overlapping electric and magnetic resonances. We compare the effective dimer polarizabilities to ones retrieved via full-wave simulations as well as ones computed via a quasi-static, dipole approximation. We observe good agreement between the fully-retarded solution and the full-wave results, whereas the quasi-static approximation is less accurate for the problemmore » at hand. The developed model can be used to predict the electric and magnetic resonances of a dimer under parallel or orthogonal (to the dimer axis) excitation. This is particularly helpful when interested in locating frequencies at which the dimer will emit directional radiation.« less
Composite Intelligent Learning Control of Strict-Feedback Systems With Disturbance.
Xu, Bin; Sun, Fuchun
2018-02-01
This paper addresses the dynamic surface control of uncertain nonlinear systems on the basis of composite intelligent learning and disturbance observer in presence of unknown system nonlinearity and time-varying disturbance. The serial-parallel estimation model with intelligent approximation and disturbance estimation is built to obtain the prediction error and in this way the composite law for weights updating is constructed. The nonlinear disturbance observer is developed using intelligent approximation information while the disturbance estimation is guaranteed to converge to a bounded compact set. The highlight is that different from previous work directly toward asymptotic stability, the transparency of the intelligent approximation and disturbance estimation is included in the control scheme. The uniformly ultimate boundedness stability is analyzed via Lyapunov method. Through simulation verification, the composite intelligent learning with disturbance observer can efficiently estimate the effect caused by system nonlinearity and disturbance while the proposed approach obtains better performance with higher accuracy.
The Anisotropic Structure of South China Sea: Using OBS Data to Constrain Mantle Flow
NASA Astrophysics Data System (ADS)
Li, L.; Xue, M.; Yang, T.; Liu, C.; Hua, Q.; Xia, S.; Huang, H.; Le, B. M.; Huo, D.; Pan, M.
2015-12-01
The dynamic mechanism of the formation of South China Sea (SCS) has been debated for decades. The anisotropic structure can provide useful insight into the complex evolution of SCS by indicating its mantle flow direction and strength. In this study, we employ shear wave splitting methods on two half-year seismic data collected from 10 and 6 passive source Ocean Bottom Seismometers (OBS) respectively. These OBSs were deployed along both sides of the extinct ridge in the central basin of SCS by Tongji University in 2012 and 2013 respectively, which were then successfully recovered in 2013 and 2015 respectively. Through processing and inspecting the global and regional earthquakes (with local events being processing) of the 2012 dataset, measurements are made for 2 global events and 24 regional events at 5 OBSs using the tangential energy minimization, the smallest eigenvalue minimization, as well as the correlation methods. We also implement cluster analysis on the splitting results obtained for different time windows as well as filtered at different frequency bands. For teleseismic core phases like SKS and PKS, we find the fast polarization direction beneath the central basin is approximately NE-SW, nearly parallel to the extinct ridge in the central basin of SCS. Whereas for regional events, the splitting analysis on S, PS and ScS phases shows much more complicated fast directions as the ray path varies for different phases. The fast directions observed can be divided into three groups: (1) for the events from the Eurasia plate, a gradual rotation of the fast polarization direction from NNE-SSW to NEE-SWW along the path from the inner Eurasia plate to the central SCS is observed, implying the mantle flow is controlled by the India-Eurasia collision; (2) for the events located at the junction of Pacific plate and Philippine plate, the dominant fast direction is NW-SE, almost perpendicular to Ryukyu Trench as well as sub-parallel to the absolute direction of Philippine plate; (3) for the events occurred in the SE direction near the Philippine Fault zone, the observed NE-SW fast direction is sub-parallel to the subduction direction of the Philippine plate.
Pulling adsorbed polymers at an angle: A low temperature theory
NASA Astrophysics Data System (ADS)
Iliev, Gerasim; Whittington, Stuart
2012-02-01
We consider several partially-directed walk models in two- and three-dimensions to study the problem of a homopolymer interacting with a surface while subject to a force at the terminal monomer. The force is applied with a component parallel to the surface as well as a component perpendicular to the surface. Depending on the relative values of the force in each direction, the force can either enhance the adsorption transition or lead to desorption in an adsorbed polymer. For each model, we determine the associated generating function and extract the phase diagram, identifying states where the polymer is thermally desorbed, adsorbed, and under the influence of the force. We note the different regimes that appear in the problem and provide a low temperature approximation to describe them. The approximation is exact at T=0 and models the exact results extremely well for small values of T. This work is an extension of a model considered by S. Whittington and E. Orlandini.
Approximation algorithms for scheduling unrelated parallel machines with release dates
NASA Astrophysics Data System (ADS)
Avdeenko, T. V.; Mesentsev, Y. A.; Estraykh, I. V.
2017-01-01
In this paper we propose approaches to optimal scheduling of unrelated parallel machines with release dates. One approach is based on the scheme of dynamic programming modified with adaptive narrowing of search domain ensuring its computational effectiveness. We discussed complexity of the exact schedules synthesis and compared it with approximate, close to optimal, solutions. Also we explain how the algorithm works for the example of two unrelated parallel machines and five jobs with release dates. Performance results that show the efficiency of the proposed approach have been given.
NASA Astrophysics Data System (ADS)
Sato, Yasuhiro; Furuki, Makoto; Tian, Minquan; Iwasa, Izumi; Pu, Lyong Sun; Tatsuura, Satoshi
2002-04-01
We demonstrated ultrafast single-shot multichannel demultiplexing by using a squarylium dye J aggregate film as an optical Kerr medium. High efficiency and fast recovery of the optical Kerr responses were achieved when a signal-pulse wavelength was close to the absorption peak of the J aggregate film with off-resonant excitation. The on/off ratio in demultiplexing of 1 Tb/s signals was improved to be approximately 5. By introducing time delay to both horizontal and vertical directions, we succeeded in directly observing the conversion of 1 Tb/s serial signals into two-dimensionally arranged parallel signals.
Single-sided mobile NMR apparatus using the transverse flux of a single permanent magnet.
Chang, Wei-Hao; Chen, Jyh-Horng; Hwang, Lian-Pin
2010-01-01
This study presents a simple design for a mobile, single-sided nuclear magnetic resonance (NMR) apparatus which uses the magnetic flux parallel to the magnetization direction of a single, disc-shaped permanent magnet polarized in radial direction. The stray magnetic field above the magnet is approximately parallel to the magnetization direction of the magnet and is utilized as the B(0) magnetic field of the apparatus. The apparatus weighs 1.8 kg, has a compact structure and can be held in one's palm. The apparatus generates a B(0) field strength of about 0.279 T at the center of apparatus surface and can acquire a clear Hahn echo signal of a pencil eraser block lying on the RF coil in one shot. Moreover, a strong static magnetic field gradient exists in the direction perpendicular to the apparatus surface. The strength of the static magnetic field gradient near the center of the apparatus surface is about 10.2 T/m; one-dimensional imaging of thin objects and liquid self-diffusion coefficient measurements can be performed therein. The available spatial resolution of the one-dimensional imaging experiments using a 5 x 5 mm horizontal sample area is about 200 mum. Several nondestructive inspection applications of the apparatus, including distinguishing between polyethylene grains of different densities, characterizing epoxy putties of distinct set times and evaluating the fat content percentages of milk powders, are also demonstrated. Compared with many previously published designs, the proposed design bears a simple structure and generates a B(0) magnetic field parallel to the apparatus surface, simplifying apparatus construction and simultaneously rendering the selection of the radiofrequency coil relatively flexible.
Computational Challenges of 3D Radiative Transfer in Atmospheric Models
NASA Astrophysics Data System (ADS)
Jakub, Fabian; Bernhard, Mayer
2017-04-01
The computation of radiative heating and cooling rates is one of the most expensive components in todays atmospheric models. The high computational cost stems not only from the laborious integration over a wide range of the electromagnetic spectrum but also from the fact that solving the integro-differential radiative transfer equation for monochromatic light is already rather involved. This lead to the advent of numerous approximations and parameterizations to reduce the cost of the solver. One of the most prominent one is the so called independent pixel approximations (IPA) where horizontal energy transfer is neglected whatsoever and radiation may only propagate in the vertical direction (1D). Recent studies implicate that the IPA introduces significant errors in high resolution simulations and affects the evolution and development of convective systems. However, using fully 3D solvers such as for example MonteCarlo methods is not even on state of the art supercomputers feasible. The parallelization of atmospheric models is often realized by a horizontal domain decomposition, and hence, horizontal transfer of energy necessitates communication. E.g. a cloud's shadow at a low zenith angle will cast a long shadow and potentially needs to communication through a multitude of processors. Especially light in the solar spectral range may travel long distances through the atmosphere. Concerning highly parallel simulations, it is vital that 3D radiative transfer solvers put a special emphasis on parallel scalability. We will present an introduction to intricacies computing 3D radiative heating and cooling rates as well as report on the parallel performance of the TenStream solver. The TenStream is a 3D radiative transfer solver using the PETSc framework to iteratively solve a set of partial differential equation. We investigate two matrix preconditioners, (a) geometric algebraic multigrid preconditioning(MG+GAMG) and (b) block Jacobi incomplete LU (ILU) factorization. The TenStream solver is tested for up to 4096 cores and shows a parallel scaling efficiency of 80-90% on various supercomputers.
Diama, A; Matthies, B; Herwig, K W; Hansen, F Y; Criswell, L; Mo, H; Bai, M; Taub, H
2009-08-28
We present evidence from neutron diffraction measurements and molecular dynamics (MD) simulations of three different monolayer phases of the intermediate-length alkanes tetracosane (n-C(24)H(50) denoted as C24) and dotriacontane (n-C(32)H(66) denoted as C32) adsorbed on a graphite basal-plane surface. Our measurements indicate that the two monolayer films differ principally in the transition temperatures between phases. At the lowest temperatures, both C24 and C32 form a crystalline monolayer phase with a rectangular-centered (RC) structure. The two sublattices of the RC structure each consists of parallel rows of molecules in their all-trans conformation aligned with their long axis parallel to the surface and forming so-called lamellas of width approximately equal to the all-trans length of the molecule. The RC structure is uniaxially commensurate with the graphite surface in its [110] direction such that the distance between molecular rows in a lamella is 4.26 A=sqrt[3a(g)], where a(g)=2.46 A is the lattice constant of the graphite basal plane. Molecules in adjacent rows of a lamella alternate in orientation between the carbon skeletal plane being parallel and perpendicular to the graphite surface. Upon heating, the crystalline monolayers transform to a "smectic" phase in which the inter-row spacing within a lamella expands by approximately 10% and the molecules are predominantly oriented with the carbon skeletal plane parallel to the graphite surface. In the smectic phase, the MD simulations show evidence of broadening of the lamella boundaries as a result of molecules diffusing parallel to their long axis. At still higher temperatures, they indicate that the introduction of gauche defects into the alkane chains drives a melting transition to a monolayer fluid phase as reported previously.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wereszczak, Andrew A.; Emily Cousineau, J.; Bennion, Kevin
The apparent thermal conductivity of packed copper wire test specimens was measured parallel and perpendicular to the axis of the wire using laser flash, transient plane source, and transmittance test methods. Approximately 50% wire packing efficiency was produced in the specimens using either 670- or 925-μm-diameter copper wires that both had an insulation coating thickness of 37 μm. The interstices were filled with a conventional varnish material and also contained some remnant porosity. The apparent thermal conductivity perpendicular to the wire axis was about 0.5–1 W/mK, whereas it was over 200 W/mK in the parallel direction. The Kanzaki model andmore » an finite element analysis (FEA) model were found to reasonably predict the apparent thermal conductivity perpendicular to the wires but thermal conductivity percolation from nonideal wire-packing may result in their underestimation of it.« less
A Two-dimensional Version of the Niblett-Bostick Transformation for Magnetotelluric Interpretations
NASA Astrophysics Data System (ADS)
Esparza, F.
2005-05-01
An imaging technique for two-dimensional magnetotelluric interpretations is developed following the well known Niblett-Bostick transformation for one-dimensional profiles. The algorithm uses a Hopfield artificial neural network to process series and parallel magnetotelluric impedances along with their analytical influence functions. The adaptive, weighted average approximation preserves part of the nonlinearity of the original problem. No initial model in the usual sense is required for the recovery of a functional model. Rather, the built-in relationship between model and data considers automatically, all at the same time, many half spaces whose electrical conductivities vary according to the data. The use of series and parallel impedances, a self-contained pair of invariants of the impedance tensor, avoids the need to decide on best angles of rotation for TE and TM separations. Field data from a given profile can thus be fed directly into the algorithm without much processing. The solutions offered by the Hopfield neural network correspond to spatial averages computed through rectangular windows that can be chosen at will. Applications of the algorithm to simple synthetic models and to the COPROD2 data set illustrate the performance of the approximation.
NASA Technical Reports Server (NTRS)
Eriksson, S.; Wilder, F. D.; Ergun, R. E.; Schwartz, S. J.; Cassak, P. A.; Burch, J. L.; Chen, Li-Jen; Torbert, R. B.; Phan, T. D.; Lavraud, B.;
2016-01-01
We report observations from the Magnetospheric Multiscale (MMS) satellites of a large guide field magnetic reconnection event. The observations suggest that two of the four MMS spacecraft sampled the electron diffusion region, whereas the other two spacecraft detected the exhaust jet from the event. The guide magnetic field amplitude is approximately 4 times that of the reconnecting field. The event is accompanied by a significant parallel electric field (E(sub parallel lines) that is larger than predicted by simulations. The high-speed (approximately 300 km/s) crossing of the electron diffusion region limited the data set to one complete electron distribution inside of the electron diffusion region, which shows significant parallel heating. The data suggest that E(sub parallel lines) is balanced by a combination of electron inertia and a parallel gradient of the gyrotropic electron pressure.
Implementation of parallel moment equations in NIMROD
NASA Astrophysics Data System (ADS)
Lee, Hankyu Q.; Held, Eric D.; Ji, Jeong-Young
2017-10-01
As collisionality is low (the Knudsen number is large) in many plasma applications, kinetic effects become important, particularly in parallel dynamics for magnetized plasmas. Fluid models can capture some kinetic effects when integral parallel closures are adopted. The adiabatic and linear approximations are used in solving general moment equations to obtain the integral closures. In this work, we present an effort to incorporate non-adiabatic (time-dependent) and nonlinear effects into parallel closures. Instead of analytically solving the approximate moment system, we implement exact parallel moment equations in the NIMROD fluid code. The moment code is expected to provide a natural convergence scheme by increasing the number of moments. Work in collaboration with the PSI Center and supported by the U.S. DOE under Grant Nos. DE-SC0014033, DE-SC0016256, and DE-FG02-04ER54746.
Computer-Aided Parallelizer and Optimizer
NASA Technical Reports Server (NTRS)
Jin, Haoqiang
2011-01-01
The Computer-Aided Parallelizer and Optimizer (CAPO) automates the insertion of compiler directives (see figure) to facilitate parallel processing on Shared Memory Parallel (SMP) machines. While CAPO currently is integrated seamlessly into CAPTools (developed at the University of Greenwich, now marketed as ParaWise), CAPO was independently developed at Ames Research Center as one of the components for the Legacy Code Modernization (LCM) project. The current version takes serial FORTRAN programs, performs interprocedural data dependence analysis, and generates OpenMP directives. Due to the widely supported OpenMP standard, the generated OpenMP codes have the potential to run on a wide range of SMP machines. CAPO relies on accurate interprocedural data dependence information currently provided by CAPTools. Compiler directives are generated through identification of parallel loops in the outermost level, construction of parallel regions around parallel loops and optimization of parallel regions, and insertion of directives with automatic identification of private, reduction, induction, and shared variables. Attempts also have been made to identify potential pipeline parallelism (implemented with point-to-point synchronization). Although directives are generated automatically, user interaction with the tool is still important for producing good parallel codes. A comprehensive graphical user interface is included for users to interact with the parallelization process.
Automatic Generation of Directive-Based Parallel Programs for Shared Memory Parallel Systems
NASA Technical Reports Server (NTRS)
Jin, Hao-Qiang; Yan, Jerry; Frumkin, Michael
2000-01-01
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. As great progress was made in hardware and software technologies, performance of parallel programs with compiler directives has demonstrated large improvement. The introduction of OpenMP directives, the industrial standard for shared-memory programming, has minimized the issue of portability. Due to its ease of programming and its good performance, the technique has become very popular. In this study, we have extended CAPTools, a computer-aided parallelization toolkit, to automatically generate directive-based, OpenMP, parallel programs. We outline techniques used in the implementation of the tool and present test results on the NAS parallel benchmarks and ARC3D, a CFD application. This work demonstrates the great potential of using computer-aided tools to quickly port parallel programs and also achieve good performance.
Automatic Multilevel Parallelization Using OpenMP
NASA Technical Reports Server (NTRS)
Jin, Hao-Qiang; Jost, Gabriele; Yan, Jerry; Ayguade, Eduard; Gonzalez, Marc; Martorell, Xavier; Biegel, Bryan (Technical Monitor)
2002-01-01
In this paper we describe the extension of the CAPO (CAPtools (Computer Aided Parallelization Toolkit) OpenMP) parallelization support tool to support multilevel parallelism based on OpenMP directives. CAPO generates OpenMP directives with extensions supported by the NanosCompiler to allow for directive nesting and definition of thread groups. We report some results for several benchmark codes and one full application that have been parallelized using our system.
Polymer scaling and dynamics in steady-state sedimentation at infinite Péclet number.
Lehtola, V; Punkkinen, O; Ala-Nissila, T
2007-11-01
We consider the static and dynamical behavior of a flexible polymer chain under steady-state sedimentation using analytic arguments and computer simulations. The model system comprises a single coarse-grained polymer chain of N segments, which resides in a Newtonian fluid as described by the Navier-Stokes equations. The chain is driven into nonequilibrium steady state by gravity acting on each segment. The equations of motion for the segments and the Navier-Stokes equations are solved simultaneously using an immersed boundary method, where thermal fluctuations are neglected. To characterize the chain conformation, we consider its radius of gyration RG(N). We find that the presence of gravity explicitly breaks the spatial symmetry leading to anisotropic scaling of the components of RG with N along the direction of gravity RG, parallel and perpendicular to it RG, perpendicular, respectively. We numerically estimate the corresponding anisotropic scaling exponents nu parallel approximately 0.79 and nu perpendicular approximately 0.45, which differ significantly from the equilibrium scaling exponent nue=0.588 in three dimensions. This indicates that on the average, the chain becomes elongated along the sedimentation direction for large enough N. We present a generalization of the Flory scaling argument, which is in good agreement with the numerical results. It also reveals an explicit dependence of the scaling exponents on the Reynolds number. To study the dynamics of the chain, we compute its effective diffusion coefficient D(N), which does not contain Brownian motion. For the range of values of N used here, we find that both the parallel and perpendicular components of D increase with the chain length N, in contrast to the case of thermal diffusion in equilibrium. This is caused by the fluid-driven fluctuations in the internal configuration of the polymer that are magnified as polymer size becomes larger.
Approximate kernel competitive learning.
Wu, Jian-Sheng; Zheng, Wei-Shi; Lai, Jian-Huang
2015-03-01
Kernel competitive learning has been successfully used to achieve robust clustering. However, kernel competitive learning (KCL) is not scalable for large scale data processing, because (1) it has to calculate and store the full kernel matrix that is too large to be calculated and kept in the memory and (2) it cannot be computed in parallel. In this paper we develop a framework of approximate kernel competitive learning for processing large scale dataset. The proposed framework consists of two parts. First, it derives an approximate kernel competitive learning (AKCL), which learns kernel competitive learning in a subspace via sampling. We provide solid theoretical analysis on why the proposed approximation modelling would work for kernel competitive learning, and furthermore, we show that the computational complexity of AKCL is largely reduced. Second, we propose a pseudo-parallelled approximate kernel competitive learning (PAKCL) based on a set-based kernel competitive learning strategy, which overcomes the obstacle of using parallel programming in kernel competitive learning and significantly accelerates the approximate kernel competitive learning for large scale clustering. The empirical evaluation on publicly available datasets shows that the proposed AKCL and PAKCL can perform comparably as KCL, with a large reduction on computational cost. Also, the proposed methods achieve more effective clustering performance in terms of clustering precision against related approximate clustering approaches. Copyright © 2014 Elsevier Ltd. All rights reserved.
Calculation of Vertical and Horizontal Mobilities in InAs/GaSb Superlattices (Postprint)
2011-10-13
width 2a and GaSb having width 2b, with the period = 2a + 2b. For energies near the band gap edges, the carrier wave function can be approximated by a...online) Electron energy bands along the growth direction for three combinations of InAs/ GaSb layer widths. For typical carrier densities, at low...Fermi energies , parallel masses, and band gaps from the 8×8 EFA model. Sheet carrier Calculated Measured Calculated InAs GaSb concentration per period
Inband radar cross section of phased arrays with parallel feeds
NASA Astrophysics Data System (ADS)
Flokas, Vassilios
1994-06-01
Approximate formulas for the inband radar cross section of arrays with parallel feeds are presented. To obtain the formulas, multiple reflections are neglected, and devices of the same type are assumed to have identical electrical performance. The approximate results were compared to the results obtained using a scattering matrix formulation. Both methods were in agreement in predicting RCS lobe positions, levels, and behavior with scanning. The advantages of the approximate method are its computational efficiency and its flexibility in handling an arbitrary number of coupler levels.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yoon, Peter H., E-mail: yoonp@umd.edu; School of Space Research, Kyung Hee University, Yongin, Gyeonggi 446-701
2015-09-15
A previous paper [P. H. Yoon, “Kinetic theory of turbulence for parallel propagation revisited: Formal results,” Phys. Plasmas 22, 082309 (2015)] revisited the second-order nonlinear kinetic theory for turbulence propagating in directions parallel/anti-parallel to the ambient magnetic field, in which the original work according to Yoon and Fang [Phys. Plasmas 15, 122312 (2008)] was refined, following the paper by Gaelzer et al. [Phys. Plasmas 22, 032310 (2015)]. The main finding involved the dimensional correction pertaining to discrete-particle effects in Yoon and Fang's theory. However, the final result was presented in terms of formal linear and nonlinear susceptibility response functions. Inmore » the present paper, the formal equations are explicitly written down for the case of low-to-intermediate frequency regime by making use of approximate forms for the response functions. The resulting equations are sufficiently concrete so that they can readily be solved by numerical means or analyzed by theoretical means. The derived set of equations describe nonlinear interactions of quasi-parallel modes whose frequency range covers the Alfvén wave range to ion-cyclotron mode, but is sufficiently lower than the electron cyclotron mode. The application of the present formalism may range from the nonlinear evolution of whistler anisotropy instability in the high-beta regime, and the nonlinear interaction of electrons with whistler-range turbulence.« less
NASA Technical Reports Server (NTRS)
Dahlback, Arne; Stamnes, Knut
1991-01-01
Accurate computation of atmospheric photodissociation and heating rates is needed in photochemical models. These quantities are proportional to the mean intensity of the solar radiation penetrating to various levels in the atmosphere. For large solar zenith angles a solution of the radiative transfer equation valid for a spherical atmosphere is required in order to obtain accurate values of the mean intensity. Such a solution based on a perturbation technique combined with the discrete ordinate method is presented. Mean intensity calculations are carried out for various solar zenith angles. These results are compared with calculations from a plane parallel radiative transfer model in order to assess the importance of using correct geometry around sunrise and sunset. This comparison shows, in agreement with previous investigations, that for solar zenith angles less than 90 deg adequate solutions are obtained for plane parallel geometry as long as spherical geometry is used to compute the direct beam attenuation; but for solar zenith angles greater than 90 deg this pseudospherical plane parallel approximation overstimates the mean intensity.
NASA Technical Reports Server (NTRS)
Keyes, David E.; Smooke, Mitchell D.
1987-01-01
A parallelized finite difference code based on the Newton method for systems of nonlinear elliptic boundary value problems in two dimensions is analyzed in terms of computational complexity and parallel efficiency. An approximate cost function depending on 15 dimensionless parameters is derived for algorithms based on stripwise and boxwise decompositions of the domain and a one-to-one assignment of the strip or box subdomains to processors. The sensitivity of the cost functions to the parameters is explored in regions of parameter space corresponding to model small-order systems with inexpensive function evaluations and also a coupled system of nineteen equations with very expensive function evaluations. The algorithm was implemented on the Intel Hypercube, and some experimental results for the model problems with stripwise decompositions are presented and compared with the theory. In the context of computational combustion problems, multiprocessors of either message-passing or shared-memory type may be employed with stripwise decompositions to realize speedup of O(n), where n is mesh resolution in one direction, for reasonable n.
Switching of the Spin-Density-Wave in CeCoIn5 probed by Thermal Conductivity
NASA Astrophysics Data System (ADS)
Kim, Duk Y.; Lin, Shi-Zeng; Weickert, Franziska; Bauer, Eric D.; Ronning, Filip; Thompson, Joe D.; Movshovich, Roman
Unconventional superconductor CeCoIn5 orders magnetically in a spin-density-wave (SDW) in the low-temperature and high-field corner of the superconducting phase. Recent neutron scattering experiment revealed that the single-domain SDW's ordering vector Q depends strongly on the direction of the magnetic field, switching sharply as the field is rotated through the anti-nodal direction. This switching may be manifestation of a pair-density-wave (PDW) p-wave order parameter, which develops in addition to the well-established d-wave order parameter due to the SDW formation. We have investigated the hypersensitivity of the magnetic domain with a thermal conductivity measurement. The heat current (J) was applied along the [110] direction such that the Q vector is either perpendicular or parallel to J, depending on the magnetic field direction. A discontinuous change of the thermal conductivity was observed when the magnetic field is rotated around the [100] direction within 0 . 2° . The thermal conductivity with the Q parallel to the heat current (J ∥Q) is approximately 15% lager than that with the Q perpendicular to the heat current (J ⊥Q). This result is consistent with additional gapping of the nodal quasiparticle by the p-wave PDW coupled to SDW. Work at Los Alamos was performed under the auspices of the U.S. Department of Energy, Office of Basic Energy Sciences, Division of Materials Sciences and Engineering.
Automatic Multilevel Parallelization Using OpenMP
NASA Technical Reports Server (NTRS)
Jin, Hao-Qiang; Jost, Gabriele; Yan, Jerry; Ayguade, Eduard; Gonzalez, Marc; Martorell, Xavier; Biegel, Bryan (Technical Monitor)
2002-01-01
In this paper we describe the extension of the CAPO parallelization support tool to support multilevel parallelism based on OpenMP directives. CAPO generates OpenMP directives with extensions supported by the NanosCompiler to allow for directive nesting and definition of thread groups. We report first results for several benchmark codes and one full application that have been parallelized using our system.
Brennan; Biddison; Frauendorf; Schwarcz; Keen; Ecker; Davis; Tinder; Swayze
1998-01-01
An automated, 96-well parallel array synthesizer for solid-phase organic synthesis has been designed and constructed. The instrument employs a unique reagent array delivery format, in which each reagent utilized has a dedicated plumbing system. An inert atmosphere is maintained during all phases of a synthesis, and temperature can be controlled via a thermal transfer plate which holds the injection molded reaction block. The reaction plate assembly slides in the X-axis direction, while eight nozzle blocks holding the reagent lines slide in the Y-axis direction, allowing for the extremely rapid delivery of any of 64 reagents to 96 wells. In addition, there are six banks of fixed nozzle blocks, which deliver the same reagent or solvent to eight wells at once, for a total of 72 possible reagents. The instrument is controlled by software which allows the straightforward programming of the synthesis of a larger number of compounds. This is accomplished by supplying a general synthetic procedure in the form of a command file, which calls upon certain reagents to be added to specific wells via lookup in a sequence file. The bottle position, flow rate, and concentration of each reagent is stored in a separate reagent table file. To demonstrate the utility of the parallel array synthesizer, a small combinatorial library of hydroxamic acids was prepared in high throughput mode for biological screening. Approximately 1300 compounds were prepared on a 10 μmole scale (3-5 mg) in a few weeks. The resulting crude compounds were generally >80% pure, and were utilized directly for high throughput screening in antibacterial assays. Several active wells were found, and the activity was verified by solution-phase synthesis of analytically pure material, indicating that the system described herein is an efficient means for the parallel synthesis of compounds for lead discovery. Copyright 1998 John Wiley & Sons, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Moreland, Kenneth D.
2017-07-01
The FY17Q3 milestone of the ECP/VTK-m project includes the completion of a VTK-m filter that computes normal vectors for surfaces. Normal vectors are those that point perpendicular to the surface and are an important direction when rendering the surface. The implementation includes the parallel algorithm itself, a filter module to simplify integrating it into other software, and documentation in the VTK-m Users’ Guide. With the completion of this milestone, we are able to necessary information to rendering systems to provide appropriate shading of surfaces. This milestone also feeds into subsequent milestones that progressively improve the approximation of surface direction.
Atomistic simulation of frictional anisotropy on quasicrystal approximant surfaces
Ye, Zhijiang; Martini, Ashlie; Thiel, Patricia; ...
2016-06-23
J. Y. Park et al. [Science 309, 1354 (2005)] have reported eight times greater atomic-scale friction in the periodic than in the quasiperiodic direction on the twofold face of a decagonal Al-Ni-Co quasicrystal. Here we present results of molecular-dynamics simulations intended to elucidate mechanisms behind this giant frictional anisotropy. Simulations of a bare atomic-force-microscope tip on several model substrates and under a variety of conditions failed to reproduce experimental results. On the other hand, including the experimental passivation of the tip with chains of hexadecane thiol, we reproduce qualitatively the experimental anisotropy in friction, finding evidence for entrainment of themore » organic chains in surface furrows parallel to the periodic direction.« less
NASA Astrophysics Data System (ADS)
Krämer, Florian; Gratz, Micha; Tschöpe, Andreas
2016-07-01
The magnetic field-dependent optical transmission of dilute Ni nanorod aqueous suspensions was investigated. A series of four samples of nanorods were synthesized using the AAO template method and processed to stable colloids. The distributions of their length and diameter were characterized by analysis of TEM images and revealed average diameters of ˜25 nm and different lengths in the range of 60 nm-1100 nm. The collinear magnetic and optical anisotropy was studied by static field-dependent transmission measurements of linearly polarized light parallel and perpendicular to the magnetic field direction. The experimental results were modelled assuming the field-dependent orientation distribution function of a superparamagnetic ensemble for the uniaxial ferromagnetic nanorods in liquid dispersion and extinction cross sections for longitudinal and transversal optical polarization derived from different approaches, including the electrostatic approximation and the separation of variables method, both applied to spheroidal particles, as well as finite element method simulations of spheroids and capped cylindrical particles. The extinction cross sections were compared to reveal the differences associated with the approximations of homogeneous polarization and/or particle shape. The consequences of these approximations for the quantitative analysis of magnetic field-dependent optical transmission measurements were investigated and a reliable protocol derived. Furthermore, the changes in optical cross sections induced by electromagnetic interaction between two nanorods in parallel end-to-end and side-by-side configuration as a function of their separation were studied.
Anisotropic Thermal Response of Packed Copper Wire
Wereszczak, Andrew A.; Emily Cousineau, J.; Bennion, Kevin; ...
2017-04-19
The apparent thermal conductivity of packed copper wire test specimens was measured parallel and perpendicular to the axis of the wire using laser flash, transient plane source, and transmittance test methods. Approximately 50% wire packing efficiency was produced in the specimens using either 670- or 925-μm-diameter copper wires that both had an insulation coating thickness of 37 μm. The interstices were filled with a conventional varnish material and also contained some remnant porosity. The apparent thermal conductivity perpendicular to the wire axis was about 0.5–1 W/mK, whereas it was over 200 W/mK in the parallel direction. The Kanzaki model andmore » an finite element analysis (FEA) model were found to reasonably predict the apparent thermal conductivity perpendicular to the wires but thermal conductivity percolation from nonideal wire-packing may result in their underestimation of it.« less
NASA Technical Reports Server (NTRS)
Loeb, N. G.; Varnai, Tamas; Winker, David M.
1998-01-01
Recent observational studies have shown that satellite retrievals of cloud optical depth based on plane-parallel model theory suffer from systematic biases that depend on viewing geometry, even when observations are restricted to overcast marine stratus layers, arguably the closest to plane parallel in nature. At moderate to low sun elevations, the plane-parallel model significantly overestimates the reflectance dependence on view angle in the forward-scattering direction but shows a similar dependence in the backscattering direction. Theoretical simulations are performed that show that the likely cause for this discrepancy is because the plane-parallel model assumption does not account for subpixel, scale variations in cloud-top height (i.e., "cloud bumps"). Monte Carlo simulation, comparing ID model radiances to radiances from overcast cloud field with 1) cloud-top height variation, but constant cloud volume extinction; 2) flat tops but horizontal variations in cloud volume extinction; and 3) variations in both cloud top height and cloud extinction are performed over a approximately equal to 4 km x 4 km domain (roughly the size of an individual GAC AVHRR pixel). The comparisons show that when cloud-top height variations are included, departures from 1D theory are remarkably similar (qualitatively) to those obtained observationally. In contrast, when clouds are assumed flat and only cloud extinction is variable, reflectance differences are much smaller and do not show any view-angle dependence. When both cloud-top height and cloud extinction variations are included, however, large increases in cloud extinction variability can enhance reflectance difference. The reason 3D-1D reflectance differences are more sensitive to cloud-top height variations in the forward-scattering direction (at moderate to low, sun elevations) is because photons leaving the cloud field in that direction experience fewer scattering events (low-order scattering) and are restricted to the topmost portions of the cloud. While reflectance deviations from 1D theory are much larger for bumpy clouds than for flat clouds with variable cloud extinction, differences in cloud albedo are comparable for these two cases.
Parallel implementation of approximate atomistic models of the AMOEBA polarizable model
NASA Astrophysics Data System (ADS)
Demerdash, Omar; Head-Gordon, Teresa
2016-11-01
In this work we present a replicated data hybrid OpenMP/MPI implementation of a hierarchical progression of approximate classical polarizable models that yields speedups of up to ∼10 compared to the standard OpenMP implementation of the exact parent AMOEBA polarizable model. In addition, our parallel implementation exhibits reasonable weak and strong scaling. The resulting parallel software will prove useful for those who are interested in how molecular properties converge in the condensed phase with respect to the MBE, it provides a fruitful test bed for exploring different electrostatic embedding schemes, and offers an interesting possibility for future exascale computing paradigms.
Anandakrishnan, Ramu; Scogland, Tom R. W.; Fenley, Andrew T.; Gordon, John C.; Feng, Wu-chun; Onufriev, Alexey V.
2010-01-01
Tools that compute and visualize biomolecular electrostatic surface potential have been used extensively for studying biomolecular function. However, determining the surface potential for large biomolecules on a typical desktop computer can take days or longer using currently available tools and methods. Two commonly used techniques to speed up these types of electrostatic computations are approximations based on multi-scale coarse-graining and parallelization across multiple processors. This paper demonstrates that for the computation of electrostatic surface potential, these two techniques can be combined to deliver significantly greater speed-up than either one separately, something that is in general not always possible. Specifically, the electrostatic potential computation, using an analytical linearized Poisson Boltzmann (ALPB) method, is approximated using the hierarchical charge partitioning (HCP) multiscale method, and parallelized on an ATI Radeon 4870 graphical processing unit (GPU). The implementation delivers a combined 934-fold speed-up for a 476,040 atom viral capsid, compared to an equivalent non-parallel implementation on an Intel E6550 CPU without the approximation. This speed-up is significantly greater than the 42-fold speed-up for the HCP approximation alone or the 182-fold speed-up for the GPU alone. PMID:20452792
The quiet evening auroral arc and the structure of the growth phase near-Earth plasma sheet
NASA Astrophysics Data System (ADS)
Coroniti, F. V.; Pritchett, P. L.
2014-03-01
The plasma pressure and current configuration of the near-Earth plasma sheet that creates and sustains the quiet evening auroral arc during the growth phase of magnetospheric substorms is investigated. We propose that the quiet evening arc (QEA) connects to the thin near-Earth current sheet, which forms during the development of the growth phase enhancement of convection. The current sheet's large polarization electric fields are shielded from the ionosphere by an Inverted-V parallel potential drop, thereby producing the electron precipitation responsible for the arc's luminosity. The QEA is located in the plasma sheet region of maximal radial pressure gradient and, in the east-west direction, follows the vanishing of the approximately dawn-dusk-directed gradient or fold in the plasma pressure. In the evening sector, the boundary between the Region1 and Region 2 current systems occurs where the pressure maximizes (approximately radial gradient of the pressure vanishes) and where the approximately radial gradient of the magnetic flux tube volume also vanishes in an inflection region. The proposed intricate balance of plasma sheet pressure and currents may well be very sensitive to disruption by the arrival of equatorward traveling auroral streamers and their associated earthward traveling dipolarization fronts.
A simple hyperbolic model for communication in parallel processing environments
NASA Technical Reports Server (NTRS)
Stoica, Ion; Sultan, Florin; Keyes, David
1994-01-01
We introduce a model for communication costs in parallel processing environments called the 'hyperbolic model,' which generalizes two-parameter dedicated-link models in an analytically simple way. Dedicated interprocessor links parameterized by a latency and a transfer rate that are independent of load are assumed by many existing communication models; such models are unrealistic for workstation networks. The communication system is modeled as a directed communication graph in which terminal nodes represent the application processes that initiate the sending and receiving of the information and in which internal nodes, called communication blocks (CBs), reflect the layered structure of the underlying communication architecture. The direction of graph edges specifies the flow of the information carried through messages. Each CB is characterized by a two-parameter hyperbolic function of the message size that represents the service time needed for processing the message. The parameters are evaluated in the limits of very large and very small messages. Rules are given for reducing a communication graph consisting of many to an equivalent two-parameter form, while maintaining an approximation for the service time that is exact in both large and small limits. The model is validated on a dedicated Ethernet network of workstations by experiments with communication subprograms arising in scientific applications, for which a tight fit of the model predictions with actual measurements of the communication and synchronization time between end processes is demonstrated. The model is then used to evaluate the performance of two simple parallel scientific applications from partial differential equations: domain decomposition and time-parallel multigrid. In an appropriate limit, we also show the compatibility of the hyperbolic model with the recently proposed LogP model.
Hybrid Parallel-Slant Hole Collimators for SPECT Imaging
NASA Astrophysics Data System (ADS)
Bai, Chuanyong; Shao, Ling; Ye, Jinghan; Durbin, M.; Petrillo, M.
2004-06-01
We propose a new collimator geometry, the hybrid parallel-slant (HPS) hole geometry, to improve sensitivity for SPECT imaging with large field of view (LFOV) gamma cameras. A HPS collimator has one segment with parallel holes and one or more segments with slant holes. The collimator can be mounted on a conventional SPECT LFOV system that uses parallel-beam collimators, and no additional detector or collimator motion is required for data acquisition. The parallel segment of the collimator allows for the acquisition of a complete data set of the organs-of-interest and the slant segments provide additional data. In this work, simulation studies of an MCAT phantom were performed with a HPS collimator with one slant segment. The slant direction points from patient head to patient feet with a slant angle of 30/spl deg/. We simulated 64 projection views over 180/spl deg/ with the modeling of nonuniform attenuation effect, and then reconstructed images using an MLEM algorithm that incorporated the hybrid geometry. It was shown that sensitivity to the cardiac region of the phantom was increased by approximately 50% when using the HPS collimator compared with a parallel-hole collimator. No visible artifacts were observed in the myocardium and the signal-to-noise ratio (SNR) of the myocardium walls was improved. Compared with collimators with other geometries, using a HPS collimator has the following advantages: (a) significant sensitivity increase; (b) a complete data set obtained from the parallel segment that allows for artifact-free image reconstruction; and (c) no additional collimator or detector motion. This work demonstrates the potential value of hybrid geometry in collimator design for LFOV SPECT imaging.
Parallelized direct execution simulation of message-passing parallel programs
NASA Technical Reports Server (NTRS)
Dickens, Phillip M.; Heidelberger, Philip; Nicol, David M.
1994-01-01
As massively parallel computers proliferate, there is growing interest in findings ways by which performance of massively parallel codes can be efficiently predicted. This problem arises in diverse contexts such as parallelizing computers, parallel performance monitoring, and parallel algorithm development. In this paper we describe one solution where one directly executes the application code, but uses a discrete-event simulator to model details of the presumed parallel machine such as operating system and communication network behavior. Because this approach is computationally expensive, we are interested in its own parallelization specifically the parallelization of the discrete-event simulator. We describe methods suitable for parallelized direct execution simulation of message-passing parallel programs, and report on the performance of such a system, Large Application Parallel Simulation Environment (LAPSE), we have built on the Intel Paragon. On all codes measured to date, LAPSE predicts performance well typically within 10 percent relative error. Depending on the nature of the application code, we have observed low slowdowns (relative to natively executing code) and high relative speedups using up to 64 processors.
Rideaux, Reuben; Apthorp, Deborah; Edwards, Mark
2015-02-12
Recent findings have indicated the capacity to consolidate multiple items into visual short-term memory in parallel varies as a function of the type of information. That is, while color can be consolidated in parallel, evidence suggests that orientation cannot. Here we investigated the capacity to consolidate multiple motion directions in parallel and reexamined this capacity using orientation. This was achieved by determining the shortest exposure duration necessary to consolidate a single item, then examining whether two items, presented simultaneously, could be consolidated in that time. The results show that parallel consolidation of direction and orientation information is possible, and that parallel consolidation of direction appears to be limited to two. Additionally, we demonstrate the importance of adequate separation between feature intervals used to define items when attempting to consolidate in parallel, suggesting that when multiple items are consolidated in parallel, as opposed to serially, the resolution of representations suffer. Finally, we used facilitation of spatial attention to show that the deterioration of item resolution occurs during parallel consolidation, as opposed to storage. © 2015 ARVO.
NASA Technical Reports Server (NTRS)
Waheed, Abdul; Yan, Jerry
1998-01-01
This paper presents a model to evaluate the performance and overhead of parallelizing sequential code using compiler directives for multiprocessing on distributed shared memory (DSM) systems. With increasing popularity of shared address space architectures, it is essential to understand their performance impact on programs that benefit from shared memory multiprocessing. We present a simple model to characterize the performance of programs that are parallelized using compiler directives for shared memory multiprocessing. We parallelized the sequential implementation of NAS benchmarks using native Fortran77 compiler directives for an Origin2000, which is a DSM system based on a cache-coherent Non Uniform Memory Access (ccNUMA) architecture. We report measurement based performance of these parallelized benchmarks from four perspectives: efficacy of parallelization process; scalability; parallelization overhead; and comparison with hand-parallelized and -optimized version of the same benchmarks. Our results indicate that sequential programs can conveniently be parallelized for DSM systems using compiler directives but realizing performance gains as predicted by the performance model depends primarily on minimizing architecture-specific data locality overhead.
NASA Technical Reports Server (NTRS)
Grossman, Bernard
1999-01-01
Compressible and incompressible versions of a three-dimensional unstructured mesh Reynolds-averaged Navier-Stokes flow solver have been differentiated and resulting derivatives have been verified by comparisons with finite differences and a complex-variable approach. In this implementation, the turbulence model is fully coupled with the flow equations in order to achieve this consistency. The accuracy demonstrated in the current work represents the first time that such an approach has been successfully implemented. The accuracy of a number of simplifying approximations to the linearizations of the residual have been examined. A first-order approximation to the dependent variables in both the adjoint and design equations has been investigated. The effects of a "frozen" eddy viscosity and the ramifications of neglecting some mesh sensitivity terms were also examined. It has been found that none of the approximations yielded derivatives of acceptable accuracy and were often of incorrect sign. However, numerical experiments indicate that an incomplete convergence of the adjoint system often yield sufficiently accurate derivatives, thereby significantly lowering the time required for computing sensitivity information. The convergence rate of the adjoint solver relative to the flow solver has been examined. Inviscid adjoint solutions typically require one to four times the cost of a flow solution, while for turbulent adjoint computations, this ratio can reach as high as eight to ten. Numerical experiments have shown that the adjoint solver can stall before converging the solution to machine accuracy, particularly for viscous cases. A possible remedy for this phenomenon would be to include the complete higher-order linearization in the preconditioning step, or to employ a simple form of mesh sequencing to obtain better approximations to the solution through the use of coarser meshes. An efficient surface parameterization based on a free-form deformation technique has been utilized and the resulting codes have been integrated with an optimization package. Lastly, sample optimizations have been shown for inviscid and turbulent flow over an ONERA M6 wing. Drag reductions have been demonstrated by reducing shock strengths across the span of the wing. In order for large scale optimization to become routine, the benefits of parallel architectures should be exploited. Although the flow solver has been parallelized using compiler directives. The parallel efficiency is under 50 percent. Clearly, parallel versions of the codes will have an immediate impact on the ability to design realistic configurations on fine meshes, and this effort is currently underway.
Propagation of coherent light pulses with PHASE
NASA Astrophysics Data System (ADS)
Bahrdt, J.; Flechsig, U.; Grizzoli, W.; Siewert, F.
2014-09-01
The current status of the software package PHASE for the propagation of coherent light pulses along a synchrotron radiation beamline is presented. PHASE is based on an asymptotic expansion of the Fresnel-Kirchhoff integral (stationary phase approximation) which is usually truncated at the 2nd order. The limits of this approximation as well as possible extensions to higher orders are discussed. The accuracy is benchmarked against a direct integration of the Fresnel-Kirchhoff integral. Long range slope errors of optical elements can be included by means of 8th order polynomials in the optical element coordinates w and l. Only recently, a method for the description of short range slope errors has been implemented. The accuracy of this method is evaluated and examples for realistic slope errors are given. PHASE can be run either from a built-in graphical user interface or from any script language. The latter method provides substantial flexibility. Optical elements including apertures can be combined. Complete wave packages can be propagated, as well. Fourier propagators are included in the package, thus, the user may choose between a variety of propagators. Several means to speed up the computation time were tested - among them are the parallelization in a multi core environment and the parallelization on a cluster.
Parallel Implementation of a High Order Implicit Collocation Method for the Heat Equation
NASA Technical Reports Server (NTRS)
Kouatchou, Jules; Halem, Milton (Technical Monitor)
2000-01-01
We combine a high order compact finite difference approximation and collocation techniques to numerically solve the two dimensional heat equation. The resulting method is implicit arid can be parallelized with a strategy that allows parallelization across both time and space. We compare the parallel implementation of the new method with a classical implicit method, namely the Crank-Nicolson method, where the parallelization is done across space only. Numerical experiments are carried out on the SGI Origin 2000.
On the suitability of the connection machine for direct particle simulation
NASA Technical Reports Server (NTRS)
Dagum, Leonard
1990-01-01
The algorithmic structure was examined of the vectorizable Stanford particle simulation (SPS) method and the structure is reformulated in data parallel form. Some of the SPS algorithms can be directly translated to data parallel, but several of the vectorizable algorithms have no direct data parallel equivalent. This requires the development of new, strictly data parallel algorithms. In particular, a new sorting algorithm is developed to identify collision candidates in the simulation and a master/slave algorithm is developed to minimize communication cost in large table look up. Validation of the method is undertaken through test calculations for thermal relaxation of a gas, shock wave profiles, and shock reflection from a stationary wall. A qualitative measure is provided of the performance of the Connection Machine for direct particle simulation. The massively parallel architecture of the Connection Machine is found quite suitable for this type of calculation. However, there are difficulties in taking full advantage of this architecture because of lack of a broad based tradition of data parallel programming. An important outcome of this work has been new data parallel algorithms specifically of use for direct particle simulation but which also expand the data parallel diction.
Analysis of Variscan dynamics; early bending of the Cantabria-Asturias Arc, northern Spain
NASA Astrophysics Data System (ADS)
Kollmeier, J. M.; van der Pluijm, B. A.; Van der Voo, R.
2000-08-01
Calcite twinning analysis in the Cantabria-Asturias Arc (CAA) of northern Spain provides a basis for evaluating conditions of Variscan stress and constrains the arc's structural evolution. Twinning typically occurs during earliest layer-parallel shortening, offering the ability to define early conditions of regional stress. Results from the Somiedo-Correcilla region are of two kinds: early maximum compressive stress oriented layer-parallel and at high angles to bedding strike (D1 σ1) and later twin producing compression oriented sub-parallel to strike (D2 σ1). When all D1 compressions are rotated into a uniform east-west reference orientation, a quite linear, north-south trending fold-thrust belt results showing a slight deflection of the southern zone to the south-southeast. North-south-directed D2 σ1 compression was recorded prior to bending of the belt. Calcite twinning data elucidate earliest structural conditions that could not be obtained by other means, whereas the kinematics of arc tightening during D2 is constrained by paleomagnetism. A large and perhaps protracted D2 σ1 is suggested by our results, as manifested by approximately 50% arc tightening prior to acquisition of paleomagnetic remagnetizations throughout the CAA. Early east-west compression (D1 σ1) likely resulted from the Ebro-Aquitaine massif docking to Laurussia whereas the north-directed collision of Africa (D2 σ1) produced clockwise bending in the northern zone, radial folding in the hinge, and rotation of thrusts in the southern zone.
Parallelism Effects and Verb Activation: The Sustained Reactivation Hypothesis
Shapiro, Lewis P.; Love, Tracy
2010-01-01
This study investigated the processes underlying parallelism by evaluating the activation of a parallel element (i.e., a verb) throughout and-coordinated sentences. Four points were tested: (1) approximately 1,600ms after the verb in the first conjunct (PP1), (2) immediately following the conjunction (PP2), (3) approximately 1,100ms after the conjunction (PP3), (4) at the end of the second conjunct (PP4). The results revealed no activation at PP1, suggesting activation related to the initial presentation had decayed by this point; however, activation was observed at PP2, PP3, and PP4, suggesting the conjunction elicits reactivation that is sustained throughout the second conjunct. These findings support a specific hypothesis about parallelism, the sustained reactivation hypothesis. This hypothesis claims that, in conjoined structures, a cue that is associated with parallelism elicits the reactivation of material from the first conjunct and that this activation is sustained until integration with the second conjunct can be completed. PMID:19774464
Parallelism effects and verb activation: the sustained reactivation hypothesis.
Callahan, Sarah M; Shapiro, Lewis P; Love, Tracy
2010-04-01
This study investigated the processes underlying parallelism by evaluating the activation of a parallel element (i.e., a verb) throughout and-coordinated sentences. Four points were tested: (1) approximately 1,600 ms after the verb in the first conjunct (PP1), (2) immediately following the conjunction (PP2), (3) approximately 1,100 ms after the conjunction (PP3), (4) at the end of the second conjunct (PP4). The results revealed no activation at PP1, suggesting activation related to the initial presentation had decayed by this point; however, activation was observed at PP2, PP3, and PP4, suggesting the conjunction elicits reactivation that is sustained throughout the second conjunct. These findings support a specific hypothesis about parallelism, the sustained reactivation hypothesis. This hypothesis claims that, in conjoined structures, a cue that is associated with parallelism elicits the reactivation of material from the first conjunct and that this activation is sustained until integration with the second conjunct can be completed.
NASA Astrophysics Data System (ADS)
Sheykina, Nadiia; Bogatina, Nina
The following variants of roots location relatively to static and alternative components of magnetic field were studied. At first variant the static magnetic field was directed parallel to the gravitation vector, the alternative magnetic field was directed perpendicular to static one; roots were directed perpendicular to both two fields’ components and gravitation vector. At the variant the negative gravitropysm for cress roots was observed. At second variant the static magnetic field was directed parallel to the gravitation vector, the alternative magnetic field was directed perpendicular to static one; roots were directed parallel to alternative magnetic field. At third variant the alternative magnetic field was directed parallel to the gravitation vector, the static magnetic field was directed perpendicular to the gravitation vector, roots were directed perpendicular to both two fields components and gravitation vector; At forth variant the alternative magnetic field was directed parallel to the gravitation vector, the static magnetic field was directed perpendicular to the gravitation vector, roots were directed parallel to static magnetic field. In all cases studied the alternative magnetic field frequency was equal to Ca ions cyclotron frequency. In 2, 3 and 4 variants gravitropism was positive. But the gravitropic reaction speeds were different. In second and forth variants the gravitropic reaction speed in error limits coincided with the gravitropic reaction speed under Earth’s conditions. At third variant the gravitropic reaction speed was slowed essentially.
Large-scale trench-perpendicular mantle flow beneath northern Chile
NASA Astrophysics Data System (ADS)
Reiss, M. C.; Rumpker, G.; Woelbern, I.
2017-12-01
We investigate the anisotropic properties of the forearc region of the central Andean margin by analyzing shear-wave splitting from teleseismic and local earthquakes from the Nazca slab. The data stems from the Integrated Plate boundary Observatory Chile (IPOC) located in northern Chile, covering an approximately 120 km wide coastal strip between 17°-25° S with an average station spacing of 60 km. With partly over ten years of data, this data set is uniquely suited to address the long-standing debate about the mantle flow field at the South American margin and in particular whether the flow field beneath the slab is parallel or perpendicular to the trench. Our measurements yield two distinct anisotropic layers. The teleseismic measurements show a change of fast polarizations directions from North to South along the trench ranging from parallel to subparallel to the absolute plate motion and, given the geometry of absolute plate motion and strike of the trench, mostly perpendicular to the trench. Shear-wave splitting from local earthquakes shows fast polarizations roughly aligned trench-parallel but exhibit short-scale variations which are indicative of a relatively shallow source. Comparisons between fast polarization directions and the strike of the local fault systems yield a good agreement. We use forward modelling to test the influence of the upper layer on the teleseismic measurements. We show that the observed variations of teleseismic measurements along the trench are caused by the anisotropy in the upper layer. Accordingly, the mantle layer is best characterized by an anisotropic fast axes parallel to the absolute plate motion which is roughly trench-perpendicular. This anisotropy is likely caused by a combination of crystallographic preferred orientation of the mantle mineral olivine as fossilized anisotropy in the slab and entrained flow beneath the slab. We interpret the upper anisotropic layer to be confined to the crust of the overriding continental plate. This is explained by the shape-preferred orientation of micro-cracks in relation to local fault zones which are oriented parallel the overall strike of the Andean range. Our results do not provide any evidence for a significant contribution of trench-parallel mantle flow beneath the subducting slab to the measurements.
Earthquake focal mechanisms and the intraplate setting of the Bermuda Rise
NASA Astrophysics Data System (ADS)
Nishenko, S. P.; Kafka, A. L.
1982-05-01
A number of intraplate earthquakes occurring in the western North Atlantic Ocean are located near the perimeter of the Bermuda rise. Focal mechanisms and depths of two earthquakes, November 24, 1976 (mb 5.1; M0 = 2.96 × 1023 dyne cm) and March 24, 1978 (mb 6.1; M0 = 3.58 × 1025 dyne cm), were determined using Rayleigh wave amplitude data in the period range 20-50 s. The 1978 earthquake occurred approximately 380 km southwest of Bermuda, near magnetic anomaly M4 (≈118 m.y. B.P.). The focal mechanism for the 1978 event is of thrust type and has nodal planes striking 340°. The depth of this event is 6 km below the seafloor, near the local depth to Mono. The strike of the fault planes does not parallel the trends of either fracture zones (300°) or magnetic lineations (035°) in the area. The fault planes do, however, parallel the strike of a magnetic gradient in the epicentral area. The 1976 earthquake occurred approximately 300 km northeast of Bermuda, near Muir seamount. The depth of this event is 10 km below the seafloor. The available data are suggestive of one nodal plane striking between 320° and 340° and nearly parallel to the trend of Muir seamount and other volcanic features in the region. In contrast to the 1978 event, the 1976 earthquake appears to exhibit a significant component of strike slip motion. P axes of both mechanisms are subparallel to the direction of absolute plate motion for North America. We suggest, however, that strain release in the Bermuda rise area is not occurring along major fracture zones or topography parallel to seafloor spreading anomalies but rather on smaller-scale structures. The stresses induced by variations of crustal thickness may be responsible for triggering intraplate seismicity in this region.
Bender, Donald A.; Kuklo, Thomas
1994-01-01
An optical mount, which directs a laser beam to a point by controlling the position of a light-transmitting optic, is stiffened so that a lowest resonant frequency of the mount is approximately one kilohertz. The optical mount, which is cylindrically-shaped, positions the optic by individually moving a plurality of carriages which are positioned longitudinally within a sidewall of the mount. The optical mount is stiffened by allowing each carriage, which is attached to the optic, to move only in a direction which is substantially parallel to a center axis of the optic. The carriage is limited to an axial movement by flexures or linear bearings which connect the carriage to the mount. The carriage is moved by a piezoelectric transducer. By limiting the carriage to axial movement, the optic can be kinematically clamped to a carriage.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Franci, Luca; INFN-Sezione di Firenze, Via G. Sansone 1, I-50019 Sesto F.no; Hellinger, Petr, E-mail: petr.hellinger@asu.cas.cz
2016-03-25
Proton temperature anisotropies between the directions parallel and perpendicular to the mean magnetic field are usually observed in the solar wind plasma. Here, we employ a high-resolution hybrid particle-in-cell simulation in order to investigate the relation between spatial properties of the proton temperature and the peaks in the current density and in the flow vorticity. Our results indicate that, although regions where the proton temperature is enhanced and temperature anisotropies are larger correspond approximately to regions where many thin current sheets form, no firm quantitative evidence supports the idea of a direct causality between the two phenomena. On the othermore » hand, quite a clear correlation between the behavior of the proton temperature and the out-of-plane vorticity is obtained.« less
Anandakrishnan, Ramu; Scogland, Tom R W; Fenley, Andrew T; Gordon, John C; Feng, Wu-chun; Onufriev, Alexey V
2010-06-01
Tools that compute and visualize biomolecular electrostatic surface potential have been used extensively for studying biomolecular function. However, determining the surface potential for large biomolecules on a typical desktop computer can take days or longer using currently available tools and methods. Two commonly used techniques to speed-up these types of electrostatic computations are approximations based on multi-scale coarse-graining and parallelization across multiple processors. This paper demonstrates that for the computation of electrostatic surface potential, these two techniques can be combined to deliver significantly greater speed-up than either one separately, something that is in general not always possible. Specifically, the electrostatic potential computation, using an analytical linearized Poisson-Boltzmann (ALPB) method, is approximated using the hierarchical charge partitioning (HCP) multi-scale method, and parallelized on an ATI Radeon 4870 graphical processing unit (GPU). The implementation delivers a combined 934-fold speed-up for a 476,040 atom viral capsid, compared to an equivalent non-parallel implementation on an Intel E6550 CPU without the approximation. This speed-up is significantly greater than the 42-fold speed-up for the HCP approximation alone or the 182-fold speed-up for the GPU alone. Copyright (c) 2010 Elsevier Inc. All rights reserved.
Ha, S; Matej, S; Ispiryan, M; Mueller, K
2013-02-01
We describe a GPU-accelerated framework that efficiently models spatially (shift) variant system response kernels and performs forward- and back-projection operations with these kernels for the DIRECT (Direct Image Reconstruction for TOF) iterative reconstruction approach. Inherent challenges arise from the poor memory cache performance at non-axis aligned TOF directions. Focusing on the GPU memory access patterns, we utilize different kinds of GPU memory according to these patterns in order to maximize the memory cache performance. We also exploit the GPU instruction-level parallelism to efficiently hide long latencies from the memory operations. Our experiments indicate that our GPU implementation of the projection operators has slightly faster or approximately comparable time performance than FFT-based approaches using state-of-the-art FFTW routines. However, most importantly, our GPU framework can also efficiently handle any generic system response kernels, such as spatially symmetric and shift-variant as well as spatially asymmetric and shift-variant, both of which an FFT-based approach cannot cope with.
Large amplitude MHD waves upstream of the Jovian bow shock
NASA Technical Reports Server (NTRS)
Goldstein, M. L.; Smith, C. W.; Matthaeus, W. H.
1983-01-01
Observations of large amplitude magnetohydrodynamics (MHD) waves upstream of Jupiter's bow shock are analyzed. The waves are found to be right circularly polarized in the solar wind frame which suggests that they are propagating in the fast magnetosonic mode. A complete spectral and minimum variance eigenvalue analysis of the data was performed. The power spectrum of the magnetic fluctuations contains several peaks. The fluctuations at 2.3 mHz have a direction of minimum variance along the direction of the average magnetic field. The direction of minimum variance of these fluctuations lies at approximately 40 deg. to the magnetic field and is parallel to the radial direction. We argue that these fluctuations are waves excited by protons reflected off the Jovian bow shock. The inferred speed of the reflected protons is about two times the solar wind speed in the plasma rest frame. A linear instability analysis is presented which suggests an explanation for many of the observed features of the observations.
NASA Astrophysics Data System (ADS)
Ha, S.; Matej, S.; Ispiryan, M.; Mueller, K.
2013-02-01
We describe a GPU-accelerated framework that efficiently models spatially (shift) variant system response kernels and performs forward- and back-projection operations with these kernels for the DIRECT (Direct Image Reconstruction for TOF) iterative reconstruction approach. Inherent challenges arise from the poor memory cache performance at non-axis aligned TOF directions. Focusing on the GPU memory access patterns, we utilize different kinds of GPU memory according to these patterns in order to maximize the memory cache performance. We also exploit the GPU instruction-level parallelism to efficiently hide long latencies from the memory operations. Our experiments indicate that our GPU implementation of the projection operators has slightly faster or approximately comparable time performance than FFT-based approaches using state-of-the-art FFTW routines. However, most importantly, our GPU framework can also efficiently handle any generic system response kernels, such as spatially symmetric and shift-variant as well as spatially asymmetric and shift-variant, both of which an FFT-based approach cannot cope with.
Login, I S; Pal, S N; Adams, D T; Gold, P E
1998-01-01
Because GabaA ligands increase acetylcholine (ACh) release from adult striatal slices, we hypothesized that activation of GabaA receptors on striatal cholinergic interneurons directly stimulates ACh secretion. Fractional [3H]ACh release was recorded during perifusion of acutely dissociated, [3H]choline-labeled, adult male rat striata. The GabaA agonist, muscimol, immediately stimulated release maximally approximately 300% with EC50 = approximately 1 microM. This action was enhanced by the allosteric GabaA receptor modulators, diazepam and secobarbital, and inhibited by the GabaA antagonist, bicuculline, by ligands for D2 or muscarinic cholinergic receptors or by low calcium buffer, tetrodotoxin or vesamicol. Membrane depolarization inversely regulated muscimol-stimulated secretion. Release of endogenous and newly synthesized ACh was stimulated in parallel by muscimol without changing choline release. Muscimol pretreatment inhibited release evoked by K+ depolarization or by receptor-mediated stimulation with glutamate. Thus, GabaA receptors on adult striatal cholinergic interneurons directly stimulate voltage- and calcium-dependent exocytosis of ACh stored in vesamicol-sensitive synaptic vesicles. The action depends on the state of membrane polarization and apparently depolarizes the membrane in turn. This functional assay demonstrates that excitatory GabaA actions are not limited to neonatal tissues. GabaA-stimulated ACh release may be prevented in situ by normal tonic dopaminergic and muscarinic input to cholinergic neurons.
NASA Technical Reports Server (NTRS)
Ierotheou, C.; Johnson, S.; Leggett, P.; Cross, M.; Evans, E.; Jin, Hao-Qiang; Frumkin, M.; Yan, J.; Biegel, Bryan (Technical Monitor)
2001-01-01
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. Historically, the lack of a programming standard for using directives and the rather limited performance due to scalability have affected the take-up of this programming model approach. Significant progress has been made in hardware and software technologies, as a result the performance of parallel programs with compiler directives has also made improvements. The introduction of an industrial standard for shared-memory programming with directives, OpenMP, has also addressed the issue of portability. In this study, we have extended the computer aided parallelization toolkit (developed at the University of Greenwich), to automatically generate OpenMP based parallel programs with nominal user assistance. We outline the way in which loop types are categorized and how efficient OpenMP directives can be defined and placed using the in-depth interprocedural analysis that is carried out by the toolkit. We also discuss the application of the toolkit on the NAS Parallel Benchmarks and a number of real-world application codes. This work not only demonstrates the great potential of using the toolkit to quickly parallelize serial programs but also the good performance achievable on up to 300 processors for hybrid message passing and directive-based parallelizations.
Graf, Daniel; Beuerle, Matthias; Schurkus, Henry F; Luenser, Arne; Savasci, Gökcen; Ochsenfeld, Christian
2018-05-08
An efficient algorithm for calculating the random phase approximation (RPA) correlation energy is presented that is as accurate as the canonical molecular orbital resolution-of-the-identity RPA (RI-RPA) with the important advantage of an effective linear-scaling behavior (instead of quartic) for large systems due to a formulation in the local atomic orbital space. The high accuracy is achieved by utilizing optimized minimax integration schemes and the local Coulomb metric attenuated by the complementary error function for the RI approximation. The memory bottleneck of former atomic orbital (AO)-RI-RPA implementations ( Schurkus, H. F.; Ochsenfeld, C. J. Chem. Phys. 2016 , 144 , 031101 and Luenser, A.; Schurkus, H. F.; Ochsenfeld, C. J. Chem. Theory Comput. 2017 , 13 , 1647 - 1655 ) is addressed by precontraction of the large 3-center integral matrix with the Cholesky factors of the ground state density reducing the memory requirements of that matrix by a factor of [Formula: see text]. Furthermore, we present a parallel implementation of our method, which not only leads to faster RPA correlation energy calculations but also to a scalable decrease in memory requirements, opening the door for investigations of large molecules even on small- to medium-sized computing clusters. Although it is known that AO methods are highly efficient for extended systems, where sparsity allows for reaching the linear-scaling regime, we show that our work also extends the applicability when considering highly delocalized systems for which no linear scaling can be achieved. As an example, the interlayer distance of two covalent organic framework pore fragments (comprising 384 atoms in total) is analyzed.
Parallelization of NAS Benchmarks for Shared Memory Multiprocessors
NASA Technical Reports Server (NTRS)
Waheed, Abdul; Yan, Jerry C.; Saini, Subhash (Technical Monitor)
1998-01-01
This paper presents our experiences of parallelizing the sequential implementation of NAS benchmarks using compiler directives on SGI Origin2000 distributed shared memory (DSM) system. Porting existing applications to new high performance parallel and distributed computing platforms is a challenging task. Ideally, a user develops a sequential version of the application, leaving the task of porting to new generations of high performance computing systems to parallelization tools and compilers. Due to the simplicity of programming shared-memory multiprocessors, compiler developers have provided various facilities to allow the users to exploit parallelism. Native compilers on SGI Origin2000 support multiprocessing directives to allow users to exploit loop-level parallelism in their programs. Additionally, supporting tools can accomplish this process automatically and present the results of parallelization to the users. We experimented with these compiler directives and supporting tools by parallelizing sequential implementation of NAS benchmarks. Results reported in this paper indicate that with minimal effort, the performance gain is comparable with the hand-parallelized, carefully optimized, message-passing implementations of the same benchmarks.
Efficient solution of parabolic equations by Krylov approximation methods
NASA Technical Reports Server (NTRS)
Gallopoulos, E.; Saad, Y.
1990-01-01
Numerical techniques for solving parabolic equations by the method of lines is addressed. The main motivation for the proposed approach is the possibility of exploiting a high degree of parallelism in a simple manner. The basic idea of the method is to approximate the action of the evolution operator on a given state vector by means of a projection process onto a Krylov subspace. Thus, the resulting approximation consists of applying an evolution operator of a very small dimension to a known vector which is, in turn, computed accurately by exploiting well-known rational approximations to the exponential. Because the rational approximation is only applied to a small matrix, the only operations required with the original large matrix are matrix-by-vector multiplications, and as a result the algorithm can easily be parallelized and vectorized. Some relevant approximation and stability issues are discussed. We present some numerical experiments with the method and compare its performance with a few explicit and implicit algorithms.
Automatic Generation of OpenMP Directives and Its Application to Computational Fluid Dynamics Codes
NASA Technical Reports Server (NTRS)
Yan, Jerry; Jin, Haoqiang; Frumkin, Michael; Yan, Jerry (Technical Monitor)
2000-01-01
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. As great progress was made in hardware and software technologies, performance of parallel programs with compiler directives has demonstrated large improvement. The introduction of OpenMP directives, the industrial standard for shared-memory programming, has minimized the issue of portability. In this study, we have extended CAPTools, a computer-aided parallelization toolkit, to automatically generate OpenMP-based parallel programs with nominal user assistance. We outline techniques used in the implementation of the tool and discuss the application of this tool on the NAS Parallel Benchmarks and several computational fluid dynamics codes. This work demonstrates the great potential of using the tool to quickly port parallel programs and also achieve good performance that exceeds some of the commercial tools.
Parallelizing alternating direction implicit solver on GPUs
USDA-ARS?s Scientific Manuscript database
We present a parallel Alternating Direction Implicit (ADI) solver on GPUs. Our implementation significantly improves existing implementations in two aspects. First, we address the scalability issue of existing Parallel Cyclic Reduction (PCR) implementations by eliminating their hardware resource con...
Recent Advances in 3D Time-Resolved Contrast-Enhanced MR Angiography
Riederer, Stephen J.; Haider, Clifton R.; Borisch, Eric A.; Weavers, Paul T.; Young, Phillip M.
2015-01-01
Contrast-enhanced MR angiography (CE-MRA) was first introduced for clinical studies approximately 20 years ago. Early work provided 3 to 4 mm spatial resolution with acquisition times in the 30 sec range. Since that time there has been continuing effort to provide improved spatial resolution with reduced acquisition time, allowing high resolution three-dimensional (3D) time-resolved studies. The purpose of this work is to describe how this has been accomplished. Specific technical enablers have been: improved gradients allowing reduced repetition times, improved k-space sampling and reconstruction methods, parallel acquisition particularly in two directions, and improved and higher count receiver coil arrays. These have collectively made high resolution time-resolved studies readily available for many anatomic regions. Depending on the application, approximate 1 mm isotropic resolution is now possible with frame times of several seconds. Clinical applications of time-resolved CE-MRA are briefly reviewed. PMID:26032598
NASA Astrophysics Data System (ADS)
Mughnetsyan, V. N.; Barseghyan, M. G.; Kirakosyan, A. A.
2008-01-01
We consider the photoionization of a hydrogen-like impurity centre in a quantum wire approximated by a cylindrical well of finite depth in a magnetic field directed along the wire axis. The ground state energy and the wave function of the electron localized on on-axis impurity centre are calculated using the variational method. The wave functions and energies of the final states in an one-dimensional conduction subband are also presented. The dependences of photoionization cross-section of a donor centre on magnetic field and frequency of incident radiation both for parallel and perpendicular polarizations and corresponding selection rules for the allowed transitions are found in the dipole approximation. The estimates of photoionization cross-section for various values of wire radius and magnetic field induction for GaAs quantum wire embedded in Ga 1-xAl 1-xAs matrix are given.
Edge Vortex Flow Due to Inhomogeneous Ion Concentration
NASA Astrophysics Data System (ADS)
Sugioka, Hideyuki
2017-04-01
The ion distribution of an open parallel electrode system is not known even though it is often used to measure the electrical characteristics of an electrolyte. Thus, for an open electrode system, we perform a non-steady direct multiphysics simulation based on the coupled Poisson-Nernst-Planck and Stokes equations and find that inhomogeneous ion concentrations at edges cause vortex flows and suppress the anomalous increase in the ion concentration near the electrodes. A surprising aspect of our findings is that the large vortex flows at the edges approximately maintain the ion-conserving condition, and thus the ion distribution of an open electrode system can be approximated by the solution of a closed electrode system that considers the ion-conserving condition rather than the Gouy-Chapman solution, which neglects the ion-conserving condition. We believe that our findings make a significant contribution to the understanding of surface science.
Parallel Plate System for Collecting Data Used to Determine Viscosity
NASA Technical Reports Server (NTRS)
Ethridge, Edwin C. (Inventor); Kaukler, William (Inventor)
2013-01-01
A parallel-plate system collects data used to determine viscosity. A first plate is coupled to a translator so that the first plate can be moved along a first direction. A second plate has a pendulum device coupled thereto such that the second plate is suspended above and parallel to the first plate. The pendulum device constrains movement of the second plate to a second direction that is aligned with the first direction and is substantially parallel thereto. A force measuring device is coupled to the second plate for measuring force along the second direction caused by movement of the second plate.
Deformational sequence of a portion of the Michipicoten greenstone belt, Chabanel Township, Ontario
NASA Technical Reports Server (NTRS)
Shrady, C. H.; Mcgill, G. E.
1986-01-01
Detailed mapping at a scale of one inch = 400 feet is being carried out within a fume kill, having excellent exposure, located in the southwestern portion of the Michipicoten Greenstone Belt near Wawa, Ontario. The rocks are metasediments and metavolcanics of lower greenschist facies. U-Pb geochronology indicates that they are at least 2698 + or - 11 Ma old. The lithologic packages strike northeast to northwest, but the dominant strike is approximately east-west. Sedimentary structures and graded bedding are well preserved, aiding in the structural interpretation of this multiply deformed area. At least six phases of deformation within a relatively small area of the Michipicoten Greenstone Belt have been tentatively identified. These include the following structural features in approximate order of occurrence: (0) soft-sediment structures; (1) regionally overturned rocks, flattened pebbles, bedding parallel cleavage, and early, approximately bedding parallel faults; (2) northwest to north striking cleavage; (3) northeast striking cleavage and associated folds, and at least some late movement on approximately bedding parallel faults; (4) north-northwest and northeast trending faults; and (5) diabase dikes and associated fracture cleavages. Minor displacement of the diabase dikes occurs on faults that appear to be reactivated older structures.
The cost of parallel consolidation into visual working memory.
Rideaux, Reuben; Edwards, Mark
2016-01-01
A growing body of evidence indicates that information can be consolidated into visual working memory in parallel. Initially, it was suggested that color information could be consolidated in parallel while orientation was strictly limited to serial consolidation (Liu & Becker, 2013). However, we recently found evidence suggesting that both orientation and motion direction items can be consolidated in parallel, with different levels of accuracy (Rideaux, Apthorp, & Edwards, 2015). Here we examine whether there is a cost associated with parallel consolidation of orientation and direction information by comparing performance, in terms of precision and guess rate, on a target recall task where items are presented either sequentially or simultaneously. The results compellingly indicate that motion direction can be consolidated in parallel, but the evidence for orientation is less conclusive. Further, we find that there is a twofold cost associated with parallel consolidation of direction: Both the probability of failing to consolidate one (or both) item/s increases and the precision at which representations are encoded is reduced. Additionally, we find evidence indicating that the increased consolidation failure may be due to interference between items presented simultaneously, and is moderated by item similarity. These findings suggest that a biased competition model may explain differences in parallel consolidation between features.
A force-based, parallel assay for the quantification of protein-DNA interactions.
Limmer, Katja; Pippig, Diana A; Aschenbrenner, Daniela; Gaub, Hermann E
2014-01-01
Analysis of transcription factor binding to DNA sequences is of utmost importance to understand the intricate regulatory mechanisms that underlie gene expression. Several techniques exist that quantify DNA-protein affinity, but they are either very time-consuming or suffer from possible misinterpretation due to complicated algorithms or approximations like many high-throughput techniques. We present a more direct method to quantify DNA-protein interaction in a force-based assay. In contrast to single-molecule force spectroscopy, our technique, the Molecular Force Assay (MFA), parallelizes force measurements so that it can test one or multiple proteins against several DNA sequences in a single experiment. The interaction strength is quantified by comparison to the well-defined rupture stability of different DNA duplexes. As a proof-of-principle, we measured the interaction of the zinc finger construct Zif268/NRE against six different DNA constructs. We could show the specificity of our approach and quantify the strength of the protein-DNA interaction.
NASA Astrophysics Data System (ADS)
Li, Xinhua; Song, Zhenyu; Zhan, Yongjie; Wu, Qiongzhi
2009-12-01
Since the system capacity is severely limited, reducing the multiple access interfere (MAI) is necessary in the multiuser direct-sequence code division multiple access (DS-CDMA) system which is used in the telecommunication terminals data-transferred link system. In this paper, we adopt an adaptive multistage parallel interference cancellation structure in the demodulator based on the least mean square (LMS) algorithm to eliminate the MAI on the basis of overviewing various of multiuser dectection schemes. Neither a training sequence nor a pilot signal is needed in the proposed scheme, and its implementation complexity can be greatly reduced by a LMS approximate algorithm. The algorithm and its FPGA implementation is then derived. Simulation results of the proposed adaptive PIC can outperform some of the existing interference cancellation methods in AWGN channels. The hardware setup of mutiuser demodulator is described, and the experimental results based on it demonstrate that the simulation results shows large performance gains over the conventional single-user demodulator.
Electrostatic ion-cyclotron waves in a nonuniform magnetic field
NASA Technical Reports Server (NTRS)
Cartier, S. L.; Dangelo, N.; Merlino, R. L.
1985-01-01
The properties of electrostatic ion-cyclotron waves excited in a single-ended cesium Q machine with a nonuniform magnetic field are described. The electrostatic ion-cyclotron waves are generated in the usual manner by drawing an electron current to a small exciter disk immersed in the plasma column. The parallel and perpendicular (to B) wavelengths and phase velocities are determined by mapping out two-dimensional wave phase contours. The wave frequency f depends on the location of the exciter disk in the nonuniform magnetic field, and propagating waves are only observed in the region where f is approximately greater than fci, where fci is the local ion-cyclotron frequency. The parallel phase velocity is in the direction of the electron drift. From measurements of the plasma properties along the axis, it is inferred that the electron drift velocity is not uniform along the entire current channel. The evidence suggests that the waves begin being excited at that axial position where the critical drift velocity is first exceeded, consistent with a current-driven excitation mechanism.
Proactive action preparation: seeing action preparation as a continuous and proactive process.
Pezzulo, Giovanni; Ognibene, Dimitri
2012-07-01
In this paper, we aim to elucidate the processes that occur during action preparation from both a conceptual and a computational point of view. We first introduce the traditional, serial model of goal-directed action and discuss from a computational viewpoint its subprocesses occurring during the two phases of covert action preparation and overt motor control. Then, we discuss recent evidence indicating that these subprocesses are highly intertwined at representational and neural levels, which undermines the validity of the serial model and points instead to a parallel model of action specification and selection. Within the parallel view, we analyze the case of delayed choice, arguing that action preparation can be proactive, and preparatory processes can take place even before decisions are made. Specifically, we discuss how prior knowledge and prospective abilities can be used to maximize utility even before deciding what to do. To support our view, we present a computational implementation of (an approximated version of) proactive action preparation, showing its advantages in a simulated tennis-like scenario.
Code of Federal Regulations, 2011 CFR
2011-04-01
... northeast approximately 16.5 miles along the Missouri River Pacific Railroad, as it parallels the Missouri... southeast approximately 8.5 miles to the intersection Big Berger Creek. (4) Then southwest along the winding course of Big Berger Creek for approximately 20 miles (eight miles due southwest) to Township line T.44...
Code of Federal Regulations, 2014 CFR
2014-04-01
... northeast approximately 16.5 miles along the Missouri River Pacific Railroad, as it parallels the Missouri... southeast approximately 8.5 miles to the intersection Big Berger Creek. (4) Then southwest along the winding course of Big Berger Creek for approximately 20 miles (eight miles due southwest) to Township line T.44...
Code of Federal Regulations, 2012 CFR
2012-04-01
... northeast approximately 16.5 miles along the Missouri River Pacific Railroad, as it parallels the Missouri... southeast approximately 8.5 miles to the intersection Big Berger Creek. (4) Then southwest along the winding course of Big Berger Creek for approximately 20 miles (eight miles due southwest) to Township line T.44...
Code of Federal Regulations, 2013 CFR
2013-04-01
... northeast approximately 16.5 miles along the Missouri River Pacific Railroad, as it parallels the Missouri... southeast approximately 8.5 miles to the intersection Big Berger Creek. (4) Then southwest along the winding course of Big Berger Creek for approximately 20 miles (eight miles due southwest) to Township line T.44...
Application of a Phase-resolving, Directional Nonlinear Spectral Wave Model
NASA Astrophysics Data System (ADS)
Davis, J. R.; Sheremet, A.; Tian, M.; Hanson, J. L.
2014-12-01
We describe several applications of a phase-resolving, directional nonlinear spectral wave model. The model describes a 2D surface gravity wave field approaching a mildly sloping beach with parallel depth contours at an arbitrary angle accounting for nonlinear, quadratic triad interactions. The model is hyperbolic, with the initial wave spectrum specified in deep water. Complex amplitudes are generated based on the random phase approximation. The numerical implementation includes unidirectional propagation as a special case. In directional mode, it solves the system of equations in the frequency-alongshore wave number space. Recent enhancements of the model include the incorporation of dissipation caused by breaking and propagation over a viscous mud layer and the calculation of wave induced setup. Applications presented include: a JONSWAP spectrum with a cos2s directional distribution, for shore-perpendicular and oblique propagation, a study of the evolution of a single directional triad, and several preliminary comparisons to wave spectra collected at the USACE-FRF in Duck, NC which show encouraging results although further validation with a wider range of beach slopes and wave conditions is needed.
Perception of straightness and parallelism with minimal distance information.
Rogers, Brian; Naumenko, Olga
2016-07-01
The ability of human observers to judge the straightness and parallelism of extended lines has been a neglected topic of study since von Helmholtz's initial observations 150 years ago. He showed that there were significant misperceptions of the straightness of extended lines seen in the peripheral visual field. The present study focused on the perception of extended lines (spanning 90° visual angle) that were directly fixated in the visual environment of a planetarium where there was only minimal information about the distance to the lines. Observers were asked to vary the curvature of 1 or more lines until they appeared to be straight and/or parallel, ignoring any perceived curvature in depth. When the horizon between the ground and the sky was visible, the results showed that observers' judgements of the straightness of a single line were significantly biased away from the veridical, great circle locations, and towards equal elevation settings. Similar biases can be seen in the jet trails of aircraft flying across the sky and in Rogers and Anstis's new moon illusion (Perception, 42(Abstract supplement) 18, 2013, 2016). The biasing effect of the horizon was much smaller when observers were asked to judge the straightness and parallelism of 2 or more extended lines. We interpret the results as showing that, in the absence of adequate distance information, observers tend to perceive the projected lines as lying on an approximately equidistant, hemispherical surface and that their judgements of straightness and parallelism are based on the perceived separation of the lines superimposed on that surface.
Dose in bone and tissue near bone-tissue interface from electron beam.
Shiu, A S; Hogstrom, K R
1991-08-01
This work has quantitatively studied the variation of dose both within bone and in unit density tissue near bone-tissue interfaces. Dose upstream of a bone-tissue interface is increased because of an increase in the backscattered electrons from the bone. The magnitude of this effect was measured using a thin parallel-plate ionization chamber upstream of a polymethyl methacrylate (PMMA)-hard bone interface. The electron backscatter factor (EBF) increased rapidly with bone thickness until a full EBF was achieved. This occurred at approximately 3.5 mm at 2 MeV and 6 mm at 13.1 MeV. The full EBF at the interface ranged from approximately 1.018 at 13.1 MeV to 1.05 at 2 MeV. It was also observed that the EBF had a dependence on the energy spectrum at the interface. The penetration of the backscattered electrons in the upstream direction of PMMA was also measured. The dose penetration fell off rapidly in the upstream direction of the interface. Dose enhancement to unit density tissue in bone was measured for an electron beam by placing thermoluminescent dosimeters (TLDs) in a PMMA-bone-PMMA phantom. The maximum dose enhancement in bone was approximately 7% of the maximum dose in water. However, the pencil-beam algorithm of Hogstrom et al. predicted an increase of only 1%, primarily owing to the inverse-square correction. Film was also used to measure the dose enhancement in bone. The film plane was aligned either perpendicular or parallel to the central axis of the beam. The film data indicated that the maximum dose enhancement in bone was approximately 8% for the former film alignment (which was similarly predicted by the TLD measurements) and 13% for the latter film alignment. These results confirm that the X ray film is not suitable to be irritated "edge on" in an inhomogeneous phantom without making perturbation corrections resulting from the film acting as a long narrow inhomogeneous cavity within the bone. In addition, the results give the radiotherapist a basis for clinical judgment when electron beams are used to treat lesions behind bone or near bony structures. We feel these data enhance the ability to recognize the shortcomings of the current dose calculation algorithm used clinically.
The OpenMP Implementation of NAS Parallel Benchmarks and its Performance
NASA Technical Reports Server (NTRS)
Jin, Hao-Qiang; Frumkin, Michael; Yan, Jerry
1999-01-01
As the new ccNUMA architecture became popular in recent years, parallel programming with compiler directives on these machines has evolved to accommodate new needs. In this study, we examine the effectiveness of OpenMP directives for parallelizing the NAS Parallel Benchmarks. Implementation details will be discussed and performance will be compared with the MPI implementation. We have demonstrated that OpenMP can achieve very good results for parallelization on a shared memory system, but effective use of memory and cache is very important.
NASA Astrophysics Data System (ADS)
Lin, W.; Tadai, O.; Shigematsu, N.; Nishikawa, O.; Mori, H.; Townend, J.; Capova, L.; Saito, S.; Kinoshita, M.
2015-12-01
The Alpine Fault is a mature active fault zone likely to rupture in the near future and DFDP aims to measure physical and chemical conditions within the fault. DFDP-2B borehole was drilled into hanging wall of the Alpine Fault. Downhole temperature measurements carried out in DFDP-2B borehole showed that the geothermal gradient in the hanging wall of the fault is very high, likely reaching to 130-150 °C/km (Sutherland et al., 2015 AGU Fall Meeting). To explain this abnormal feature, the determination of thermal properties of all the rock types in the hanging wall of the Alpine Fault is essential. To measure thermal properties and elastic wave velocities, we collected six typical rock block samples from outcrops in Stony creek and Gaunt creek. These include ultramylonite, mylonite, muscovite schist, garnet amphibolite, protomylonite and schist, which are representative of the hanging wall of the Alpine Fault. Their wet bulk densities are 2.7 - 2.8 g/cm3, and porosities are 1.4 - 3.0%. We prepared a pair of 4 cm cube specimens of each rock type with one flat plane parallel to the foliation. First, we measured thermal conductivity by the transient plane heat source (hot disc) method in a bulk mode, i.e. to deal with the rock as an isotropic material. However, several samples have clearly visible foliation and are likely to be anisotropic. Thus, the data measured in bulk mode provided an average value of the rocks in the range of approximately 2.4 - 3.2 W/mK. The next step will be to measure thermal conductivity in an anisotropic mode. We also measured P wave velocity (Vp) using the same samples, but in two directions, i.e. parallel and perpendicular to the foliation, respectively. Our preliminary results suggested that Vp is anisotropic in all the six rocks. Generally, Vp parallel to foliation is higher than that in the perpendicular direction. Vp in the parallel direction ranged in 5.5 - 6.0 km/s, whereas in the perpendicular direction it was 4.4 - 5.5 km/s. We thank the PIs and onsite staffs of the DFDP-2 project for their helps to collecting rock samples, and the financial support by JSPS (Japan-New Zealand Joint Research Program).
A wavelet approach to binary blackholes with asynchronous multitasking
NASA Astrophysics Data System (ADS)
Lim, Hyun; Hirschmann, Eric; Neilsen, David; Anderson, Matthew; Debuhr, Jackson; Zhang, Bo
2016-03-01
Highly accurate simulations of binary black holes and neutron stars are needed to address a variety of interesting problems in relativistic astrophysics. We present a new method for the solving the Einstein equations (BSSN formulation) using iterated interpolating wavelets. Wavelet coefficients provide a direct measure of the local approximation error for the solution and place collocation points that naturally adapt to features of the solution. Further, they exhibit exponential convergence on unevenly spaced collection points. The parallel implementation of the wavelet simulation framework presented here deviates from conventional practice in combining multi-threading with a form of message-driven computation sometimes referred to as asynchronous multitasking.
Parallel Wavefront Analysis for a 4D Interferometer
NASA Technical Reports Server (NTRS)
Rao, Shanti R.
2011-01-01
This software provides a programming interface for automating data collection with a PhaseCam interferometer from 4D Technology, and distributing the image-processing algorithm across a cluster of general-purpose computers. Multiple instances of 4Sight (4D Technology s proprietary software) run on a networked cluster of computers. Each connects to a single server (the controller) and waits for instructions. The controller directs the interferometer to several images, then assigns each image to a different computer for processing. When the image processing is finished, the server directs one of the computers to collate and combine the processed images, saving the resulting measurement in a file on a disk. The available software captures approximately 100 images and analyzes them immediately. This software separates the capture and analysis processes, so that analysis can be done at a different time and faster by running the algorithm in parallel across several processors. The PhaseCam family of interferometers can measure an optical system in milliseconds, but it takes many seconds to process the data so that it is usable. In characterizing an adaptive optics system, like the next generation of astronomical observatories, thousands of measurements are required, and the processing time quickly becomes excessive. A programming interface distributes data processing for a PhaseCam interferometer across a Windows computing cluster. A scriptable controller program coordinates data acquisition from the interferometer, storage on networked hard disks, and parallel processing. Idle time of the interferometer is minimized. This architecture is implemented in Python and JavaScript, and may be altered to fit a customer s needs.
Langton, Christian M; Wille, Marie-Luise; Flegg, Mark B
2014-04-01
The acceptance of broadband ultrasound attenuation for the assessment of osteoporosis suffers from a limited understanding of ultrasound wave propagation through cancellous bone. It has recently been proposed that the ultrasound wave propagation can be described by a concept of parallel sonic rays. This concept approximates the detected transmission signal to be the superposition of all sonic rays that travel directly from transmitting to receiving transducer. The transit time of each ray is defined by the proportion of bone and marrow propagated. An ultrasound transit time spectrum describes the proportion of sonic rays having a particular transit time, effectively describing lateral inhomogeneity of transit times over the surface of the receiving ultrasound transducer. The aim of this study was to provide a proof of concept that a transit time spectrum may be derived from digital deconvolution of input and output ultrasound signals. We have applied the active-set method deconvolution algorithm to determine the ultrasound transit time spectra in the three orthogonal directions of four cancellous bone replica samples and have compared experimental data with the prediction from the computer simulation. The agreement between experimental and predicted ultrasound transit time spectrum analyses derived from Bland-Altman analysis ranged from 92% to 99%, thereby supporting the concept of parallel sonic rays for ultrasound propagation in cancellous bone. In addition to further validation of the parallel sonic ray concept, this technique offers the opportunity to consider quantitative characterisation of the material and structural properties of cancellous bone, not previously available utilising ultrasound.
Kinetic Alfven turbulence: Electron and ion heating by particle-in-cell simulations
NASA Astrophysics Data System (ADS)
Gary, S. P.; Hughes, R. S.; Wang, J.; Parashar, T. N.
2017-12-01
Three-dimensional particle-in-cell simulations of the forward cascade of decaying kinetic Alfvén turbulence have been carried out as an initial-value problem on a collisionless, homogeneous, magnetized, electron-ion plasma model with betae = betai =0.50 and mi/me=100 where subscripts e and i represent electrons and ions respectively. Initial anisotropic narrowband spectra of relatively long wavelength modes with approximately gyrotropic distributions in kperp undergo a forward cascade to broadband spectra of magnetic fluctuations at shorter wavelengths. Maximum electron and ion heating rates are computed as functions of the initial fluctuating magnetic field energy density eo on the range 0.05 < eo < 0.50. In contrast to dissipation by whistler turbulence, the maximum ion heating rate due to kinetic Alfvén turbulence is substantially greater than the maximum electron heating rate. Furthermore, ion heating as well as electron heating due to kinetic Alfvén turbulence scale approximately with eo. Finally, electron heating leads to anisotropies of the type T||e> Tperpe where the parallel and perpendicular symbols refer to directions parallel and perpendicular, respectively, to the background magnetic field, whereas the heated ions remain relatively isotropic. This implies that, for the range of eo values considered, the Landau wave-particle resonance is a likely heating mechanism for the electrons and may also contribute to ion heating.
Bender, D.A.; Kuklo, T.
1994-11-08
An optical mount, which directs a laser beam to a point by controlling the position of a light-transmitting optic, is stiffened so that a lowest resonant frequency of the mount is approximately one kilohertz. The optical mount, which is cylindrically-shaped, positions the optic by individually moving a plurality of carriages which are positioned longitudinally within a sidewall of the mount. The optical mount is stiffened by allowing each carriage, which is attached to the optic, to move only in a direction which is substantially parallel to a center axis of the optic. The carriage is limited to an axial movement by flexures or linear bearings which connect the carriage to the mount. The carriage is moved by a piezoelectric transducer. By limiting the carriage to axial movement, the optic can be kinematically clamped to a carriage. 5 figs.
Free-flow variability on the Jess and Souza Ranches, Altamont Pass. [Final report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nierenberg, R.
1988-04-25
A central monitoring computer was installed on each ranch. The computers were connected by communication cables to 50 turbines on the Souza Ranch and 150 turbines on the Jess Ranch. Anemometers were installed on every other turbine on 12-foot booms at 35 feet above ground level (AGL). Spacing between anemometers was approximately 200 feet in the crosswind direction by 500 feet in the parallel direction. A total of 23 turbines on the Souza Ranch was instrumented in this fashion, as well as two multi-level meteorological towers. On the Jess Ranch, 77 turbines were instrumented; about half at 35 feet AGLmore » and half at 50 feet AGL, plus four additional towers. Wind data were collected for approximately a 100 hour period on each ranch. All turbines were shut down during these periods so that no turbine wakes would be present. The data periods were selected by the meteorologist to insure that they occurred during typical spring-summer flow regimes. The terrain features upwind of the site appear to play as significant a role in the flow variability as terrain features within the site.« less
Free-flow variability on the Jess and Souza Ranches, Altamont Pass
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nierenberg, R.
1988-04-25
A central monitoring computer was installed on each ranch. The computers were connected by communication cables to 50 turbines on the Souza Ranch and 150 turbines on the Jess Ranch. Anemometers were installed on every other turbine on 12-foot booms at 35 feet above ground level (AGL). Spacing between anemometers was approximately 200 feet in the crosswind direction by 500 feet in the parallel direction. A total of 23 turbines on the Souza Ranch was instrumented in this fashion, as well as two multi-level meteorological towers. On the Jess Ranch, 77 turbines were instrumented; about half at 35 feet AGLmore » and half at 50 feet AGL, plus four additional towers. Wind data were collected for approximately a 100 hour period on each ranch. All turbines were shut down during these periods so that no turbine wakes would be present. The data periods were selected by the meteorologist to insure that they occurred during typical spring-summer flow regimes. The terrain features upwind of the site appear to play as significant a role in the flow variability as terrain features within the site.« less
Cao, Jianfang; Chen, Lichao; Wang, Min; Tian, Yun
2018-01-01
The Canny operator is widely used to detect edges in images. However, as the size of the image dataset increases, the edge detection performance of the Canny operator decreases and its runtime becomes excessive. To improve the runtime and edge detection performance of the Canny operator, in this paper, we propose a parallel design and implementation for an Otsu-optimized Canny operator using a MapReduce parallel programming model that runs on the Hadoop platform. The Otsu algorithm is used to optimize the Canny operator's dual threshold and improve the edge detection performance, while the MapReduce parallel programming model facilitates parallel processing for the Canny operator to solve the processing speed and communication cost problems that occur when the Canny edge detection algorithm is applied to big data. For the experiments, we constructed datasets of different scales from the Pascal VOC2012 image database. The proposed parallel Otsu-Canny edge detection algorithm performs better than other traditional edge detection algorithms. The parallel approach reduced the running time by approximately 67.2% on a Hadoop cluster architecture consisting of 5 nodes with a dataset of 60,000 images. Overall, our approach system speeds up the system by approximately 3.4 times when processing large-scale datasets, which demonstrates the obvious superiority of our method. The proposed algorithm in this study demonstrates both better edge detection performance and improved time performance.
Cao, Jianfang; Cui, Hongyan; Shi, Hao; Jiao, Lijuan
2016-01-01
A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network's initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data.
Asymmetry in the Farley-Buneman dispersion relation caused by parallel electric fields
NASA Astrophysics Data System (ADS)
Forsythe, Victoriya V.; Makarevich, Roman A.
2016-11-01
An implicit assumption utilized in studies of E region plasma waves generated by the Farley-Buneman instability (FBI) is that the FBI dispersion relation and its solutions for the growth rate and phase velocity are perfectly symmetric with respect to the reversal of the wave propagation component parallel to the magnetic field. In the present study, a recently derived general dispersion relation that describes fundamental plasma instabilities in the lower ionosphere including FBI is considered and it is demonstrated that the dispersion relation is symmetric only for background electric fields that are perfectly perpendicular to the magnetic field. It is shown that parallel electric fields result in significant differences between the growth rates and phase velocities for propagation of parallel components of opposite signs. These differences are evaluated using numerical solutions of the general dispersion relation and shown to exhibit an approximately linear relationship with the parallel electric field near the E region peak altitude of 110 km. An analytic expression for the differences is also derived from an approximate version of the dispersion relation, with comparisons between numerical and analytic results agreeing near 110 km. It is further demonstrated that parallel electric fields do not change the overall symmetry when the full 3-D wave propagation vector is reversed, with no symmetry seen when either the perpendicular or parallel component is reversed. The present results indicate that moderate-to-strong parallel electric fields of 0.1-1.0 mV/m can result in experimentally measurable differences between the characteristics of plasma waves with parallel propagation components of opposite polarity.
Incomplete Sparse Approximate Inverses for Parallel Preconditioning
Anzt, Hartwig; Huckle, Thomas K.; Bräckle, Jürgen; ...
2017-10-28
In this study, we propose a new preconditioning method that can be seen as a generalization of block-Jacobi methods, or as a simplification of the sparse approximate inverse (SAI) preconditioners. The “Incomplete Sparse Approximate Inverses” (ISAI) is in particular efficient in the solution of sparse triangular linear systems of equations. Those arise, for example, in the context of incomplete factorization preconditioning. ISAI preconditioners can be generated via an algorithm providing fine-grained parallelism, which makes them attractive for hardware with a high concurrency level. Finally, in a study covering a large number of matrices, we identify the ISAI preconditioner as anmore » attractive alternative to exact triangular solves in the context of incomplete factorization preconditioning.« less
Kryzhevoi, Nikolai V; Mateo, David; Pi, Martí; Barranco, Manuel; Cederbaum, Lorenz S
2013-11-07
Interatomic Coulombic decay (ICD) represents an efficient electronic relaxation mechanism of an ionized or an excited system embedded in an environment. The type of this environment and its size have a great impact on the ICD performance. It is stressed that ICD is sensitive to the arrangement of neighboring atoms when the initially created vacancy has a polarization direction. This is demonstrated in the present paper for the case of a 3p-ionized Ca surrounded by He atoms. Useful explicit expressions are derived for the ICD widths which show that the neighbors located along the polarization direction of the ionized orbital have the largest contribution to the ICD rate. By comparison with ab initio results for small clusters, we also show that in a helium environment, the pairwise approximation represents a reliable approach for computing ICD widths. Using this approximation and the density distribution of the helium atoms obtained within density functional theory, we explore ICD in large isotopically mixed helium droplets doped with Ca. A special emphasis is given to the difference between the ICD widths for the Ca3p orbitals directed perpendicular and parallel to the droplet surface. Depending on the size and isotopic composition of the droplet, Ca resides in the interfacial layer between the (4)He core and the (3)He outer shell. Hence, ICD studies in these droplets may provide valuable information on the properties of this interface.
A Hybrid MPI/OpenMP Approach for Parallel Groundwater Model Calibration on Multicore Computers
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tang, Guoping; D'Azevedo, Ed F; Zhang, Fan
2010-01-01
Groundwater model calibration is becoming increasingly computationally time intensive. We describe a hybrid MPI/OpenMP approach to exploit two levels of parallelism in software and hardware to reduce calibration time on multicore computers with minimal parallelization effort. At first, HydroGeoChem 5.0 (HGC5) is parallelized using OpenMP for a uranium transport model with over a hundred species involving nearly a hundred reactions, and a field scale coupled flow and transport model. In the first application, a single parallelizable loop is identified to consume over 97% of the total computational time. With a few lines of OpenMP compiler directives inserted into the code,more » the computational time reduces about ten times on a compute node with 16 cores. The performance is further improved by selectively parallelizing a few more loops. For the field scale application, parallelizable loops in 15 of the 174 subroutines in HGC5 are identified to take more than 99% of the execution time. By adding the preconditioned conjugate gradient solver and BICGSTAB, and using a coloring scheme to separate the elements, nodes, and boundary sides, the subroutines for finite element assembly, soil property update, and boundary condition application are parallelized, resulting in a speedup of about 10 on a 16-core compute node. The Levenberg-Marquardt (LM) algorithm is added into HGC5 with the Jacobian calculation and lambda search parallelized using MPI. With this hybrid approach, compute nodes at the number of adjustable parameters (when the forward difference is used for Jacobian approximation), or twice that number (if the center difference is used), are used to reduce the calibration time from days and weeks to a few hours for the two applications. This approach can be extended to global optimization scheme and Monte Carol analysis where thousands of compute nodes can be efficiently utilized.« less
Petrenko, Taras; Kossmann, Simone; Neese, Frank
2011-02-07
In this paper, we present the implementation of efficient approximations to time-dependent density functional theory (TDDFT) within the Tamm-Dancoff approximation (TDA) for hybrid density functionals. For the calculation of the TDDFT/TDA excitation energies and analytical gradients, we combine the resolution of identity (RI-J) algorithm for the computation of the Coulomb terms and the recently introduced "chain of spheres exchange" (COSX) algorithm for the calculation of the exchange terms. It is shown that for extended basis sets, the RIJCOSX approximation leads to speedups of up to 2 orders of magnitude compared to traditional methods, as demonstrated for hydrocarbon chains. The accuracy of the adiabatic transition energies, excited state structures, and vibrational frequencies is assessed on a set of 27 excited states for 25 molecules with the configuration interaction singles and hybrid TDDFT/TDA methods using various basis sets. Compared to the canonical values, the typical error in transition energies is of the order of 0.01 eV. Similar to the ground-state results, excited state equilibrium geometries differ by less than 0.3 pm in the bond distances and 0.5° in the bond angles from the canonical values. The typical error in the calculated excited state normal coordinate displacements is of the order of 0.01, and relative error in the calculated excited state vibrational frequencies is less than 1%. The errors introduced by the RIJCOSX approximation are, thus, insignificant compared to the errors related to the approximate nature of the TDDFT methods and basis set truncation. For TDDFT/TDA energy and gradient calculations on Ag-TB2-helicate (156 atoms, 2732 basis functions), it is demonstrated that the COSX algorithm parallelizes almost perfectly (speedup ~26-29 for 30 processors). The exchange-correlation terms also parallelize well (speedup ~27-29 for 30 processors). The solution of the Z-vector equations shows a speedup of ~24 on 30 processors. The parallelization efficiency for the Coulomb terms can be somewhat smaller (speedup ~15-25 for 30 processors), but their contribution to the total calculation time is small. Thus, the parallel program completes a Becke3-Lee-Yang-Parr energy and gradient calculation on the Ag-TB2-helicate in less than 4 h on 30 processors. We also present the necessary extension of the Lagrangian formalism, which enables the calculation of the TDDFT excited state properties in the frozen-core approximation. The algorithms described in this work are implemented into the ORCA electronic structure system.
Novel molecular targets for kRAS downregulation: promoter G-quadruplexes
2016-11-01
conditions, and described the structure as having mixed parallel/anti-parallel loops of lengths 2:8:10 in the 5’-3’ direction. Using selective small...and anti-parallel loop directionality of lengths 4:10:8 in the 5’–3’ direction, three tetrads stacked, and involving guanines in runs B, C, E, and F...a tri-stacked structure incorporating runs B, C, E and F with intervening loops of 2, 10, and 8 bases in the 5’–3’ direction. G = black circles, C
NASA Astrophysics Data System (ADS)
Shi, Sheng-bing; Chen, Zhen-xing; Qin, Shao-gang; Song, Chun-yan; Jiang, Yun-hong
2014-09-01
With the development of science and technology, photoelectric equipment comprises visible system, infrared system, laser system and so on, integration, information and complication are higher than past. Parallelism and jumpiness of optical axis are important performance of photoelectric equipment,directly affect aim, ranging, orientation and so on. Jumpiness of optical axis directly affect hit precision of accurate point damage weapon, but we lack the facility which is used for testing this performance. In this paper, test system which is used fo testing parallelism and jumpiness of optical axis is devised, accurate aim isn't necessary and data processing are digital in the course of testing parallelism, it can finish directly testing parallelism of multi-axes, aim axis and laser emission axis, parallelism of laser emission axis and laser receiving axis and first acuualizes jumpiness of optical axis of optical sighting device, it's a universal test system.
The polarization patterns of skylight reflected off wave water surface.
Zhou, Guanhua; Xu, Wujian; Niu, Chunyue; Zhao, Huijie
2013-12-30
In this paper we propose a model to understand the polarization patterns of skylight when reflected off the surface of waves. The semi-empirical Rayleigh model is used to analyze the polarization of scattered skylight; the Harrison and Coombes model is used to analyze light radiance distribution; and the Cox-Munk model and Mueller matrix are used to analyze reflections from wave surface. First, we calculate the polarization patterns and intensity distribution of light reflected off wave surface. Then we investigate their relationship with incident radiation, solar zenith angle, wind speed and wind direction. Our results show that the polarization patterns of reflected skylight from waves and flat water are different, while skylight reflected on both kinds of water is generally highly polarized at the Brewster angle and the polarization direction is approximately parallel to the water's surface. The backward-reflecting Brewster zone has a relatively low reflectance and a high DOP in all observing directions. This can be used to optimally diminish the reflected skylight and avoid sunglint in ocean optics measurements.
Tensor methodology and computational geometry in direct computational experiments in fluid mechanics
NASA Astrophysics Data System (ADS)
Degtyarev, Alexander; Khramushin, Vasily; Shichkina, Julia
2017-07-01
The paper considers a generalized functional and algorithmic construction of direct computational experiments in fluid dynamics. Notation of tensor mathematics is naturally embedded in the finite - element operation in the construction of numerical schemes. Large fluid particle, which have a finite size, its own weight, internal displacement and deformation is considered as an elementary computing object. Tensor representation of computational objects becomes strait linear and uniquely approximation of elementary volumes and fluid particles inside them. The proposed approach allows the use of explicit numerical scheme, which is an important condition for increasing the efficiency of the algorithms developed by numerical procedures with natural parallelism. It is shown that advantages of the proposed approach are achieved among them by considering representation of large particles of a continuous medium motion in dual coordinate systems and computing operations in the projections of these two coordinate systems with direct and inverse transformations. So new method for mathematical representation and synthesis of computational experiment based on large particle method is proposed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kuritsyn, A.; Fiksel, G.; Almagri, A. F.
2009-05-15
In this paper measurements of momentum and current transport caused by current driven tearing instability are reported. The measurements are done in the Madison Symmetric Torus reversed-field pinch [R. N. Dexter, D. W. Kerst, T. W. Lovell, S. C. Prager, and J. C. Sprott, Fusion Technol. 19, 131 (1991)] in a regime with repetitive bursts of tearing instability causing magnetic field reconnection. It is established that the plasma parallel momentum profile flattens during these reconnection events: The flow decreases in the core and increases at the edge. The momentum relaxation phenomenon is similar in nature to the well established relaxationmore » of the parallel electrical current and could be a general feature of self-organized systems. The measured fluctuation-induced Maxwell and Reynolds stresses, which govern the dynamics of plasma flow, are large and almost balance each other such that their difference is approximately equal to the rate of change of plasma momentum. The Hall dynamo, which is directly related to the Maxwell stress, drives the parallel current profile relaxation at resonant surfaces at the reconnection events. These results qualitatively agree with analytical calculations and numerical simulations. It is plausible that current-driven instabilities can be responsible for momentum transport in other laboratory and astrophysical plasmas.« less
Large-Scale Parallel Viscous Flow Computations using an Unstructured Multigrid Algorithm
NASA Technical Reports Server (NTRS)
Mavriplis, Dimitri J.
1999-01-01
The development and testing of a parallel unstructured agglomeration multigrid algorithm for steady-state aerodynamic flows is discussed. The agglomeration multigrid strategy uses a graph algorithm to construct the coarse multigrid levels from the given fine grid, similar to an algebraic multigrid approach, but operates directly on the non-linear system using the FAS (Full Approximation Scheme) approach. The scalability and convergence rate of the multigrid algorithm are examined on the SGI Origin 2000 and the Cray T3E. An argument is given which indicates that the asymptotic scalability of the multigrid algorithm should be similar to that of its underlying single grid smoothing scheme. For medium size problems involving several million grid points, near perfect scalability is obtained for the single grid algorithm, while only a slight drop-off in parallel efficiency is observed for the multigrid V- and W-cycles, using up to 128 processors on the SGI Origin 2000, and up to 512 processors on the Cray T3E. For a large problem using 25 million grid points, good scalability is observed for the multigrid algorithm using up to 1450 processors on a Cray T3E, even when the coarsest grid level contains fewer points than the total number of processors.
Nonadiabatic electron response in the Hasegawa-Wakatani equations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stoltzfus-Dueck, T.; Scott, B. D.; Krommes, J. A.
2013-08-15
Tokamak edge turbulence is strongly influenced by parallel electron physics, which relaxes density and potential fluctuations towards electron adiabatic response. Beginning with the paradigmatic Hasegawa-Wakatani equations (HWEs) for resistive tokamak edge turbulence, a unique decomposition of the electric potential (φ) into adiabatic (a) and nonadiabatic (b) portions is derived, based on the requirement that a neither drive nor respond to the parallel current j{sub ∥}. The form of the decomposition clarifies that, at perpendicular scales large relative to the sound radius, the electron adiabatic response controls the nonzonal φ, not the fluctuating density n. Simple energy balance arguments allow onemore » to rigorously bound the ratio of rms nonzonal nonadiabatic fluctuations (b(tilde sign)) relative to adiabatic ones (ã). The role of the vorticity nonlinearity in transferring energy between adiabatic and nonadiabatic fluctuations aids intuitive understanding of self-sustained turbulence in the HWEs. When the normalized parallel resistivity is weak, b(tilde sign) becomes effectively slaved, allowing the reduction to an approximate one-field model that remains valid for strong turbulence. In addition to guiding physical intuition, the one-field reduction should greatly ease further analytical manipulations. Direct numerical simulation of the 2D HWEs confirms the convergence of the asymptotic formula for b(tilde sign)« less
Proposed scheme for parallel 10Gb/s VSR system and its verilog HDL realization
NASA Astrophysics Data System (ADS)
Zhou, Yi; Chen, Hongda; Zuo, Chao; Jia, Jiuchun; Shen, Rongxuan; Chen, Xiongbin
2005-02-01
This paper proposes a novel and innovative scheme for 10Gb/s parallel Very Short Reach (VSR) optical communication system. The optimized scheme properly manages the SDH/SONET redundant bytes and adjusts the position of error detecting bytes and error correction bytes. Compared with the OIF-VSR4-01.0 proposal, the scheme has a coding process module. The SDH/SONET frames in transmission direction are disposed as follows: (1) The Framer-Serdes Interface (FSI) gets 16×622.08Mb/s STM-64 frame. (2) The STM-64 frame is byte-wise stripped across 12 channels, all channels are data channels. During this process, the parity bytes and CRC bytes are generated in the similar way as OIF-VSR4-01.0 and stored in the code process module. (3) The code process module will regularly convey the additional parity bytes and CRC bytes to all 12 data channels. (4) After the 8B/10B coding, the 12 channels is transmitted to the parallel VCSEL array. The receive process approximately in reverse order of transmission process. By applying this scheme to 10Gb/s VSR system, the frame size in VSR system is reduced from 15552×12 bytes to 14040×12 bytes, the system redundancy is reduced obviously.
NASA Astrophysics Data System (ADS)
Margheriti, L.; Ferulano, M. F.; Di Bona, M.
2006-11-01
Shear wave splitting is measured at 14 seismic stations in the Reggio Emilia region above local background seismicity and two sequences of seismic events. The good quality of the waveforms together with the favourable distribution of earthquake foci allows us to place strong constraints on the geometry and the depth of the anisotropic volume. It is about 60 km2 wide and located between 6 and 11 km depth, inside Mesozoic age carbonate rocks. The splitting results suggest also the presence of a shallower anisotropic layer about 1 km thick and few km wide in the Pliocene-Quaternary alluvium above the Mesozoic layer. The fast polarization directions (N30°E) are approximately parallel to the maximum horizontal stress (σ1 is SSW-NNE) in the region and also parallel to the strike of the main structural features in the Reggio Emilia area. The size of the delay times suggests about 4.5 per cent shear wave velocity anisotropy. These parameters agree with an interpretation of seismic anisotropy in terms of the extensive-dilatancy anisotropy model which considers the rock volume to be pervaded by fluid-saturated microcracks aligned by the active stress field. We cannot completely rule out the contribution of aligned macroscopic fractures as the cause of the shear wave anisotropy even if the parallel shear wave polarizations we found are diagnostic of transverse isotropy with a horizontal axis of symmetry. This symmetry is commonly explained by parallel stress-aligned microcracks.
Deep crustal deformation by sheath folding in the Adirondack Mountains, USA
NASA Technical Reports Server (NTRS)
Mclelland, J. M.
1988-01-01
As described by McLelland and Isachsen, the southern half of the Adirondacks are underlain by major isoclinal (F sub 1) and open-upright (F sub 2) folds whose axes are parallel, trend approximately E-W, and plunge gently about the horizontal. These large structures are themselves folded by open upright folds trending NNE (F sub 3). It is pointed out that elongation lineations in these rocks are parallel to X of the finite strain ellipsoid developed during progressive rotational strain. The parallelism between F sub 1 and F sub 2 fold axes and elongation lineations led to the hypothesis that progressive rotational strain, with a west-directed tectonic transport, rotated earlier F sub 1-folds into parallelism with the evolving elongation lineation. Rotation is accomplished by ductile, passive flow of F sub 1-axes into extremely arcuate, E-W hinges. In order to test these hypotheses a number of large folds were mapped in the eastern Adirondacks. Other evidence supporting the existence of sheath folds in the Adirondacks is the presence, on a map scale, of synforms whose limbs pass through the vertical and into antiforms. This type of outcrop pattern is best explained by intersecting a horizontal plane with the double curvature of sheath folds. It is proposed that sheath folding is a common response of hot, ductile rocks to rotational strain at deep crustal levels. The recognition of sheath folds in the Adirondacks reconciles the E-W orientation of fold axes with an E-W elongation lineation.
Feng, Yanqiu; Song, Yanli; Wang, Cong; Xin, Xuegang; Feng, Qianjin; Chen, Wufan
2013-10-01
To develop and test a new algorithm for fast direct Fourier transform (DrFT) reconstruction of MR data on non-Cartesian trajectories composed of lines with equally spaced points. The DrFT, which is normally used as a reference in evaluating the accuracy of other reconstruction methods, can reconstruct images directly from non-Cartesian MR data without interpolation. However, DrFT reconstruction involves substantially intensive computation, which makes the DrFT impractical for clinical routine applications. In this article, the Chirp transform algorithm was introduced to accelerate the DrFT reconstruction of radial and Periodically Rotated Overlapping ParallEL Lines with Enhanced Reconstruction (PROPELLER) MRI data located on the trajectories that are composed of lines with equally spaced points. The performance of the proposed Chirp transform algorithm-DrFT algorithm was evaluated by using simulation and in vivo MRI data. After implementing the algorithm on a graphics processing unit, the proposed Chirp transform algorithm-DrFT algorithm achieved an acceleration of approximately one order of magnitude, and the speed-up factor was further increased to approximately three orders of magnitude compared with the traditional single-thread DrFT reconstruction. Implementation the Chirp transform algorithm-DrFT algorithm on the graphics processing unit can efficiently calculate the DrFT reconstruction of the radial and PROPELLER MRI data. Copyright © 2012 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Beck, R.; Carilli, C. L.; Holdaway, M. A.; Klein, U.
1994-12-01
Radio continuum observations of the spiral galaxy NGC 253 with the Effelsberg and Very Large Array (VLA) telescopes reveal polarized emission from the bar and halo regions. Within the bar Faraday depolarization is strong at 1.5 and 5 GHz, due to ionized gas with ne approximately equal 0.1 - 3/cu cm which is mixed with turbulent magnetic fields of approximately equal 17 microG estimated strength. Even at 10 GHz the degree of polarization in the bar is low (only approximately equal 5% east and approximately equal 2% west of the nucleus) due to beam depolarization by unresolved tangled fields. In contrast, the magnetic fields in the halo are highly uniform, as indicated by fractional polarizations up to 40% at 10 GHz. Faraday depolarization in the halo at 1.5 GHz calls for a warm, clumpy gas component with ne approximately equal 0.02/cu cm and approximately equal 6 microG turbulent fields. We detected Faraday rotation in the bar, with rotation measures absolute value of RM approximately equal 100 rad/sq m (between 10 and 5 GHz) having different signs east and west of the nucleus. Below 5 GHz Faraday rotation is strongly reduced by the limited transparency for polarized emission in the bar. Faraday rotation in the halo in two regions at approximately 5 kpc above and below the plane with RM approximately equal -7 rad/sq m between 10 and 1.5 GHz can be ascribed to hot gas with mean value of ne approximately equal 0.002/cu cm and uniform fields along the line of sight of mean value of Bu parallel approximately equal -2 microG. The magnetic field structure in the bar and halo of NGC 253 is best described by the quadrupole-type dynamo mode SO, with a ring-like field in the bar and a field mainly parallel to the plane in a co-rotating halo. A major perturbation occurs in the east where the field is perpendicular to the plane and follows a 'spur'. The galactic wind is suppressed by the dominating plane-parallel field, except along the spur.
Creating IRT-Based Parallel Test Forms Using the Genetic Algorithm Method
ERIC Educational Resources Information Center
Sun, Koun-Tem; Chen, Yu-Jen; Tsai, Shu-Yen; Cheng, Chien-Fen
2008-01-01
In educational measurement, the construction of parallel test forms is often a combinatorial optimization problem that involves the time-consuming selection of items to construct tests having approximately the same test information functions (TIFs) and constraints. This article proposes a novel method, genetic algorithm (GA), to construct parallel…
Issues of planning trajectory of parallel robots taking into account zones of singularity
NASA Astrophysics Data System (ADS)
Rybak, L. A.; Khalapyan, S. Y.; Gaponenko, E. V.
2018-03-01
A method for determining the design characteristics of a parallel robot necessary to provide specified parameters of its working space that satisfy the controllability requirement is developed. The experimental verification of the proposed method was carried out using an approximate planar 3-RPR mechanism.
NASA Astrophysics Data System (ADS)
Qin, Cheng-Zhi; Zhan, Lijun
2012-06-01
As one of the important tasks in digital terrain analysis, the calculation of flow accumulations from gridded digital elevation models (DEMs) usually involves two steps in a real application: (1) using an iterative DEM preprocessing algorithm to remove the depressions and flat areas commonly contained in real DEMs, and (2) using a recursive flow-direction algorithm to calculate the flow accumulation for every cell in the DEM. Because both algorithms are computationally intensive, quick calculation of the flow accumulations from a DEM (especially for a large area) presents a practical challenge to personal computer (PC) users. In recent years, rapid increases in hardware capacity of the graphics processing units (GPUs) provided in modern PCs have made it possible to meet this challenge in a PC environment. Parallel computing on GPUs using a compute-unified-device-architecture (CUDA) programming model has been explored to speed up the execution of the single-flow-direction algorithm (SFD). However, the parallel implementation on a GPU of the multiple-flow-direction (MFD) algorithm, which generally performs better than the SFD algorithm, has not been reported. Moreover, GPU-based parallelization of the DEM preprocessing step in the flow-accumulation calculations has not been addressed. This paper proposes a parallel approach to calculate flow accumulations (including both iterative DEM preprocessing and a recursive MFD algorithm) on a CUDA-compatible GPU. For the parallelization of an MFD algorithm (MFD-md), two different parallelization strategies using a GPU are explored. The first parallelization strategy, which has been used in the existing parallel SFD algorithm on GPU, has the problem of computing redundancy. Therefore, we designed a parallelization strategy based on graph theory. The application results show that the proposed parallel approach to calculate flow accumulations on a GPU performs much faster than either sequential algorithms or other parallel GPU-based algorithms based on existing parallelization strategies.
Ground Motion Analysis of Co-Located DAS and Seismometer Sensors
NASA Astrophysics Data System (ADS)
Wang, H. F.; Fratta, D.; Lord, N. E.; Lancelle, C.; Thurber, C. H.; Zeng, X.; Parker, L.; Chalari, A.; Miller, D.; Feigl, K. L.; Team, P.
2016-12-01
The PoroTomo research team deployed 8700-meters of Distributed Acoustic Sensing (DAS) cable in a shallow trench and 400-meters in a borehole at Brady Hot Springs, Nevada in March 2016 together with an array of 246, three-component geophones. The seismic sensors occupied a natural laboratory 1500 x 500 x 400 meters overlying the Brady geothermal field. The DAS cable was laid out in three parallel zig-zag lines with line segments approximately 100-meters in length and geophones were spaced at approximately 50-m intervals. In several line segments, geophones were co-located within one meter of the DAS cable. Both DAS and the conventional geophones recorded continuously over 15 days. A large Vibroseis truck (T-Rex) provided the seismic source at approximately 250 locations outside and within the array. The Vibroseis protocol called for excitation in one vertical and two orthogonal horizontal directions at each location. For each mode, three, 5-to-80-Hz upsweeps were made over 20 seconds. In addition, a moderate-sized earthquake with a local magnitude of 4.3 was recorded on March 21, 2016. Its epicenter was approximately 150-km away. Several DAS line segments with co-located geophone stations were used to test relationships between the strain rate recorded by DAS and ground velocity recorded by the geophones.
Establishing use of crutches by a mentally retarded spina bifida child1
Horner, R. Don
1971-01-01
A 5-yr-old mentally retarded spina bifida child was taught to walk with the aid of crutches. This behavior was developed through fading of physical prompting within a 10-step successive approximation sequence. Preliminary training to establish gait consisted of developing use of parallel bars through fading of physically modelled responses within a six-step successive approximation sequence. Use of parallel bars ceased during an extinction period and completely recovered upon being primed with one “free” reinforcement. Systematic use of natural reinforcers was employed as an aid in maintaining use of crutches. PMID:16795294
Parallel tempering for the traveling salesman problem
DOE Office of Scientific and Technical Information (OSTI.GOV)
Percus, Allon; Wang, Richard; Hyman, Jeffrey
We explore the potential of parallel tempering as a combinatorial optimization method, applying it to the traveling salesman problem. We compare simulation results of parallel tempering with a benchmark implementation of simulated annealing, and study how different choices of parameters affect the relative performance of the two methods. We find that a straightforward implementation of parallel tempering can outperform simulated annealing in several crucial respects. When parameters are chosen appropriately, both methods yield close approximation to the actual minimum distance for an instance with 200 nodes. However, parallel tempering yields more consistently accurate results when a series of independent simulationsmore » are performed. Our results suggest that parallel tempering might offer a simple but powerful alternative to simulated annealing for combinatorial optimization problems.« less
Highly Parallel Alternating Directions Algorithm for Time Dependent Problems
NASA Astrophysics Data System (ADS)
Ganzha, M.; Georgiev, K.; Lirkov, I.; Margenov, S.; Paprzycki, M.
2011-11-01
In our work, we consider the time dependent Stokes equation on a finite time interval and on a uniform rectangular mesh, written in terms of velocity and pressure. For this problem, a parallel algorithm based on a novel direction splitting approach is developed. Here, the pressure equation is derived from a perturbed form of the continuity equation, in which the incompressibility constraint is penalized in a negative norm induced by the direction splitting. The scheme used in the algorithm is composed of two parts: (i) velocity prediction, and (ii) pressure correction. This is a Crank-Nicolson-type two-stage time integration scheme for two and three dimensional parabolic problems in which the second-order derivative, with respect to each space variable, is treated implicitly while the other variable is made explicit at each time sub-step. In order to achieve a good parallel performance the solution of the Poison problem for the pressure correction is replaced by solving a sequence of one-dimensional second order elliptic boundary value problems in each spatial direction. The parallel code is implemented using the standard MPI functions and tested on two modern parallel computer systems. The performed numerical tests demonstrate good level of parallel efficiency and scalability of the studied direction-splitting-based algorithm.
The Potsdam Parallel Ice Sheet Model (PISM-PIK) - Part 1: Model description
NASA Astrophysics Data System (ADS)
Winkelmann, R.; Martin, M. A.; Haseloff, M.; Albrecht, T.; Bueler, E.; Khroulev, C.; Levermann, A.
2011-09-01
We present the Potsdam Parallel Ice Sheet Model (PISM-PIK), developed at the Potsdam Institute for Climate Impact Research to be used for simulations of large-scale ice sheet-shelf systems. It is derived from the Parallel Ice Sheet Model (Bueler and Brown, 2009). Velocities are calculated by superposition of two shallow stress balance approximations within the entire ice covered region: the shallow ice approximation (SIA) is dominant in grounded regions and accounts for shear deformation parallel to the geoid. The plug-flow type shallow shelf approximation (SSA) dominates the velocity field in ice shelf regions and serves as a basal sliding velocity in grounded regions. Ice streams can be identified diagnostically as regions with a significant contribution of membrane stresses to the local momentum balance. All lateral boundaries in PISM-PIK are free to evolve, including the grounding line and ice fronts. Ice shelf margins in particular are modeled using Neumann boundary conditions for the SSA equations, reflecting a hydrostatic stress imbalance along the vertical calving face. The ice front position is modeled using a subgrid-scale representation of calving front motion (Albrecht et al., 2011) and a physically-motivated calving law based on horizontal spreading rates. The model is tested in experiments from the Marine Ice Sheet Model Intercomparison Project (MISMIP). A dynamic equilibrium simulation of Antarctica under present-day conditions is presented in Martin et al. (2011).
2D Seismic Imaging of Elastic Parameters by Frequency Domain Full Waveform Inversion
NASA Astrophysics Data System (ADS)
Brossier, R.; Virieux, J.; Operto, S.
2008-12-01
Thanks to recent advances in parallel computing, full waveform inversion is today a tractable seismic imaging method to reconstruct physical parameters of the earth interior at different scales ranging from the near- surface to the deep crust. We present a massively parallel 2D frequency-domain full-waveform algorithm for imaging visco-elastic media from multi-component seismic data. The forward problem (i.e. the resolution of the frequency-domain 2D PSV elastodynamics equations) is based on low-order Discontinuous Galerkin (DG) method (P0 and/or P1 interpolations). Thanks to triangular unstructured meshes, the DG method allows accurate modeling of both body waves and surface waves in case of complex topography for a discretization of 10 to 15 cells per shear wavelength. The frequency-domain DG system is solved efficiently for multiple sources with the parallel direct solver MUMPS. The local inversion procedure (i.e. minimization of residuals between observed and computed data) is based on the adjoint-state method which allows to efficiently compute the gradient of the objective function. Applying the inversion hierarchically from the low frequencies to the higher ones defines a multiresolution imaging strategy which helps convergence towards the global minimum. In place of expensive Newton algorithm, the combined use of the diagonal terms of the approximate Hessian matrix and optimization algorithms based on quasi-Newton methods (Conjugate Gradient, LBFGS, ...) allows to improve the convergence of the iterative inversion. The distribution of forward problem solutions over processors driven by a mesh partitioning performed by METIS allows to apply most of the inversion in parallel. We shall present the main features of the parallel modeling/inversion algorithm, assess its scalability and illustrate its performances with realistic synthetic case studies.
Cao, Jianfang; Cui, Hongyan; Shi, Hao; Jiao, Lijuan
2016-01-01
A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network’s initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data. PMID:27304987
Wang, Min; Tian, Yun
2018-01-01
The Canny operator is widely used to detect edges in images. However, as the size of the image dataset increases, the edge detection performance of the Canny operator decreases and its runtime becomes excessive. To improve the runtime and edge detection performance of the Canny operator, in this paper, we propose a parallel design and implementation for an Otsu-optimized Canny operator using a MapReduce parallel programming model that runs on the Hadoop platform. The Otsu algorithm is used to optimize the Canny operator's dual threshold and improve the edge detection performance, while the MapReduce parallel programming model facilitates parallel processing for the Canny operator to solve the processing speed and communication cost problems that occur when the Canny edge detection algorithm is applied to big data. For the experiments, we constructed datasets of different scales from the Pascal VOC2012 image database. The proposed parallel Otsu-Canny edge detection algorithm performs better than other traditional edge detection algorithms. The parallel approach reduced the running time by approximately 67.2% on a Hadoop cluster architecture consisting of 5 nodes with a dataset of 60,000 images. Overall, our approach system speeds up the system by approximately 3.4 times when processing large-scale datasets, which demonstrates the obvious superiority of our method. The proposed algorithm in this study demonstrates both better edge detection performance and improved time performance. PMID:29861711
A comparison of energetic ions in the plasma depletion layer and the quasi-parallel magnetosheath
NASA Technical Reports Server (NTRS)
Fuselier, Stephen A.
1994-01-01
Energetic ion spectra measured by the Active Magnetospheric Particle Tracer Explorers/Charge Composition Explorer (AMPTE/CCE) downstream from the Earth's quasi-parallel bow shock (in the quasi-parallel magnetosheath) and in the plasma depletion layer are compared. In the latter region, energetic ions are from a single source, leakage of magnetospheric ions across the magnetopause and into the plasma depletion layer. In the former region, both the magnetospheric source and shock acceleration of the thermal solar wind population at the quasi-parallel shock can contribute to the energetic ion spectra. The relative strengths of these two energetic ion sources are determined through the comparison of spectra from the two regions. It is found that magnetospheric leakage can provide an upper limit of 35% of the total energetic H(+) population in the quasi-parallel magnetosheath near the magnetopause in the energy range from approximately 10 to approximately 80 keV/e and substantially less than this limit for the energetic He(2+) population. The rest of the energetic H(+) population and nearly all of the energetic He(2+) population are accelerated out of the thermal solar wind population through shock acceleration processes. By comparing the energetic and thermal He(2+) and H(+) populations in the quasi-parallel magnetosheath, it is found that the quasi-parallel bow shock is 2 to 3 times more efficient at accelerating He(2+) than H(+). This result is consistent with previous estimates from shock acceleration theory and simulati ons.
Semi-automated 96-well liquid-liquid extraction for quantitation of drugs in biological fluids.
Zhang, N; Hoffman, K L; Li, W; Rossi, D T
2000-02-01
A semi-automated liquid-liquid extraction (LLE) technique for biological fluid sample preparation was introduced for the quantitation of four drugs in rat plasma. All liquid transferring during the sample preparation was automated using a Tomtec Quadra 96 Model 320 liquid handling robot, which processed up to 96 samples in parallel. The samples were either in 96-deep-well plate or tube-rack format. One plate of samples can be prepared in approximately 1.5 h, and the 96-well plate is directly compatible with the autosampler of an LC/MS system. Selection of organic solvents and recoveries are discussed. Also, precision, relative error, linearity and quantitation of the semi automated LLE method are estimated for four example drugs using LC/MS/MS with a multiple reaction monitoring (MRM) approach. The applicability of this method and future directions are evaluated.
Fukuda, Muneyuki; Tomimatsu, Satoshi; Nakamura, Kuniyasu; Koguchi, Masanari; Shichi, Hiroyasu; Umemura, Kaoru
2004-01-01
A new method to prepare micropillar specimens with a high aspect ratio that is suitable for three-dimensional scanning transmission electron microscopy (3D-STEM) was developed. The key features of the micropillar fabrication are: first, microsampling to extract a small piece including the structure of interest in an IC chip, and second, an ion-beam with an incident direction of 60 degrees to the pillar's axis that enables the parallel sidewalls of the pillar to be produced with a high aspect ratio. A memory-cell structure (length: 6 microm; width: 300 x 500 nm) was fabricated in the micropillar and observed from various directions with a 3D-STEM. A planiform capacitor covered with granular surfaces and a solid crossing gate and metal lines was successfully observed threedimensionally at a resolution of approximately 5 nm.
Okamoto, Naoya; Yoshimatsu, Katsunori; Schneider, Kai; Farge, Marie
2014-03-01
Small-scale anisotropic intermittency is examined in three-dimensional incompressible magnetohydrodynamic turbulence subjected to a uniformly imposed magnetic field. Orthonormal wavelet analyses are applied to direct numerical simulation data at moderate Reynolds number and for different interaction parameters. The magnetic Reynolds number is sufficiently low such that the quasistatic approximation can be applied. Scale-dependent statistical measures are introduced to quantify anisotropy in terms of the flow components, either parallel or perpendicular to the imposed magnetic field, and in terms of the different directions. Moreover, the flow intermittency is shown to increase with increasing values of the interaction parameter, which is reflected in strongly growing flatness values when the scale decreases. The scale-dependent anisotropy of energy is found to be independent of scale for all considered values of the interaction parameter. The strength of the imposed magnetic field does amplify the anisotropy of the flow.
Mineral exploration potential of ERTS-1 data
NASA Technical Reports Server (NTRS)
Brewer, W. A. (Principal Investigator); Erskine, M. C., Jr.; Prindle, R. O.
1972-01-01
The author has identified the following significant results. Preliminary analysis of a mosaic composing eight individual ERTS frames (1:1,000,000) extending well beyond the test site has revealed a number of tectonic structural trends that are controlled by regional lineations. So far most of the regional lineations fall into three general directions: east by northeast, northwest, and north-south. From preliminary examination, it appears that the older Precambrian basement predominates in the NE-bearing structural trends, whereas the predominate NW trend is most likely associated with the Texas Structural Zone, and the north-south trend being the Utah-Arizona belt and/or part of the southern Basin and Range Province. One major lineation, made up of many parallel lineations, is noticeable just north of Lake Pleasant which extends for approximately 100 miles in a northern direction out of the target area. This feature corresponds to a Precambrian schist formation shown on the USGS geologic map of Arizona.
Detection and tracking of a low energy swell system off the U.S. East Coast with the Seasat SAR
NASA Technical Reports Server (NTRS)
Beal, R. C.
1980-01-01
It is noted that on the morning of September 28, 1978, at 1520 GMT, Seasat approached the East Coast of the U.S. with the 100 km swath of its synthetic aperture radar (SAR) running approximately parallel to the coast but displayed eastward by about 20 km. This pass is analyzed and the following conclusions are drawn: (1) the SAR can successfully detect low-energy swell systems with wave heights under 1 m (actually 0.65 + or - 0.25 m); (2) the refraction of low-energy but well-organized swells deriving from changes in the local depth of the ocean is clearly detectable in both wavelength and direction; and (3) the complexity of the ocean spectrum (whether composed of more than one system or spread in direction and wave number) appears to have little bearing on the threshold detection limits.
Parallel/distributed direct method for solving linear systems
NASA Technical Reports Server (NTRS)
Lin, Avi
1990-01-01
A new family of parallel schemes for directly solving linear systems is presented and analyzed. It is shown that these schemes exhibit a near optimal performance and enjoy several important features: (1) For large enough linear systems, the design of the appropriate paralleled algorithm is insensitive to the number of processors as its performance grows monotonically with them; (2) It is especially good for large matrices, with dimensions large relative to the number of processors in the system; (3) It can be used in both distributed parallel computing environments and tightly coupled parallel computing systems; and (4) This set of algorithms can be mapped onto any parallel architecture without any major programming difficulties or algorithmical changes.
LAMMPS framework for dynamic bonding and an application modeling DNA
NASA Astrophysics Data System (ADS)
Svaneborg, Carsten
2012-08-01
We have extended the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) to support directional bonds and dynamic bonding. The framework supports stochastic formation of new bonds, breakage of existing bonds, and conversion between bond types. Bond formation can be controlled to limit the maximal functionality of a bead with respect to various bond types. Concomitant with the bond dynamics, angular and dihedral interactions are dynamically introduced between newly connected triplets and quartets of beads, where the interaction type is determined from the local pattern of bead and bond types. When breaking bonds, all angular and dihedral interactions involving broken bonds are removed. The framework allows chemical reactions to be modeled, and use it to simulate a simplistic, coarse-grained DNA model. The resulting DNA dynamics illustrates the power of the present framework. Catalogue identifier: AEME_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEME_v1_0.html Program obtainable from: CPC Program Library, Queen's University, Belfast, N. Ireland Licensing provisions: GNU General Public Licence No. of lines in distributed program, including test data, etc.: 2 243 491 No. of bytes in distributed program, including test data, etc.: 771 Distribution format: tar.gz Programming language: C++ Computer: Single and multiple core servers Operating system: Linux/Unix/Windows Has the code been vectorized or parallelized?: Yes. The code has been parallelized by the use of MPI directives. RAM: 1 Gb Classification: 16.11, 16.12 Nature of problem: Simulating coarse-grain models capable of chemistry e.g. DNA hybridization dynamics. Solution method: Extending LAMMPS to handle dynamic bonding and directional bonds. Unusual features: Allows bonds to be created and broken while angular and dihedral interactions are kept consistent. Additional comments: The distribution file for this program is approximately 36 Mbytes and therefore is not delivered directly when download or E-mail is requested. Instead an html file giving details of how the program can be obtained is sent. Running time: Hours to days. The examples provided in the distribution take just seconds to run.
Structure of scintillations in Neptune's occultation shadow
NASA Technical Reports Server (NTRS)
Hubbard, W. B.; Lellouch, Emmanuel; Sicardy, Bruno; Brahic, Andre; Vilas, Faith
1988-01-01
An exceptionally high-quality data set from a Neptune occultation is used here to derive a number of new results about the statistical properties of the fluctuations of the intensity distribution in various parts of Neptune's occultation shadow. An approximate numerical ray-tracing model which successfully accounts for many of the qualitative aspects of the observed intensity fluctuation distribution is introduced. Strong refractive scintillation is simulated by including the effects of 'turbulence' with projected atmospheric properties allowed to vary in both the direction perpendicular and parallel to the limb, and an explicit two-dimensional picture of a typical intensity distribution throughout an occulting planet's shadow is presented. The results confirm the existence of highly anisotropic turbulence.
Nicol, Thomas H.; Niemann, Ralph C.; Gonczy, John D.
1988-01-01
A support system is disclosed for restraining large masses at very low or cryogenic temperatures. The support system employs a tie bar that is pivotally connected at opposite ends to an anchoring support member and a sliding support member. The tie bar extends substantially parallel to the longitudinal axis of the cold mass assembly, and comprises a rod that lengthens when cooled and a pair of end attachments that contract when cooled. The rod and end attachments are sized so that when the tie bar is cooled to cryogenic temperature, the net change in tie bar length is approximately zero. Longitudinal force directed against the cold mass assembly is distributed by the tie bar between the anchoring support member and the sliding support member.
A law of the wall for turbulent boundary layers with suction: Stevenson's formula revisited
NASA Astrophysics Data System (ADS)
Vigdorovich, Igor
2016-08-01
The turbulent velocity field in the viscous sublayer of the boundary layer with suction to a first approximation is homogeneous in any direction parallel to the wall and is determined by only three constant quantities — the wall shear stress, the suction velocity, and the fluid viscosity. This means that there exists a finite algebraic relation between the turbulent shear stress and the longitudinal mean-velocity gradient, using which as a closure condition for the equations of motion, we establish an exact asymptotic behavior of the velocity profile at the outer edge of the viscous sublayer. The obtained relationship provides a generalization of the logarithmic law to the case of wall suction.
Direct numerical simulation of instabilities in parallel flow with spherical roughness elements
NASA Technical Reports Server (NTRS)
Deanna, R. G.
1992-01-01
Results from a direct numerical simulation of laminar flow over a flat surface with spherical roughness elements using a spectral-element method are given. The numerical simulation approximates roughness as a cellular pattern of identical spheres protruding from a smooth wall. Periodic boundary conditions on the domain's horizontal faces simulate an infinite array of roughness elements extending in the streamwise and spanwise directions, which implies the parallel-flow assumption, and results in a closed domain. A body force, designed to yield the horizontal Blasius velocity in the absence of roughness, sustains the flow. Instabilities above a critical Reynolds number reveal negligible oscillations in the recirculation regions behind each sphere and in the free stream, high-amplitude oscillations in the layer directly above the spheres, and a mean profile with an inflection point near the sphere's crest. The inflection point yields an unstable layer above the roughness (where U''(y) is less than 0) and a stable region within the roughness (where U''(y) is greater than 0). Evidently, the instability begins when the low-momentum or wake region behind an element, being the region most affected by disturbances (purely numerical in this case), goes unstable and moves. In compressible flow with periodic boundaries, this motion sends disturbances to all regions of the domain. In the unstable layer just above the inflection point, the disturbances grow while being carried downstream with a propagation speed equal to the local mean velocity; they do not grow amid the low energy region near the roughness patch. The most amplified disturbance eventually arrives at the next roughness element downstream, perturbing its wake and inducing a global response at a frequency governed by the streamwise spacing between spheres and the mean velocity of the most amplified layer.
Carlson, D.
2010-01-01
Joints within unconsolidated material such as glacial till can be primary avenues for the flow of electrical charge, water, and contaminants. To facilitate the siting and design of remediation programs, a need exists to map anisotropic distribution of such pathways within glacial tills by determining the azimuth of the dominant joint set. The azimuthal survey method uses standard resistivity equipment with a Wenner array rotated about a fixed center point at selected degree intervals that yields an apparent resistivity ellipse. From this ellipse, joint set orientation can be determined. Azimuthal surveys were conducted at 21 sites in a 500-km2 (193 mi2) area around Milwaukee, Wisconsin, and more specifically, at sites having more than 30 m (98 ft) of glacial till (to minimize the influence of underlying bedrock joints). The 26 azimuthal surveys revealed a systematic pattern to the trend of the dominant joint set within the tills, which is approximately parallel to ice flow direction during till deposition. The average orientation of the joint set parallel with the ice flow direction is N77??E and N37??E for the Oak Creek and Ozaukee tills, respectively. The mean difference between average direct observation of joint set orientations and average azimuthal resistivity results is 8??, which is one fifth of the difference of ice flow direction between the Ozaukee and Oak Creek tills. The results of this study suggest that the surface azimuthal electrical resistivity survey method used for local in situ studies can be a useful noninvasive method for delineating joint sets within shallow geologic material for regional studies. Copyright ?? 2010 The American Association of Petroleum Geologists/Division of Environmental Geosciences. All rights reserved.
Monte Carlo charged-particle tracking and energy deposition on a Lagrangian mesh.
Yuan, J; Moses, G A; McKenty, P W
2005-10-01
A Monte Carlo algorithm for alpha particle tracking and energy deposition on a cylindrical computational mesh in a Lagrangian hydrodynamics code used for inertial confinement fusion (ICF) simulations is presented. The straight line approximation is used to follow propagation of "Monte Carlo particles" which represent collections of alpha particles generated from thermonuclear deuterium-tritium (DT) reactions. Energy deposition in the plasma is modeled by the continuous slowing down approximation. The scheme addresses various aspects arising in the coupling of Monte Carlo tracking with Lagrangian hydrodynamics; such as non-orthogonal severely distorted mesh cells, particle relocation on the moving mesh and particle relocation after rezoning. A comparison with the flux-limited multi-group diffusion transport method is presented for a polar direct drive target design for the National Ignition Facility. Simulations show the Monte Carlo transport method predicts about earlier ignition than predicted by the diffusion method, and generates higher hot spot temperature. Nearly linear speed-up is achieved for multi-processor parallel simulations.
Anderson, R.E.; Barnhard, T.P.
1993-01-01
The Virgin River depression and surrounding mountains are Neogene features that are partly contiguous with the little-strained rocks of the structural transition to the Colorado Plateau province. This contiguity makes the area ideally suited for evaluating the sense, magnitude, and kinematics of Neogene deformation. Analysis along the strain boundary shows that, compared to the adjacent little-strained area, large-magnitude vertical deformation greatly exceeds extensional deformation and that significant amounts of lateral displacement approximately parallel the province boundary. Isostatic rebound following tectonic denudation is an unlikely direct cause of the strong vertical structural relief adjacent to the strain boundary. Instead, the observed structures are first-order features defining a three-dimensional strain field produced by approximately east-west extension, vertical structural attenuation, and extension-normal shortening. All major structural elements of the strain-boundary strain field are also found in the adjacent Basin and Range. -from Authors
ERIC Educational Resources Information Center
Laszlo, Sarah; Plaut, David C.
2012-01-01
The Parallel Distributed Processing (PDP) framework has significant potential for producing models of cognitive tasks that approximate how the brain performs the same tasks. To date, however, there has been relatively little contact between PDP modeling and data from cognitive neuroscience. In an attempt to advance the relationship between…
Matthew Parks; Richard Cronn; Aaron Liston
2009-01-01
We reconstruct the infrageneric phylogeny of Pinus from 37 nearly-complete chloroplast genomes (average 109 kilobases each of an approximately 120 kilobase genome) generated using multiplexed massively parallel sequencing. We found that 30/33 ingroup nodes resolved wlth > 95-percent bootstrap support; this is a substantial improvement relative...
Electrostatically confined nanoparticle interactions and dynamics.
Eichmann, Shannon L; Anekal, Samartha G; Bevan, Michael A
2008-02-05
We report integrated evanescent wave and video microscopy measurements of three-dimensional trajectories of 50, 100, and 250 nm gold nanoparticles electrostatically confined between parallel planar glass surfaces separated by 350 and 600 nm silica colloid spacers. Equilibrium analyses of single and ensemble particle height distributions normal to the confining walls produce net electrostatic potentials in excellent agreement with theoretical predictions. Dynamic analyses indicate lateral particle diffusion coefficients approximately 30-50% smaller than expected from predictions including the effects of the equilibrium particle distribution within the gap and multibody hydrodynamic interactions with the confining walls. Consistent analyses of equilibrium and dynamic information in each measurement do not indicate any roles for particle heating or hydrodynamic slip at the particle or wall surfaces, which would both increase diffusivities. Instead, lower than expected diffusivities are speculated to arise from electroviscous effects enhanced by the relative extent (kappaa approximately 1-3) and overlap (kappah approximately 2-4) of electrostatic double layers on the particle and wall surfaces. These results demonstrate direct, quantitative measurements and a consistent interpretation of metal nanoparticle electrostatic interactions and dynamics in a confined geometry, which provides a basis for future similar measurements involving other colloidal forces and specific biomolecular interactions.
A GaAs vector processor based on parallel RISC microprocessors
NASA Astrophysics Data System (ADS)
Misko, Tim A.; Rasset, Terry L.
A vector processor architecture based on the development of a 32-bit microprocessor using gallium arsenide (GaAs) technology has been developed. The McDonnell Douglas vector processor (MVP) will be fabricated completely from GaAs digital integrated circuits. The MVP architecture includes a vector memory of 1 megabyte, a parallel bus architecture with eight processing elements connected in parallel, and a control processor. The processing elements consist of a reduced instruction set CPU (RISC) with four floating-point coprocessor units and necessary memory interface functions. This architecture has been simulated for several benchmark programs including complex fast Fourier transform (FFT), complex inner product, trigonometric functions, and sort-merge routine. The results of this study indicate that the MVP can process a 1024-point complex FFT at a speed of 112 microsec (389 megaflops) while consuming approximately 618 W of power in a volume of approximately 0.1 ft-cubed.
Idealized model of polar cap currents, fields, and auroras
NASA Technical Reports Server (NTRS)
Cornwall, J. M.
1985-01-01
During periods of northward Bz, the electric field applied to the magnetosphere is generally opposite to that occurring during southward Bz and complicated patterns of convection result, showing some features reversed in comparison with the southward Bz case. A study is conducted of a simple generalization of early work on idealized convection models, which allows for coexistence of sunward convection over the central polar cap and antisunward convection elsewhere in the cap. The present model, valid for By approximately 0, has a four-cell convection pattern and is based on the combination of ionospheric current conservation with a relation between parallel auroral currents and parallel potential drops. Global magnetospheric issues involving, e.g., reconnection are not considered. The central result of this paper is an expression giving the parallel potential drop for polar cap auroras (with By approximately 0) in terms of the polar cap convection field profile.
27 CFR 9.222 - Naches Heights.
Code of Federal Regulations, 2012 CFR
2012-04-01
... the intersection of the Burlington Northern single-track rail line and the Congdon (Schuler) Canal... a straight line approximately 0.15 mile to the Congdon (Schuler) Canal, which closely parallels the... Congdon (Schuler) Canal, onto the Selah map, approximately 3.25 miles, returning to the beginning point...
27 CFR 9.222 - Naches Heights.
Code of Federal Regulations, 2014 CFR
2014-04-01
... the intersection of the Burlington Northern single-track rail line and the Congdon (Schuler) Canal... a straight line approximately 0.15 mile to the Congdon (Schuler) Canal, which closely parallels the... Congdon (Schuler) Canal, onto the Selah map, approximately 3.25 miles, returning to the beginning point...
27 CFR 9.222 - Naches Heights.
Code of Federal Regulations, 2013 CFR
2013-04-01
... the intersection of the Burlington Northern single-track rail line and the Congdon (Schuler) Canal... a straight line approximately 0.15 mile to the Congdon (Schuler) Canal, which closely parallels the... Congdon (Schuler) Canal, onto the Selah map, approximately 3.25 miles, returning to the beginning point...
Vortex lattice structures in YNi{sub 2}B{sub 2}C
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yethiraj, M.; Paul, D.M.; Tomy, C.V.
The authors observe a flux lattice with square symmetry in the superconductor YNi{sub 2}B{sub 2}C when the applied field is parallel to the c-axis of the crystal. A square lattice observed previously in the isostructural magnetic analog ErNi{sub 2}B{sub 2}C was attributed to the interaction between magnetic order in that system and the flux lattice. Since the Y-based compound does not order magnetically, it is clear that the structure of the flux lattice is unrelated to magnetic order. In fact, they show that the flux lines have a square cross-section when the applied field is parallel to the c-axis ofmore » the crystal, since the measured penetration depth along the 100 crystal direction is larger than the penetration depth along the 110 by approximately 60%. This is the likely reason for the square symmetry of the lattice. Although they find considerable disorder in the arrangement of the flux lines at 2.5T, no melting of the vortex lattice was observed.« less
Vortex lattice structures in YNi{sub 2}B{sub 2}C
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yethiraj, M.; Paul, D.M.; Tomy, C.V.
We observe a flux lattice with square symmetry in the superconductor YNi{sub 2}B{sub 2}C when the applied field is parallel to the c-axis of the crystal. A square lattice observed previously in the isostructural magnetic analog ErNi{sub 2}B{sub 2}C was attributed to the interaction between magnetic order in that system and the flux lattice. Since the Y-based compound does not order magnetically, it is clear that the structure of the flux lattice is unrelated to magnetic order. In fact, we show that the flux lines have a square cross-section when the applied field is parallel to the c-axis of themore » crystal, since the measured penetration depth along the 110 crystal direction is smaller than the penetration depth along the 100 by approximately 30%. This causes the square symmetry of the lattice. Although we find considerable disorder in the arrangement of the flux lines at 2.5T, no melting of the vortex lattice was observed.« less
Valsalva's maneuver revisited: a quantitative method yielding insights into human autonomic control
NASA Technical Reports Server (NTRS)
Smith, M. L.; Beightol, L. A.; Fritsch-Yelle, J. M.; Ellenbogen, K. A.; Porter, T. R.; Eckberg, D. L.
1996-01-01
Seventeen healthy supine subjects performed graded Valsalva maneuvers. In four subjects, transesophageal echographic aortic cross-sectional areas decreased during and increased after straining. During the first seconds of straining, when aortic cross-sectional area was declining and peripheral arterial pressure was rising, peroneal sympathetic muscle neurons were nearly silent. Then, as aortic cross-sectional area and peripheral pressure both declined, sympathetic muscle nerve activity increased, in proportion to the intensity of straining. Poststraining arterial pressure elevations were proportional to preceding increases of sympathetic activity. Sympathetic inhibition after straining persisted much longer than arterial and right atrial pressure elevations. Similarly, R-R intervals changed in parallel with peripheral arterial pressure, until approximately 45 s after the onset of straining, when R-R intervals were greater and arterial pressures were smaller than prestraining levels. Our conclusions are as follows: opposing changes of carotid and aortic baroreceptor inputs reduce sympathetic muscle and increase vagal cardiac motor neuronal firing; parallel changes of barorsensory inputs provoke reciprocal changes of sympathetic and direct changes of vagal firing; and pressure transients lasting only seconds reset arterial pressure-sympathetic and -vagal response relations.
Hierarchical fractional-step approximations and parallel kinetic Monte Carlo algorithms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Arampatzis, Giorgos, E-mail: garab@math.uoc.gr; Katsoulakis, Markos A., E-mail: markos@math.umass.edu; Plechac, Petr, E-mail: plechac@math.udel.edu
2012-10-01
We present a mathematical framework for constructing and analyzing parallel algorithms for lattice kinetic Monte Carlo (KMC) simulations. The resulting algorithms have the capacity to simulate a wide range of spatio-temporal scales in spatially distributed, non-equilibrium physiochemical processes with complex chemistry and transport micro-mechanisms. Rather than focusing on constructing exactly the stochastic trajectories, our approach relies on approximating the evolution of observables, such as density, coverage, correlations and so on. More specifically, we develop a spatial domain decomposition of the Markov operator (generator) that describes the evolution of all observables according to the kinetic Monte Carlo algorithm. This domain decompositionmore » corresponds to a decomposition of the Markov generator into a hierarchy of operators and can be tailored to specific hierarchical parallel architectures such as multi-core processors or clusters of Graphical Processing Units (GPUs). Based on this operator decomposition, we formulate parallel Fractional step kinetic Monte Carlo algorithms by employing the Trotter Theorem and its randomized variants; these schemes, (a) are partially asynchronous on each fractional step time-window, and (b) are characterized by their communication schedule between processors. The proposed mathematical framework allows us to rigorously justify the numerical and statistical consistency of the proposed algorithms, showing the convergence of our approximating schemes to the original serial KMC. The approach also provides a systematic evaluation of different processor communicating schedules. We carry out a detailed benchmarking of the parallel KMC schemes using available exact solutions, for example, in Ising-type systems and we demonstrate the capabilities of the method to simulate complex spatially distributed reactions at very large scales on GPUs. Finally, we discuss work load balancing between processors and propose a re-balancing scheme based on probabilistic mass transport methods.« less
On the progressive enrichment of the oxygen isotopic composition of water along a leaf.
Farquhar, G. D.; Gan, K. S.
2003-06-01
A model has been derived for the enrichment of heavy isotopes of water in leaves, including progressive enrichment along the leaf. In the model, lighter water is preferentially transpired leaving heavier water to diffuse back into the xylem and be carried further along the leaf. For this pattern to be pronounced, the ratio of advection to diffusion (Péclet number) has to be large in the longitudinal direction, and small in the radial direction. The progressive enrichment along the xylem is less than that occurring at the sites of evaporation in the mesophyll, depending on the isolation afforded by the radial Péclet number. There is an upper bound on enrichment, and effects of ground tissue associated with major veins are included. When transpiration rate is spatially nonuniform, averaging of enrichment occurs more naturally with transpiration weighting than with area-based weighting. This gives zero average enrichment of transpired water, the modified Craig-Gordon equation for average enrichment at the sites of evaporation and the Farquhar and Lloyd (In Stable Isotopes and Plant Carbon-Water Relations, pp. 47-70. Academic Press, New York, USA, 1993) prediction for mesophyll water. Earlier results on the isotopic composition of evolved oxygen and of retro-diffused carbon dioxide are preserved if these processes vary in parallel with transpiration rate. Parallel variation should be indicated approximately by uniform carbon isotope discrimination across the leaf.
Florine-Casteel, K
1990-01-01
Low-light digitized video fluorescence microscopy has been utilized to measure the steady-state polarized fluorescence from the membrane probe diphenylhexatriene (DPH) and its cationic and phosphatidylcholine derivatives 1-(4-trimethylammoniumphenyl)-6-phenyl-1,3,5-hexatriene (TMA-DPH) and 2-[3-(diphenylhexatrienyl)propanoyl]-3-palmitoyl-L-alpha-phosphati dylcholine (DPH-PC), respectively, in cell-size (10-70 microns) unilamellar vesicles composed of gel-or fluid-phase phospholipid. Using an inverted microscope with epi-illumination optics and an intensified silicon intensified target camera interfaced to a minicomputer, fluorescence images of single vesicles were obtained at emission polarizer orientations of 0 degrees, 45 degrees, 90 degrees, and 135 degrees relative to the excitation light polarization direction. Fluorescence intensity ratios F90 degrees/F0 degrees (= F perpendicular/F parallel) and F135 degrees/F45 degrees were calculated on a pixel-by-pixel basis from digitized image pairs. Theoretical expressions were derived for collected polarized fluorescence as a function of position on the membrane surface as well as the degree of lipid order, in terms of the fluorophore's maximum angular motional freedom in the bilayer (identical to theta max), using a modification of the method of D. Axelrod (1979. Biophys. J. 26:557-574) together with the "wobbling-in-a-cone" model of probe rotational diffusion. Comparison of experimental polarization ratios with theoretical ratios yielded the following results. In gel-phase dipalmitoyl-phosphatidylcholine, the data for all three probes correspond to a model in which the cone angle theta max = 17 +/- 2 degrees and there exists a collective tilt of the phospholipid acyl chains of 30 degrees relative to the bilayer normal. In addition, approximately 5% of DPH and TMA-DPH molecules are aligned parallel to the plane of the bilayer. In fluid-phase palmitoyloleoyl-phosphatidylcholine, the data are well fit by models in which theta max = 60 +/- 2 degrees for DPH and DPH-PC and 32 +/- 4 degrees for TMA-DPH, with approximately 20% of DPH molecules and 10% of TMA-DPH molecules aligned parallel to the bilayer plane, and a net phospholipid tilt at or near the headgroup region of approximately 30 degrees. The results demonstrate that lipid order can be measured with a spatial resolution of approximately 1 micron2 in cell-size vesicles even with high aperture observation through a microscope. Images FIGURE 4 FIGURE 7 FIGURE 10 PMID:2393705
Newton-like methods for Navier-Stokes solution
NASA Astrophysics Data System (ADS)
Qin, N.; Xu, X.; Richards, B. E.
1992-12-01
The paper reports on Newton-like methods called SFDN-alpha-GMRES and SQN-alpha-GMRES methods that have been devised and proven as powerful schemes for large nonlinear problems typical of viscous compressible Navier-Stokes solutions. They can be applied using a partially converged solution from a conventional explicit or approximate implicit method. Developments have included the efficient parallelization of the schemes on a distributed memory parallel computer. The methods are illustrated using a RISC workstation and a transputer parallel system respectively to solve a hypersonic vortical flow.
NASA Astrophysics Data System (ADS)
Ohkubo, Toshifumi; Park, Majung; Hirata, Masakazu; Oumi, Manabu; Nakajima, Kunio
In near-field optical recording, the combination of a triangular aperture and a polarized illuminating light is thought to be one of the most promising breakthroughs for improving both spatial resolution and signal-to-noise ratio. In light of this, we have already fabricated a triangular-aperture mounted optical head slider and demonstrated its superior performance while clarifying the influence of the polarization direction on the spatial resolution in the circumferential direction. When the polarization direction was perpendicular to the bottom side (which is parallel to the slider trailing edge) of the aperture, the highest spatial resolution and signal contrast were obtained, in spite of the usage of a fairly large aperture, indicating the presence of clear readout signal waveforms corresponding down to 100 nm line-and-space (L/S) patterns. In this study, we tried to experimentally clarify the influence of the polarization direction of the illuminating light on an aperture's field spread in the radial direction. In order to concretely evaluate the field spread, we prepared 1-mm-long linearly arranged (in the circumferential direction) L/S patterns on a metal-layered medium, and a piezo-electric actuator combined positioner. Intersecting the aperture at two portions of the tracks, directly acquired signal waveforms could be successfully transformed into the waveforms that would be obtained if the aperture had crossed the track at right angles. The field spreads in the radial direction were estimated to be approximately 250 nm when the polarization direction was perpendicular to the bottom side. In contrast, when the polarization direction was 45 degrees, the stationary field spread in the radial direction was estimated to be approximately 350 - 370 nm. It could be confirmed experimentally that both the highest spatial resolution in the circumferential direction and the smallest field spread in the radial direction were realized with the combination of the triangular aperture and the illuminating polarized light whose direction was perpendicular to the bottom side. Based on these results, the signal-to-noise ratio will be evaluated and discussed in the future with respect to the above-mentioned optimum aperture structure and conditions.
NASA Astrophysics Data System (ADS)
Wittek, Peter; Calderaro, Luca
2015-12-01
We extended a parallel and distributed implementation of the Trotter-Suzuki algorithm for simulating quantum systems to study a wider range of physical problems and to make the library easier to use. The new release allows periodic boundary conditions, many-body simulations of non-interacting particles, arbitrary stationary potential functions, and imaginary time evolution to approximate the ground state energy. The new release is more resilient to the computational environment: a wider range of compiler chains and more platforms are supported. To ease development, we provide a more extensive command-line interface, an application programming interface, and wrappers from high-level languages.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tan, H.
1999-03-31
The purpose of this research is to develop a multiplexed sample processing system in conjunction with multiplexed capillary electrophoresis for high-throughput DNA sequencing. The concept from DNA template to called bases was first demonstrated with a manually operated single capillary system. Later, an automated microfluidic system with 8 channels based on the same principle was successfully constructed. The instrument automatically processes 8 templates through reaction, purification, denaturation, pre-concentration, injection, separation and detection in a parallel fashion. A multiplexed freeze/thaw switching principle and a distribution network were implemented to manage flow direction and sample transportation. Dye-labeled terminator cycle-sequencing reactions are performedmore » in an 8-capillary array in a hot air thermal cycler. Subsequently, the sequencing ladders are directly loaded into a corresponding size-exclusion chromatographic column operated at {approximately} 60 C for purification. On-line denaturation and stacking injection for capillary electrophoresis is simultaneously accomplished at a cross assembly set at {approximately} 70 C. Not only the separation capillary array but also the reaction capillary array and purification columns can be regenerated after every run. DNA sequencing data from this system allow base calling up to 460 bases with accuracy of 98%.« less
Smart Optical Material Characterization System and Method
NASA Technical Reports Server (NTRS)
Choi, Sang Hyouk (Inventor); Park, Yeonjoon (Inventor)
2015-01-01
Disclosed is a system and method for characterizing optical materials, using steps and equipment for generating a coherent laser light, filtering the light to remove high order spatial components, collecting the filtered light and forming a parallel light beam, splitting the parallel beam into a first direction and a second direction wherein the parallel beam travelling in the second direction travels toward the material sample so that the parallel beam passes through the sample, applying various physical quantities to the sample, reflecting the beam travelling in the first direction to produce a first reflected beam, reflecting the beam that passes through the sample to produce a second reflected beam that travels back through the sample, combining the second reflected beam after it travels back though the sample with the first reflected beam, sensing the light beam produced by combining the first and second reflected beams, and processing the sensed beam to determine sample characteristics and properties.
A model for optimizing file access patterns using spatio-temporal parallelism
DOE Office of Scientific and Technical Information (OSTI.GOV)
Boonthanome, Nouanesengsy; Patchett, John; Geveci, Berk
2013-01-01
For many years now, I/O read time has been recognized as the primary bottleneck for parallel visualization and analysis of large-scale data. In this paper, we introduce a model that can estimate the read time for a file stored in a parallel filesystem when given the file access pattern. Read times ultimately depend on how the file is stored and the access pattern used to read the file. The file access pattern will be dictated by the type of parallel decomposition used. We employ spatio-temporal parallelism, which combines both spatial and temporal parallelism, to provide greater flexibility to possible filemore » access patterns. Using our model, we were able to configure the spatio-temporal parallelism to design optimized read access patterns that resulted in a speedup factor of approximately 400 over traditional file access patterns.« less
Linearly exact parallel closures for slab geometry
NASA Astrophysics Data System (ADS)
Ji, Jeong-Young; Held, Eric D.; Jhang, Hogun
2013-08-01
Parallel closures are obtained by solving a linearized kinetic equation with a model collision operator using the Fourier transform method. The closures expressed in wave number space are exact for time-dependent linear problems to within the limits of the model collision operator. In the adiabatic, collisionless limit, an inverse Fourier transform is performed to obtain integral (nonlocal) parallel closures in real space; parallel heat flow and viscosity closures for density, temperature, and flow velocity equations replace Braginskii's parallel closure relations, and parallel flow velocity and heat flow closures for density and temperature equations replace Spitzer's parallel transport relations. It is verified that the closures reproduce the exact linear response function of Hammett and Perkins [Phys. Rev. Lett. 64, 3019 (1990)] for Landau damping given a temperature gradient. In contrast to their approximate closures where the vanishing viscosity coefficient numerically gives an exact response, our closures relate the heat flow and nonvanishing viscosity to temperature and flow velocity (gradients).
The Potsdam Parallel Ice Sheet Model (PISM-PIK) - Part 1: Model description
NASA Astrophysics Data System (ADS)
Winkelmann, R.; Martin, M. A.; Haseloff, M.; Albrecht, T.; Bueler, E.; Khroulev, C.; Levermann, A.
2010-08-01
We present the Potsdam Parallel Ice Sheet Model (PISM-PIK), developed at the Potsdam Institute for Climate Impact Research to be used for simulations of large-scale ice sheet-shelf systems. It is derived from the Parallel Ice Sheet Model (Bueler and Brown, 2009). Velocities are calculated by superposition of two shallow stress balance approximations within the entire ice covered region: the shallow ice approximation (SIA) is dominant in grounded regions and accounts for shear deformation parallel to the geoid. The plug-flow type shallow shelf approximation (SSA) dominates the velocity field in ice shelf regions and serves as a basal sliding velocity in grounded regions. Ice streams naturally emerge through this approach and can be identified diagnostically as regions with a significant contribution of membrane stresses to the local momentum balance. All lateral boundaries in PISM-PIK are free to evolve, including the grounding line and ice fronts. Ice shelf margins in particular are modeled using Neumann boundary conditions for the SSA equations, reflecting a hydrostatic stress imbalance along the vertical calving face. The ice front position is modeled using a subgrid scale representation of calving front motion (Albrecht et al., 2010) and a physically motivated dynamic calving law based on horizontal spreading rates. The model is validated within the Marine Ice Sheet Model Intercomparison Project (MISMIP) and is used for a dynamic equilibrium simulation of Antarctica under present-day conditions in the second part of this paper (Martin et al., 2010).
NASA Technical Reports Server (NTRS)
Kouznetsov, Igor; Lotko, William
1995-01-01
The 'radial' transport of energy by internal ULF waves, stimulated by dayside magnetospheric boundary oscillations, is analyzed in the framework of one-fluid magnetohydrodynamics. (the term radial is used here to denote the direction orthogonal to geomagnetic flux surfaces.) The model for the inhomogeneous magnetospheric plasma and background magnetic field is axisymmetric and includes radial and parallel variations in the magnetic field, magnetic curvature, plasma density, and low but finite plasma pressure. The radial mode structure of the coupled fast and intermediate MHD waves is determined by numerical solution of the inhomogeneous wave equation; the parallel mode structure is characterized by a Wentzel-Kramer-Brillouin (WKB) approximation. Ionospheric dissipation is modeled by allowing the parallel wave number to be complex. For boudnary oscillations with frequencies in the range from 10 to 48 mHz, and using a dipole model for the background magnetic field, the combined effects of magnetic curvature and finite plasma pressure are shown to (1) enhance the amplitude of field line resonances by as much as a factor of 2 relative to values obtained in a cold plasma or box-model approximation for the dayside magnetosphere; (2) increase the energy flux delivered to a given resonance by a factor of 2-4; and (3) broaden the spectral width of the resonance by a factor of 2-3. The effects are attributed to the existence of an 'Alfven buoyancy oscillation,' which approaches the usual shear mode Alfven wave at resonance, but unlike the shear Alfven mode, it is dispersive at short perpendicular wavelengths. The form of dispersion is analogous to that of an internal atmospheric gravity wave, with the magnetic tension of the curved background field providing the restoring force and allowing radial propagation of the mode. For nominal dayside parameters, the propagation band of the Alfven buoyancy wave occurs between the location of its (field line) resonance and that of the fast mode cutoff that exists at larger radial distances.
Automatic Management of Parallel and Distributed System Resources
NASA Technical Reports Server (NTRS)
Yan, Jerry; Ngai, Tin Fook; Lundstrom, Stephen F.
1990-01-01
Viewgraphs on automatic management of parallel and distributed system resources are presented. Topics covered include: parallel applications; intelligent management of multiprocessing systems; performance evaluation of parallel architecture; dynamic concurrent programs; compiler-directed system approach; lattice gaseous cellular automata; and sparse matrix Cholesky factorization.
New NAS Parallel Benchmarks Results
NASA Technical Reports Server (NTRS)
Yarrow, Maurice; Saphir, William; VanderWijngaart, Rob; Woo, Alex; Kutler, Paul (Technical Monitor)
1997-01-01
NPB2 (NAS (NASA Advanced Supercomputing) Parallel Benchmarks 2) is an implementation, based on Fortran and the MPI (message passing interface) message passing standard, of the original NAS Parallel Benchmark specifications. NPB2 programs are run with little or no tuning, in contrast to NPB vendor implementations, which are highly optimized for specific architectures. NPB2 results complement, rather than replace, NPB results. Because they have not been optimized by vendors, NPB2 implementations approximate the performance a typical user can expect for a portable parallel program on distributed memory parallel computers. Together these results provide an insightful comparison of the real-world performance of high-performance computers. New NPB2 features: New implementation (CG), new workstation class problem sizes, new serial sample versions, more performance statistics.
An experiment in hurricane track prediction using parallel computing methods
NASA Technical Reports Server (NTRS)
Song, Chang G.; Jwo, Jung-Sing; Lakshmivarahan, S.; Dhall, S. K.; Lewis, John M.; Velden, Christopher S.
1994-01-01
The barotropic model is used to explore the advantages of parallel processing in deterministic forecasting. We apply this model to the track forecasting of hurricane Elena (1985). In this particular application, solutions to systems of elliptic equations are the essence of the computational mechanics. One set of equations is associated with the decomposition of the wind into irrotational and nondivergent components - this determines the initial nondivergent state. Another set is associated with recovery of the streamfunction from the forecasted vorticity. We demonstrate that direct parallel methods based on accelerated block cyclic reduction (BCR) significantly reduce the computational time required to solve the elliptic equations germane to this decomposition and forecast problem. A 72-h track prediction was made using incremental time steps of 16 min on a network of 3000 grid points nominally separated by 100 km. The prediction took 30 sec on the 8-processor Alliant FX/8 computer. This was a speed-up of 3.7 when compared to the one-processor version. The 72-h prediction of Elena's track was made as the storm moved toward Florida's west coast. Approximately 200 km west of Tampa Bay, Elena executed a dramatic recurvature that ultimately changed its course toward the northwest. Although the barotropic track forecast was unable to capture the hurricane's tight cycloidal looping maneuver, the subsequent northwesterly movement was accurately forecasted as was the location and timing of landfall near Mobile Bay.
Variable-Complexity Multidisciplinary Optimization on Parallel Computers
NASA Technical Reports Server (NTRS)
Grossman, Bernard; Mason, William H.; Watson, Layne T.; Haftka, Raphael T.
1998-01-01
This report covers work conducted under grant NAG1-1562 for the NASA High Performance Computing and Communications Program (HPCCP) from December 7, 1993, to December 31, 1997. The objective of the research was to develop new multidisciplinary design optimization (MDO) techniques which exploit parallel computing to reduce the computational burden of aircraft MDO. The design of the High-Speed Civil Transport (HSCT) air-craft was selected as a test case to demonstrate the utility of our MDO methods. The three major tasks of this research grant included: development of parallel multipoint approximation methods for the aerodynamic design of the HSCT, use of parallel multipoint approximation methods for structural optimization of the HSCT, mathematical and algorithmic development including support in the integration of parallel computation for items (1) and (2). These tasks have been accomplished with the development of a response surface methodology that incorporates multi-fidelity models. For the aerodynamic design we were able to optimize with up to 20 design variables using hundreds of expensive Euler analyses together with thousands of inexpensive linear theory simulations. We have thereby demonstrated the application of CFD to a large aerodynamic design problem. For the predicting structural weight we were able to combine hundreds of structural optimizations of refined finite element models with thousands of optimizations based on coarse models. Computations have been carried out on the Intel Paragon with up to 128 nodes. The parallel computation allowed us to perform combined aerodynamic-structural optimization using state of the art models of a complex aircraft configurations.
Partitioning in parallel processing of production systems
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oflazer, K.
1987-01-01
This thesis presents research on certain issues related to parallel processing of production systems. It first presents a parallel production system interpreter that has been implemented on a four-processor multiprocessor. This parallel interpreter is based on Forgy's OPS5 interpreter and exploits production-level parallelism in production systems. Runs on the multiprocessor system indicate that it is possible to obtain speed-up of around 1.7 in the match computation for certain production systems when productions are split into three sets that are processed in parallel. The next issue addressed is that of partitioning a set of rules to processors in a parallel interpretermore » with production-level parallelism, and the extent of additional improvement in performance. The partitioning problem is formulated and an algorithm for approximate solutions is presented. The thesis next presents a parallel processing scheme for OPS5 production systems that allows some redundancy in the match computation. This redundancy enables the processing of a production to be divided into units of medium granularity each of which can be processed in parallel. Subsequently, a parallel processor architecture for implementing the parallel processing algorithm is presented.« less
Determination of backbone chain direction of PDA using FFM
NASA Astrophysics Data System (ADS)
Jo, Sadaharu; Okamoto, Kentaro; Takenaga, Mitsuru
2010-01-01
The effect of backbone chains on friction force was investigated on both Langmuir-Blodgett (LB) films of 10,12-heptacosadiynoic acid and the (0 1 0) surfaces of single crystals of 2,4-hexadiene-1,6-diol using friction force microscopy (FFM). It was observed that friction force decreased when the scanning direction was parallel to the [0 0 1] direction in both samples. Moreover, friction force decreased when the scanning direction was parallel to the crystallographic [1 0 2], [1 0 1], [1 0 0] and [1 0 1¯] directions in only the single crystals. For the LB films, the [0 0 1] direction corresponds to the backbone chain direction of 10,12-heptacosadiynoic acid. For the single crystals, both the [0 0 1] and [1 0 1] directions correspond to the backbone chain direction, and the [1 0 2], [1 0 0] and [1 0 1¯] directions correspond to the low-index crystallographic direction. In both the LB films and single crystals, the friction force was minimized when the directions of scanning and the backbone chain were parallel.
LDRD final report on massively-parallel linear programming : the parPCx system.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parekh, Ojas; Phillips, Cynthia Ann; Boman, Erik Gunnar
2005-02-01
This report summarizes the research and development performed from October 2002 to September 2004 at Sandia National Laboratories under the Laboratory-Directed Research and Development (LDRD) project ''Massively-Parallel Linear Programming''. We developed a linear programming (LP) solver designed to use a large number of processors. LP is the optimization of a linear objective function subject to linear constraints. Companies and universities have expended huge efforts over decades to produce fast, stable serial LP solvers. Previous parallel codes run on shared-memory systems and have little or no distribution of the constraint matrix. We have seen no reports of general LP solver runsmore » on large numbers of processors. Our parallel LP code is based on an efficient serial implementation of Mehrotra's interior-point predictor-corrector algorithm (PCx). The computational core of this algorithm is the assembly and solution of a sparse linear system. We have substantially rewritten the PCx code and based it on Trilinos, the parallel linear algebra library developed at Sandia. Our interior-point method can use either direct or iterative solvers for the linear system. To achieve a good parallel data distribution of the constraint matrix, we use a (pre-release) version of a hypergraph partitioner from the Zoltan partitioning library. We describe the design and implementation of our new LP solver called parPCx and give preliminary computational results. We summarize a number of issues related to efficient parallel solution of LPs with interior-point methods including data distribution, numerical stability, and solving the core linear system using both direct and iterative methods. We describe a number of applications of LP specific to US Department of Energy mission areas and we summarize our efforts to integrate parPCx (and parallel LP solvers in general) into Sandia's massively-parallel integer programming solver PICO (Parallel Interger and Combinatorial Optimizer). We conclude with directions for long-term future algorithmic research and for near-term development that could improve the performance of parPCx.« less
Anhydrous crystals of DNA bases are wide gap semiconductors.
Maia, F F; Freire, V N; Caetano, E W S; Azevedo, D L; Sales, F A M; Albuquerque, E L
2011-05-07
We present the structural, electronic, and optical properties of anhydrous crystals of DNA nucleobases (guanine, adenine, cytosine, and thymine) found after DFT (Density Functional Theory) calculations within the local density approximation, as well as experimental measurements of optical absorption for powders of these crystals. Guanine and cytosine (adenine and thymine) anhydrous crystals are predicted from the DFT simulations to be direct (indirect) band gap semiconductors, with values 2.68 eV and 3.30 eV (2.83 eV and 3.22 eV), respectively, while the experimentally estimated band gaps we have measured are 3.83 eV and 3.84 eV (3.89 eV and 4.07 eV), in the same order. The electronic effective masses we have obtained at band extremes show that, at low temperatures, these crystals behave like wide gap semiconductors for electrons moving along the nucleobases stacking direction, while the hole transport are somewhat limited. Lastly, the calculated electronic dielectric functions of DNA nucleobases crystals in the parallel and perpendicular directions to the stacking planes exhibit a high degree of anisotropy (except cytosine), in agreement with published experimental results.
Directional guidance from audible pedestrian signals for street crossing.
Wall, Robert S; Ashmead, Daniel H; Bentzen, Billie Louise; Barlow, Janet
2004-10-10
Typical audible pedestrian signals indicate when the pedestrian walk interval is in effect but provide little, or even misleading information for directional alignment. In three experiments, blind and blindfolded sighted adults crossed a simulated crossing with recorded traffic noise to approximate street sounds. This was done to investigate how characteristics of signal presentation affected usefulness of the auditory signal for guiding crossing behaviour. Crossing was more accurate when signals came only from the far end of the crossing rather than the typical practice of presenting signals simultaneously from both ends. Alternating the signal between ends of the crossing was not helpful. Also, the customary practice of signalling two parallel crossings at the same time drew participants somewhat toward the opposite crossing. Providing a locator tone at the end of the crossing during the pedestrian clearance interval improved crossing accuracy. These findings provide a basis for designing audible pedestrian signals to enhance directional guidance. The principal findings were the same for blind and sighted participants and applied across a range of specific signals (e.g. chirps, clicks, voices).
Seismic response of transamerica building. II. System identification
Safak, E.; Celebi, M.
1991-01-01
A detailed analysis of the recorded seismic response of the Transamerica Building during the October 17, 1989 Loma Prieta earthquake is presented. The system identification algorithm used for the analysis is based on the discrete-time linear filtering approach with least-squares approximation, and assumes a multi-input, single-output model for the building. Fifteen modes in the north-south direction, and 18 modes in the east-west direction are identified from the records. The analysis shows that the building's response to the earthquake was dominated by a coupled mode of vibration at 0.28 Hz in the southwest-northeast direction, which is almost parallel to one of the diagonals in the building's square cross section. The reason for this behavior is the symmetry of the building's structural characteristics, as well as the strong polarization of the S-waves of the earthquake. Several higher modes of the building were excited during the strong-motion part of the earthquake. The results also show a significant amount of rocking in the building at a frequency of 2.15 Hz.
Nicol, T.H.; Niemann, R.C.; Gonczy, J.D.
1988-11-01
A support system is disclosed for restraining large masses at very low or cryogenic temperatures. The support system employs a tie bar that is pivotally connected at opposite ends to an anchoring support member and a sliding support member. The tie bar extends substantially parallel to the longitudinal axis of the cold mass assembly, and comprises a rod that lengthens when cooled and a pair of end attachments that contract when cooled. The rod and end attachments are sized so that when the tie bar is cooled to cryogenic temperature, the net change in tie bar length is approximately zero. Longitudinal force directed against the cold mass assembly is distributed by the tie bar between the anchoring support member and the sliding support member. 7 figs.
Broad band antennas and feed methods
DOE Office of Scientific and Technical Information (OSTI.GOV)
Benzel, David M.; Twogood, Richard E.
Two or more Vivaldi antennas, consisting of two plates each, each with the antenna's natural impedance of approximately 100 ohms, are placed in parallel to achieve a 50 ohm impedance in the case of two antennas or other impedances (100/n ohms) for more than two antennas. A single Vivaldi antenna plate (half Vivaldi antenna) over a ground plane can also be used to achieve a 50 ohm impedance, or two or more single plates over a ground plane to achieve other impedances. Unbalanced 50 ohm transmission lines, e.g. coaxial cables, can be used to directly feed, the dual Vivaldi (fourmore » plate) antenna in a center fed angled center departure, or more desirably, a center fed offset departure configuration.« less
A new collimator for I-123-IMP SPECT imaging of the brain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Oyamada, H.; Fukukita, H.; Tanaka, E.
1985-05-01
At present, commercially available I-123-IMP is contaminated with I-124 and its concentration on the assay date is said to be approximately 5%. Therefore, the application of medium energy parallel hole collimator (MEPC) used in many places for SPECT results in deterioration of the image quality. Recently, the authors have developed a new collimator for I-123-IMP SPECT imaging comprised of 4 slat type units; ultrahigh resolution (UHR), high resolution (HR), high sensitivity (HS), and ultrahigh sensitivity (UHS). The slit width/septum thickness in mm for UHR, HR, HS, and UHS are 0.9/0.5, 1.5/0.85, 3.2/1.5, and 5.2/2.0, respectively. In practice, either UHR ormore » HR is set to the detector (Shimadzu LFOV-E, modified type) together with either HS or UHS. The former is always set to the detector with the slit direction parallel to the rotation axis, and the latter is set with its slit direction at a right angle to the former. This is based on an idea that, upon sacrifice of resolution to some extent, sensitivity can be gained on the axial direction while the resolution on the transaxial slice will still be sufficiently preserved. Resolutions (transaxial direction/axial direction) in FWHM (mm) for each combination (UHR-HS, UHR-UHS, HR-HS, and HR-UHS) were 15.9/31.4, 15.9/36.5,23.2/33.3, and 23.9/40.7, respectively, whereas the resolution of MEPC was 28.7/29.5. On the other hand, relative sensitivities to MEPC were 0.57, 0.86, 0.80, and 1.16. The authors conclude that the combination of UHR and HS is best suited for clinical practice and, at present they are obtaining I-123-IMP SPECT images of good quality.« less
CASCADE AND DAMPING OF ALFVEN-CYCLOTRON FLUCTUATIONS: APPLICATION TO SOLAR WIND TURBULENCE
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jiang Yanwei; Petrosian, Vahe; Liu Siming
2009-06-10
It is well recognized that the presence of magnetic fields will lead to anisotropic energy cascade and dissipation of astrophysical turbulence. With the diffusion approximation and linear dissipation rates, we study the cascade and damping of Alfven-cyclotron fluctuations in solar plasmas numerically for two diagonal diffusion tensors, one (isotropic) with identical components for the parallel and perpendicular directions (with respect to the magnetic field) and one with different components (nonisotropic). It is found that for the isotropic case the steady-state turbulence spectra are nearly isotropic in the inertial range and can be fitted by a single power-law function with amore » spectral index of -3/2, similar to the Iroshnikov-Kraichnan phenomenology, while for the nonisotropic case the spectra vary greatly with the direction of propagation. The energy fluxes in both cases are much higher in the perpendicular direction than in the parallel direction due to the angular dependence (or inhomogeneity) of the components. In addition, beyond the MHD regime the kinetic effects make the spectrum softer at higher wavenumbers. In the dissipation range the turbulence spectrum cuts off at the wavenumber, where the damping rate becomes comparable to the cascade rate, and the cutoff wavenumber changes with the wave propagation direction. The angle-averaged turbulence spectrum of the isotropic model resembles a broken power law, which cuts off at the maximum of the cutoff wavenumbers or the {sup 4}He cyclotron frequency. Taking into account the Doppler effects, the model naturally reproduces the broken power-law turbulence spectra observed in the solar wind and predicts that a higher break frequency always comes along with a softer dissipation range spectrum that may be caused by the increase of the turbulence intensity, the reciprocal of the plasma {beta}{sub p}, and/or the angle between the solar wind velocity and the mean magnetic field. These predictions can be tested by detailed comparisons with more accurate observations.« less
Hansen, R; Thogersen, T; Rogalla, F
2007-01-01
In the early 1990s, the Wastewater Treatment Plant (WWTP) of Frederikshavn, Denmark, was extended to meet new requirements for nutrient removal (8 mg/L TN, 1.5 mg TP/L) as well as to increase its average daily flow to 16,500 m(3)/d (4.5 MGD). As the most economical upgrade of the existing activated sludge (AS) plant, a parallel biological aerated filter (BAF) was selected, and started up in 1995. Running two full scale processes in parallel for over ten years on the same wastewater and treatment objectives enabled a direct comparison in relation to operating performance, costs and experience. Common pretreatment consists of screening, an aerated grit and grease removal and three primary settlers with chemical addition. The effluent is then pumped to the two parallel biological treatment stages, AS with recirculation and an upflow BAF with floating media. The wastewater is a mixture of industrial and domestic wastewater, with a dominant discharge of fish processing effluent which can amount to 50% of the flow. The maximum hydraulic load on the pretreatment section as a whole is 1,530 m(3)/h. Approximately 60% of the sewer system is combined with a total of 32 overflow structures. To avoid the direct discharge of combined sewer overflows into the receiving waters, the total hydraulic wet weather capacity of the plant is increased to 4,330 m(3)/h, or 6 times average flow. During rain, some of the raw sewage can be directed through a stormwater bypass to the BAF, which can be modified in its operation to accommodate various treatment needs: either using simultaneous nitrification/denitrification in all filters with recirculation introducing bottom aeration with full nitrification in some filters for storm treatment and/or post-denitrification in one filter. After treatment, the wastewater is discharged to the Baltic Sea through a 500 m outfall. The BAF backwash sludge, approximately 1,900 m(3) per 24 h in dry weather, is redirected to the AS plant. Primary settler sludge and the combined biosolids from the AS plant are anaerobically digested, with methane gas being used for generation of heat and power. On-line measurements for the parameters NO3, NO2, NH4, temperature as well as dissolved oxygen (DO) are used for control of aeration and external carbon source (methanol). Dosing of flocculants for P-removal is carried out based on laboratory analysis and jar tests. This paper discusses the experience gained from the plant operation during the last ten years, compiling comparative performance and cost data of the two processes, as well as their optimisation.
A Knowledge-Based Approach for Item Exposure Control in Computerized Adaptive Testing
ERIC Educational Resources Information Center
Doong, Shing H.
2009-01-01
The purpose of this study is to investigate a functional relation between item exposure parameters (IEPs) and item parameters (IPs) over parallel pools. This functional relation is approximated by a well-known tool in machine learning. Let P and Q be parallel item pools and suppose IEPs for P have been obtained via a Sympson and Hetter-type…
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shearer, C.K.; Papike, J.J.; Burger, P.V.
2012-03-15
The relative proportion of divalent and trivalent Eu has proven to be a useful tool for estimating f{sub O{sub 2}} in various magmatic systems. However, in most cases, direct determination of the Eu valence state has not been made. In this study, direct determination of Eu valence by XANES and REE abundance in merrillite provide insights into the crystal chemistry of these phosphates and their ability to record conditions of magmatism. Merrillite strongly prefers Eu{sup 3+} to Eu{sup 2+}, with the average valence state of Eu ranging between 2.9 and 3 over approximately six orders of magnitude in f{sub O{submore » 2}}. The dramatic shift in the REE patterns of merrillite in martian basaltic magmas, from highly LREE-depleted to LREE-enriched, parallels many other trace element and isotopic variations and reflects the sources for these magmas. The behavior of REE in the merrillite directly reflects the relationship between the eightfold-coordinated Ca1 site and adjacent sixfold Na and tetrahedral P sites that enables charge balancing through coupled substitutions.« less
A queueing network model to analyze the impact of parallelization of care on patient cycle time.
Jiang, Lixiang; Giachetti, Ronald E
2008-09-01
The total time a patient spends in an outpatient facility, called the patient cycle time, is a major contributor to overall patient satisfaction. A frequently recommended strategy to reduce the total time is to perform some activities in parallel thereby shortening patient cycle time. To analyze patient cycle time this paper extends and improves upon existing multi-class open queueing network model (MOQN) so that the patient flow in an urgent care center can be modeled. Results of the model are analyzed using data from an urgent care center contemplating greater parallelization of patient care activities. The results indicate that parallelization can reduce the cycle time for those patient classes which require more than one diagnostic and/ or treatment intervention. However, for many patient classes there would be little if any improvement, indicating the importance of tools to analyze business process reengineering rules. The paper makes contributions by implementing an approximation for fork/join queues in the network and by improving the approximation for multiple server queues in both low traffic and high traffic conditions. We demonstrate the accuracy of the MOQN results through comparisons to simulation results.
Superfocusing terahertz waves below lambda/250 using plasmonic parallel-plate waveguides.
Zhan, Hui; Mendis, Rajind; Mittleman, Daniel M
2010-04-26
We experimentally demonstrate complete two-dimensional (2-D) confinement of terahertz (THz) energy in finite-width parallel-plate waveguides, defying conventional wisdom in the century-old field of microwave waveguide technology. We find that the degree of energy confinement increases exponentially with decreasing plate separation. We propose that this 2-D confinement is mediated by the mutual coupling of plasmonic edge modes, analogous to that observed in slot waveguides at optical wavelengths. By adiabatically tapering the width and the separation, we focus THz waves down to a size of 10 microm (approximately lambda/260) by 18 microm ( approximately lambda/145), which corresponds to a mode area of only 2.6 x 10(-5) lambda(2).
Albedo of an irradiated plane-parallel atmosphere with finite optical depth
NASA Astrophysics Data System (ADS)
Fukue, Jun
2018-03-01
We analytically derive albedo for a plane-parallel atmosphere with finite optical depth, irradiated by an external source, under the local thermodynamic equilibrium approximation. Albedo is expressed as a function of the photon destruction probability ɛ and optical depth τ, with several parameters such as dilution factors of the external source. In the particular case of the infinite optical depth, albedo A is expressed as A=[1 + (1-W_J/W_H)√{3ɛ}/3]/(1+√{3ɛ}), where WJ and WH are the dilution factors for the mean intensity and Eddington flux, respectively. An example of a model atmosphere is also presented under a gray approximation.
Dunn, Katherine E; Trefzer, Martin A; Johnson, Steven; Tyrrell, Andy M
2016-08-01
Molecular computation with DNA has great potential for low power, highly parallel information processing in a biological or biochemical context. However, significant challenges remain for the field of DNA computation. New technology is needed to allow multiplexed label-free readout and to enable regulation of molecular state without addition of new DNA strands. These capabilities could be provided by hybrid bioelectronic systems in which biomolecular computing is integrated with conventional electronics through immobilization of DNA machines on the surface of electronic circuitry. Here we present a quantitative experimental analysis of a surface-immobilized OR gate made from DNA and driven by strand displacement. The purpose of our work is to examine the performance of a simple representative surface-immobilized DNA logic machine, to provide valuable information for future work on hybrid bioelectronic systems involving DNA devices. We used a quartz crystal microbalance to examine a DNA monolayer containing approximately 5×10(11)gatescm(-2), with an inter-gate separation of approximately 14nm, and we found that the ensemble of gates took approximately 6min to switch. The gates could be switched repeatedly, but the switching efficiency was significantly degraded on the second and subsequent cycles when the binding site for the input was near to the surface. Otherwise, the switching efficiency could be 80% or better, and the power dissipated by the ensemble of gates during switching was approximately 0.1nWcm(-2), which is orders of magnitude less than the power dissipated during switching of an equivalent array of transistors. We propose an architecture for hybrid DNA-electronic systems in which information can be stored and processed, either in series or in parallel, by a combination of molecular machines and conventional electronics. In this architecture, information can flow freely and in both directions between the solution-phase and the underlying electronics via surface-immobilized DNA machines that provide the interface between the molecular and electronic domains. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Parallelized CCHE2D flow model with CUDA Fortran on Graphics Process Units
USDA-ARS?s Scientific Manuscript database
This paper presents the CCHE2D implicit flow model parallelized using CUDA Fortran programming technique on Graphics Processing Units (GPUs). A parallelized implicit Alternating Direction Implicit (ADI) solver using Parallel Cyclic Reduction (PCR) algorithm on GPU is developed and tested. This solve...
Aerostructural analysis and design optimization of composite aircraft
NASA Astrophysics Data System (ADS)
Kennedy, Graeme James
High-performance composite materials exhibit both anisotropic strength and stiffness properties. These anisotropic properties can be used to produce highly-tailored aircraft structures that meet stringent performance requirements, but these properties also present unique challenges for analysis and design. New tools and techniques are developed to address some of these important challenges. A homogenization-based theory for beams is developed to accurately predict the through-thickness stress and strain distribution in thick composite beams. Numerical comparisons demonstrate that the proposed beam theory can be used to obtain highly accurate results in up to three orders of magnitude less computational time than three-dimensional calculations. Due to the large finite-element model requirements for thin composite structures used in aerospace applications, parallel solution methods are explored. A parallel direct Schur factorization method is developed. The parallel scalability of the direct Schur approach is demonstrated for a large finite-element problem with over 5 million unknowns. In order to address manufacturing design requirements, a novel laminate parametrization technique is presented that takes into account the discrete nature of the ply-angle variables, and ply-contiguity constraints. This parametrization technique is demonstrated on a series of structural optimization problems including compliance minimization of a plate, buckling design of a stiffened panel and layup design of a full aircraft wing. The design and analysis of composite structures for aircraft is not a stand-alone problem and cannot be performed without multidisciplinary considerations. A gradient-based aerostructural design optimization framework is presented that partitions the disciplines into distinct process groups. An approximate Newton-Krylov method is shown to be an efficient aerostructural solution algorithm and excellent parallel scalability of the algorithm is demonstrated. An induced drag optimization study is performed to compare the trade-off between wing weight and induced drag for wing tip extensions, raked wing tips and winglets. The results demonstrate that it is possible to achieve a 43% induced drag reduction with no weight penalty, a 28% induced drag reduction with a 10% wing weight reduction, or a 20% wing weight reduction with a 5% induced drag penalty from a baseline wing obtained from a structural mass-minimization problem with fixed aerodynamic loads.
NASA Technical Reports Server (NTRS)
Miller, R. H.; Gombosi, T. I.; Gary, S. P.; Winske, D.
1991-01-01
The direction of propagation of low frequency magnetic fluctuations generated by cometary ion pick-up is examined by means of 1D electromagnetic hybrid simulations. The newborn ions are injected at a constant rate, and the helicity and direction of propagation of magnetic fluctuations are explored for cometary ion injection angles of 0 and 90 deg relative to the solar wind magnetic field. The parameter eta represents the relative contribution of wave energy propagating in the direction away from the comet, parallel to the beam. For small (quasi-parallel) injection angles eta was found to be of order unity, while for larger (quasi-perpendicular) angles eta was found to be of order 0.5.
Parallel multigrid smoothing: polynomial versus Gauss-Seidel
NASA Astrophysics Data System (ADS)
Adams, Mark; Brezina, Marian; Hu, Jonathan; Tuminaro, Ray
2003-07-01
Gauss-Seidel is often the smoother of choice within multigrid applications. In the context of unstructured meshes, however, maintaining good parallel efficiency is difficult with multiplicative iterative methods such as Gauss-Seidel. This leads us to consider alternative smoothers. We discuss the computational advantages of polynomial smoothers within parallel multigrid algorithms for positive definite symmetric systems. Two particular polynomials are considered: Chebyshev and a multilevel specific polynomial. The advantages of polynomial smoothing over traditional smoothers such as Gauss-Seidel are illustrated on several applications: Poisson's equation, thin-body elasticity, and eddy current approximations to Maxwell's equations. While parallelizing the Gauss-Seidel method typically involves a compromise between a scalable convergence rate and maintaining high flop rates, polynomial smoothers achieve parallel scalable multigrid convergence rates without sacrificing flop rates. We show that, although parallel computers are the main motivation, polynomial smoothers are often surprisingly competitive with Gauss-Seidel smoothers on serial machines.
7 CFR 52.3753 - Styles of canned ripe olives.
Code of Federal Regulations, 2010 CFR
2010-01-01
...) Halved. “Halved” olives are pitted olives in which each olive is cut lengthwise into two approximately equal parts. (d) Segmented. “Segmented” olives are pitted olives in which each olive is cut lengthwise into three or more approximately equal parts. (e) Sliced. “Sliced” olives consist of parallel slices of...
7 CFR 52.3753 - Styles of canned ripe olives.
Code of Federal Regulations, 2011 CFR
2011-01-01
...) Halved. “Halved” olives are pitted olives in which each olive is cut lengthwise into two approximately equal parts. (d) Segmented. “Segmented” olives are pitted olives in which each olive is cut lengthwise into three or more approximately equal parts. (e) Sliced. “Sliced” olives consist of parallel slices of...
Unstable Resonator Optical Parametric Oscillator Based on Quasi-Phase-Matched RbTiOAsO(4).
Hansson, G; Karlsson, H; Laurell, F
2001-10-20
We demonstrate improved signal and idler-beam quality of a 3-mm-aperture quasi-phase-matched RbTiOAsO(4) optical parametric oscillator through use of a confocal unstable resonator as compared with a plane-parallel resonator. Both oscillators were singly resonant, and the periodically poled RbTiOAsO(4) crystal generated a signal at 1.56 mum and an idler at 3.33 mum when pumped at 1.064 mum. We compared the beam quality produced by the 1.2-magnification confocal unstable resonator with the beam quality produced by the plane-parallel resonator by measuring the signal and the idler beam M(2) value. We also investigated the effect of pump-beam intensity distribution by comparing the result of a Gaussian and a top-hat intensity profile pump beam. We generated a signal beam of M(2) approximately 7 and an idler beam of M(2) approximately 2.5 through use of an unstable resonator and a Gaussian intensity profile pump beam. This corresponds to an increase of a factor of approximately 2 in beam quality for the signal and a factor of 3 for the idler, compared with the beam quality of the plane-parallel resonator optical parametric oscillator.
NASA Astrophysics Data System (ADS)
Shariati, Maryam; Yortsos, Yannis; Talon, Laurent; Martin, Jerome; Rakotomalala, Nicole; Salin, Dominique
2003-11-01
We consider miscible displacement between parallel plates, where the viscosity is a function of the concentration. By selecting a piece-wise representation, the problem can be considered as ``three-phase'' flow. Assuming a lubrication-type approximation, the mathematical description is in terms of two quasi-linear hyperbolic equations. When the mobility of the middle phase is smaller than its neighbors, the system is genuinely hyperbolic and can be solved analytically. However, when it is larger, an elliptic region develops. This change-of-type behavior is for the first time proved here based on sound physical principles. Numerical solutions with a small diffusion are presented. Good agreement is obtained outside the elliptic region, but not inside, where the numerical results show unstable behavior. We conjecture that for the solution of the real problem in the mixed-type case, the full higher-dimensionality problem must be considered inside the elliptic region, in which the lubrication (parallel-flow) approximation is no longer appropriate. This is discussed in a companion presentation.
Generalized kinetic-neoclassical closure for parallel viscosity in a tokamak.
NASA Astrophysics Data System (ADS)
Smolyakov, A.; Callen, J. D.; Hegna, C.
2000-10-01
We develop a drift-kinetic equation for a Chapman Enskog-type calculations of the parallel viscosity in a tokamak. This approach allows us to uniformly obtain closure relations for the parallel viscosity that include the kinetic effects of wave-particle interactions, such as those of Hammet-Perkins closures, as well as standard neoclassical moment closures induced by collisions and the magnetic field strength variation along field lines. Closures for both these cases can be obtained from our expressions; also, their mutual influences can be investigated. The developed equations allow calculation of parallel vicosity in general kinetic-neoclassical regimes while the main conservation properties remain correct even with an approximate treatment of the collisional operator.
Direct Observation of Parallel Folding Pathways Revealed Using a Symmetric Repeat Protein System
Aksel, Tural; Barrick, Doug
2014-01-01
Although progress has been made to determine the native fold of a polypeptide from its primary structure, the diversity of pathways that connect the unfolded and folded states has not been adequately explored. Theoretical and computational studies predict that proteins fold through parallel pathways on funneled energy landscapes, although experimental detection of pathway diversity has been challenging. Here, we exploit the high translational symmetry and the direct length variation afforded by linear repeat proteins to directly detect folding through parallel pathways. By comparing folding rates of consensus ankyrin repeat proteins (CARPs), we find a clear increase in folding rates with increasing size and repeat number, although the size of the transition states (estimated from denaturant sensitivity) remains unchanged. The increase in folding rate with chain length, as opposed to a decrease expected from typical models for globular proteins, is a clear demonstration of parallel pathways. This conclusion is not dependent on extensive curve-fitting or structural perturbation of protein structure. By globally fitting a simple parallel-Ising pathway model, we have directly measured nucleation and propagation rates in protein folding, and have quantified the fluxes along each path, providing a detailed energy landscape for folding. This finding of parallel pathways differs from results from kinetic studies of repeat-proteins composed of sequence-variable repeats, where modest repeat-to-repeat energy variation coalesces folding into a single, dominant channel. Thus, for globular proteins, which have much higher variation in local structure and topology, parallel pathways are expected to be the exception rather than the rule. PMID:24988356
Scan Directed Load Balancing for Highly-Parallel Mesh-Connected Computers
1991-07-01
DTIC ~ ELECTE OCT 2 41991 AD-A242 045 Scan Directed Load Balancing for Highly-Parallel Mesh-Connected Computers’ Edoardo S. Biagioni Jan F. Prins...Department of Computer Science University of North Carolina Chapel Hill, N.C. 27599-3175 USA biagioni @cs.unc.edu prinsOcs.unc.edu Abstract Scan Directed...MasPar Computer Corpora- tion. Bibliography [1] Edoardo S. Biagioni . Scan Directed Load Balancing. PhD thesis., University of North Carolina, Chapel Hill
62. A view looking northeast, parallel to U.S. 24 shows ...
62. A view looking northeast, parallel to U.S. 24 shows an 'outrigger' crib which remains, within a yard of the roadway pavement and approximately 5 feet lower in grade. This actually represents a second register of cribs in the lock. - Wabash & Erie Canal, Lock No. 2, 8 miles east of Fort Wayne, adjacent to U.S. Route 24, New Haven, Allen County, IN
NASA Astrophysics Data System (ADS)
Liakos, Anastasios; Malamataris, Nikolaos A.
2014-05-01
The topology and evolution of flow around a surface mounted cubical object in three dimensional channel flow is examined for low to moderate Reynolds numbers. Direct numerical simulations were performed via a home made parallel finite element code. The computational domain has been designed according to actual laboratory experiment conditions. Analysis of the results is performed using the three dimensional theory of separation. Our findings indicate that a tornado-like vortex by the side of the cube is present for all Reynolds numbers for which flow was simulated. A horseshoe vortex upstream from the cube was formed at Reynolds number approximately 1266. Pressure distributions are shown along with three dimensional images of the tornado-like vortex and the horseshoe vortex at selected Reynolds numbers. Finally, and in accordance to previous work, our results indicate that the upper limit for the Reynolds number for which steady state results are physically realizable is roughly 2000.
A transient FETI methodology for large-scale parallel implicit computations in structural mechanics
NASA Technical Reports Server (NTRS)
Farhat, Charbel; Crivelli, Luis; Roux, Francois-Xavier
1992-01-01
Explicit codes are often used to simulate the nonlinear dynamics of large-scale structural systems, even for low frequency response, because the storage and CPU requirements entailed by the repeated factorizations traditionally found in implicit codes rapidly overwhelm the available computing resources. With the advent of parallel processing, this trend is accelerating because explicit schemes are also easier to parallelize than implicit ones. However, the time step restriction imposed by the Courant stability condition on all explicit schemes cannot yet -- and perhaps will never -- be offset by the speed of parallel hardware. Therefore, it is essential to develop efficient and robust alternatives to direct methods that are also amenable to massively parallel processing because implicit codes using unconditionally stable time-integration algorithms are computationally more efficient when simulating low-frequency dynamics. Here we present a domain decomposition method for implicit schemes that requires significantly less storage than factorization algorithms, that is several times faster than other popular direct and iterative methods, that can be easily implemented on both shared and local memory parallel processors, and that is both computationally and communication-wise efficient. The proposed transient domain decomposition method is an extension of the method of Finite Element Tearing and Interconnecting (FETI) developed by Farhat and Roux for the solution of static problems. Serial and parallel performance results on the CRAY Y-MP/8 and the iPSC-860/128 systems are reported and analyzed for realistic structural dynamics problems. These results establish the superiority of the FETI method over both the serial/parallel conjugate gradient algorithm with diagonal scaling and the serial/parallel direct method, and contrast the computational power of the iPSC-860/128 parallel processor with that of the CRAY Y-MP/8 system.
NASA Technical Reports Server (NTRS)
Noh, H. M.; Pathak, P. H.
1986-01-01
An approximate but sufficiently accurate high frequency solution which combines the uniform geometrical theory of diffraction (UTD) and the aperture integration (AI) method is developed for analyzing the problem of electromagnetic (EM) plane wave scattering by an open-ended, perfectly-conducting, semi-infinite hollow rectangular waveguide (or duct) with a thin, uniform layer of lossy or absorbing material on its inner wall, and with a planar termination inside. In addition, a high frequency solution for the EM scattering by a two dimensional (2-D), semi-infinite parallel plate waveguide with a absorber coating on the inner walls is also developed as a first step before analyzing the open-ended semi-infinite three dimensional (3-D) rectangular waveguide geometry. The total field scattered by the semi-infinite waveguide consists firstly of the fields scattered from the edges of the aperture at the open-end, and secondly of the fields which are coupled into the waveguide from the open-end and then reflected back from the interior termination to radiate out of the open-end. The first contribution to the scattered field can be found directly via the UTD ray method. The second contribution is found via the AI method which employs rays to describe the fields in the aperture that arrive there after reflecting from the interior termination. It is assumed that the direction of the incident plane wave and the direction of observation lie well inside the forward half space tht exists outside the half space containing the semi-infinite waveguide geometry. Also, the medium exterior to the waveguide is assumed to be free space.
NASA Astrophysics Data System (ADS)
Pastori, M.; Piccinini, D.; Margheriti, L.; Improta, L.; Valoroso, L.; Chiaraluce, L.; Chiarabba, C.
2009-10-01
Shear wave splitting is measured at 19 seismic stations of a temporary network deployed in the Val d'Agri area to record low-magnitude seismic activity. The splitting results suggest the presence of an anisotropic layer between the surface and 15 km depth (i.e. above the hypocentres). The dominant fast polarization direction strikes NW-SE parallel to the Apennines orogen and is approximately parallel to the maximum horizontal stress in the region, as well as to major normal faults bordering the Val d'Agri basin. The size of the normalized delay times in the study region is about 0.01 s km-1, suggesting 4.5 percent shear wave velocity anisotropy (SWVA). On the south-western flank of the basin, where most of the seismicity occurs, we found larger values of normalized delay times, between 0.017 and 0.02 s km-1. These high values suggest a 10 percent of SWVA. These parameters agree with an interpretation of seismic anisotropy in terms of the Extensive-Dilatancy Anisotropy (EDA) model that considers the rock volume pervaded by fluid-saturated microcracks aligned by the active stress field. Anisotropic parameters are consistent with borehole image logs from deep exploration wells in the Val d'Agri oil field that detect pervasive fluid saturated microcracks striking NW-SE parallel to the maximum horizontal stress in the carbonatic reservoir. However, we cannot rule out the contribution of aligned macroscopic fractures because the main Quaternary normal faults are parallel to the maximum horizontal stress. The strong anisotropy and the seismicity concentration testify for active deformation along the SW flank of the basin.
Parallel simulation of tsunami inundation on a large-scale supercomputer
NASA Astrophysics Data System (ADS)
Oishi, Y.; Imamura, F.; Sugawara, D.
2013-12-01
An accurate prediction of tsunami inundation is important for disaster mitigation purposes. One approach is to approximate the tsunami wave source through an instant inversion analysis using real-time observation data (e.g., Tsushima et al., 2009) and then use the resulting wave source data in an instant tsunami inundation simulation. However, a bottleneck of this approach is the large computational cost of the non-linear inundation simulation and the computational power of recent massively parallel supercomputers is helpful to enable faster than real-time execution of a tsunami inundation simulation. Parallel computers have become approximately 1000 times faster in 10 years (www.top500.org), and so it is expected that very fast parallel computers will be more and more prevalent in the near future. Therefore, it is important to investigate how to efficiently conduct a tsunami simulation on parallel computers. In this study, we are targeting very fast tsunami inundation simulations on the K computer, currently the fastest Japanese supercomputer, which has a theoretical peak performance of 11.2 PFLOPS. One computing node of the K computer consists of 1 CPU with 8 cores that share memory, and the nodes are connected through a high-performance torus-mesh network. The K computer is designed for distributed-memory parallel computation, so we have developed a parallel tsunami model. Our model is based on TUNAMI-N2 model of Tohoku University, which is based on a leap-frog finite difference method. A grid nesting scheme is employed to apply high-resolution grids only at the coastal regions. To balance the computation load of each CPU in the parallelization, CPUs are first allocated to each nested layer in proportion to the number of grid points of the nested layer. Using CPUs allocated to each layer, 1-D domain decomposition is performed on each layer. In the parallel computation, three types of communication are necessary: (1) communication to adjacent neighbours for the finite difference calculation, (2) communication between adjacent layers for the calculations to connect each layer, and (3) global communication to obtain the time step which satisfies the CFL condition in the whole domain. A preliminary test on the K computer showed the parallel efficiency on 1024 cores was 57% relative to 64 cores. We estimate that the parallel efficiency will be considerably improved by applying a 2-D domain decomposition instead of the present 1-D domain decomposition in future work. The present parallel tsunami model was applied to the 2011 Great Tohoku tsunami. The coarsest resolution layer covers a 758 km × 1155 km region with a 405 m grid spacing. A nesting of five layers was used with the resolution ratio of 1/3 between nested layers. The finest resolution region has 5 m resolution and covers most of the coastal region of Sendai city. To complete 2 hours of simulation time, the serial (non-parallel) computation took approximately 4 days on a workstation. To complete the same simulation on 1024 cores of the K computer, it took 45 minutes which is more than two times faster than real-time. This presentation discusses the updated parallel computational performance and the efficient use of the K computer when considering the characteristics of the tsunami inundation simulation model in relation to the characteristics and capabilities of the K computer.
NASA Astrophysics Data System (ADS)
Diama, A.; Matthies, B.; Herwig, K. W.; Hansen, F. Y.; Criswell, L.; Mo, H.; Bai, M.; Taub, H.
2009-08-01
We present evidence from neutron diffraction measurements and molecular dynamics (MD) simulations of three different monolayer phases of the intermediate-length alkanes tetracosane (n-C24H50 denoted as C24) and dotriacontane (n-C32H66 denoted as C32) adsorbed on a graphite basal-plane surface. Our measurements indicate that the two monolayer films differ principally in the transition temperatures between phases. At the lowest temperatures, both C24 and C32 form a crystalline monolayer phase with a rectangular-centered (RC) structure. The two sublattices of the RC structure each consists of parallel rows of molecules in their all-trans conformation aligned with their long axis parallel to the surface and forming so-called lamellas of width approximately equal to the all-trans length of the molecule. The RC structure is uniaxially commensurate with the graphite surface in its [110] direction such that the distance between molecular rows in a lamella is 4.26 Å=√3 ag, where ag=2.46 Å is the lattice constant of the graphite basal plane. Molecules in adjacent rows of a lamella alternate in orientation between the carbon skeletal plane being parallel and perpendicular to the graphite surface. Upon heating, the crystalline monolayers transform to a "smectic" phase in which the inter-row spacing within a lamella expands by ˜10% and the molecules are predominantly oriented with the carbon skeletal plane parallel to the graphite surface. In the smectic phase, the MD simulations show evidence of broadening of the lamella boundaries as a result of molecules diffusing parallel to their long axis. At still higher temperatures, they indicate that the introduction of gauche defects into the alkane chains drives a melting transition to a monolayer fluid phase as reported previously.
Laboratory Study of the Displacement Coalbed CH4 Process and Efficiency of CO2 and N2 Injection
Wang, Liguo; Wang, Yongkang
2014-01-01
ECBM displacement experiments are a direct way to observe the gas displacement process and efficiency by inspecting the produced gas composition and flow rate. We conducted two sets of ECBM experiments by injecting N2 and CO2 through four large parallel specimens (300 × 50 × 50 mm coal briquette). N2 or CO2 is injected at pressures of 1.5, 1.8, and 2.2 MPa and various crustal stresses. The changes in pressure along the briquette and the concentration of the gas mixture flowing out of the briquette were analyzed. Gas injection significantly enhances CBM recovery. Experimental recoveries of the original extant gas are in excess of 90% for all cases. The results show that the N2 breakthrough occurs earlier than the CO2 breakthrough. The breakthrough time of N2 is approximately 0.5 displaced volumes. Carbon dioxide, however, breaks through at approximately 2 displaced volumes. Coal can adsorb CO2, which results in a slower breakthrough time. In addition, ground stress significantly influences the displacement effect of the gas injection. PMID:24741346
Implementing direct, spatially isolated problems on transputer networks
NASA Technical Reports Server (NTRS)
Ellis, Graham K.
1988-01-01
Parametric studies were performed on transputer networks of up to 40 processors to determine how to implement and maximize the performance of the solution of problems where no processor-to-processor data transfer is required for the problem solution (spatially isolated). Two types of problems are investigated a computationally intensive problem where the solution required the transmission of 160 bytes of data through the parallel network, and a communication intensive example that required the transmission of 3 Mbytes of data through the network. This data consists of solutions being sent back to the host processor and not intermediate results for another processor to work on. Studies were performed on both integer and floating-point transputers. The latter features an on-chip floating-point math unit and offers approximately an order of magnitude performance increase over the integer transputer on real valued computations. The results indicate that a minimum amount of work is required on each node per communication to achieve high network speedups (efficiencies). The floating-point processor requires approximately an order of magnitude more work per communication than the integer processor because of the floating-point unit's increased computing capacity.
NASA Astrophysics Data System (ADS)
Vargas, C.; Arcos, J.; Bautista, O.; Méndez, F.
2017-09-01
The effective dispersion coefficient of a neutral solute in the combined electroosmotic (EO) and magnetohydrodynamic (MHD)-driven flow of a Newtonian fluid through a parallel flat plate microchannel is studied. The walls of the microchannel are assumed to have modulated and low zeta potentials that vary slowly in the axial direction in a sinusoidal manner. The flow field required to obtain the dispersion coefficient is solved using the lubrication approximation theory. The solution of the electrical potential is based on the Debye-Hückel approximation for a symmetric (Z :Z ) electrolyte solution. The EO and MHD effects, together with the variations in the zeta potentials of the walls, are observed to notably modify the axial distribution of the effective dispersion coefficient. The problem is formulated for two cases of the zeta potential function. Note that the dispersion coefficient primarily depends on the Hartmann number, on the ratio of the half height of the microchannel to the Debye length, and on the assumed variation in the zeta potentials of the walls.
Control of parallel manipulators using force feedback
NASA Technical Reports Server (NTRS)
Nanua, Prabjot
1994-01-01
Two control schemes are compared for parallel robotic mechanisms actuated by hydraulic cylinders. One scheme, the 'rate based scheme', uses the position and rate information only for feedback. The second scheme, the 'force based scheme' feeds back the force information also. The force control scheme is shown to improve the response over the rate control one. It is a simple constant gain control scheme better suited to parallel mechanisms. The force control scheme can be easily modified for the dynamic forces on the end effector. This paper presents the results of a computer simulation of both the rate and force control schemes. The gains in the force based scheme can be individually adjusted in all three directions, whereas the adjustment in just one direction of the rate based scheme directly affects the other two directions.
Federal Register 2010, 2011, 2012, 2013, 2014
2010-07-22
... include: demolition of approximately 6,435 feet of Airport Road; construction of approximately 6,405 feet of relocated Airport Road; installation of ILS components on the north end of Runway 20; construction of access roads and equipment shelter buildings; construction of the parallel taxiway/ramp expansion...
Anisotropic surface-state-mediated RKKY interaction between adatoms on a hexagonal lattice
NASA Astrophysics Data System (ADS)
Patrone, Paul N.; Einstein, T. L.
2012-01-01
Motivated by recent numerical studies of Ag on Pt(111), we derive an expression for the RKKY interaction mediated by surface states, considering the effect of anisotropy in the Fermi edge. Our analysis is based on a stationary phase approximation. The main contribution to the interaction comes from electrons whose Fermi velocity vF is parallel to the vector R connecting the interacting adatoms; we show that, in general, the corresponding Fermi wave vector kF is not parallel to R. The interaction is oscillatory; the amplitude and wavelength of oscillations have angular dependence arising from the anisotropy of the surface-state band structure. The wavelength, in particular, is determined by the projection of this kF (corresponding to vF) onto the direction of R. Our analysis is easily generalized to other systems. For Ag on Pt(111), our results indicate that the RKKY interaction between pairs of adatoms should be nearly isotropic and so cannot account for the anisotropy found in the studies motivating our work. However, for metals with surface-state dispersions similar to Be(101¯0), we show that the RKKY interaction should have considerable anisotropy.
NASA Technical Reports Server (NTRS)
Krosel, S. M.; Milner, E. J.
1982-01-01
The application of Predictor corrector integration algorithms developed for the digital parallel processing environment are investigated. The algorithms are implemented and evaluated through the use of a software simulator which provides an approximate representation of the parallel processing hardware. Test cases which focus on the use of the algorithms are presented and a specific application using a linear model of a turbofan engine is considered. Results are presented showing the effects of integration step size and the number of processors on simulation accuracy. Real time performance, interprocessor communication, and algorithm startup are also discussed.
Parallel Directionally Split Solver Based on Reformulation of Pipelined Thomas Algorithm
NASA Technical Reports Server (NTRS)
Povitsky, A.
1998-01-01
In this research an efficient parallel algorithm for 3-D directionally split problems is developed. The proposed algorithm is based on a reformulated version of the pipelined Thomas algorithm that starts the backward step computations immediately after the completion of the forward step computations for the first portion of lines This algorithm has data available for other computational tasks while processors are idle from the Thomas algorithm. The proposed 3-D directionally split solver is based on the static scheduling of processors where local and non-local, data-dependent and data-independent computations are scheduled while processors are idle. A theoretical model of parallelization efficiency is used to define optimal parameters of the algorithm, to show an asymptotic parallelization penalty and to obtain an optimal cover of a global domain with subdomains. It is shown by computational experiments and by the theoretical model that the proposed algorithm reduces the parallelization penalty about two times over the basic algorithm for the range of the number of processors (subdomains) considered and the number of grid nodes per subdomain.
NASA Astrophysics Data System (ADS)
Jiang, Y.; Xing, H. L.
2016-12-01
Micro-seismic events induced by water injection, mining activity or oil/gas extraction are quite informative, the interpretation of which can be applied for the reconstruction of underground stress and monitoring of hydraulic fracturing progress in oil/gas reservoirs. The source characterises and locations are crucial parameters that required for these purposes, which can be obtained through the waveform matching inversion (WMI) method. Therefore it is imperative to develop a WMI algorithm with high accuracy and convergence speed. Heuristic algorithm, as a category of nonlinear method, possesses a very high convergence speed and good capacity to overcome local minimal values, and has been well applied for many areas (e.g. image processing, artificial intelligence). However, its effectiveness for micro-seismic WMI is still poorly investigated; very few literatures exits that addressing this subject. In this research an advanced heuristic algorithm, gravitational search algorithm (GSA) , is proposed to estimate the focal mechanism (angle of strike, dip and rake) and source locations in three dimension. Unlike traditional inversion methods, the heuristic algorithm inversion does not require the approximation of green function. The method directly interacts with a CPU parallelized finite difference forward modelling engine, and updating the model parameters under GSA criterions. The effectiveness of this method is tested with synthetic data form a multi-layered elastic model; the results indicate GSA can be well applied on WMI and has its unique advantages. Keywords: Micro-seismicity, Waveform matching inversion, gravitational search algorithm, parallel computation
NASA Technical Reports Server (NTRS)
Lipatov, A. S.; Farrell, W. M.; Cooper, J. F.; Sittler, E. C., Jr.; Hartle, R. E.
2015-01-01
The interactions between the solar wind and Moon-sized objects are determined by a set of the solar wind parameters and plasma environment of the space objects. The orientation of upstream magnetic field is one of the key factors which determines the formation and structure of bow shock wave/Mach cone or Alfven wing near the obstacle. The study of effects of the direction of the upstream magnetic field on lunar-like plasma environment is the main subject of our investigation in this paper. Photoionization, electron-impact ionization and charge exchange are included in our hybrid model. The computational model includes the self-consistent dynamics of the light (hydrogen (+), helium (+)) and heavy (sodium (+)) pickup ions. The lunar interior is considered as a weakly conducting body. Our previous 2013 lunar work, as reported in this journal, found formation of a triple structure of the Mach cone near the Moon in the case of perpendicular upstream magnetic field. Further advances in modeling now reveal the presence of strong wave activity in the upstream solar wind and plasma wake in the cases of quasiparallel and parallel upstream magnetic fields. However, little wave activity is found for the opposite case with a perpendicular upstream magnetic field. The modeling does not show a formation of the Mach cone in the case of theta(Sub B,U) approximately equal to 0 degrees.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yee, Seonghwan, E-mail: Seonghwan.Yee@Beaumont.edu; Gao, Jia-Hong
Purpose: To investigate whether the direction of spin-lock field, either parallel or antiparallel to the rotating magnetization, has any effect on the spin-lock MRI signal and further on the quantitative measurement of T1ρ, in a clinical 3 T MRI system. Methods: The effects of inverted spin-lock field direction were investigated by acquiring a series of spin-lock MRI signals for an American College of Radiology MRI phantom, while the spin-lock field direction was switched between the parallel and antiparallel directions. The acquisition was performed for different spin-locking methods (i.e., for the single- and dual-field spin-locking methods) and for different levels ofmore » clinically feasible spin-lock field strength, ranging from 100 to 500 Hz, while the spin-lock duration was varied in the range from 0 to 100 ms. Results: When the spin-lock field was inverted into the antiparallel direction, the rate of MRI signal decay was altered and the T1ρ value, when compared to the value for the parallel field, was clearly different. Different degrees of such direction-dependency were observed for different spin-lock field strengths. In addition, the dependency was much smaller when the parallel and the antiparallel fields are mixed together in the dual-field method. Conclusions: The spin-lock field direction could impact the MRI signal and further the T1ρ measurement in a clinical MRI system.« less
Tetreault, J.; Jones, C.H.; Erslev, E.; Larson, S.; Hudson, M.; Holdaway, S.
2008-01-01
Significant fold-axis-parallel slip is accommodated in the folded strata of the Grayback monocline, northeastern Front Range, Colorado, without visible large strike-slip displacement on the fold surface. In many cases, oblique-slip deformation is partitioned; fold-axis-normal slip is accommodated within folds, and fold-axis-parallel slip is resolved onto adjacent strike-slip faults. Unlike partitioning strike-parallel slip onto adjacent strike-slip faults, fold-axis-parallel slip has deformed the forelimb of the Grayback monocline. Mean compressive paleostress orientations in the forelimb are deflected 15??-37?? clockwise from the regional paleostress orientation of the northeastern Front Range. Paleomagnetic directions from the Permian Ingleside Formation in the forelimb are rotated 16??-42?? clockwise about a bedding-normal axis relative to the North American Permian reference direction. The paleostress and paleomagnetic rotations increase with the bedding dip angle and decrease along strike toward the fold tip. These measurements allow for 50-120 m of fold-axis-parallel slip within the forelimb, depending on the kinematics of strike-slip shear. This resolved horizontal slip is nearly equal in magnitude to the ???180 m vertical throw across the fold. For 200 m of oblique-slip displacement (120 m of strike slip and 180 m of reverse slip), the true shortening direction across the fold is N90??E, indistinguishable from the regionally inferred direction of N90??E and quite different from the S53??E fold-normal direction. Recognition of this deformational style means that significant amounts of strike slip can be accommodated within folds without axis-parallel surficial faulting. ?? 2008 Geological Society of America.
QR-decomposition based SENSE reconstruction using parallel architecture.
Ullah, Irfan; Nisar, Habab; Raza, Haseeb; Qasim, Malik; Inam, Omair; Omer, Hammad
2018-04-01
Magnetic Resonance Imaging (MRI) is a powerful medical imaging technique that provides essential clinical information about the human body. One major limitation of MRI is its long scan time. Implementation of advance MRI algorithms on a parallel architecture (to exploit inherent parallelism) has a great potential to reduce the scan time. Sensitivity Encoding (SENSE) is a Parallel Magnetic Resonance Imaging (pMRI) algorithm that utilizes receiver coil sensitivities to reconstruct MR images from the acquired under-sampled k-space data. At the heart of SENSE lies inversion of a rectangular encoding matrix. This work presents a novel implementation of GPU based SENSE algorithm, which employs QR decomposition for the inversion of the rectangular encoding matrix. For a fair comparison, the performance of the proposed GPU based SENSE reconstruction is evaluated against single and multicore CPU using openMP. Several experiments against various acceleration factors (AFs) are performed using multichannel (8, 12 and 30) phantom and in-vivo human head and cardiac datasets. Experimental results show that GPU significantly reduces the computation time of SENSE reconstruction as compared to multi-core CPU (approximately 12x speedup) and single-core CPU (approximately 53x speedup) without any degradation in the quality of the reconstructed images. Copyright © 2018 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Maneva, Yana; Poedts, Stefaan
2017-04-01
The electromagnetic fluctuations in the solar wind represent a zoo of plasma waves with different properties, whose wavelengths range from largest fluid scales to the smallest dissipation scales. By nature the power spectrum of the magnetic fluctuations is anisotropic with different spectral slopes in parallel and perpendicular directions with respect to the background magnetic field. Furthermore, the magnetic field power spectra steepen as one moves from the inertial to the dissipation range and we observe multiple spectral breaks with different slopes in parallel and perpendicular direction at the ion scales and beyond. The turbulent dissipation of magnetic field fluctuations at the sub-ion scales is believed to go into local ion heating and acceleration, so that the spectral breaks are typically associated with particle energization. The gained energy can be in the form of anisotropic heating, formation of non-thermal features in the particle velocity distributions functions, and redistribution of the differential acceleration between the different ion populations. To study the relation between the evolution of the anisotropic turbulent spectra and the particle heating at the ion and sub-ion scales we perform a series of 2.5D hybrid simulations in a collisionless drifting proton-alpha plasma. We neglect the fast electron dynamics and treat the electrons as an isothermal fluid electrons, whereas the protons and a minor population of alpha particles are evolved in a fully kinetic manner. We start with a given wave spectrum and study the evolution of the magnetic field spectral slopes as a function of the parallel and perpendicular wave¬numbers. Simultaneously, we track the particle response and the energy exchange between the parallel and perpendicular scales. We observe anisotropic behavior of the turbulent power spectra with steeper slopes along the dominant energy-containing direction. This means that for parallel and quasi-parallel waves we have steeper spectral slope in parallel direction, whereas for highly oblique waves the dissipation occurs predominantly in perpendicular direction and the spectral slopes are steeper across the background magnetic field. The value of the spectral slopes depends on the angle of propagation, the spectral range, as well as the plasma properties. In general the dissipation is stronger at small scales and the corresponding spectral slopes there are steeper. For parallel and quasi-parallel propagation the prevailing energy cascade remains along the magnetic field, whereas for initially isotropic oblique turbulence the cascade develops mainly in perpendicular direction.
Two-dimensional confinement of 3d{1} electrons in LaTiO_{3}/LaAlO{3} multilayers.
Seo, S S A; Han, M J; Hassink, G W J; Choi, W S; Moon, S J; Kim, J S; Susaki, T; Lee, Y S; Yu, J; Bernhard, C; Hwang, H Y; Rijnders, G; Blank, D H A; Keimer, B; Noh, T W
2010-01-22
We report spectroscopic ellipsometry measurements of the anisotropy of the interband transitions parallel and perpendicular to the planes of (LaTiO3)n(LaAlO3)5 multilayers with n=1-3. These provide direct information about the electronic structure of the two-dimensional (2D) 3d{1} state of the Ti ions. In combination with local density approximation, including a Hubbard U calculation, we suggest that 2D confinement in the TiO2 slabs lifts the degeneracy of the t{2g} states leaving only the planar d{xy} orbitals occupied. We outline that these multilayers can serve as a model system for the study of the t{2g} 2D Hubbard model.
On the application of under-decimated filter banks
NASA Technical Reports Server (NTRS)
Lin, Y.-P.; Vaidyanathan, P. P.
1994-01-01
Maximally decimated filter banks have been extensively studied in the past. A filter bank is said to be under-decimated if the number of channels is more than the decimation ratio in the subbands. A maximally decimated filter bank is well known for its application in subband coding. Another application of maximally decimated filter banks is in block filtering. Convolution through block filtering has the advantages that parallelism is increased and data are processed at a lower rate. However, the computational complexity is comparable to that of direct convolution. More recently, another type of filter bank convolver has been developed. In this scheme, the convolution is performed in the subbands. Quantization and bit allocation of subband signals are based on signal variance, as in subband coding. Consequently, for a fixed rate, the result of convolution is more accurate than is direct convolution. This type of filter bank convolver also enjoys the advantages of block filtering, parallelism, and a lower working rate. Nevertheless, like block filtering, there is no computational saving. In this article, under-decimated systems are introduced to solve the problem. The new system is decimated only by half the number of channels. Two types of filter banks can be used in the under-decimated system: the discrete Fourier transform (DFT) filter banks and the cosine modulated filter banks. They are well known for their low complexity. In both cases, the system is approximately alias free, and the overall response is equivalent to a tunable multilevel filter. Properties of the DFT filter banks and the cosine modulated filter banks can be exploited to simultaneously achieve parallelism, computational saving, and a lower working rate. Furthermore, for both systems, the implementation cost of the analysis or synthesis bank is comparable to that of one prototype filter plus some low-complexity modulation matrices. The individual analysis and synthesis filters have complex coefficients in the DFT filter banks but have real coefficients in the cosine modulated filter banks.
NASA Astrophysics Data System (ADS)
Mackowski, Daniel; Ramezanpour, Bahareh
2018-07-01
A formulation is developed for numerically solving the frequency domain Maxwell's equations in plane parallel layers of inhomogeneous media. As was done in a recent work [1], the plane parallel layer is modeled as an infinite square lattice of W × W × H unit cells, with W being a sample width of the layer and H the layer thickness. As opposed to the 3D volume integral/discrete dipole formulation, the derivation begins with a Fourier expansion of the electric field amplitude in the lateral plane, and leads to a coupled system of 1D ordinary differential equations in the depth direction of the layer. A 1D dyadic Green's function is derived for this system and used to construct a set of coupled 1D integral equations for the field expansion coefficients. The resulting mathematical formulation is considerably simpler and more compact than that derived, for the same system, using the discrete dipole approximation applied to the periodic plane lattice. Furthermore, the fundamental property variable appearing in the formulation is the Fourier transformed complex permittivity distribution in the unit cell, and the method obviates any need to define or calculate a dipole polarizability. Although designed primarily for random media calculations, the method is also capable of predicting the single scattering properties of individual particles; comparisons are presented to demonstrate that the method can accurately reproduce, at scattering angles not too close to 90°, the polarimetric scattering properties of single and multiple spheres. The derivation of the dyadic Green's function allows for an analytical preconditioning of the equations, and it is shown that this can result in significantly accelerated solution times when applied to densely-packed systems of particles. Calculation results demonstrate that the method, when applied to inhomogeneous media, can predict coherent backscattering and polarization opposition effects.
On the application of under-decimated filter banks
NASA Astrophysics Data System (ADS)
Lin, Y.-P.; Vaidyanathan, P. P.
1994-11-01
Maximally decimated filter banks have been extensively studied in the past. A filter bank is said to be under-decimated if the number of channels is more than the decimation ratio in the subbands. A maximally decimated filter bank is well known for its application in subband coding. Another application of maximally decimated filter banks is in block filtering. Convolution through block filtering has the advantages that parallelism is increased and data are processed at a lower rate. However, the computational complexity is comparable to that of direct convolution. More recently, another type of filter bank convolver has been developed. In this scheme, the convolution is performed in the subbands. Quantization and bit allocation of subband signals are based on signal variance, as in subband coding. Consequently, for a fixed rate, the result of convolution is more accurate than is direct convolution. This type of filter bank convolver also enjoys the advantages of block filtering, parallelism, and a lower working rate. Nevertheless, like block filtering, there is no computational saving. In this article, under-decimated systems are introduced to solve the problem. The new system is decimated only by half the number of channels. Two types of filter banks can be used in the under-decimated system: the discrete Fourier transform (DFT) filter banks and the cosine modulated filter banks. They are well known for their low complexity. In both cases, the system is approximately alias free, and the overall response is equivalent to a tunable multilevel filter. Properties of the DFT filter banks and the cosine modulated filter banks can be exploited to simultaneously achieve parallelism, computational saving, and a lower working rate.
Use Computer-Aided Tools to Parallelize Large CFD Applications
NASA Technical Reports Server (NTRS)
Jin, H.; Frumkin, M.; Yan, J.
2000-01-01
Porting applications to high performance parallel computers is always a challenging task. It is time consuming and costly. With rapid progressing in hardware architectures and increasing complexity of real applications in recent years, the problem becomes even more sever. Today, scalability and high performance are mostly involving handwritten parallel programs using message-passing libraries (e.g. MPI). However, this process is very difficult and often error-prone. The recent reemergence of shared memory parallel (SMP) architectures, such as the cache coherent Non-Uniform Memory Access (ccNUMA) architecture used in the SGI Origin 2000, show good prospects for scaling beyond hundreds of processors. Programming on an SMP is simplified by working in a globally accessible address space. The user can supply compiler directives, such as OpenMP, to parallelize the code. As an industry standard for portable implementation of parallel programs for SMPs, OpenMP is a set of compiler directives and callable runtime library routines that extend Fortran, C and C++ to express shared memory parallelism. It promises an incremental path for parallel conversion of existing software, as well as scalability and performance for a complete rewrite or an entirely new development. Perhaps the main disadvantage of programming with directives is that inserted directives may not necessarily enhance performance. In the worst cases, it can create erroneous results. While vendors have provided tools to perform error-checking and profiling, automation in directive insertion is very limited and often failed on large programs, primarily due to the lack of a thorough enough data dependence analysis. To overcome the deficiency, we have developed a toolkit, CAPO, to automatically insert OpenMP directives in Fortran programs and apply certain degrees of optimization. CAPO is aimed at taking advantage of detailed inter-procedural dependence analysis provided by CAPTools, developed by the University of Greenwich, to reduce potential errors made by users. Earlier tests on NAS Benchmarks and ARC3D have demonstrated good success of this tool. In this study, we have applied CAPO to parallelize three large applications in the area of computational fluid dynamics (CFD): OVERFLOW, TLNS3D and INS3D. These codes are widely used for solving Navier-Stokes equations with complicated boundary conditions and turbulence model in multiple zones. Each one comprises of from 50K to 1,00k lines of FORTRAN77. As an example, CAPO took 77 hours to complete the data dependence analysis of OVERFLOW on a workstation (SGI, 175MHz, R10K processor). A fair amount of effort was spent on correcting false dependencies due to lack of necessary knowledge during the analysis. Even so, CAPO provides an easy way for user to interact with the parallelization process. The OpenMP version was generated within a day after the analysis was completed. Due to sequential algorithms involved, code sections in TLNS3D and INS3D need to be restructured by hand to produce more efficient parallel codes. An included figure shows preliminary test results of the generated OVERFLOW with several test cases in single zone. The MPI data points for the small test case were taken from a handcoded MPI version. As we can see, CAPO's version has achieved 18 fold speed up on 32 nodes of the SGI O2K. For the small test case, it outperformed the MPI version. These results are very encouraging, but further work is needed. For example, although CAPO attempts to place directives on the outer- most parallel loops in an interprocedural framework, it does not insert directives based on the best manual strategy. In particular, it lacks the support of parallelization at the multi-zone level. Future work will emphasize on the development of methodology to work in a multi-zone level and with a hybrid approach. Development of tools to perform more complicated code transformation is also needed.
ERIC Educational Resources Information Center
Gil, Arturo; Peidró, Adrián; Reinoso, Óscar; Marín, José María
2017-01-01
This paper presents a tool, LABEL, oriented to the teaching of parallel robotics. The application, organized as a set of tools developed using Easy Java Simulations, enables the study of the kinematics of parallel robotics. A set of classical parallel structures was implemented such that LABEL can solve the inverse and direct kinematic problem of…
Parallel aeroelastic computations for wing and wing-body configurations
NASA Technical Reports Server (NTRS)
Byun, Chansup
1994-01-01
The objective of this research is to develop computationally efficient methods for solving fluid-structural interaction problems by directly coupling finite difference Euler/Navier-Stokes equations for fluids and finite element dynamics equations for structures on parallel computers. This capability will significantly impact many aerospace projects of national importance such as Advanced Subsonic Civil Transport (ASCT), where the structural stability margin becomes very critical at the transonic region. This research effort will have direct impact on the High Performance Computing and Communication (HPCC) Program of NASA in the area of parallel computing.
Ion acceleration and heating by kinetic Alfvén waves associated with magnetic reconnection
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liang, Ji; Lin, Yu; Johnson, Jay R.
In a previous study on the generation and signatures of kinetic Alfv en waves (KAWs) associated with magnetic reconnection in a current sheet revealed that KAWs are a common feature during reconnection [Liang et al. J. Geophys. Res.: Space Phys. 121, 6526 (2016)]. In this paper, ion acceleration and heating by the KAWs generated during magnetic reconnection are investigated with a three-dimensional (3-D) hybrid model. It is found that in the outflow region, a fraction of inflow ions are accelerated by the KAWs generated in the leading bulge region of reconnection, and their parallel velocities gradually increase up to slightly super-Alfv enic. As a result of waveparticle interactions, an accelerated ion beam forms in the direction of the anti-parallel magnetic field, in addition to the core ion population, leading to the development of non-Maxwellian velocity distributions, which include a trapped population with parallel velocities consistent with the wave speed. We then heat ions in both parallel and perpendicular directions. In the parallel direction, the heating results from nonlinear Landau resonance of trapped ions. In the perpendicular direction, however, evidence of stochastic heating by the KAWs is found during the acceleration stage, with an increase of magnetic moment μ. The coherence in the T more » $$\\perp$$ ion temperature and the perpendicular electric and magnetic fields of KAWs also provides evidence for perpendicular heating by KAWs. The parallel and perpendicular heating of the accelerated beam occur simultaneously, leading to the development of temperature anisotropy with the perpendicular temperature T $$\\perp$$>T $$\\parallel$$ temperature. The heating rate agrees with the damping rate of the KAWs, and the heating is dominated by the accelerated ion beam. In the later stage, with the increase of the fraction of the accelerated ions, interaction between the accelerated beam and the core population also contributes to the ion heating, ultimately leading to overlap of the beams and an overall anisotropy with T $$\\perp$$>T $$\\parallel$$.« less
Ion acceleration and heating by kinetic Alfvén waves associated with magnetic reconnection
Liang, Ji; Lin, Yu; Johnson, Jay R.; ...
2017-09-19
In a previous study on the generation and signatures of kinetic Alfv en waves (KAWs) associated with magnetic reconnection in a current sheet revealed that KAWs are a common feature during reconnection [Liang et al. J. Geophys. Res.: Space Phys. 121, 6526 (2016)]. In this paper, ion acceleration and heating by the KAWs generated during magnetic reconnection are investigated with a three-dimensional (3-D) hybrid model. It is found that in the outflow region, a fraction of inflow ions are accelerated by the KAWs generated in the leading bulge region of reconnection, and their parallel velocities gradually increase up to slightly super-Alfv enic. As a result of waveparticle interactions, an accelerated ion beam forms in the direction of the anti-parallel magnetic field, in addition to the core ion population, leading to the development of non-Maxwellian velocity distributions, which include a trapped population with parallel velocities consistent with the wave speed. We then heat ions in both parallel and perpendicular directions. In the parallel direction, the heating results from nonlinear Landau resonance of trapped ions. In the perpendicular direction, however, evidence of stochastic heating by the KAWs is found during the acceleration stage, with an increase of magnetic moment μ. The coherence in the T more » $$\\perp$$ ion temperature and the perpendicular electric and magnetic fields of KAWs also provides evidence for perpendicular heating by KAWs. The parallel and perpendicular heating of the accelerated beam occur simultaneously, leading to the development of temperature anisotropy with the perpendicular temperature T $$\\perp$$>T $$\\parallel$$ temperature. The heating rate agrees with the damping rate of the KAWs, and the heating is dominated by the accelerated ion beam. In the later stage, with the increase of the fraction of the accelerated ions, interaction between the accelerated beam and the core population also contributes to the ion heating, ultimately leading to overlap of the beams and an overall anisotropy with T $$\\perp$$>T $$\\parallel$$.« less
Search asymmetries: parallel processing of uncertain sensory information.
Vincent, Benjamin T
2011-08-01
What is the mechanism underlying search phenomena such as search asymmetry? Two-stage models such as Feature Integration Theory and Guided Search propose parallel pre-attentive processing followed by serial post-attentive processing. They claim search asymmetry effects are indicative of finding pairs of features, one processed in parallel, the other in serial. An alternative proposal is that a 1-stage parallel process is responsible, and search asymmetries occur when one stimulus has greater internal uncertainty associated with it than another. While the latter account is simpler, only a few studies have set out to empirically test its quantitative predictions, and many researchers still subscribe to the 2-stage account. This paper examines three separate parallel models (Bayesian optimal observer, max rule, and a heuristic decision rule). All three parallel models can account for search asymmetry effects and I conclude that either people can optimally utilise the uncertain sensory data available to them, or are able to select heuristic decision rules which approximate optimal performance. Copyright © 2011 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Wichert, Viktoria; Arkenberg, Mario; Hauschildt, Peter H.
2016-10-01
Highly resolved state-of-the-art 3D atmosphere simulations will remain computationally extremely expensive for years to come. In addition to the need for more computing power, rethinking coding practices is necessary. We take a dual approach by introducing especially adapted, parallel numerical methods and correspondingly parallelizing critical code passages. In the following, we present our respective work on PHOENIX/3D. With new parallel numerical algorithms, there is a big opportunity for improvement when iteratively solving the system of equations emerging from the operator splitting of the radiative transfer equation J = ΛS. The narrow-banded approximate Λ-operator Λ* , which is used in PHOENIX/3D, occurs in each iteration step. By implementing a numerical algorithm which takes advantage of its characteristic traits, the parallel code's efficiency is further increased and a speed-up in computational time can be achieved.
NASA Technical Reports Server (NTRS)
Ayguade, Eduard; Gonzalez, Marc; Martorell, Xavier; Jost, Gabriele
2004-01-01
In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Benchmarks employing multi-level OpenMP parallelism. For our study we use the NanosCompiler, which supports nesting of OpenMP directives and provides clauses to control the grouping of threads, load balancing, and synchronization. We report the benchmark results, compare the timings with those of different hybrid parallelization paradigms and discuss OpenMP implementation issues which effect the performance of multi-level parallel applications.
[The parallelisms in of sound signal of domestic sheep and Northern fur seals].
Nikol'skiĭ, A A; Lisitsina, T Iu
2011-01-01
The parallelisms in communicative behavior of domestic sheep and Northern fur seals within a herd are accompanied by parallelisms in parameters of sound signal, the calling scream. This signal ensures ties between babies and their mothers at a long distance. The basis of parallelisms is formed by amplitude modulation at two levels: the one being a direct amplitude modulation of the carrier frequency and the other--modulation of the carrier frequency oscillation. Parallelisms in the signal oscillatory process result in corresponding parallelisms in the structure of its frequency spectrum.
Parallel adaptive wavelet collocation method for PDEs
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nejadmalayeri, Alireza, E-mail: Alireza.Nejadmalayeri@gmail.com; Vezolainen, Alexei, E-mail: Alexei.Vezolainen@Colorado.edu; Brown-Dymkoski, Eric, E-mail: Eric.Browndymkoski@Colorado.edu
2015-10-01
A parallel adaptive wavelet collocation method for solving a large class of Partial Differential Equations is presented. The parallelization is achieved by developing an asynchronous parallel wavelet transform, which allows one to perform parallel wavelet transform and derivative calculations with only one data synchronization at the highest level of resolution. The data are stored using tree-like structure with tree roots starting at a priori defined level of resolution. Both static and dynamic domain partitioning approaches are developed. For the dynamic domain partitioning, trees are considered to be the minimum quanta of data to be migrated between the processes. This allowsmore » fully automated and efficient handling of non-simply connected partitioning of a computational domain. Dynamic load balancing is achieved via domain repartitioning during the grid adaptation step and reassigning trees to the appropriate processes to ensure approximately the same number of grid points on each process. The parallel efficiency of the approach is discussed based on parallel adaptive wavelet-based Coherent Vortex Simulations of homogeneous turbulence with linear forcing at effective non-adaptive resolutions up to 2048{sup 3} using as many as 2048 CPU cores.« less
Particle-in-cell simulations of the critical ionization velocity effect in finite size clouds
NASA Technical Reports Server (NTRS)
Moghaddam-Taaheri, E.; Lu, G.; Goertz, C. K.; Nishikawa, K. - I.
1994-01-01
The critical ionization velocity (CIV) mechanism in a finite size cloud is studied with a series of electrostatic particle-in-cell simulations. It is observed that an initial seed ionization, produced by non-CIV mechanisms, generates a cross-field ion beam which excites a modified beam-plasma instability (MBPI) with frequency in the range of the lower hybrid frequency. The excited waves accelerate electrons along the magnetic field up to the ion drift energy that exceeds the ionization energy of the neutral atoms. The heated electrons in turn enhance the ion beam by electron-neutral impact ionization, which establishes a positive feedback loop in maintaining the CIV process. It is also found that the efficiency of the CIV mechanism depends on the finite size of the gas cloud in the following ways: (1) Along the ambient magnetic field the finite size of the cloud, L (sub parallel), restricts the growth of the fastest growing mode, with a wavelength lambda (sub m parallel), of the MBPI. The parallel electron heating at wave saturation scales approximately as (L (sub parallel)/lambda (sub m parallel)) (exp 1/2); (2) Momentum coupling between the cloud and the ambient plasma via the Alfven waves occurs as a result of the finite size of the cloud in the direction perpendicular to both the ambient magnetic field and the neutral drift. This reduces exponentially with time the relative drift between the ambient plasma and the neutrals. The timescale is inversely proportional to the Alfven velocity. (3) The transvers e charge separation field across the cloud was found to result in the modulation of the beam velocity which reduces the parallel heating of electrons and increases the transverse acceleration of electrons. (4) Some energetic electrons are lost from the cloud along the magnetic field at a rate characterized by the acoustic velocity, instead of the electron thermal velocity. The loss of energetic electrons from the cloud seems to be larger in the direction of plasma drift relative to the neutrals, where the loss rate is characterized by the neutral drift velocity. It is also shown that a factor of 4 increase in the ambient plasma density, increases the CIV ionization yield by almost 2 orders of magnitude at the end of a typical run. It is concluded that a larger ambient plasma density can result in a larger CIV yield because of (1) larger seed ion production by non-CIV mechanisms, (2) smaller Alfven velocity and hence weak momentum coupling, and (3) smaller ratio of the ion beam density to the ambient ion density, and therefore a weaker modulation of the beam velocity. The simulation results are used to interpret various chemical release experiments in space.
Low-Rank Correction Methods for Algebraic Domain Decomposition Preconditioners
Li, Ruipeng; Saad, Yousef
2017-08-01
This study presents a parallel preconditioning method for distributed sparse linear systems, based on an approximate inverse of the original matrix, that adopts a general framework of distributed sparse matrices and exploits domain decomposition (DD) and low-rank corrections. The DD approach decouples the matrix and, once inverted, a low-rank approximation is applied by exploiting the Sherman--Morrison--Woodbury formula, which yields two variants of the preconditioning methods. The low-rank expansion is computed by the Lanczos procedure with reorthogonalizations. Numerical experiments indicate that, when combined with Krylov subspace accelerators, this preconditioner can be efficient and robust for solving symmetric sparse linear systems. Comparisonsmore » with pARMS, a DD-based parallel incomplete LU (ILU) preconditioning method, are presented for solving Poisson's equation and linear elasticity problems.« less
Low-Rank Correction Methods for Algebraic Domain Decomposition Preconditioners
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Ruipeng; Saad, Yousef
This study presents a parallel preconditioning method for distributed sparse linear systems, based on an approximate inverse of the original matrix, that adopts a general framework of distributed sparse matrices and exploits domain decomposition (DD) and low-rank corrections. The DD approach decouples the matrix and, once inverted, a low-rank approximation is applied by exploiting the Sherman--Morrison--Woodbury formula, which yields two variants of the preconditioning methods. The low-rank expansion is computed by the Lanczos procedure with reorthogonalizations. Numerical experiments indicate that, when combined with Krylov subspace accelerators, this preconditioner can be efficient and robust for solving symmetric sparse linear systems. Comparisonsmore » with pARMS, a DD-based parallel incomplete LU (ILU) preconditioning method, are presented for solving Poisson's equation and linear elasticity problems.« less
Database Reorganization in Parallel Disk Arrays with I/O Service Stealing
NASA Technical Reports Server (NTRS)
Zabback, Peter; Onyuksel, Ibrahim; Scheuermann, Peter; Weikum, Gerhard
1996-01-01
We present a model for data reorganization in parallel disk systems that is geared towards load balancing in an environment with periodic access patterns. Data reorganization is performed by disk cooling, i.e. migrating files or extents from the hottest disks to the coldest ones. We develop an approximate queueing model for determining the effective arrival rates of cooling requests and discuss its use in assessing the costs versus benefits of cooling.
On the dimensionally correct kinetic theory of turbulence for parallel propagation
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gaelzer, R., E-mail: rudi.gaelzer@ufrgs.br, E-mail: yoonp@umd.edu, E-mail: 007gasun@khu.ac.kr, E-mail: luiz.ziebell@ufrgs.br; Ziebell, L. F., E-mail: rudi.gaelzer@ufrgs.br, E-mail: yoonp@umd.edu, E-mail: 007gasun@khu.ac.kr, E-mail: luiz.ziebell@ufrgs.br; Yoon, P. H., E-mail: rudi.gaelzer@ufrgs.br, E-mail: yoonp@umd.edu, E-mail: 007gasun@khu.ac.kr, E-mail: luiz.ziebell@ufrgs.br
2015-03-15
Yoon and Fang [Phys. Plasmas 15, 122312 (2008)] formulated a second-order nonlinear kinetic theory that describes the turbulence propagating in directions parallel/anti-parallel to the ambient magnetic field. Their theory also includes discrete-particle effects, or the effects due to spontaneously emitted thermal fluctuations. However, terms associated with the spontaneous fluctuations in particle and wave kinetic equations in their theory contain proper dimensionality only for an artificial one-dimensional situation. The present paper extends the analysis and re-derives the dimensionally correct kinetic equations for three-dimensional case. The new formalism properly describes the effects of spontaneous fluctuations emitted in three-dimensional space, while the collectivelymore » emitted turbulence propagates predominantly in directions parallel/anti-parallel to the ambient magnetic field. As a first step, the present investigation focuses on linear wave-particle interaction terms only. A subsequent paper will include the dimensionally correct nonlinear wave-particle interaction terms.« less
Inelastic Strain and Damage in Surface Instability Tests
NASA Astrophysics Data System (ADS)
Kao, Chu-Shu; Tarokh, Ali; Biolzi, Luigi; Labuz, Joseph F.
2016-02-01
Spalling near a free surface in laboratory experiments on two sandstones was characterized using acoustic emission and digital image correlation. A surface instability apparatus was used to reproduce a state of plane strain near a free surface in a modeled semi-infinite medium subjected to far-field compressive stress. Comparison between AE locations and crack trajectory mapped after the test showed good consistency. Digital image correlation was used to find the displacements in directions parallel (axial direction) and perpendicular (lateral direction) to the free surface at various stages of loading. At a load ratio, LR = current load/peak load, of approximately 30 %, elastic deformation was measured. At 70-80 % LR, the free-face effect started to appear in the displacement contours, especially for the lateral displacement measurements. As the axial compressive stress increased close to peak, extensional lateral strain started to show concentrations associated with localized damage. Continuum damage mechanics was used to describe damage evolution in the surface instability test, and it was shown that a critical value of extensional inelastic strain, on the order of -10-3 for the virgin sandstones, may provide an indicator for determining the onset of surface spalling.
Benkert, Thomas; Tian, Ye; Huang, Chenchan; DiBella, Edward V R; Chandarana, Hersh; Feng, Li
2018-07-01
Golden-angle radial sparse parallel (GRASP) MRI reconstruction requires gridding and regridding to transform data between radial and Cartesian k-space. These operations are repeatedly performed in each iteration, which makes the reconstruction computationally demanding. This work aimed to accelerate GRASP reconstruction using self-calibrating GRAPPA operator gridding (GROG) and to validate its performance in clinical imaging. GROG is an alternative gridding approach based on parallel imaging, in which k-space data acquired on a non-Cartesian grid are shifted onto a Cartesian k-space grid using information from multicoil arrays. For iterative non-Cartesian image reconstruction, GROG is performed only once as a preprocessing step. Therefore, the subsequent iterative reconstruction can be performed directly in Cartesian space, which significantly reduces computational burden. Here, a framework combining GROG with GRASP (GROG-GRASP) is first optimized and then compared with standard GRASP reconstruction in 22 prostate patients. GROG-GRASP achieved approximately 4.2-fold reduction in reconstruction time compared with GRASP (∼333 min versus ∼78 min) while maintaining image quality (structural similarity index ≈ 0.97 and root mean square error ≈ 0.007). Visual image quality assessment by two experienced radiologists did not show significant differences between the two reconstruction schemes. With a graphics processing unit implementation, image reconstruction time can be further reduced to approximately 14 min. The GRASP reconstruction can be substantially accelerated using GROG. This framework is promising toward broader clinical application of GRASP and other iterative non-Cartesian reconstruction methods. Magn Reson Med 80:286-293, 2018. © 2017 International Society for Magnetic Resonance in Medicine. © 2017 International Society for Magnetic Resonance in Medicine.
Spencer, J.E.
2000-01-01
The corrugated form of the Harcuvar, South Mountains, and Catalina metamorphic core complexes in Arizona reflects the shape of the middle Tertiary extensional detachment fault that projects over each complex. Corrugation axes are approximately parallel to the fault-displacement direction and to the footwall mylonitic lineation. The core complexes are locally incised by enigmatic, linear drainages that parallel corrugation axes and the inferred extension direction and are especially conspicuous on the crests of antiformal corrugations. These drainages have been attributed to erosional incision on a freshly denuded, planar, inclined fault ramp followed by folding that elevated and preserved some drainages on the crests of rising antiforms. According to this hypothesis, corrugations were produced by folding after subacrial exposure of detachment-fault foot-walls. An alternative hypothesis, proposed here, is as follows. In a setting where preexisting drainages cross an active normal fault, each fault-slip event will cut each drainage into two segments separated by a freshly denuded fault ramp. The upper and lower drainage segments will remain hydraulically linked after each fault-slip event if the drainage in the hanging-wall block is incised, even if the stream is on the flank of an antiformal corrugation and there is a large component of strike-slip fault movement. Maintenance of hydraulic linkage during sequential fault-slip events will guide the lengthening stream down the fault ramp as the ramp is uncovered, and stream incision will form a progressively lengthening, extension-parallel, linear drainage segment. This mechanism for linear drainage genesis is compatible with corrugations as original irregularities of the detachment fault, and does not require folding after early to middle Miocene footwall exhumations. This is desirable because many drainages are incised into nonmylonitic crystalline footwall rocks that were probably not folded under low-temperature, surface conditions. An alternative hypothesis, that drainages were localized by small fault grooves as footwalls were uncovered, is not supported by analysis of a down-plunge fault projection for the southern Rincon Mountains that shows a linear drainage aligned with the crest of a small antiformal groove on the detachment fault, but this process could have been effective elsewhere. Lineation-parallel drainages now plunge gently southwestward on the southwest ends of antiformal corrugations in the South and Buckskin Mountains, but these drainages must have originally plunged northeastward if they formed by either of the two alternative processes proposed here. Footwall exhumation and incision by northeast-flowing streams was apparently followed by core-complex arching and drainage reversal.
NASA Astrophysics Data System (ADS)
Sun, Jicheng; Gao, Xinliang; Lu, Quanming; Chen, Lunjin; Liu, Xu; Wang, Xueyi; Tao, Xin; Wang, Shui
2017-05-01
In this paper, we perform a 1-D particle-in-cell (PIC) simulation model consisting of three species, cold electrons, cold ions, and energetic ion ring, to investigate spectral structures of magnetosonic waves excited by ring distribution protons in the Earth's magnetosphere, and dynamics of charged particles during the excitation of magnetosonic waves. As the wave normal angle decreases, the spectral range of excited magnetosonic waves becomes broader with upper frequency limit extending beyond the lower hybrid resonant frequency, and the discrete spectra tends to merge into a continuous one. This dependence on wave normal angle is consistent with the linear theory. The effects of magnetosonic waves on the background cold plasma populations also vary with wave normal angle. For exactly perpendicular magnetosonic waves (parallel wave number k|| = 0), there is no energization in the parallel direction for both background cold protons and electrons due to the negligible fluctuating electric field component in the parallel direction. In contrast, the perpendicular energization of background plasmas is rather significant, where cold protons follow unmagnetized motion while cold electrons follow drift motion due to wave electric fields. For magnetosonic waves with a finite k||, there exists a nonnegligible parallel fluctuating electric field, leading to a significant and rapid energization in the parallel direction for cold electrons. These cold electrons can also be efficiently energized in the perpendicular direction due to the interaction with the magnetosonic wave fields in the perpendicular direction. However, cold protons can be only heated in the perpendicular direction, which is likely caused by the higher-order resonances with magnetosonic waves. The potential impacts of magnetosonic waves on the energization of the background cold plasmas in the Earth's inner magnetosphere are also discussed in this paper.
NASA Astrophysics Data System (ADS)
Wang, Xin; Tu, Chuanyi; Marsch, Eckart; He, Jiansen; Wang, Linghua
2016-01-01
Turbulence in the solar wind was recently reported to be anisotropic, with the average power spectral index close to -2 when sampling parallel to the local mean magnetic field B0 and close to -5/3 when sampling perpendicular to the local B0. This result was widely considered to be observational evidence for the critical balance theory (CBT), which is derived by making the assumption that the turbulence strength is close to one. However, this basic assumption has not yet been checked carefully with observational data. Here we present for the first time the scale-dependent magnetic-field fluctuation amplitude, which is normalized by the local B0 and evaluated for both parallel and perpendicular sampling directions, using two 30-day intervals of Ulysses data. From our results, the turbulence strength is evaluated as much less than one at small scales in the parallel direction. An even stricter criterion is imposed when selecting the wavelet coefficients for a given sampling direction, so that the time stationarity of the local B0 is better ensured during the local sampling interval. The spectral index for the parallel direction is then found to be -1.75, whereas the spectral index in the perpendicular direction remains close to -1.65. These two new results, namely that the value of the turbulence strength is much less than one in the parallel direction and that the angle dependence of the spectral index is weak, cannot be explained by existing turbulence theories, like CBT, and thus will require new theoretical considerations and promote further observations of solar-wind turbulence.
Automatic Alignment of Displacement-Measuring Interferometer
NASA Technical Reports Server (NTRS)
Halverson, Peter; Regehr, Martin; Spero, Robert; Alvarez-Salazar, Oscar; Loya, Frank; Logan, Jennifer
2006-01-01
A control system strives to maintain the correct alignment of a laser beam in an interferometer dedicated to measuring the displacement or distance between two fiducial corner-cube reflectors. The correct alignment of the laser beam is parallel to the line between the corner points of the corner-cube reflectors: Any deviation from parallelism changes the length of the optical path between the reflectors, thereby introducing a displacement or distance measurement error. On the basis of the geometrical optics of corner-cube reflectors, the length of the optical path can be shown to be L = L(sub 0)cos theta, where L(sub 0) is the distance between the corner points and theta is the misalignment angle. Therefore, the measurement error is given by DeltaL = L(sub 0)(cos theta - 1). In the usual case in which the misalignment is small, this error can be approximated as DeltaL approximately equal to -L(sub 0)theta sup 2/2. The control system (see figure) is implemented partly in hardware and partly in software. The control system includes three piezoelectric actuators for rapid, fine adjustment of the direction of the laser beam. The voltages applied to the piezoelectric actuators include components designed to scan the beam in a circular pattern so that the beam traces out a narrow cone (60 microradians wide in the initial application) about the direction in which it is nominally aimed. This scan is performed at a frequency (2.5 Hz in the initial application) well below the resonance frequency of any vibration of the interferometer. The laser beam makes a round trip to both corner-cube reflectors and then interferes with the launched beam. The interference is detected on a photodiode. The length of the optical path is measured by a heterodyne technique: A 100- kHz frequency shift between the launched beam and a reference beam imposes, on the detected signal, an interferometric phase shift proportional to the length of the optical path. A phase meter comprising analog filters and specialized digital circuitry converts the phase shift to an indication of displacement, generating a digital signal proportional to the path length.
Wakefield Computations for the CLIC PETS using the Parallel Finite Element Time-Domain Code T3P
DOE Office of Scientific and Technical Information (OSTI.GOV)
Candel, A; Kabel, A.; Lee, L.
In recent years, SLAC's Advanced Computations Department (ACD) has developed the high-performance parallel 3D electromagnetic time-domain code, T3P, for simulations of wakefields and transients in complex accelerator structures. T3P is based on advanced higher-order Finite Element methods on unstructured grids with quadratic surface approximation. Optimized for large-scale parallel processing on leadership supercomputing facilities, T3P allows simulations of realistic 3D structures with unprecedented accuracy, aiding the design of the next generation of accelerator facilities. Applications to the Compact Linear Collider (CLIC) Power Extraction and Transfer Structure (PETS) are presented.
NASA Astrophysics Data System (ADS)
Macomber, B.; Woollands, R. M.; Probe, A.; Younes, A.; Bai, X.; Junkins, J.
2013-09-01
Modified Chebyshev Picard Iteration (MCPI) is an iterative numerical method for approximating solutions of linear or non-linear Ordinary Differential Equations (ODEs) to obtain time histories of system state trajectories. Unlike other step-by-step differential equation solvers, the Runge-Kutta family of numerical integrators for example, MCPI approximates long arcs of the state trajectory with an iterative path approximation approach, and is ideally suited to parallel computation. Orthogonal Chebyshev Polynomials are used as basis functions during each path iteration; the integrations of the Picard iteration are then done analytically. Due to the orthogonality of the Chebyshev basis functions, the least square approximations are computed without matrix inversion; the coefficients are computed robustly from discrete inner products. As a consequence of discrete sampling and weighting adopted for the inner product definition, Runge phenomena errors are minimized near the ends of the approximation intervals. The MCPI algorithm utilizes a vector-matrix framework for computational efficiency. Additionally, all Chebyshev coefficients and integrand function evaluations are independent, meaning they can be simultaneously computed in parallel for further decreased computational cost. Over an order of magnitude speedup from traditional methods is achieved in serial processing, and an additional order of magnitude is achievable in parallel architectures. This paper presents a new MCPI library, a modular toolset designed to allow MCPI to be easily applied to a wide variety of ODE systems. Library users will not have to concern themselves with the underlying mathematics behind the MCPI method. Inputs are the boundary conditions of the dynamical system, the integrand function governing system behavior, and the desired time interval of integration, and the output is a time history of the system states over the interval of interest. Examples from the field of astrodynamics are presented to compare the output from the MCPI library to current state-of-practice numerical integration methods. It is shown that MCPI is capable of out-performing the state-of-practice in terms of computational cost and accuracy.
A direct-execution parallel architecture for the Advanced Continuous Simulation Language (ACSL)
NASA Technical Reports Server (NTRS)
Carroll, Chester C.; Owen, Jeffrey E.
1988-01-01
A direct-execution parallel architecture for the Advanced Continuous Simulation Language (ACSL) is presented which overcomes the traditional disadvantages of simulations executed on a digital computer. The incorporation of parallel processing allows the mapping of simulations into a digital computer to be done in the same inherently parallel manner as they are currently mapped onto an analog computer. The direct-execution format maximizes the efficiency of the executed code since the need for a high level language compiler is eliminated. Resolution is greatly increased over that which is available with an analog computer without the sacrifice in execution speed normally expected with digitial computer simulations. Although this report covers all aspects of the new architecture, key emphasis is placed on the processing element configuration and the microprogramming of the ACLS constructs. The execution times for all ACLS constructs are computed using a model of a processing element based on the AMD 29000 CPU and the AMD 29027 FPU. The increase in execution speed provided by parallel processing is exemplified by comparing the derived execution times of two ACSL programs with the execution times for the same programs executed on a similar sequential architecture.
Ultrasonically-assisted Thermal Stir Welding System
NASA Technical Reports Server (NTRS)
Ding, R. Jeffrey (Inventor)
2014-01-01
A welding head assembly has a work piece disposed between its containment plates' opposing surfaces with the work piece being maintained in a plastic state thereof at least in a vicinity of the welding head assembly's stir rod as the rod is rotated about its longitudinal axis. The welding head assembly and the work piece experience relative movement there between in a direction perpendicular to the rod's longitudinal axis as the work piece is subjected to a compressive force applied by the containment plates. A first source coupled to the first containment plate applies a first ultrasonic wave thereto such that the first ultrasonic wave propagates parallel to the direction of relative movement. A second source coupled to the second containment plate applies a second ultrasonic wave thereto such that the second ultrasonic wave propagates parallel to the direction of relative movement.propagates parallel to the direction of relative movement.
NASA Technical Reports Server (NTRS)
Saini, Subhash; Frumkin, Michael; Hribar, Michelle; Jin, Hao-Qiang; Waheed, Abdul; Yan, Jerry
1998-01-01
Porting applications to new high performance parallel and distributed computing platforms is a challenging task. Since writing parallel code by hand is extremely time consuming and costly, porting codes would ideally be automated by using some parallelization tools and compilers. In this paper, we compare the performance of the hand written NAB Parallel Benchmarks against three parallel versions generated with the help of tools and compilers: 1) CAPTools: an interactive computer aided parallelization too] that generates message passing code, 2) the Portland Group's HPF compiler and 3) using compiler directives with the native FORTAN77 compiler on the SGI Origin2000.
Evaluation of Orientation Performance of Attention Patterns for Blind Person.
Fujisawa, Shoichiro; Ishibashi, Tatsuki; Sato, Katsuya; Ito, Sin-Ichi; Sueda, Osamu
2017-01-01
Tactile walking surface indicators (TWSIs) are installed on footpath to support independent travel for the blind. There are two types of TWSIs, attention patterns and guiding patterns. The attention pattern is usually installed at the crosswalk entrances. The direction of the crossing can be acquired by the row of the projection of the attention pattern through the soles of the shoes. In addition, truncated domes or cones of the attention pattern were arranged in a square grid, parallel or diagonal at 45 degrees to the principal direction of travel. However, the international standard organization (ISO) allows a wide-ranging size. In this research, the direction indicating performance was compared at the same intervals for the five diameters specified by the international standard. As a result of the experiment, the diagonal array does not indicate the direction of travel, but the projection row does indicate the direction of travel in the parallel array. When the attention pattern is installed at a crosswalk entrance, a parallel array should be installed in the direction of the crossing.
Efficient Parallel Algorithm For Direct Numerical Simulation of Turbulent Flows
NASA Technical Reports Server (NTRS)
Moitra, Stuti; Gatski, Thomas B.
1997-01-01
A distributed algorithm for a high-order-accurate finite-difference approach to the direct numerical simulation (DNS) of transition and turbulence in compressible flows is described. This work has two major objectives. The first objective is to demonstrate that parallel and distributed-memory machines can be successfully and efficiently used to solve computationally intensive and input/output intensive algorithms of the DNS class. The second objective is to show that the computational complexity involved in solving the tridiagonal systems inherent in the DNS algorithm can be reduced by algorithm innovations that obviate the need to use a parallelized tridiagonal solver.
Two-axis magnetic field sensor
NASA Technical Reports Server (NTRS)
Smith, Carl H. (Inventor); Nordman, Catherine A. (Inventor); Jander, Albrecht (Inventor); Qian, Zhenghong (Inventor)
2006-01-01
A ferromagnetic thin-film based magnetic field sensor with first and second sensitive direction sensing structures each having a nonmagnetic intermediate layer with two major surfaces on opposite sides thereof having a magnetization reference layer on one and an anisotropic ferromagnetic material sensing layer on the other having a length in a selected length direction and a smaller width perpendicular thereto and parallel to the relatively fixed magnetization direction. The relatively fixed magnetization direction of said magnetization reference layer in each is oriented in substantially parallel to the substrate but substantially perpendicular to that of the other. An annealing process is used to form the desired magnetization directions.
NASA Astrophysics Data System (ADS)
Macek, Wiesław M.; Wawrzaszek, Anna; Kucharuk, Beata
2018-01-01
Turbulence is complex behavior that is ubiquitous in space, including the environments of the heliosphere and the magnetosphere. Our studies on solar wind turbulence including the heliosheath, and even at the heliospheric boundaries, also beyond the ecliptic plane, have shown that turbulence is intermittent in the entire heliosphere. As is known, turbulence in space plasmas often exhibits substantial deviations from normal Gaussian distributions. Therefore, we analyze the fluctuations of plasma and magnetic field parameters also in the magnetosheath behind the Earth's bow shock. Based on THEMIS observations, we have already suggested that turbulence behind the quasi-perpendicular shock is more intermittent with larger kurtosis than that behind the quasi-parallel shocks. Following this study, we would like to present a detailed analysis of intermittent anisotropic turbulence in the magnetosheath depending on various characteristics of plasma behind the bow shock and now also near the magnetopause. In particular, for very high Alfvénic Mach numbers and high plasma beta we have clear non-Gaussian statistics in the directions perpendicular to the magnetic field. On the other hand, for directions parallel to this field the kurtosis is small and the plasma is close to equilibrium. However, the level of intermittency for the outgoing fluctuations seems to be similar to that for the ingoing fluctuations, which is consistent with approximate equipartition of energy between the oppositely propagating Alfvén waves. We hope that the difference in characteristic behavior of these fluctuations in various regions of space plasmas can help to detect some complex structures in space missions in the near future.
Generation of low-divergence laser beams
Kronberg, James W.
1993-01-01
Apparatus for transforming a conventional beam of coherent light, having a Gaussian energy distribution and relatively high divergence, into a beam in which the energy distribution approximates a single, non-zero-order Bessel function and which therefore has much lower divergence. The apparatus comprises a zone plate having transmitting and reflecting zones defined by the pattern of light interference produced by the combination of a beam of coherent light with a Gaussian energy distribution and one having such a Bessel distribution. The interference pattern between the two beams is a concentric array of multiple annuli, and is preferably recorded as a hologram. The hologram is then used to form the transmitting and reflecting zones by photo-etching portions of a reflecting layer deposited on a plate made of a transmitting material. A Bessel beam, containing approximately 50% of the energy of the incident beam, is produced by passing a Gaussian beam through such a Bessel zone plate. The reflected beam, also containing approximately 50% of the incident beam energy and having a Bessel energy distribution, can be redirected in the same direction and parallel to the transmitted beam. Alternatively, a filter similar to the Bessel zone plate can be placed within the resonator cavity of a conventional laser system having a front mirror and a rear mirror, preferably axially aligned with the mirrors and just inside the front mirror to generate Bessel energy distribution light beams at the laser source.
Parareal in time 3D numerical solver for the LWR Benchmark neutron diffusion transient model
DOE Office of Scientific and Technical Information (OSTI.GOV)
Baudron, Anne-Marie, E-mail: anne-marie.baudron@cea.fr; CEA-DRN/DMT/SERMA, CEN-Saclay, 91191 Gif sur Yvette Cedex; Lautard, Jean-Jacques, E-mail: jean-jacques.lautard@cea.fr
2014-12-15
In this paper we present a time-parallel algorithm for the 3D neutrons calculation of a transient model in a nuclear reactor core. The neutrons calculation consists in numerically solving the time dependent diffusion approximation equation, which is a simplified transport equation. The numerical resolution is done with finite elements method based on a tetrahedral meshing of the computational domain, representing the reactor core, and time discretization is achieved using a θ-scheme. The transient model presents moving control rods during the time of the reaction. Therefore, cross-sections (piecewise constants) are taken into account by interpolations with respect to the velocity ofmore » the control rods. The parallelism across the time is achieved by an adequate use of the parareal in time algorithm to the handled problem. This parallel method is a predictor corrector scheme that iteratively combines the use of two kinds of numerical propagators, one coarse and one fine. Our method is made efficient by means of a coarse solver defined with large time step and fixed position control rods model, while the fine propagator is assumed to be a high order numerical approximation of the full model. The parallel implementation of our method provides a good scalability of the algorithm. Numerical results show the efficiency of the parareal method on large light water reactor transient model corresponding to the Langenbuch–Maurer–Werner benchmark.« less
Breakdown of Spatial Parallel Coding in Children's Drawing
ERIC Educational Resources Information Center
De Bruyn, Bart; Davis, Alyson
2005-01-01
When drawing real scenes or copying simple geometric figures young children are highly sensitive to parallel cues and use them effectively. However, this sensitivity can break down in surprisingly simple tasks such as copying a single line where robust directional errors occur despite the presence of parallel cues. Before we can conclude that this…
Heinrich, Doris; Sackmann, Erich
2006-11-01
The micro-viscoelasticity of the intracellular space of Dictyostelium discoideum cells is studied by evaluating the intracellular transport of magnetic force probes and their viscoelastic responses to force pulses of 20-700 pN. The role of the actin cortex, the microtubule (MT) aster and their crosstalk is explored by comparing the behaviour of wild-type cells, myosin II null mutants, latrunculin A and benomyl treated cells. The MT coupled beads perform irregular local and long range directed motions which are characterized by measuring their velocity distributions (P(v)). The correlated motion of the MT and the centrosome are evaluated by microfluorescence of GFP-labelled MTs. P(v) can be represented by log-normal distributions with long tails and it is determined by random sweeping motions (v approximately 0.5 microm/s) of the MTs (caused by tangential forces on the filament ends coupled to the actin cortex) and by intermittent bead transports parallel to the MTs (v(max) approximately 1.5 microm/s). The tails are due to spontaneous filament deflections (with speeds up to 10 microm/s) attributed to pre-stressing of the MT by local cortical tensions, generated by dynactin motors generating plus-end directed forces in the MTs. The viscoelastic responses are strongly non-linear and are mostly directed opposite or perpendicular to the force, showing that the cytoplasm behaves as an active viscoplastic body with time and force dependent drag coefficients. Nano-Newton loads exerted on the soft MT are balanced by traction forces arising at the MT ends coupled to the actin cortex and the centrosome, respectively. The mechanical coupling between the soft microtubules and the viscoelastic actin cortex provides cells with high mechanical stability despite the softness of the cytoplasm.
Special cases of friction and applications
NASA Technical Reports Server (NTRS)
Litvin, F. L.; Coy, J. J.
1983-01-01
Two techniques for reducing friction forces are presented. The techniques are applied to the generalized problem of reducing the friction between kinematic pairs which connect a moveable link to a frame. The basic principles are: (1) Let the moveable link be supported by two bearings where the relative velocities of the link with respect to each bearing are of opposite directions. Thus the resultant force (torque) of friction acting on the link due to the bearings is approximately zero. Then, additional perturbation of motion parallel to the main motion of the moveable link will require only a very small force; (2) Let the perturbation in motion be perpendicular to the main motion. Equations are developed which explain these two methods. The results are discussed in relation to friction in geared couplings, gyroscope gimbal bearings and a rotary conveyor system. Design examples are presented.
Cobalt adatoms on graphene: Effects of anisotropies on the correlated electronic structure
NASA Astrophysics Data System (ADS)
Mozara, R.; Valentyuk, M.; Krivenko, I.; Şaşıoǧlu, E.; Kolorenč, J.; Lichtenstein, A. I.
2018-02-01
Impurities on surfaces experience a geometric symmetry breaking induced not only by the on-site crystal-field splitting and the orbital-dependent hybridization, but also by different screening of the Coulomb interaction in different directions. We present a many-body study of the Anderson impurity model representing a Co adatom on graphene, taking into account all anisotropies of the effective Coulomb interaction, which we obtained by the constrained random-phase approximation. The most pronounced differences are naturally displayed by the many-body self-energy projected onto the single-particle states. For the solution of the Anderson impurity model and analytical continuation of the Matsubara data, we employed new implementations of the continuous-time hybridization expansion quantum Monte Carlo and the stochastic optimization method, and we verified the results in parallel with the exact diagonalization method.
Sound-turbulence interaction in transonic boundary layers
NASA Astrophysics Data System (ADS)
Lelostec, Ludovic; Scalo, Carlo; Lele, Sanjiva
2014-11-01
Acoustic wave scattering in a transonic boundary layer is investigated through a novel approach. Instead of simulating directly the interaction of an incoming oblique acoustic wave with a turbulent boundary layer, suitable Dirichlet conditions are imposed at the wall to reproduce only the reflected wave resulting from the interaction of the incident wave with the boundary layer. The method is first validated using the laminar boundary layer profiles in a parallel flow approximation. For this scattering problem an exact inviscid solution can be found in the frequency domain which requires numerical solution of an ODE. The Dirichlet conditions are imposed in a high-fidelity unstructured compressible flow solver for Large Eddy Simulation (LES), CharLESx. The acoustic field of the reflected wave is then solved and the interaction between the boundary layer and sound scattering can be studied.
NASA Technical Reports Server (NTRS)
Goldstein, Marvin E.; Leib, Stewart J.
1999-01-01
An approximate method for calculating the noise generated by a turbulent flow within a semi-infinite duct of arbitrary cross section is developed. It is based on a previously derived high-frequency solution to Lilley's equation, which describes the sound propagation in a transversely-sheared mean flow. The source term is simplified by assuming the turbulence to be axisymmetric about the mean flow direction. Numerical results are presented for the special case of a ring source in a circular duct with an axisymmetric mean flow. They show that the internally generated noise is suppressed at sufficiently large upstream angles in a hard walled duct, and that acoustic liners can significantly reduce the sound radiated in both the upstream and downstream regions, depending upon the source location and Mach number of the flow.
NASA Technical Reports Server (NTRS)
Goldstein, Marvin E.; Leib, Stewart J.
1999-01-01
An approximate method for calculating the noise generated by a turbulent flow within a semi-infinite duct of arbitrary cross section is developed. It is based on a previously derived high-frequency solution to Lilley's equation, which describes the sound propagation in transversely-sheared mean flow. The source term is simplified by assuming the turbulence to be axisymmetric about the mean flow direction. Numerical results are presented for the special case of a ring source in a circular duct with an axisymmetric mean flow. They show that the internally generated noise is suppressed at sufficiently large upstream angles in a hard walled duct, and that acoustic liners can significantly reduce the sound radiated in both the upstream and downstream regions, depending upon the source location and Mach number of the flow.
NASA Astrophysics Data System (ADS)
Kohyama, Sumihiro; Takahashi, Hidetoshi; Yoshida, Satoru; Onoe, Hiroaki; Hirayama-Shoji, Kayoko; Tsukagoshi, Takuya; Takahata, Tomoyuki; Shimoyama, Isao
2018-04-01
This paper reports on a method to measure a spring constant on site using a micro electro mechanical systems (MEMS) force and displacement sensor. The proposed sensor consists of a force-sensing cantilever and a displacement-sensing cantilever. Each cantilever is composed of two beams with a piezoresistor on the sidewall for measuring the in-plane lateral directional force and displacement. The force resolution and displacement resolution of the fabricated sensor were less than 0.8 µN and 0.1 µm, respectively. We measured the spring constants of two types of hydrogel microparticles to demonstrate the effectiveness of the proposed sensor, with values of approximately 4.3 N m-1 and 15.1 N m-1 obtained. The results indicated that the proposed sensor is effective for on-site spring constant measurement.
Computational physics of the mind
NASA Astrophysics Data System (ADS)
Duch, Włodzisław
1996-08-01
In the XIX century and earlier physicists such as Newton, Mayer, Hooke, Helmholtz and Mach were actively engaged in the research on psychophysics, trying to relate psychological sensations to intensities of physical stimuli. Computational physics allows to simulate complex neural processes giving a chance to answer not only the original psychophysical questions but also to create models of the mind. In this paper several approaches relevant to modeling of the mind are outlined. Since direct modeling of the brain functions is rather limited due to the complexity of such models a number of approximations is introduced. The path from the brain, or computational neurosciences, to the mind, or cognitive sciences, is sketched, with emphasis on higher cognitive functions such as memory and consciousness. No fundamental problems in understanding of the mind seem to arise. From a computational point of view realistic models require massively parallel architectures.
NASA Astrophysics Data System (ADS)
Habenicht, Carsten; Schuster, Roman; Knupfer, Martin; Büchner, Bernd
2018-05-01
We have investigated indirect excitons in bulk 2H-MoS2 using transmission electron energy-loss spectroscopy. The electron energy-loss spectra were measured for various momentum transfer values parallel to the and directions of the Brillouin zone. The results allowed the identification of the indirect excitons between the valence band K v and conduction band Λc points, the Γv and K c points as well as adjacent K v and points. The energy-momentum dispersions for the K v-Λc, Γv-K c and K v1- excitons along the line are presented. The former two transitions exhibit a quadratic dispersion which allowed calculating their effective exciton masses based on the effective mass approximation. The K v1- transition follows a more linear dispersion relationship.
Pulsed electromagnetic gas acceleration
NASA Technical Reports Server (NTRS)
Jahn, R. G.; Vonjaskowsky, W. F.; Clark, K. E.
1971-01-01
Experimental data were combined with one-dimensional conservation relations to yield information on the energy deposition ratio in a parallel-plate accelerator, where the downstream flow was confined to a constant area channel. Approximately 70% of the total input power was detected in the exhaust flow, of which only about 20% appeared as directed kinetic energy, thus implying that a downstream expansion to convert chamber enthalpy into kinetic energy must be an important aspect of conventional high power MPD arcs. Spectroscopic experiments on a quasi-steady MPD argon accelerator verified the presence of A(III) and the absence of A(I), and indicated an azimuthal structure in the jet related to the mass injection locations. Measurements of pressure in the arc chamber and impact pressure in the exhaust jet using a piezocrystal backed by a Plexiglas rod were in good agreement with the electromagnetic thrust model.
Atmospheric effects on radiometry from zenith of a plane with dark vertical protrusions
NASA Technical Reports Server (NTRS)
Otterman, J.
1983-01-01
Effects of an optically thin plane-parallel scattering atmosphere on radiometric imaging from the zenith of a specific surface-type are analyzed. The surface model was previously developed to describe arid steppe, where the sparse vegetation forms dark vertical protrusions from the bright soil-plane. The analysis is in terms of the surface reflectivity to the zenith r sub p for the direct beam, which is formulated as r sub p = r sub i exp (-s tan theta sub 0), where v sub i is the Lambert law reflectivity of the soil, the protrusions parameters s is the projection on a vertical plane of protrusions per unit area and theta sub 0 is the zenith angle. The surface reflectivity r sub p is approximately equal to that for the global irradiance (which is directly measured in the field) only for a narrow range of the solar zenith angles. The effects of the atmosphere when imaging large uniform areas of this type are comparable to those in imaging a Lambert surface with a reflectivity r sub p. Thus, the effects can be approximated by those in the case of a dark Lambert surface (analyzed previously), inasmuch as r sub p is smaller than the soil reflectivity r sub i for any off-zenith illumination. The surface becomes effectively darker with increasing solar zenith angle. Adjacency effects of a reflection from one area and scattering in the instantaneous field of view (object pixel) are analyzed as cross radiance and cross irradiance.
Hydrogen-assisted stable crack growth in iron-3 wt% silicon steel
DOE Office of Scientific and Technical Information (OSTI.GOV)
Marrow, T.J.; Prangnell, P.; Aindow, M.
1996-08-01
Observations of internal hydrogen cleavage in Fe-3Si are reported. Hydrogen-assisted stable crack growth (H-SCG) is associated with cleavage striations of a 300 nm spacing, observed using scanning electron microscopy (SEM) and atomic force microscopy (AFM). High resolution SEM revealed finer striations, previously undetected, with a spacing of approximately 30 nm. These were parallel to the coarser striations. Scanning tunneling microscopy (STM) also showed the fine striation spacing, and gave a striation height of approximately 15 nm. The crack front was not parallel to the striations. Transmission electron microscopy (TEM) of crack tip plastic zones showed {l_brace}112{r_brace} and {l_brace}110{r_brace} slip, withmore » a high dislocation density (around 10{sup 14}m{sup {minus}2}). The slip plane spacing was approximately 15--30 nm. Parallel arrays of high dislocation density were observed in the wake of the hydrogen cleavage crack. It is concluded that H-ScG in Fe-3Si occurs by periodic brittle cleavage on the {l_brace}001{r_brace} planes. This is preceded by dislocation emission. The coarse striations are produced by crack tip blunting and the fine striations by dislocations attracted by image forces to the fracture surface after cleavage. The effects of temperature, pressure and yield strength on the kinetics of H-SCG can be predicted using a model for diffusion of hydrogen through the plastic zone.« less
Pulvermüller, Friedemann; Shtyrov, Yury; Hauk, Olaf
2009-08-01
How long does it take the human mind to grasp the idea when hearing or reading a sentence? Neurophysiological methods looking directly at the time course of brain activity indexes of comprehension are critical for finding the answer to this question. As the dominant cognitive approaches, models of serial/cascaded and parallel processing, make conflicting predictions on the time course of psycholinguistic information access, they can be tested using neurophysiological brain activation recorded in MEG and EEG experiments. Seriality and cascading of lexical, semantic and syntactic processes receives support from late (latency approximately 1/2s) sequential neurophysiological responses, especially N400 and P600. However, parallelism is substantiated by early near-simultaneous brain indexes of a range of psycholinguistic processes, up to the level of semantic access and context integration, emerging already 100-250ms after critical stimulus information is present. Crucially, however, there are reliable latency differences of 20-50ms between early cortical area activations reflecting lexical, semantic and syntactic processes, which are left unexplained by current serial and parallel brain models of language. We here offer a mechanistic model grounded in cortical nerve cell circuits that builds upon neuroanatomical and neurophysiological knowledge and explains both near-simultaneous activations and fine-grained delays. A key concept is that of discrete distributed cortical circuits with specific inter-area topographies. The full activation, or ignition, of specifically distributed binding circuits explains the near-simultaneity of early neurophysiological indexes of lexical, syntactic and semantic processing. Activity spreading within circuits determined by between-area conduction delays accounts for comprehension-related regional activation differences in the millisecond range.
Phloem Metabolism and Function Have to Cope with Low Internal Oxygen1
van Dongen, Joost T.; Schurr, Ulrich; Pfister, Michelle; Geigenberger, Peter
2003-01-01
We have investigated the consequences of endogenous limitations in oxygen delivery for phloem transport in Ricinus communis. In situ oxygen profiles were measured directly across stems of plants growing in air (21% [v/v] oxygen), using a microsensor with a tip diameter of approximately 30 μm. Oxygen levels decreased from 21% (v/v) at the surface to 7% (v/v) in the vascular region and increased again to 15% (v/v) toward the hollow center of the stem. Phloem sap exuding from small incisions in the bark of the stem was hypoxic, and the ATP to ADP ratio (4.1) and energy charge (0.78) were also low. When 5-cm stem segments of intact plants were exposed to zero external oxygen for 90 min, oxygen levels within the phloem decreased to approximately 2% (v/v), and ATP to ADP ratio and adenylate energy charge dropped further to 1.92 and 0.68, respectively. This was accompanied by a marked decrease in the phloem sucrose (Suc) concentration and Suc transport rate, which is likely to be explained by the inhibition of retrieval processes in the phloem. Germinating seedlings were used to analyze the effect of a stepwise decrease in oxygen tension on phloem transport and energy metabolism in more detail. Within the endosperm embedding the cotyledons—next to the phloem loading sites—oxygen decreased from approximately 14% (v/v) in 6-d-old seedlings down to approximately 6% (v/v) in 10-d-old seedlings. This was paralleled by a similar decrease of oxygen inside the hypocotyl. When the endosperm was removed and cotyledons incubated in a 100 mm Suc solution with 21%, 6%, 3%, or 0.5% (v/v) oxygen for 3 h before phloem sap was analyzed, decreasing oxygen tensions led to a progressive decrease in phloem energy state, indicating a partial inhibition of respiration. The estimated ratio of NADH to NAD+ in the phloem exudate remained low (approximately 0.0014) when oxygen was decreased to 6% and 3% (v/v) but increased markedly (to approximately 0.008) at 0.5% (v/v) oxygen, paralleled by an increase in lactate and ethanol. Suc concentration and translocation decreased when oxygen was decreased to 3% and 0.5% (v/v). Falling oxygen led to a progressive increase in amino acids, especially of alanine, γ-aminobutyrat, methionine, and isoleucine, a progressive decrease in the C to N ratio, and an increase in the succinate to malate ratio in the phloem. These results show that oxygen concentration is low inside the transport phloem in planta and that this results in adaptive changes in phloem metabolism and function. PMID:12692313
Parallelization and automatic data distribution for nuclear reactor simulations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liebrock, L.M.
1997-07-01
Detailed attempts at realistic nuclear reactor simulations currently take many times real time to execute on high performance workstations. Even the fastest sequential machine can not run these simulations fast enough to ensure that the best corrective measure is used during a nuclear accident to prevent a minor malfunction from becoming a major catastrophe. Since sequential computers have nearly reached the speed of light barrier, these simulations will have to be run in parallel to make significant improvements in speed. In physical reactor plants, parallelism abounds. Fluids flow, controls change, and reactions occur in parallel with only adjacent components directlymore » affecting each other. These do not occur in the sequentialized manner, with global instantaneous effects, that is often used in simulators. Development of parallel algorithms that more closely approximate the real-world operation of a reactor may, in addition to speeding up the simulations, actually improve the accuracy and reliability of the predictions generated. Three types of parallel architecture (shared memory machines, distributed memory multicomputers, and distributed networks) are briefly reviewed as targets for parallelization of nuclear reactor simulation. Various parallelization models (loop-based model, shared memory model, functional model, data parallel model, and a combined functional and data parallel model) are discussed along with their advantages and disadvantages for nuclear reactor simulation. A variety of tools are introduced for each of the models. Emphasis is placed on the data parallel model as the primary focus for two-phase flow simulation. Tools to support data parallel programming for multiple component applications and special parallelization considerations are also discussed.« less
Demonstrating Forces between Parallel Wires.
ERIC Educational Resources Information Center
Baker, Blane
2000-01-01
Describes a physics demonstration that dramatically illustrates the mutual repulsion (attraction) between parallel conductors using insulated copper wire, wooden dowels, a high direct current power supply, electrical tape, and an overhead projector. (WRM)
Hierarchical Parallelism in Finite Difference Analysis of Heat Conduction
NASA Technical Reports Server (NTRS)
Padovan, Joseph; Krishna, Lala; Gute, Douglas
1997-01-01
Based on the concept of hierarchical parallelism, this research effort resulted in highly efficient parallel solution strategies for very large scale heat conduction problems. Overall, the method of hierarchical parallelism involves the partitioning of thermal models into several substructured levels wherein an optimal balance into various associated bandwidths is achieved. The details are described in this report. Overall, the report is organized into two parts. Part 1 describes the parallel modelling methodology and associated multilevel direct, iterative and mixed solution schemes. Part 2 establishes both the formal and computational properties of the scheme.
NASA Technical Reports Server (NTRS)
Sanger, Eugen
1932-01-01
In the present report the computation is actually carried through for the case of parallel spars of equal resistance in bending without direct loading, including plotting of the influence lines; for other cases the method of calculation is explained. The development of large size airplanes can be speeded up by accurate methods of calculation such as this.
Time-Domain Evaluation of Fractional Order Controllers’ Direct Discretization Methods
NASA Astrophysics Data System (ADS)
Ma, Chengbin; Hori, Yoichi
Fractional Order Control (FOC), in which the controlled systems and/or controllers are described by fractional order differential equations, has been applied to various control problems. Though it is not difficult to understand FOC’s theoretical superiority, realization issue keeps being somewhat problematic. Since the fractional order systems have an infinite dimension, proper approximation by finite difference equation is needed to realize the designed fractional order controllers. In this paper, the existing direct discretization methods are evaluated by their convergences and time-domain comparison with the baseline case. Proposed sampling time scaling property is used to calculate the baseline case with full memory length. This novel discretization method is based on the classical trapezoidal rule but with scaled sampling time. Comparative studies show good performance and simple algorithm make the Short Memory Principle method most practically superior. The FOC research is still at its primary stage. But its applications in modeling and robustness against non-linearities reveal the promising aspects. Parallel to the development of FOC theories, applying FOC to various control problems is also crucially important and one of top priority issues.
Propulsion of helical flagella near boundaries
NASA Astrophysics Data System (ADS)
Rodenborn, Bruce; Giesbrecht, Grant; Ni, Katha; Vock, Isaac
The presence of nearby boundaries is known to have dramatic effects on the swimming behavior of microorganisms because of the no-slip condition at the boundary. Microorganisms that use a helical flagellum experience forces both along the axis of the helix and in the direction perpendicular to the axis. These low Reynolds number boundary effects have primarily been studied using live bacteria and using numerical simulations. However, small scale measurements give limited information about the forces and torques on the microorganisms. Furthermore, numerical studies are approximate because they have generally used Stokeslet-based simulations with image Stokeslets to represent the effects of the boundaries. Instead, we directly measure the propulsion of macroscopic helical flagella with diameter 12 mm using a fluid with viscosity 105 times that of water to ensure the Reynolds number in the experiments is much less than unity, just as for bacteria. We measure the parallel and perpendicular forces as a function of boundary distance to determine the nonzero elements of the propulsive matrix for axial rotation near a boundary. We then compare our results to the theory and simulations of Lauga et al. and to biological measurements.
NASA Astrophysics Data System (ADS)
Liakos, Anastasios; Malamataris, Nikolaos
2014-11-01
The topology and evolution of flow around a surface mounted cubical object in three dimensional channel flow is examined for low to moderate Reynolds numbers. Direct numerical simulations were performed via a home made parallel finite element code. The computational domain has been designed according to actual laboratory experimental conditions. Analysis of the results is performed using the three dimensional theory of separation. Our findings indicate that a tornado-like vortex by the side of the cube is present for all Reynolds numbers for which flow was simulated. A horse-shoe vortex upstream from the cube was formed at Reynolds number approximately 1266. Pressure distributions are shown along with three dimensional images of the tornado-like vortex and the horseshoe vortex at selected Reynolds numbers. Finally, and in accordance to previous work, our results indicate that the upper limit for the Reynolds number for which steady state results are physically realizable is roughly 2000. Financial support of author NM from the Office of Naval Research Global (ONRG-VSP, N62909-13-1-V016) is acknowledged.
NASA Astrophysics Data System (ADS)
Abramov, Rafail V.
2018-06-01
For the gas near a solid planar wall, we propose a scaling formula for the mean free path of a molecule as a function of the distance from the wall, under the assumption of a uniform distribution of the incident directions of the molecular free flight. We subsequently impose the same scaling onto the viscosity of the gas near the wall and compute the Navier-Stokes solution of the velocity of a shear flow parallel to the wall. Under the simplifying assumption of constant temperature of the gas, the velocity profile becomes an explicit nonlinear function of the distance from the wall and exhibits a Knudsen boundary layer near the wall. To verify the validity of the obtained formula, we perform the Direct Simulation Monte Carlo computations for the shear flow of argon and nitrogen at normal density and temperature. We find excellent agreement between our velocity approximation and the computed DSMC velocity profiles both within the Knudsen boundary layer and away from it.
NASA Technical Reports Server (NTRS)
Woronowicz, Michael S.
2016-01-01
Analytical expressions for column number density (CND) are developed for optical line of sight paths through a variety of steady free molecule point source models including directionally-constrained effusion (Mach number M = 0) and flow from a sonic orifice (M 1). Sonic orifice solutions are approximate, developed using a fair simulacrum fitted to the free molecule solution. Expressions are also developed for a spherically-symmetric thermal expansion (M = 0). CND solutions are found for the most general paths relative to these sources and briefly explored. It is determined that the maximum CND from a distant location through directed effusion and sonic orifice cases occurs along the path parallel to the source plane that intersects the plume axis. For the effusive case this value is exactly twice the CND found along the ray originating from that point of intersection and extending to infinity along the plumes axis. For sonic plumes this ratio is reduced to about 43. For high Mach number cases the maximum CND will be found along the axial centerline path.
NASA Astrophysics Data System (ADS)
Byun, Hye Suk; El-Naggar, Mohamed Y.; Kalia, Rajiv K.; Nakano, Aiichiro; Vashishta, Priya
2017-10-01
Kinetic Monte Carlo (KMC) simulations are used to study long-time dynamics of a wide variety of systems. Unfortunately, the conventional KMC algorithm is not scalable to larger systems, since its time scale is inversely proportional to the simulated system size. A promising approach to resolving this issue is the synchronous parallel KMC (SPKMC) algorithm, which makes the time scale size-independent. This paper introduces a formal derivation of the SPKMC algorithm based on local transition-state and time-dependent Hartree approximations, as well as its scalable parallel implementation based on a dual linked-list cell method. The resulting algorithm has achieved a weak-scaling parallel efficiency of 0.935 on 1024 Intel Xeon processors for simulating biological electron transfer dynamics in a 4.2 billion-heme system, as well as decent strong-scaling parallel efficiency. The parallel code has been used to simulate a lattice of cytochrome complexes on a bacterial-membrane nanowire, and it is broadly applicable to other problems such as computational synthesis of new materials.
The implementation of an aeronautical CFD flow code onto distributed memory parallel systems
NASA Astrophysics Data System (ADS)
Ierotheou, C. S.; Forsey, C. R.; Leatham, M.
2000-04-01
The parallelization of an industrially important in-house computational fluid dynamics (CFD) code for calculating the airflow over complex aircraft configurations using the Euler or Navier-Stokes equations is presented. The code discussed is the flow solver module of the SAUNA CFD suite. This suite uses a novel grid system that may include block-structured hexahedral or pyramidal grids, unstructured tetrahedral grids or a hybrid combination of both. To assist in the rapid convergence to a solution, a number of convergence acceleration techniques are employed including implicit residual smoothing and a multigrid full approximation storage scheme (FAS). Key features of the parallelization approach are the use of domain decomposition and encapsulated message passing to enable the execution in parallel using a single programme multiple data (SPMD) paradigm. In the case where a hybrid grid is used, a unified grid partitioning scheme is employed to define the decomposition of the mesh. The parallel code has been tested using both structured and hybrid grids on a number of different distributed memory parallel systems and is now routinely used to perform industrial scale aeronautical simulations. Copyright
A hybrid parallel framework for the cellular Potts model simulations
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jiang, Yi; He, Kejing; Dong, Shoubin
2009-01-01
The Cellular Potts Model (CPM) has been widely used for biological simulations. However, most current implementations are either sequential or approximated, which can't be used for large scale complex 3D simulation. In this paper we present a hybrid parallel framework for CPM simulations. The time-consuming POE solving, cell division, and cell reaction operation are distributed to clusters using the Message Passing Interface (MPI). The Monte Carlo lattice update is parallelized on shared-memory SMP system using OpenMP. Because the Monte Carlo lattice update is much faster than the POE solving and SMP systems are more and more common, this hybrid approachmore » achieves good performance and high accuracy at the same time. Based on the parallel Cellular Potts Model, we studied the avascular tumor growth using a multiscale model. The application and performance analysis show that the hybrid parallel framework is quite efficient. The hybrid parallel CPM can be used for the large scale simulation ({approx}10{sup 8} sites) of complex collective behavior of numerous cells ({approx}10{sup 6}).« less
NASA Astrophysics Data System (ADS)
Onishi, Keiji; Tsubokura, Makoto
2016-11-01
A methodology to eliminate the manual work required for correcting the surface imperfections of computer-aided-design (CAD) data, will be proposed. Such a technique is indispensable for CFD analysis of industrial applications involving complex geometries. The CAD geometry is degenerated into cell-oriented values based on Cartesian grid. This enables the parallel pre-processing as well as the ability to handle 'dirty' CAD data that has gaps, overlaps, or sharp edges without necessitating any fixes. An arbitrary boundary representation is used with a dummy-cell technique based on immersed boundary (IB) method. To model the IB, a forcing term is directly imposed at arbitrary ghost cells by linear interpolation of the momentum. The mass conservation is satisfied in the approximate domain that covers fluid region except the wall including cells. Attempts to Satisfy mass conservation in the wall containing cells leads to pressure oscillations near the IB. The consequence of this approximation will be discussed through fundamental study of an LES based channel flow simulation, and high Reynolds number flow around a sphere. And, an analysis comparing our results with wind tunnel experiments of flow around a full-vehicle geometry will also be presented.
Explicit and Implicit Processes Constitute the Fast and Slow Processes of Sensorimotor Learning.
McDougle, Samuel D; Bond, Krista M; Taylor, Jordan A
2015-07-01
A popular model of human sensorimotor learning suggests that a fast process and a slow process work in parallel to produce the canonical learning curve (Smith et al., 2006). Recent evidence supports the subdivision of sensorimotor learning into explicit and implicit processes that simultaneously subserve task performance (Taylor et al., 2014). We set out to test whether these two accounts of learning processes are homologous. Using a recently developed method to assay explicit and implicit learning directly in a sensorimotor task, along with a computational modeling analysis, we show that the fast process closely resembles explicit learning and the slow process approximates implicit learning. In addition, we provide evidence for a subdivision of the slow/implicit process into distinct manifestations of motor memory. We conclude that the two-state model of motor learning is a close approximation of sensorimotor learning, but it is unable to describe adequately the various implicit learning operations that forge the learning curve. Our results suggest that a wider net be cast in the search for the putative psychological mechanisms and neural substrates underlying the multiplicity of processes involved in motor learning. Copyright © 2015 the authors 0270-6474/15/359568-12$15.00/0.
Explicit and Implicit Processes Constitute the Fast and Slow Processes of Sensorimotor Learning
Bond, Krista M.; Taylor, Jordan A.
2015-01-01
A popular model of human sensorimotor learning suggests that a fast process and a slow process work in parallel to produce the canonical learning curve (Smith et al., 2006). Recent evidence supports the subdivision of sensorimotor learning into explicit and implicit processes that simultaneously subserve task performance (Taylor et al., 2014). We set out to test whether these two accounts of learning processes are homologous. Using a recently developed method to assay explicit and implicit learning directly in a sensorimotor task, along with a computational modeling analysis, we show that the fast process closely resembles explicit learning and the slow process approximates implicit learning. In addition, we provide evidence for a subdivision of the slow/implicit process into distinct manifestations of motor memory. We conclude that the two-state model of motor learning is a close approximation of sensorimotor learning, but it is unable to describe adequately the various implicit learning operations that forge the learning curve. Our results suggest that a wider net be cast in the search for the putative psychological mechanisms and neural substrates underlying the multiplicity of processes involved in motor learning. PMID:26134640
Justification of Shallow-Water Theory
NASA Astrophysics Data System (ADS)
Ostapenko, V. V.
2018-01-01
The basic conservation laws of shallow-water theory are derived from multidimensional mass and momentum integral conservation laws describing the plane-parallel flow of an ideal incompressible fluid above the horizontal bottom. This conclusion is based on the concept of hydrostatic approximation, which generalizes the concept of long-wavelength approximation and is used for justifying the applicability of the shallow-water theory in the simulation of wave flows of fluid with hydraulic bores.
DOE Office of Scientific and Technical Information (OSTI.GOV)
2014-01-17
This library is an implementation of the Sparse Approximate Matrix Multiplication (SpAMM) algorithm introduced. It provides a matrix data type, and an approximate matrix product, which exhibits linear scaling computational complexity for matrices with decay. The product error and the performance of the multiply can be tuned by choosing an appropriate tolerance. The library can be compiled for serial execution or parallel execution on shared memory systems with an OpenMP capable compiler
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shadid, John Nicolas; Elman, Howard; Shuttleworth, Robert R.
2007-04-01
In recent years, considerable effort has been placed on developing efficient and robust solution algorithms for the incompressible Navier-Stokes equations based on preconditioned Krylov methods. These include physics-based methods, such as SIMPLE, and purely algebraic preconditioners based on the approximation of the Schur complement. All these techniques can be represented as approximate block factorization (ABF) type preconditioners. The goal is to decompose the application of the preconditioner into simplified sub-systems in which scalable multi-level type solvers can be applied. In this paper we develop a taxonomy of these ideas based on an adaptation of a generalized approximate factorization of themore » Navier-Stokes system first presented in [25]. This taxonomy illuminates the similarities and differences among these preconditioners and the central role played by efficient approximation of certain Schur complement operators. We then present a parallel computational study that examines the performance of these methods and compares them to an additive Schwarz domain decomposition (DD) algorithm. Results are presented for two and three-dimensional steady state problems for enclosed domains and inflow/outflow systems on both structured and unstructured meshes. The numerical experiments are performed using MPSalsa, a stabilized finite element code.« less
NASA Technical Reports Server (NTRS)
Kahler, S.; Lin, R. P.
1994-01-01
The determination of the polarities of interplanetary magnetic fields (whether the field direction is outward from or inward toward the sun) has been based on a comparison of observed field directions with the nominal Parker spiral angle. These polarities can be mapped back to the solar source field polarities. This technique fails when field directions deviate substantially from the Parker angle or when fields are substantially kinked. We introduce a simple new technique to determine the polarities of interplanetary fields using E greater than 2 keV interplanetary electrons which stream along field lines away from the sun. Those electrons usually show distinct unidirectional pitch-angle anisotropies either parallel or anti-parallel to the field. Since the electron flow direction is known to be outward from the sun, the anisotropies parallel to the field indicate outward-pointing, positive-polarity fields, and those anti-parallel indicate inward-pointing, negative-polarity fields. We use data from the UC Berkeley electron experiment on the International Sun Earth Explorer 3 (ISSE-3) spacecraft to compare the field polarities deduced from the electron data, Pe (outward or inward), with the polarities inferred from field directions, Pd, around two sector boundaries in 1979. We show examples of large (greater than 100 deg) changes in azimuthal field direction Phi over short (less than 1 hr) time scales, some with and some without reversals in Pe. The latter cases indicate that such large directional changes can occur in unipolar structures. On the other hand, we found an example of a change in Pe during which the rotation in Phi was less than 30 deg, indicating polarity changes in nearly unidirectional structures. The field directions are poor guides to the polarities in these cases.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rouet, François-Henry; Li, Xiaoye S.; Ghysels, Pieter
In this paper, we present a distributed-memory library for computations with dense structured matrices. A matrix is considered structured if its off-diagonal blocks can be approximated by a rank-deficient matrix with low numerical rank. Here, we use Hierarchically Semi-Separable (HSS) representations. Such matrices appear in many applications, for example, finite-element methods, boundary element methods, and so on. Exploiting this structure allows for fast solution of linear systems and/or fast computation of matrix-vector products, which are the two main building blocks of matrix computations. The compression algorithm that we use, that computes the HSS form of an input dense matrix, reliesmore » on randomized sampling with a novel adaptive sampling mechanism. We discuss the parallelization of this algorithm and also present the parallelization of structured matrix-vector product, structured factorization, and solution routines. The efficiency of the approach is demonstrated on large problems from different academic and industrial applications, on up to 8,000 cores. Finally, this work is part of a more global effort, the STRUctured Matrices PACKage (STRUMPACK) software package for computations with sparse and dense structured matrices. Hence, although useful on their own right, the routines also represent a step in the direction of a distributed-memory sparse solver.« less
Rouet, François-Henry; Li, Xiaoye S.; Ghysels, Pieter; ...
2016-06-30
In this paper, we present a distributed-memory library for computations with dense structured matrices. A matrix is considered structured if its off-diagonal blocks can be approximated by a rank-deficient matrix with low numerical rank. Here, we use Hierarchically Semi-Separable (HSS) representations. Such matrices appear in many applications, for example, finite-element methods, boundary element methods, and so on. Exploiting this structure allows for fast solution of linear systems and/or fast computation of matrix-vector products, which are the two main building blocks of matrix computations. The compression algorithm that we use, that computes the HSS form of an input dense matrix, reliesmore » on randomized sampling with a novel adaptive sampling mechanism. We discuss the parallelization of this algorithm and also present the parallelization of structured matrix-vector product, structured factorization, and solution routines. The efficiency of the approach is demonstrated on large problems from different academic and industrial applications, on up to 8,000 cores. Finally, this work is part of a more global effort, the STRUctured Matrices PACKage (STRUMPACK) software package for computations with sparse and dense structured matrices. Hence, although useful on their own right, the routines also represent a step in the direction of a distributed-memory sparse solver.« less
Parallel k-means++ for Multiple Shared-Memory Architectures
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mackey, Patrick S.; Lewis, Robert R.
2016-09-22
In recent years k-means++ has become a popular initialization technique for improved k-means clustering. To date, most of the work done to improve its performance has involved parallelizing algorithms that are only approximations of k-means++. In this paper we present a parallelization of the exact k-means++ algorithm, with a proof of its correctness. We develop implementations for three distinct shared-memory architectures: multicore CPU, high performance GPU, and the massively multithreaded Cray XMT platform. We demonstrate the scalability of the algorithm on each platform. In addition we present a visual approach for showing which platform performed k-means++ the fastest for varyingmore » data sizes.« less
VCSELs for datacom applications
NASA Astrophysics Data System (ADS)
Wipiejewski, Torsten; Wolf, Hans-Dieter; Korte, Lutz; Huber, Wolfgang; Kristen, Guenter; Hoyler, Charlotte; Hedrich, Harald; Kleinbub, Oliver; Albrecht, Tony; Mueller, Juergen; Orth, Andreas; Spika, Zeljko; Lutgen, Stephan; Pflaeging, Hartwig; Harrasser, Joerg; Droegemueller, Karsten; Plickert, Volker; Kuhl, Detlef; Blank, Juergen; Pietsch, Doris; Stange, Herwig; Karstensen, Holger
1999-04-01
The use of oxide confined VCSELs in datacom applications is demonstrated. The devices exhibit low threshold currents of approximately 3 mA and low electrical series resistance of about 50 (Omega) . The emission wavelength is in the 850 nm range. Life times of the devices are several million hours under normal operating conditions. VCSEL arrays are employed in a high performance parallel optical link called PAROLITM. This optical ink provides 12 parallel channels with a total bandwidth exceeding 12 Gbit/s. The VCSELs optimized for the parallel optical link show excellent threshold current uniformity between channels of < 50 (mu) A. The array life time drops compared to a single device, but is still larger than 1 million hours.
NASA Astrophysics Data System (ADS)
Zimovets, Artem; Matviychuk, Alexander; Ushakov, Vladimir
2016-12-01
The paper presents two different approaches to reduce the time of computer calculation of reachability sets. First of these two approaches use different data structures for storing the reachability sets in the computer memory for calculation in single-threaded mode. Second approach is based on using parallel algorithms with reference to the data structures from the first approach. Within the framework of this paper parallel algorithm of approximate reachability set calculation on computer with SMP-architecture is proposed. The results of numerical modelling are presented in the form of tables which demonstrate high efficiency of parallel computing technology and also show how computing time depends on the used data structure.
SPM oxidation and parallel writing on zirconium nitride thin films
NASA Astrophysics Data System (ADS)
Farkas, N.; Comer, J. R.; Zhang, G.; Evans, E. A.; Ramsier, R. D.; Dagata, J. A.
2005-07-01
Systematic investigation of the SPM oxidation process of sputter-deposited ZrN thin films is reported. During the intrinsic part of the oxidation, the density of the oxide increases until the total oxide thickness is approximately twice the feature height. Further oxide growth is sustainable as the system undergoes plastic flow followed by delamination from the ZrN-silicon interface keeping the oxide density constant. ZrN exhibits superdiffusive oxidation kinetics in these single tip SPM studies. We extend this work to the fabrication of parallel oxide patterns 70 nm in height covering areas in the square centimeter range. This simple, quick, and well-controlled parallel nanolithographic technique has great potential for biomedical template fabrication.
Tondon, Abhishek; Kaunas, Roland
2014-01-01
Cell structure depends on both matrix strain and stiffness, but their interactive effects are poorly understood. We investigated the interactive roles of matrix properties and stretching patterns on cell structure by uniaxially stretching U2OS cells expressing GFP-actin on silicone rubber sheets supporting either a surface-adsorbed coating or thick hydrogel of type-I collagen. Cells and their actin stress fibers oriented perpendicular to the direction of cyclic stretch on collagen-coated sheets, but oriented parallel to the stretch direction on collagen gels. There was significant alignment parallel to the direction of a steady increase in stretch for cells on collagen gels, while cells on collagen-coated sheets did not align in any direction. The extent of alignment was dependent on both strain rate and duration. Stretch-induced alignment on collagen gels was blocked by the myosin light-chain kinase inhibitor ML7, but not by the Rho-kinase inhibitor Y27632. We propose that active orientation of the actin cytoskeleton perpendicular and parallel to direction of stretch on stiff and soft substrates, respectively, are responses that tend to maintain intracellular tension at an optimal level. Further, our results indicate that cells can align along directions of matrix stress without collagen fibril alignment, indicating that matrix stress can directly regulate cell morphology.
NASA Astrophysics Data System (ADS)
Popov, Igor; Sukov, Sergey
2018-02-01
A modification of the adaptive artificial viscosity (AAV) method is considered. This modification is based on one stage time approximation and is adopted to calculation of gasdynamics problems on unstructured grids with an arbitrary type of grid elements. The proposed numerical method has simplified logic, better performance and parallel efficiency compared to the implementation of the original AAV method. Computer experiments evidence the robustness and convergence of the method to difference solution.
Parallel algorithm for computation of second-order sequential best rotations
NASA Astrophysics Data System (ADS)
Redif, Soydan; Kasap, Server
2013-12-01
Algorithms for computing an approximate polynomial matrix eigenvalue decomposition of para-Hermitian systems have emerged as a powerful, generic signal processing tool. A technique that has shown much success in this regard is the sequential best rotation (SBR2) algorithm. Proposed is a scheme for parallelising SBR2 with a view to exploiting the modern architectural features and inherent parallelism of field-programmable gate array (FPGA) technology. Experiments show that the proposed scheme can achieve low execution times while requiring minimal FPGA resources.
Pattern recognition with parallel associative memory
NASA Technical Reports Server (NTRS)
Toth, Charles K.; Schenk, Toni
1990-01-01
An examination is conducted of the feasibility of searching targets in aerial photographs by means of a parallel associative memory (PAM) that is based on the nearest-neighbor algorithm; the Hamming distance is used as a measure of closeness, in order to discriminate patterns. Attention has been given to targets typically used for ground-control points. The method developed sorts out approximate target positions where precise localizations are needed, in the course of the data-acquisition process. The majority of control points in different images were correctly identified.
Yokohama, Noriya
2013-07-01
This report was aimed at structuring the design of architectures and studying performance measurement of a parallel computing environment using a Monte Carlo simulation for particle therapy using a high performance computing (HPC) instance within a public cloud-computing infrastructure. Performance measurements showed an approximately 28 times faster speed than seen with single-thread architecture, combined with improved stability. A study of methods of optimizing the system operations also indicated lower cost.
NASA Technical Reports Server (NTRS)
Ergun, R. E.; Holmes, J. C.; Goodrich, K. A.; Wilder, F. D.; Stawarz, J. E.; Eriksson, S.; Newman, D. L.; Schwartz, S. J.; Goldman, M. V.; Sturner, A. P.;
2016-01-01
We report observations from the Magnetospheric Multiscale satellites of large-amplitude, parallel, electrostatic waves associated with magnetic reconnection at the Earth's magnetopause. The observed waves have parallel electric fields (E(sub parallel)) with amplitudes on the order of 100 mV/m and display nonlinear characteristics that suggest a possible net E(sub parallel). These waves are observed within the ion diffusion region and adjacent to (within several electron skin depths) the electron diffusion region. They are in or near the magnetosphere side current layer. Simulation results support that the strong electrostatic linear and nonlinear wave activities appear to be driven by a two stream instability, which is a consequence of mixing cold (less than 10eV) plasma in the magnetosphere with warm (approximately 100eV) plasma from the magnetosheath on a freshly reconnected magnetic field line. The frequent observation of these waves suggests that cold plasma is often present near the magnetopause.
Parallel-vector solution of large-scale structural analysis problems on supercomputers
NASA Technical Reports Server (NTRS)
Storaasli, Olaf O.; Nguyen, Duc T.; Agarwal, Tarun K.
1989-01-01
A direct linear equation solution method based on the Choleski factorization procedure is presented which exploits both parallel and vector features of supercomputers. The new equation solver is described, and its performance is evaluated by solving structural analysis problems on three high-performance computers. The method has been implemented using Force, a generic parallel FORTRAN language.
Spatial data analytics on heterogeneous multi- and many-core parallel architectures using python
Laura, Jason R.; Rey, Sergio J.
2017-01-01
Parallel vector spatial analysis concerns the application of parallel computational methods to facilitate vector-based spatial analysis. The history of parallel computation in spatial analysis is reviewed, and this work is placed into the broader context of high-performance computing (HPC) and parallelization research. The rise of cyber infrastructure and its manifestation in spatial analysis as CyberGIScience is seen as a main driver of renewed interest in parallel computation in the spatial sciences. Key problems in spatial analysis that have been the focus of parallel computing are covered. Chief among these are spatial optimization problems, computational geometric problems including polygonization and spatial contiguity detection, the use of Monte Carlo Markov chain simulation in spatial statistics, and parallel implementations of spatial econometric methods. Future directions for research on parallelization in computational spatial analysis are outlined.
H+ and O+ dynamics during ultra-low frequency waves in the Earth's magnetotail plasma sheet
NASA Astrophysics Data System (ADS)
De Spiegeleer, Alexandre; Hamrin, Maria; Pitkänen, Timo; Volwerk, Martin; Mouikis, Christopher; Kistler, Lynn; Nilsson, Hans; Norqvist, Patrik; Andersson, Laila
2017-04-01
The concentration of ionospheric oxygen (O^+) in the magnetotail plasma sheet can be relatively elevated depending on, for instance, the geomagnetic activity as well as the solar cycle. The dynamics of the tail plasma sheet can be affected by the presence of O+ via for example the generation of instabilities such as the Kelvin-Helmholtz instability. However, the O+ is not always taken into account when studying the dynamics of the tail plasma sheet. We investigate proton (H^+) and O+ during ultra-low frequency waves (period > 5 min) in the mid-tail plasma sheet (beyond 10R_E) using Cluster data. We observe that the velocity of O+ can be significantly different from that of H^+. When occuring, this velocity difference always seems to be in the direction parallel to the magnetic field. The parallel velocity of the two species can be observed to be somewhat out of phase, meaning that while one species flows in the parallel direction, the other flows in the anti-parallel direction. Possible causes for such large discrepancies between the dynamics of O+ and H+ are discussed.
Development of high temperature fasteners using directionally solidified eutectic alloys
NASA Technical Reports Server (NTRS)
George, F. D.
1972-01-01
The suitability of the eutectics for high temperature fasteners was investigated. Material properties were determined as a function of temperature, and included shear parallel and perpendicular to the growth direction and torsion parallel to it. Techniques for fabricating typical fastener shapes included grinding, creep forming, and direct casting. Both lamellar Ni3Al-Ni3Nb and fibrous (Co,Cr,Al)-(Cr,Co)7C3 alloys showed promise as candidate materials for high temperature fastener applications. A brief evaluation of the performance of the best fabricated fastener design was made.
al3c: high-performance software for parameter inference using Approximate Bayesian Computation.
Stram, Alexander H; Marjoram, Paul; Chen, Gary K
2015-11-01
The development of Approximate Bayesian Computation (ABC) algorithms for parameter inference which are both computationally efficient and scalable in parallel computing environments is an important area of research. Monte Carlo rejection sampling, a fundamental component of ABC algorithms, is trivial to distribute over multiple processors but is inherently inefficient. While development of algorithms such as ABC Sequential Monte Carlo (ABC-SMC) help address the inherent inefficiencies of rejection sampling, such approaches are not as easily scaled on multiple processors. As a result, current Bayesian inference software offerings that use ABC-SMC lack the ability to scale in parallel computing environments. We present al3c, a C++ framework for implementing ABC-SMC in parallel. By requiring only that users define essential functions such as the simulation model and prior distribution function, al3c abstracts the user from both the complexities of parallel programming and the details of the ABC-SMC algorithm. By using the al3c framework, the user is able to scale the ABC-SMC algorithm in parallel computing environments for his or her specific application, with minimal programming overhead. al3c is offered as a static binary for Linux and OS-X computing environments. The user completes an XML configuration file and C++ plug-in template for the specific application, which are used by al3c to obtain the desired results. Users can download the static binaries, source code, reference documentation and examples (including those in this article) by visiting https://github.com/ahstram/al3c. astram@usc.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Trakumas, S.; Salter, E.
2009-02-01
Adverse health effects due to exposure to airborne particles are associated with particle deposition within the human respiratory tract. Particle size, shape, chemical composition, and the individual physiological characteristics of each person determine to what depth inhaled particles may penetrate and deposit within the respiratory tract. Various particle inertial classification devices are available to fractionate airborne particles according to their aerodynamic size to approximate particle penetration through the human respiratory tract. Cyclones are most often used to sample thoracic or respirable fractions of inhaled particles. Extensive studies of different cyclonic samplers have shown, however, that the sampling characteristics of cyclones do not follow the entire selected convention accurately. In the search for a more accurate way to assess worker exposure to different fractions of inhaled dust, a novel sampler comprising several inertial impactors arranged in parallel was designed and tested. The new design includes a number of separated impactors arranged in parallel. Prototypes of respirable and thoracic samplers each comprising four impactors arranged in parallel were manufactured and tested. Results indicated that the prototype samplers followed closely the penetration characteristics for which they were designed. The new samplers were found to perform similarly for liquid and solid test particles; penetration characteristics remained unchanged even after prolonged exposure to coal mine dust at high concentration. The new parallel impactor design can be applied to approximate any monotonically decreasing penetration curve at a selected flow rate. Personal-size samplers that operate at a few L/min as well as area samplers that operate at higher flow rates can be made based on the suggested design. Performance of such samplers can be predicted with high accuracy employing well-established impaction theory.
A Theoretical Study of Cold Air Damming.
NASA Astrophysics Data System (ADS)
Xu, Qin
1990-12-01
The dynamics of cold air damming are examined analytically with a two-layer steady state model. The upper layer is a warm and saturated cross-mountain (easterly or southeasterly onshore) flow. The lower layer is a cold mountain-parallel (northerly) jet trapped on the windward (eastern) side of the mountain. The interface between the two layers represents a coastal front-a sloping inversion layer coupling the trapped cold dome with the warm onshore flow above through pressure continuity.An analytical expression is obtained for the inviscid upper-layer flow with hydrostatic and moist adiabatic approximations. Blackadar's PBL parameterization of eddy viscosity is used in the lower-layer equations. Solutions for the mountain-parallel jet and its associated secondary transverse circulation are obtained by expanding asymptotically upon a small parameter proportional to the square root of the inertial aspect ratio-the ratio between the mountain height and the radius of inertial oscillation. The geometric shape of the sloping interface is solved numerically from a differential-integral equation derived from the pressure continuity condition imposed at the interface.The observed flow structures and force balances of cold air damming events are produced qualitatively by the model. In the cold dome the mountain-parallel jet is controlled by the competition between the mountain-parallel pressure gradient and friction: the jet is stronger with smoother surfaces, higher mountains, and faster mountain-normal geostrophic winds. In the mountain-normal direction the vertically averaged force balance in the cold dome is nearly geostrophic and controls the geometric shape of the cold dome. The basic mountain-normal pressure gradient generated in the cold dome by the negative buoyancy distribution tends to flatten the sloping interface and expand the cold dome upstream against the mountain-normal pressure gradient (produced by the upper-layer onshore wind) and Coriolis force (induced by the lower-layer mountain-parallel jet). It is found that the interface slope increases and the cold dome shrinks as the Froude number and/or upstream mountain-parallel geostrophic wind increase, or as the Rossby number, upper-layer depth, and/or surface roughness length decrease, and vice versa. The cold dome will either vanish or not be in a steady state if the Froude number is large enough or the roughness length gets too small. The theoretical findings are explained physically based on detailed analyses of the force balance along the inversion interface.
Performance and Application of Parallel OVERFLOW Codes on Distributed and Shared Memory Platforms
NASA Technical Reports Server (NTRS)
Djomehri, M. Jahed; Rizk, Yehia M.
1999-01-01
The presentation discusses recent studies on the performance of the two parallel versions of the aerodynamics CFD code, OVERFLOW_MPI and _MLP. Developed at NASA Ames, the serial version, OVERFLOW, is a multidimensional Navier-Stokes flow solver based on overset (Chimera) grid technology. The code has recently been parallelized in two ways. One is based on the explicit message-passing interface (MPI) across processors and uses the _MPI communication package. This approach is primarily suited for distributed memory systems and workstation clusters. The second, termed the multi-level parallel (MLP) method, is simple and uses shared memory for all communications. The _MLP code is suitable on distributed-shared memory systems. For both methods, the message passing takes place across the processors or processes at the advancement of each time step. This procedure is, in effect, the Chimera boundary conditions update, which is done in an explicit "Jacobi" style. In contrast, the update in the serial code is done in more of the "Gauss-Sidel" fashion. The programming efforts for the _MPI code is more complicated than for the _MLP code; the former requires modification of the outer and some inner shells of the serial code, whereas the latter focuses only on the outer shell of the code. The _MPI version offers a great deal of flexibility in distributing grid zones across a specified number of processors in order to achieve load balancing. The approach is capable of partitioning zones across multiple processors or sending each zone and/or cluster of several zones into a single processor. The message passing across the processors consists of Chimera boundary and/or an overlap of "halo" boundary points for each partitioned zone. The MLP version is a new coarse-grain parallel concept at the zonal and intra-zonal levels. A grouping strategy is used to distribute zones into several groups forming sub-processes which will run in parallel. The total volume of grid points in each group are approximately balanced. A proper number of threads are initially allocated to each group, and in subsequent iterations during the run-time, the number of threads are adjusted to achieve load balancing across the processes. Each process exploits the multitasking directives already established in Overflow.
NASA Astrophysics Data System (ADS)
Osorio-Murillo, C. A.; Over, M. W.; Frystacky, H.; Ames, D. P.; Rubin, Y.
2013-12-01
A new software application called MAD# has been coupled with the HTCondor high throughput computing system to aid scientists and educators with the characterization of spatial random fields and enable understanding the spatial distribution of parameters used in hydrogeologic and related modeling. MAD# is an open source desktop software application used to characterize spatial random fields using direct and indirect information through Bayesian inverse modeling technique called the Method of Anchored Distributions (MAD). MAD relates indirect information with a target spatial random field via a forward simulation model. MAD# executes inverse process running the forward model multiple times to transfer information from indirect information to the target variable. MAD# uses two parallelization profiles according to computational resources available: one computer with multiple cores and multiple computers - multiple cores through HTCondor. HTCondor is a system that manages a cluster of desktop computers for submits serial or parallel jobs using scheduling policies, resources monitoring, job queuing mechanism. This poster will show how MAD# reduces the time execution of the characterization of random fields using these two parallel approaches in different case studies. A test of the approach was conducted using 1D problem with 400 cells to characterize saturated conductivity, residual water content, and shape parameters of the Mualem-van Genuchten model in four materials via the HYDRUS model. The number of simulations evaluated in the inversion was 10 million. Using the one computer approach (eight cores) were evaluated 100,000 simulations in 12 hours (10 million - 1200 hours approximately). In the evaluation on HTCondor, 32 desktop computers (132 cores) were used, with a processing time of 60 hours non-continuous in five days. HTCondor reduced the processing time for uncertainty characterization by a factor of 20 (1200 hours reduced to 60 hours.)
Extending generalized Kubelka-Munk to three-dimensional radiative transfer.
Sandoval, Christopher; Kim, Arnold D
2015-08-10
The generalized Kubelka-Munk (gKM) approximation is a linear transformation of the double spherical harmonics of order one (DP1) approximation of the radiative transfer equation. Here, we extend the gKM approximation to study problems in three-dimensional radiative transfer. In particular, we derive the gKM approximation for the problem of collimated beam propagation and scattering in a plane-parallel slab composed of a uniform absorbing and scattering medium. The result is an 8×8 system of partial differential equations that is much easier to solve than the radiative transfer equation. We compare the solutions of the gKM approximation with Monte Carlo simulations of the radiative transfer equation to identify the range of validity for this approximation. We find that the gKM approximation is accurate for isotropic scattering media that are sufficiently thick and much less accurate for anisotropic, forward-peaked scattering media.
Toward Webscale, Rule-Based Inference on the Semantic Web Via Data Parallelism
2013-02-01
Another work distinct from its peers is the work on approximate reasoning by Rudolph et al. [34] in which multiple inference sys- tems were combined not...Workshop Scalable Semantic Web Knowledge Base Systems, 2010, pp. 17–31. [34] S. Rudolph , T. Tserendorj, and P. Hitzler, “What is approximate reasoning...2013] [55] M. Duerst and M. Suignard. (2005, Jan .). RFC 3987 – internationalized resource identifiers (IRIs). IETF. [Online]. Available: http
Sittig, D. F.; Orr, J. A.
1991-01-01
Various methods have been proposed in an attempt to solve problems in artifact and/or alarm identification including expert systems, statistical signal processing techniques, and artificial neural networks (ANN). ANNs consist of a large number of simple processing units connected by weighted links. To develop truly robust ANNs, investigators are required to train their networks on huge training data sets, requiring enormous computing power. We implemented a parallel version of the backward error propagation neural network training algorithm in the widely portable parallel programming language C-Linda. A maximum speedup of 4.06 was obtained with six processors. This speedup represents a reduction in total run-time from approximately 6.4 hours to 1.5 hours. We conclude that use of the master-worker model of parallel computation is an excellent method for obtaining speedups in the backward error propagation neural network training algorithm. PMID:1807607
Parallel Implicit Runge-Kutta Methods Applied to Coupled Orbit/Attitude Propagation
NASA Astrophysics Data System (ADS)
Hatten, Noble; Russell, Ryan P.
2017-12-01
A variable-step Gauss-Legendre implicit Runge-Kutta (GLIRK) propagator is applied to coupled orbit/attitude propagation. Concepts previously shown to improve efficiency in 3DOF propagation are modified and extended to the 6DOF problem, including the use of variable-fidelity dynamics models. The impact of computing the stage dynamics of a single step in parallel is examined using up to 23 threads and 22 associated GLIRK stages; one thread is reserved for an extra dynamics function evaluation used in the estimation of the local truncation error. Efficiency is found to peak for typical examples when using approximately 8 to 12 stages for both serial and parallel implementations. Accuracy and efficiency compare favorably to explicit Runge-Kutta and linear-multistep solvers for representative scenarios. However, linear-multistep methods are found to be more efficient for some applications, particularly in a serial computing environment, or when parallelism can be applied across multiple trajectories.
Observations of large parallel electric fields in the auroral ionosphere
NASA Technical Reports Server (NTRS)
Mozer, F. S.
1976-01-01
Rocket borne measurements employing a double probe technique were used to gather evidence for the existence of electric fields in the auroral ionosphere having components parallel to the magnetic field direction. An analysis of possible experimental errors leads to the conclusion that no known uncertainties can account for the roughly 10 mV/m parallel electric fields that are observed.
Phase reconstruction using compressive two-step parallel phase-shifting digital holography
NASA Astrophysics Data System (ADS)
Ramachandran, Prakash; Alex, Zachariah C.; Nelleri, Anith
2018-04-01
The linear relationship between the sample complex object wave and its approximated complex Fresnel field obtained using single shot parallel phase-shifting digital holograms (PPSDH) is used in compressive sensing framework and an accurate phase reconstruction is demonstrated. It is shown that the accuracy of phase reconstruction of this method is better than that of compressive sensing adapted single exposure inline holography (SEOL) method. It is derived that the measurement model of PPSDH method retains both the real and imaginary parts of the Fresnel field but with an approximation noise and the measurement model of SEOL retains only the real part exactly equal to the real part of the complex Fresnel field and its imaginary part is completely not available. Numerical simulation is performed for CS adapted PPSDH and CS adapted SEOL and it is demonstrated that the phase reconstruction is accurate for CS adapted PPSDH and can be used for single shot digital holographic reconstruction.
Parallel programming of saccades during natural scene viewing: evidence from eye movement positions.
Wu, Esther X W; Gilani, Syed Omer; van Boxtel, Jeroen J A; Amihai, Ido; Chua, Fook Kee; Yen, Shih-Cheng
2013-10-24
Previous studies have shown that saccade plans during natural scene viewing can be programmed in parallel. This evidence comes mainly from temporal indicators, i.e., fixation durations and latencies. In the current study, we asked whether eye movement positions recorded during scene viewing also reflect parallel programming of saccades. As participants viewed scenes in preparation for a memory task, their inspection of the scene was suddenly disrupted by a transition to another scene. We examined whether saccades after the transition were invariably directed immediately toward the center or were contingent on saccade onset times relative to the transition. The results, which showed a dissociation in eye movement behavior between two groups of saccades after the scene transition, supported the parallel programming account. Saccades with relatively long onset times (>100 ms) after the transition were directed immediately toward the center of the scene, probably to restart scene exploration. Saccades with short onset times (<100 ms) moved to the center only one saccade later. Our data on eye movement positions provide novel evidence of parallel programming of saccades during scene viewing. Additionally, results from the analyses of intersaccadic intervals were also consistent with the parallel programming hypothesis.
Magnetic Fields in Blazar Jets: Jet-Alignment of Radio and Optical Polarization over 20-30 Years
NASA Astrophysics Data System (ADS)
Wills, Beverley J.; Aller, M. F.; Caldwell, C.; Aller, H. D.
2012-01-01
Blazars are highly active nuclei of distant galaxies. They produce synchrotron-emitting relativistic jets on scales of less than a parsec to many Kpc. When viewed head-on, as opposed to in the plane of the sky, the jet motion appears superluminal, and the emission is Doppler boosted. Blazars show rapid radio and optical variability in flux density and polarization. There are two types of blazars that can have strong synchrotron continua: some quasars with strong broad emission lines, and BL Lac objects with weak or undetected broad lines. We have compiled optical linear polarization measurements of more than 100 blazars, including archival data from McDonald Observatory. While the optical data are somewhat sparsely sampled, The University of Michigan Radio Astronomical Observatory observed many blazars over 20-30 years, often well-sampled over days to weeks, enabling quasi-simultaneous comparison of optical and radio polarization position angles (EVPAs). We also collected data on jet direction -- position angles of the jet component nearest the radio core. The project is unique in examining the polarization and jet behavior over many years. BL Lac objects tend to have stable optically thin EVPA in the jet direction, meaning magnetic field is perpendicular to jet flow, often interpreted as the magnetic field compressed by shocks. In quasar-blazars optical and radio EVPA often changes between parallel or perpendicular to the jet direction, even in the same object. The underlying B field of the jet is is parallel to the flow, with approximately 90 degree changes resulting from shocks. For both BL Lac objects & quasars, the scatter in EVPA usually increases from low frequencies (4.8 GHz) through 14.5 GHz through optical. The wide optical-radio frequency range allows us to investigate optical depth effects and the spatial origin of radio and optical emission.
A fault is born: The Landers-Mojave earthquake line
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nur, A.; Ron, H.
1993-04-01
The epicenter and the southern portion of the 1992 Landers earthquake fell on an approximately N-S earthquake line, defined by both epicentral locations and by the rupture directions of four previous M>5 earthquakes in the Mojave: The 1947 Manix; 1975 Galway Lake; 1979 Homestead Valley: and 1992 Joshua Tree events. Another M 5.2 earthquake epicenter in 1965 fell on this line where it intersects the Calico fault. In contrast, the northern part of the Landers rupture followed the NW-SE trending Camp Rock and parallel faults, exhibiting an apparently unusual rupture kink. The block tectonic model (Ron et al., 1984) combiningmore » fault kinematic and mechanics, explains both the alignment of the events, and their ruptures (Nur et al., 1986, 1989), as well as the Landers kink (Nur et al., 1992). Accordingly, the now NW oriented faults have rotated into their present direction away from the direction of maximum shortening, close to becoming locked, whereas a new fault set, optimally oriented relative to the direction of shortening, is developing to accommodate current crustal deformation. The Mojave-Landers line may thus be a new fault in formation. During the transition of faulting from the old, well developed and wak but poorly oriented faults to the strong, but favorably oriented new ones, both can slip simultaneously, giving rise to kinks such as Landers.« less
Wave Turning and Flow Angle in the E-Region Ionosphere
NASA Astrophysics Data System (ADS)
Young, M.; Oppenheim, M. M.; Dimant, Y. S.
2016-12-01
This work presents results of particle-in-cell (PIC) simulations of Farley-Buneman (FB) turbulence at various altitudes in the high-latitude E-region ionosphere. In that region, the FB instability regularly produces meter-scale plasma irregularities. VHF radars observe coherent echoes via Bragg scatter from wave fronts parallel or anti-parallel to the radar line of sight (LoS) but do not necessarily measure the mean direction of wave propagation. Haldoupis (1984) conducted a study of diffuse radar aurora and found that the spectral width of back-scattered power depends critically on the angle between the radar LoS and the true flow direction, called the flow angle. Knowledge of the flow angle will allow researchers to better interpret observations of coherent back-scatter. Experiments designed to observe meter-scale irregularities in the E-region ionosphere created by the FB instability typically assume that the predominant flow direction is the E×B direction. However, linear theory of Dimant and Oppenheim (2004) showed that FB waves should turn away from E×B and particle-in-cell simulations by Oppenheim and Dimant (2013) support the theory. The present study comprises a quantitative analysis of the dependence of back-scattered power, flow velocity, and spectral width as functions of the flow angle. It also demonstrates that the mean direction of meter-scale wave propagation may differ from the E×B direction by tens of degrees. The analysis includes 2-D and 3-D simulations at a range of altitudes in the auroral ionosphere. Comparison between 2-D and 3-D simulations illustrates the relative importance to the irregularity spectrum of a small but finite component in the direction parallel to B. Previous work has shown this small parallel component to be important to turbulent electron heating and nonlinear transport.
Parallel image registration with a thin client interface
NASA Astrophysics Data System (ADS)
Saiprasad, Ganesh; Lo, Yi-Jung; Plishker, William; Lei, Peng; Ahmad, Tabassum; Shekhar, Raj
2010-03-01
Despite its high significance, the clinical utilization of image registration remains limited because of its lengthy execution time and a lack of easy access. The focus of this work was twofold. First, we accelerated our course-to-fine, volume subdivision-based image registration algorithm by a novel parallel implementation that maintains the accuracy of our uniprocessor implementation. Second, we developed a thin-client computing model with a user-friendly interface to perform rigid and nonrigid image registration. Our novel parallel computing model uses the message passing interface model on a 32-core cluster. The results show that, compared with the uniprocessor implementation, the parallel implementation of our image registration algorithm is approximately 5 times faster for rigid image registration and approximately 9 times faster for nonrigid registration for the images used. To test the viability of such systems for clinical use, we developed a thin client in the form of a plug-in in OsiriX, a well-known open source PACS workstation and DICOM viewer, and used it for two applications. The first application registered the baseline and follow-up MR brain images, whose subtraction was used to track progression of multiple sclerosis. The second application registered pretreatment PET and intratreatment CT of radiofrequency ablation patients to demonstrate a new capability of multimodality imaging guidance. The registration acceleration coupled with the remote implementation using a thin client should ultimately increase accuracy, speed, and access of image registration-based interpretations in a number of diagnostic and interventional applications.
An integrated model to simulate the scattering of ultrasounds by inclusions in steels.
Darmon, Michel; Calmon, Pierre; Bèle, Bertrand
2004-04-01
We present a study performed to model and predict the ultrasonic response of alumina inclusions in steels. The Born and the extended quasistatic approximations have been applied and modified to improve their accuracy in the framework of this application. The modified Born approximation, called "doubly distorted wave (D(2)W) Born approximation" allowing to deal with various inclusion shapes, has been selected to be implemented in the CIVA software. The model reliability has been evaluated by comparison with Ying and Truell's exact analytical solution. In parallel, measurements have been carried out upon both natural and artificial alumina inclusions.
NASA Astrophysics Data System (ADS)
Maitarad, Amphawan; Poomsuk, Nattawee; Vilaivan, Chotima; Vilaivan, Tirayut; Siriwong, Khatcharin
2018-04-01
Suitable conformations for peptide nucleic acid (PNA) self-hybrids with (2‧R,4‧R)- and (2‧R,4‧S)-prolyl-(1S,2S)-2-aminocyclopentanecarboxylic acid backbones (namely, acpcPNA and epi-acpcPNA, respectively) were investigated based on molecular dynamics simulations. The results revealed that hybridization of the acpcPNA was observed only in the parallel direction, with a conformation close to the P-type structure. In contrast, self-hybrids of the epi-acpcPNA were formed in the antiparallel and parallel directions; the antiparallel duplex adopted the B-form conformation, and the parallel duplex was between B- and P-forms. The calculated binding energies and the experimental data indicate that the antiparallel epi-acpcPNA self-hybrid was more stable than the parallel duplex.
The parallel programming of voluntary and reflexive saccades.
Walker, Robin; McSorley, Eugene
2006-06-01
A novel two-step paradigm was used to investigate the parallel programming of consecutive, stimulus-elicited ('reflexive') and endogenous ('voluntary') saccades. The mean latency of voluntary saccades, made following the first reflexive saccades in two-step conditions, was significantly reduced compared to that of voluntary saccades made in the single-step control trials. The latency of the first reflexive saccades was modulated by the requirement to make a second saccade: first saccade latency increased when a second voluntary saccade was required in the opposite direction to the first saccade, and decreased when a second saccade was required in the same direction as the first reflexive saccade. A second experiment confirmed the basic effect and also showed that a second reflexive saccade may be programmed in parallel with a first voluntary saccade. The results support the view that voluntary and reflexive saccades can be programmed in parallel on a common motor map.
Blocksome, Michael A.; Mamidala, Amith R.
2013-09-03
Fencing direct memory access (`DMA`) data transfers in a parallel active messaging interface (`PAMI`) of a parallel computer, the PAMI including data communications endpoints, each endpoint including specifications of a client, a context, and a task, the endpoints coupled for data communications through the PAMI and through DMA controllers operatively coupled to segments of shared random access memory through which the DMA controllers deliver data communications deterministically, including initiating execution through the PAMI of an ordered sequence of active DMA instructions for DMA data transfers between two endpoints, effecting deterministic DMA data transfers through a DMA controller and a segment of shared memory; and executing through the PAMI, with no FENCE accounting for DMA data transfers, an active FENCE instruction, the FENCE instruction completing execution only after completion of all DMA instructions initiated prior to execution of the FENCE instruction for DMA data transfers between the two endpoints.
Blocksome, Michael A; Mamidala, Amith R
2014-02-11
Fencing direct memory access (`DMA`) data transfers in a parallel active messaging interface (`PAMI`) of a parallel computer, the PAMI including data communications endpoints, each endpoint including specifications of a client, a context, and a task, the endpoints coupled for data communications through the PAMI and through DMA controllers operatively coupled to segments of shared random access memory through which the DMA controllers deliver data communications deterministically, including initiating execution through the PAMI of an ordered sequence of active DMA instructions for DMA data transfers between two endpoints, effecting deterministic DMA data transfers through a DMA controller and a segment of shared memory; and executing through the PAMI, with no FENCE accounting for DMA data transfers, an active FENCE instruction, the FENCE instruction completing execution only after completion of all DMA instructions initiated prior to execution of the FENCE instruction for DMA data transfers between the two endpoints.
Blocksome, Michael A.; Mamidala, Amith R.
2015-07-07
Fencing direct memory access (`DMA`) data transfers in a parallel active messaging interface (`PAMI`) of a parallel computer, the PAMI including data communications endpoints, each endpoint including specifications of a client, a context, and a task, the endpoints coupled for data communications through the PAMI and through DMA controllers operatively coupled to a deterministic data communications network through which the DMA controllers deliver data communications deterministically, including initiating execution through the PAMI of an ordered sequence of active DMA instructions for DMA data transfers between two endpoints, effecting deterministic DMA data transfers through a DMA controller and the deterministic data communications network; and executing through the PAMI, with no FENCE accounting for DMA data transfers, an active FENCE instruction, the FENCE instruction completing execution only after completion of all DMA instructions initiated prior to execution of the FENCE instruction for DMA data transfers between the two endpoints.
Blocksome, Michael A.; Mamidala, Amith R.
2015-07-14
Fencing direct memory access (`DMA`) data transfers in a parallel active messaging interface (`PAMI`) of a parallel computer, the PAMI including data communications endpoints, each endpoint including specifications of a client, a context, and a task, the endpoints coupled for data communications through the PAMI and through DMA controllers operatively coupled to a deterministic data communications network through which the DMA controllers deliver data communications deterministically, including initiating execution through the PAMI of an ordered sequence of active DMA instructions for DMA data transfers between two endpoints, effecting deterministic DMA data transfers through a DMA controller and the deterministic data communications network; and executing through the PAMI, with no FENCE accounting for DMA data transfers, an active FENCE instruction, the FENCE instruction completing execution only after completion of all DMA instructions initiated prior to execution of the FENCE instruction for DMA data transfers between the two endpoints.
Picosecond phase-velocity dispersion of hypersonic phonons imaged with ultrafast electron microscopy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cremons, Daniel R.; Du, Daniel X.; Flannigan, David J.
We describe the direct imaging—with four-dimensional ultrafast electron microscopy—of the emergence, evolution, dispersion, and decay of photoexcited, hypersonic coherent acoustic phonons in nanoscale germanium wedges. Coherent strain waves generated via ultrafast in situ photoexcitation were imaged propagating with initial phase velocities of up to 35 km/s across discrete micrometer-scale crystal regions. We then observe that, while each wave front travels at a constant velocity, the entire wave train evolves with a time-varying phase-velocity dispersion, displaying a single-exponential decay to the longitudinal speed of sound (5 km/s) and with a mean lifetime of 280 ps. We also find that the wavemore » trains propagate along a single in-plane direction oriented parallel to striations introduced during specimen preparation, independent of crystallographic direction. Elastic-plate modeling indicates the dynamics arise from excitation of a single, symmetric (dilatational) guided acoustic mode. Further, by precisely determining the experiment time-zero position with a plasma-lensing method, we find that wave-front emergence occurs approximately 100 ps after femtosecond photoexcitation, which matches well with Auger recombination times in germanium. We conclude by discussing the similarities between the imaged hypersonic strain-wave dynamics and electron/hole plasma-wave dynamics in strongly photoexcited semiconductors.« less
Picosecond phase-velocity dispersion of hypersonic phonons imaged with ultrafast electron microscopy
Cremons, Daniel R.; Du, Daniel X.; Flannigan, David J.
2017-12-05
We describe the direct imaging—with four-dimensional ultrafast electron microscopy—of the emergence, evolution, dispersion, and decay of photoexcited, hypersonic coherent acoustic phonons in nanoscale germanium wedges. Coherent strain waves generated via ultrafast in situ photoexcitation were imaged propagating with initial phase velocities of up to 35 km/s across discrete micrometer-scale crystal regions. We then observe that, while each wave front travels at a constant velocity, the entire wave train evolves with a time-varying phase-velocity dispersion, displaying a single-exponential decay to the longitudinal speed of sound (5 km/s) and with a mean lifetime of 280 ps. We also find that the wavemore » trains propagate along a single in-plane direction oriented parallel to striations introduced during specimen preparation, independent of crystallographic direction. Elastic-plate modeling indicates the dynamics arise from excitation of a single, symmetric (dilatational) guided acoustic mode. Further, by precisely determining the experiment time-zero position with a plasma-lensing method, we find that wave-front emergence occurs approximately 100 ps after femtosecond photoexcitation, which matches well with Auger recombination times in germanium. We conclude by discussing the similarities between the imaged hypersonic strain-wave dynamics and electron/hole plasma-wave dynamics in strongly photoexcited semiconductors.« less
Efficient parallel and out of core algorithms for constructing large bi-directed de Bruijn graphs.
Kundeti, Vamsi K; Rajasekaran, Sanguthevar; Dinh, Hieu; Vaughn, Matthew; Thapar, Vishal
2010-11-15
Assembling genomic sequences from a set of overlapping reads is one of the most fundamental problems in computational biology. Algorithms addressing the assembly problem fall into two broad categories - based on the data structures which they employ. The first class uses an overlap/string graph and the second type uses a de Bruijn graph. However with the recent advances in short read sequencing technology, de Bruijn graph based algorithms seem to play a vital role in practice. Efficient algorithms for building these massive de Bruijn graphs are very essential in large sequencing projects based on short reads. In an earlier work, an O(n/p) time parallel algorithm has been given for this problem. Here n is the size of the input and p is the number of processors. This algorithm enumerates all possible bi-directed edges which can overlap with a node and ends up generating Θ(nΣ) messages (Σ being the size of the alphabet). In this paper we present a Θ(n/p) time parallel algorithm with a communication complexity that is equal to that of parallel sorting and is not sensitive to Σ. The generality of our algorithm makes it very easy to extend it even to the out-of-core model and in this case it has an optimal I/O complexity of Θ(nlog(n/B)Blog(M/B)) (M being the main memory size and B being the size of the disk block). We demonstrate the scalability of our parallel algorithm on a SGI/Altix computer. A comparison of our algorithm with the previous approaches reveals that our algorithm is faster--both asymptotically and practically. We demonstrate the scalability of our sequential out-of-core algorithm by comparing it with the algorithm used by VELVET to build the bi-directed de Bruijn graph. Our experiments reveal that our algorithm can build the graph with a constant amount of memory, which clearly outperforms VELVET. We also provide efficient algorithms for the bi-directed chain compaction problem. The bi-directed de Bruijn graph is a fundamental data structure for any sequence assembly program based on Eulerian approach. Our algorithms for constructing Bi-directed de Bruijn graphs are efficient in parallel and out of core settings. These algorithms can be used in building large scale bi-directed de Bruijn graphs. Furthermore, our algorithms do not employ any all-to-all communications in a parallel setting and perform better than the prior algorithms. Finally our out-of-core algorithm is extremely memory efficient and can replace the existing graph construction algorithm in VELVET.
NASA Astrophysics Data System (ADS)
Warrier, M.; Bhardwaj, U.; Hemani, H.; Schneider, R.; Mutzke, A.; Valsakumar, M. C.
2015-12-01
We report on molecular Dynamics (MD) simulations carried out in fcc Cu and bcc W using the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) code to study (i) the statistical variations in the number of interstitials and vacancies produced by energetic primary knock-on atoms (PKA) (0.1-5 keV) directed in random directions and (ii) the in-cascade cluster size distributions. It is seen that around 60-80 random directions have to be explored for the average number of displaced atoms to become steady in the case of fcc Cu, whereas for bcc W around 50-60 random directions need to be explored. The number of Frenkel pairs produced in the MD simulations are compared with that from the Binary Collision Approximation Monte Carlo (BCA-MC) code SDTRIM-SP and the results from the NRT model. It is seen that a proper choice of the damage energy, i.e. the energy required to create a stable interstitial, is essential for the BCA-MC results to match the MD results. On the computational front it is seen that in-situ processing saves the need to input/output (I/O) atomic position data of several tera-bytes when exploring a large number of random directions and there is no difference in run-time because the extra run-time in processing data is offset by the time saved in I/O.
Spatio-temporal dynamics of processing non-symbolic number: An ERP source localization study
Hyde, Daniel C.; Spelke, Elizabeth S.
2013-01-01
Coordinated studies with adults, infants, and nonhuman animals provide evidence for two distinct systems of non-verbal number representation. The ‘parallel individuation’ system selects and retains information about 1–3 individual entities and the ‘numerical magnitude’ system establishes representations of the approximate cardinal value of a group. Recent ERP work has demonstrated that these systems reliably evoke functionally and temporally distinct patterns of brain response that correspond to established behavioral signatures. However, relatively little is known about the neural generators of these ERP signatures. To address this question, we targeted known ERP signatures of these systems, by contrasting processing of small versus large non-symbolic numbers, and used a source localization algorithm (LORETA) to identify their cortical origins. Early processing of small numbers, showing the signature effects of parallel individuation on the N1 (∼150 ms), was localized primarily to extrastriate visual regions. In contrast, qualitatively and temporally distinct processing of large numbers, showing the signatures of approximate number representation on the mid-latency P2p (∼200–250 ms), was localized primarily to right intraparietal regions. In comparison, mid-latency small number processing was localized to the right temporal-parietal junction and left-lateralized intraparietal regions. These results add spatial information to the emerging ERP literature documenting the process by which we represent number. Furthermore, these results substantiate recent claims that early attentional processes determine whether a collection of objects will be represented through parallel individuation or as an approximate numerical magnitude by providing evidence that downstream processing diverges to distinct cortical regions. PMID:21830257
Heat loads on poloidal and toroidal edges of castellated plasma-facing components in COMPASS
NASA Astrophysics Data System (ADS)
Dejarnac, R.; Corre, Y.; Vondracek, P.; Gaspar, J.; Gauthier, E.; Gunn, J. P.; Komm, M.; Gardarein, J.-L.; Horacek, J.; Hron, M.; Matejicek, J.; Pitts, R. A.; Panek, R.
2018-06-01
Dedicated experiments have been performed in the COMPASS tokamak to thoroughly study the power deposition processes occurring on poloidal and toroidal edges of castellated plasma-facing components in tokamaks during steady-state L-mode conditions. Surface temperatures measured by a high resolution infra-red camera are compared with reconstructed synthetic data from a 2D thermal model using heat flux profiles derived from both the optical approximation and 2D particle-in-cell (PIC) simulations. In the case of poloidal leading edges, when the contribution from local radiation is taken into account, the parallel heat flux deduced from unperturbed, upstream measurements is fully consistent with the observed temperature increase at the leading edges of various heights, respecting power balance assuming simple projection of the parallel flux density. Smoothing of the heat flux deposition profile due to finite ion Larmor radius predicted by the PIC simulations is found to be weak and the power deposition on misaligned poloidal edges is better described by the optical approximation. This is consistent with an electron-dominated regime associated with a non-ambipolar parallel current flow. In the case of toroidal gap edges, the different contributions of the total incoming flux along the gap have been observed experimentally for the first time. They confirm the results of recent numerical studies performed for ITER showing that in specific cases the heat deposition does not necessarily follow the optical approximation. Indeed, ions can spiral onto the magnetically shadowed toroidal edge. Particle-in-cell simulations emphasize again the role played by local non-ambipolarity in the deposition pattern.
The influence of ozone and aerosols on the brightness and color of the twilight sky
NASA Technical Reports Server (NTRS)
Adams, C. N.; Plass, G. N.; Kattawar, G. W.
1974-01-01
The radiance and color of the twilight sky are calculated for single scattered radiation with the use of spherically symmetric models of the earth's atmosphere. Spherical geometry is used throughout the calculations with no plane-parallel approximations. Refraction effects are taken into account through fine subdivision of the atmosphere into spherical shells of fixed index of refraction. Snell's law of refraction is used to calculate a new direction of travel each time that a photon traverses the interface between layers. Five different models of the atmosphere were used: a pure molecular scattering atmosphere; molecular atmosphere plus ozone absorption; and three models with aerosol concentrations of one, three, and ten times normal together with molecular scattering and ozone absorption. The results of the calculations are shown for various observation positions and local viewing angles in the solar plane for wavelengths in the range from 0.40 to 0.75 micron.
The influence of ozone and aerosols on the brightness and color of the twilight zone
NASA Technical Reports Server (NTRS)
Adams, C. N.; Plass, G. N.; Kattawar, G. W.
1973-01-01
The radiance and color of the twilight sky are calculated for single scattered radiation with the use of spherically symmetric models of the earth's atmosphere. Spherical geometry is used throughout the calculations with no plane parallel approximations. Refraction effects are taken into account through fine subdivision of the atmosphere into spherical shells of fixed index of refraction. Shell's law of refraction is used to calculate a direction of travel each time that a photon traverses the interface between layers. Five different models of the atmosphere were used: a pure molecular scattering atmosphere; molecular atmosphere plus ozone absorption; and three models with aerosol concentrations of 1, 3, and 10 times normal together with molecular scattering and ozone absorption. The results of the calculations are shown for various observation positions and local viewing angles in the solar plane for wavelengths in the range of 0.40 microns to 0.75 microns.
Metabolic molecular markers of the tidal clock in the marine crustacean Eurydice pulchra
O’Neill, John Stuart; Lee, Kate D.; Zhang, Lin; Feeney, Kevin; Webster, Simon George; Blades, Matthew James; Kyriacou, Charalambos Panayiotis; Hastings, Michael Harvey; Wilcockson, David Charles
2015-01-01
Summary In contrast to the well mapped molecular orchestration of circadian timekeeping in terrestrial organisms, the mechanisms that direct tidal and lunar rhythms in marine species are entirely unknown. Using a combination of biochemical and molecular approaches we have identified a series of metabolic markers of the tidal clock of the intertidal isopod Eurydice pulchra. Specifically, we show that the overoxidation of peroxiredoxin (PRX), a conserved marker of circadian timekeeping in terrestrial eukaryotes [1], follows a circatidal (approximately 12.4 hours) pattern in E. pulchra, in register with the tidal pattern of swimming. In parallel, we show that mitochondrially encoded genes are expressed with a circatidal rhythm. Together, these findings demonstrate that PRX overoxidation rhythms are not intrinsically circadian; rather they appear to resonate with the dominant metabolic cycle of an organism, regardless of its frequency. Moreover, they provide the first molecular leads for dissecting the tidal clockwork. PMID:25898100
Combustion of hydrogen injected into a supersonic airstream (a guide to the HISS computer program)
NASA Technical Reports Server (NTRS)
Dyer, D. F.; Maples, G.; Spalding, D. B.
1976-01-01
A computer program based on a finite-difference, implicit numerical integration scheme is described for the prediction of hydrogen injected into a supersonic airstream at an angle ranging from normal to parallel to the airstream main flow direction. Results of calculations for flow and thermal property distributions were compared with 'cold flow data' taken by NASA/Langley and show excellent correlation. Typical results for equilibrium combustion are presented and exhibit qualitatively plausible behavior. Computer time required for a given case is approximately one minute on a CDC 7600. A discussion of the assumption of parabolic flow in the injection region is given which demonstrates that improvement in calculation in this region could be obtained by a partially-parabolic procedure which has been developed. It is concluded that the technique described provides an efficient and reliable means for analyzing hydrogen injection into supersonic airstreams and the subsequent combustion.
Habenicht, Carsten; Schuster, Roman; Knupfer, Martin; Büchner, Bernd
2018-05-23
We have investigated indirect excitons in bulk 2H-MoS 2 using transmission electron energy-loss spectroscopy. The electron energy-loss spectra were measured for various momentum transfer values parallel to the [Formula: see text] and [Formula: see text] directions of the Brillouin zone. The results allowed the identification of the indirect excitons between the valence band K v and conduction band Λ c points, the Γ v and K c points as well as adjacent K v and [Formula: see text] points. The energy-momentum dispersions for the K v -Λ c , Γ v -K c and K v1 -[Formula: see text] excitons along the [Formula: see text] line are presented. The former two transitions exhibit a quadratic dispersion which allowed calculating their effective exciton masses based on the effective mass approximation. The K v1 -[Formula: see text] transition follows a more linear dispersion relationship.
On iterative processes in the Krylov-Sonneveld subspaces
NASA Astrophysics Data System (ADS)
Ilin, Valery P.
2016-10-01
The iterative Induced Dimension Reduction (IDR) methods are considered for solving large systems of linear algebraic equations (SLAEs) with nonsingular nonsymmetric matrices. These approaches are investigated by many authors and are charachterized sometimes as the alternative to the classical processes of Krylov type. The key moments of the IDR algorithms consist in the construction of the embedded Sonneveld subspaces, which have the decreasing dimensions and use the orthogonalization to some fixed subspace. Other independent approaches for research and optimization of the iterations are based on the augmented and modified Krylov subspaces by using the aggregation and deflation procedures with present various low rank approximations of the original matrices. The goal of this paper is to show, that IDR method in Sonneveld subspaces present an original interpretation of the modified algorithms in the Krylov subspaces. In particular, such description is given for the multi-preconditioned semi-conjugate direction methods which are actual for the parallel algebraic domain decomposition approaches.
Toroidal gyrofluid equations for simulations of tokamak turbulence
NASA Astrophysics Data System (ADS)
Beer, M. A.; Hammett, G. W.
1996-11-01
A set of nonlinear gyrofluid equations for simulations of tokamak turbulence are derived by taking moments of the nonlinear toroidal gyrokinetic equation. The moment hierarchy is closed with approximations that model the kinetic effects of parallel Landau damping, toroidal drift resonances, and finite Larmor radius effects. These equations generalize the work of Dorland and Hammett [Phys. Fluids B 5, 812 (1993)] to toroidal geometry by including essential toroidal effects. The closures for phase mixing from toroidal ∇B and curvature drifts take the basic form presented in Waltz et al. [Phys. Fluids B 4, 3138 (1992)], but here a more rigorous procedure is used, including an extension to higher moments, which provides significantly improved accuracy. In addition, trapped ion effects and collisions are incorporated. This reduced set of nonlinear equations accurately models most of the physics considered important for ion dynamics in core tokamak turbulence, and is simple enough to be used in high resolution direct numerical simulations.
The Hanle effect applied to magnetic field measurements
NASA Technical Reports Server (NTRS)
Leroy, J. L.
1985-01-01
The Hanle effect is the modification by a local magnetic field of the polarization due to coherent scattering in spectral lines. It results from the precession of a classical oscillator about the magnetic field direction. The sophisticated quantum-mechanical treatment, which is required to compute the polarization parameters of scattered light, was developed. The main features of the Hanle effect concerning magnetic field measurements are: (1) a good sensitivity within the approximate range 0.1 B gamma rho to 10 B gamma rho where B gamma rho is the field strength yielding a Larmor period equal to the radiative lifetime, (2) there is no Hanle effect for field vectors parallel to the excitating beam, (3) the Hanle effect refers essentially to the linear polarization in a spectral line, (4) various points in the line profile are affected in the same way by change of linear polarization so that polarization parameters can be measured on the integrated line profile.
Flight- and ground-test correlation study of BMDO SDS materials: Phase 1 report
NASA Technical Reports Server (NTRS)
Chung, Shirley Y.; Brinza, David E.; Minton, Timothy K.; Stiegman, Albert E.; Kenny, James T.; Liang, Ranty H.
1993-01-01
The NASA Evaluation of Oxygen Interactions with Materials-3 (EOIM-3) experiment served as a test bed for a variety of materials that are candidates for Ballistic Missile Defense Organization (BMDO) space assets. The materials evaluated on this flight experiment were provided by BMDO contractors and technology laboratories. A parallel ground exposure evaluation was conducted using the FAST atomic-oxygen simulation facility at Physical Sciences, Inc. The EOIM-3 materials were exposed to an atomic oxygen fluence of approximately 2.3 x 10(exp 2) atoms/sq. cm. The ground-exposed materials' fluence of 2.0 - 2.5 x 10(exp 2) atoms/sq. cm permits direct comparison of ground-exposed materials' performance with that of the flight-exposed specimens. The results from the flight test conducted aboard STS-46 and the correlative ground exposure are presented in this publication.
NASA Astrophysics Data System (ADS)
Lu, San; Artemyev, A. V.; Angelopoulos, V.
2017-11-01
Magnetotail current sheet thinning is a distinctive feature of substorm growth phase, during which magnetic energy is stored in the magnetospheric lobes. Investigation of charged particle dynamics in such thinning current sheets is believed to be important for understanding the substorm energy storage and the current sheet destabilization responsible for substorm expansion phase onset. We use Time History of Events and Macroscale Interactions during Substorms (THEMIS) B and C observations in 2008 and 2009 at 18 - 25 RE to show that during magnetotail current sheet thinning, the electron temperature decreases (cooling), and the parallel temperature decreases faster than the perpendicular temperature, leading to a decrease of the initially strong electron temperature anisotropy (isotropization). This isotropization cannot be explained by pure adiabatic cooling or by pitch angle scattering. We use test particle simulations to explore the mechanism responsible for the cooling and isotropization. We find that during the thinning, a fast decrease of a parallel electric field (directed toward the Earth) can speed up the electron parallel cooling, causing it to exceed the rate of perpendicular cooling, and thus lead to isotropization, consistent with observation. If the parallel electric field is too small or does not change fast enough, the electron parallel cooling is slower than the perpendicular cooling, so the parallel electron anisotropy grows, contrary to observation. The same isotropization can also be accomplished by an increasing parallel electric field directed toward the equatorial plane. Our study reveals the existence of a large-scale parallel electric field, which plays an important role in magnetotail particle dynamics during the current sheet thinning process.
Solar Wind Proton Temperature Anisotropy: Linear Theory and WIND/SWE Observations
NASA Technical Reports Server (NTRS)
Hellinger, P.; Travnicek, P.; Kasper, J. C.; Lazarus, A. J.
2006-01-01
We present a comparison between WIND/SWE observations (Kasper et al., 2006) of beta parallel to p and T perpendicular to p/T parallel to p (where beta parallel to p is the proton parallel beta and T perpendicular to p and T parallel to p are the perpendicular and parallel proton are the perpendicular and parallel proton temperatures, respectively; here parallel and perpendicular indicate directions with respect to the ambient magnetic field) and predictions of the Vlasov linear theory. In the slow solar wind, the observed proton temperature anisotropy seems to be constrained by oblique instabilities, by the mirror one and the oblique fire hose, contrary to the results of the linear theory which predicts a dominance of the proton cyclotron instability and the parallel fire hose. The fast solar wind core protons exhibit an anticorrelation between beta parallel to c and T perpendicular to c/T parallel to c (where beta parallel to c is the core proton parallel beta and T perpendicular to c and T parallel to c are the perpendicular and parallel core proton temperatures, respectively) similar to that observed in the HELIOS data (Marsch et al., 2004).
NASA Technical Reports Server (NTRS)
Juang, Hann-Ming Henry; Tao, Wei-Kuo; Zeng, Xi-Ping; Shie, Chung-Lin; Simpson, Joanne; Lang, Steve
2004-01-01
The capability for massively parallel programming (MPP) using a message passing interface (MPI) has been implemented into a three-dimensional version of the Goddard Cumulus Ensemble (GCE) model. The design for the MPP with MPI uses the concept of maintaining similar code structure between the whole domain as well as the portions after decomposition. Hence the model follows the same integration for single and multiple tasks (CPUs). Also, it provides for minimal changes to the original code, so it is easily modified and/or managed by the model developers and users who have little knowledge of MPP. The entire model domain could be sliced into one- or two-dimensional decomposition with a halo regime, which is overlaid on partial domains. The halo regime requires that no data be fetched across tasks during the computational stage, but it must be updated before the next computational stage through data exchange via MPI. For reproducible purposes, transposing data among tasks is required for spectral transform (Fast Fourier Transform, FFT), which is used in the anelastic version of the model for solving the pressure equation. The performance of the MPI-implemented codes (i.e., the compressible and anelastic versions) was tested on three different computing platforms. The major results are: 1) both versions have speedups of about 99% up to 256 tasks but not for 512 tasks; 2) the anelastic version has better speedup and efficiency because it requires more computations than that of the compressible version; 3) equal or approximately-equal numbers of slices between the x- and y- directions provide the fastest integration due to fewer data exchanges; and 4) one-dimensional slices in the x-direction result in the slowest integration due to the need for more memory relocation for computation.
NASA Astrophysics Data System (ADS)
Li, P. H. Y.; Bishop, R. F.
2018-03-01
We implement the coupled cluster method to very high orders of approximation to study the spin-1/2 J1 -J2 Heisenberg model on a cross-striped square lattice. Every nearest-neighbour pair of sites on the square lattice has an isotropic antiferromagnetic exchange bond of strength J1 > 0 , while the basic square plaquettes in alternate columns have either both or neither next-nearest-neighbour (diagonal) pairs of sites connected by an equivalent frustrating bond of strength J2 ≡ αJ1 > 0 . By studying the magnetic order parameter (i.e., the average local on-site magnetization) in the range 0 ≤ α ≤ 1 of the frustration parameter we find that the quasiclassical antiferromagnetic Néel and (so-called) double Néel states form the stable ground-state phases in the respective regions α < α1ac = 0 . 46(1) and α > α1bc = 0.615(5) . The double Néel state has Néel (⋯ ↑↓↑↓ ⋯) ordering along the (column) direction parallel to the stripes of squares with both or no J2 bonds, and spins alternating in a pairwise (⋯ ↑↑↓↓↑↑↓↓ ⋯) fashion along the perpendicular (row) direction, so that the parallel pairs occur on squares with both J2 bonds present. Further explicit calculations of both the triplet spin gap and the zero-field uniform transverse magnetic susceptibility provide compelling evidence that the ground-state phase over all or most of the intermediate regime α1ac < α < α1bc is a gapped state with no discernible long-range magnetic order.
NASA Astrophysics Data System (ADS)
Fujii, Koki; Nomura, Fumimasa; Kaneko, Tomoyuki
2018-03-01
To investigate the optimal conditions for electrical stimulation, communities of lined-up chick embryonic cardiomyocytes were evaluated in terms of their threshold voltage for pacing (PVMin) and the half-maximum paced frequency (PF50), with a focus on the following factors: (1) the orientation of the major axis of cell communities to the electric field (EF) direction as the external factor; (2) the number of cells in a cell community, the length of the cell community, and the mean length of cells comprising the community as the internal factors. Firstly, PVMin decreased with increasing length of the cell network oriented parallel to the EF. PVMin was approximately 0.041 ± 0.025 V/mm when the community was sufficiently long. On the other hand, PVMin in the orthogonal orientation was constant at 1.7 ± 0.047 V/mm with no dependence on the length of the cell network. Secondly, we found that PF50 increased with increasing length of the cell network or the number of cells in the network; the PF50 values were 2.03 ± 0.05 and 3.39 ± 0.05 Hz when the respective cell network lengths were 100 µm (n = 43) and more than 300 µm (n = 6) and the cells were oriented parallel to the EF. These findings indicate that it is important to suppress ventricular fibrillation with minimal efficient stimulation by considering the EF direction with respect to the orientation of cardiomyocytes. Furthermore, expanded cells showed the loss of ability to respond to stimulation at higher frequencies. Cardiomyocytes combined with seeded fibroblasts as a cell network at a low density are a possible model of a ventricular remodeling heart.
Metal-organic framework assembled from erbium and a tetrapodal polyphosphonic acid organic linker.
Mendes, Ricardo F; Firmino, Ana D G; Tomé, João P C; Almeida Paz, Filipe A
2018-06-01
A three-dimensional metal-organic framework (MOF), poly[[μ 6 -5'-pentahydrogen [1,1'-biphenyl]-3,3',5,5'-tetrayltetrakis(phosphonato)]erbium(III)] 2.5-hydrate], formulated as [Er(C 12 H 11 O 12 P 4 )]·2.5H 2 O or [Er(H 5 btp)]·2.5H 2 O (I) and isotypical with a Y 3+ -based MOF reported previously by our research group [Firmino et al. (2017b). Inorg. Chem. 56, 1193-1208], was constructed based solely on Er 3+ and on the polyphosphonic organic linker [1,1'-biphenyl]-3,3',5,5'-tetrakis(phosphonic acid) (H 8 btp). The present work describes our efforts to introduce lanthanide cations into the flexible network, demonstrating that, on the one hand, the compound can be obtained using three distinct experimental methods, i.e. hydro(solvo)thermal (Hy), microwave-assisted (MW) and one-pot (Op), and, on the other hand, that crystallite size can be approximately fine-tuned according to the method employed. MOF I contains hexacoordinated Er 3+ cations which are distributed in a zigzag inorganic chain running parallel to the [100] direction of the unit cell. The chains are, in turn, bridged by the anionic organic linker to form a three-dimensional 6,6-connected binodal network. This connectivity leads to the existence of one-dimensional channels (also running parallel to the [100] direction) filled with disordered and partially occupied water molecules of crystalization which are engaged in O-H...O hydrogen-bonding interactions with the [Er(H 5 btp)] framework. Additional weak π-π interactions [intercentroid distance = 3.957 (7) Å] exist between aromatic rings, which help to maintain the structural integrity of the network.
Directions in parallel programming: HPF, shared virtual memory and object parallelism in pC++
NASA Technical Reports Server (NTRS)
Bodin, Francois; Priol, Thierry; Mehrotra, Piyush; Gannon, Dennis
1994-01-01
Fortran and C++ are the dominant programming languages used in scientific computation. Consequently, extensions to these languages are the most popular for programming massively parallel computers. We discuss two such approaches to parallel Fortran and one approach to C++. The High Performance Fortran Forum has designed HPF with the intent of supporting data parallelism on Fortran 90 applications. HPF works by asking the user to help the compiler distribute and align the data structures with the distributed memory modules in the system. Fortran-S takes a different approach in which the data distribution is managed by the operating system and the user provides annotations to indicate parallel control regions. In the case of C++, we look at pC++ which is based on a concurrent aggregate parallel model.
Permeability of stylolite-bearing chalk
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lind, I.; Nykjaer, O.; Priisholm, S.
1994-11-01
Permeabilities were measured on core plugs from stylolite-bearing chalk of the Gorm field in the Danish North Sea. Air and liquid permeabilities were measured in directions parallel to and perpendicular to the stylolite surface. Permeability was measured with sleeve pressure equal to in-situ reservoir stress. Permeabilities of plugs with stylolites but without stylolite-associated fractures were equal in the two directions. The permeability is equal to the matrix permeability of non-stylolite-bearing chalk. In contrast, when fractures were associated with the stylolites, permeability was enhanced. The enhancement was most significant in the horizontal direction parallel to the stylolites.
NASA Astrophysics Data System (ADS)
Gramajo, A. A.; Della Picca, R.; Arbó, D. G.
2017-08-01
We present a theoretical study of ionization of the hydrogen atom due to an XUV pulse in the presence of an infrared (IR) laser with both fields linearly polarized in the same direction. In particular, we study the energy distribution of photoelectrons emitted perpendicularly to the polarization direction. As we previously showed in Gramajo et al. [Phys. Rev. A 94, 053404 (2016), 10.1103/PhysRevA.94.053404] for parallel emission, by means of a very simple semiclassical model which considers electron trajectories born at different ionization times, the electron energy spectrum can be interpreted as the interplay of intra- and intercycle interferences. However, contrary to the case of parallel emission the intracycle interference pattern stems from the coherent superposition of four electron trajectories giving rise to (i) interference of electron trajectories born during the same half cycle (intra-half-cycle interference) and (ii) interference between electron trajectories born during the first half cycle with those born during the second half cycle (inter-half-cycle interference). The intercycle interference is responsible for the formation of the sidebands. We also show that the destructive inter-half-cycle interference for the absorption and emission of an even number of IR laser photons is responsible for the characteristic sidebands in the perpendicular direction separated by twice the IR photon energy. This contrasts with the emission along the polarization axis (all sideband orders are present) since intra-half-cycle interferences do not exist in that case. The intracycle interference pattern works as a modulation of the sidebands and, in the same way, it is modulated by the intra-half-cycle interference pattern. We analyze the dependence of the energy spectrum on the laser intensity and the time delay between the XUV pulse and the IR laser. Finally, we show that our semiclassical simulations are in very good agreement with quantum calculations within the strong-field approximation and the numerical solution of the time-dependent Schrödinger equation, giving rise to nonzero emission, in contraposition to other theories.
Considering Horn's Parallel Analysis from a Random Matrix Theory Point of View.
Saccenti, Edoardo; Timmerman, Marieke E
2017-03-01
Horn's parallel analysis is a widely used method for assessing the number of principal components and common factors. We discuss the theoretical foundations of parallel analysis for principal components based on a covariance matrix by making use of arguments from random matrix theory. In particular, we show that (i) for the first component, parallel analysis is an inferential method equivalent to the Tracy-Widom test, (ii) its use to test high-order eigenvalues is equivalent to the use of the joint distribution of the eigenvalues, and thus should be discouraged, and (iii) a formal test for higher-order components can be obtained based on a Tracy-Widom approximation. We illustrate the performance of the two testing procedures using simulated data generated under both a principal component model and a common factors model. For the principal component model, the Tracy-Widom test performs consistently in all conditions, while parallel analysis shows unpredictable behavior for higher-order components. For the common factor model, including major and minor factors, both procedures are heuristic approaches, with variable performance. We conclude that the Tracy-Widom procedure is preferred over parallel analysis for statistically testing the number of principal components based on a covariance matrix.
A Parallel Adaboost-Backpropagation Neural Network for Massive Image Dataset Classification
NASA Astrophysics Data System (ADS)
Cao, Jianfang; Chen, Lichao; Wang, Min; Shi, Hao; Tian, Yun
2016-12-01
Image classification uses computers to simulate human understanding and cognition of images by automatically categorizing images. This study proposes a faster image classification approach that parallelizes the traditional Adaboost-Backpropagation (BP) neural network using the MapReduce parallel programming model. First, we construct a strong classifier by assembling the outputs of 15 BP neural networks (which are individually regarded as weak classifiers) based on the Adaboost algorithm. Second, we design Map and Reduce tasks for both the parallel Adaboost-BP neural network and the feature extraction algorithm. Finally, we establish an automated classification model by building a Hadoop cluster. We use the Pascal VOC2007 and Caltech256 datasets to train and test the classification model. The results are superior to those obtained using traditional Adaboost-BP neural network or parallel BP neural network approaches. Our approach increased the average classification accuracy rate by approximately 14.5% and 26.0% compared to the traditional Adaboost-BP neural network and parallel BP neural network, respectively. Furthermore, the proposed approach requires less computation time and scales very well as evaluated by speedup, sizeup and scaleup. The proposed approach may provide a foundation for automated large-scale image classification and demonstrates practical value.
A Parallel Adaboost-Backpropagation Neural Network for Massive Image Dataset Classification.
Cao, Jianfang; Chen, Lichao; Wang, Min; Shi, Hao; Tian, Yun
2016-12-01
Image classification uses computers to simulate human understanding and cognition of images by automatically categorizing images. This study proposes a faster image classification approach that parallelizes the traditional Adaboost-Backpropagation (BP) neural network using the MapReduce parallel programming model. First, we construct a strong classifier by assembling the outputs of 15 BP neural networks (which are individually regarded as weak classifiers) based on the Adaboost algorithm. Second, we design Map and Reduce tasks for both the parallel Adaboost-BP neural network and the feature extraction algorithm. Finally, we establish an automated classification model by building a Hadoop cluster. We use the Pascal VOC2007 and Caltech256 datasets to train and test the classification model. The results are superior to those obtained using traditional Adaboost-BP neural network or parallel BP neural network approaches. Our approach increased the average classification accuracy rate by approximately 14.5% and 26.0% compared to the traditional Adaboost-BP neural network and parallel BP neural network, respectively. Furthermore, the proposed approach requires less computation time and scales very well as evaluated by speedup, sizeup and scaleup. The proposed approach may provide a foundation for automated large-scale image classification and demonstrates practical value.
Sublattice parallel replica dynamics.
Martínez, Enrique; Uberuaga, Blas P; Voter, Arthur F
2014-06-01
Exascale computing presents a challenge for the scientific community as new algorithms must be developed to take full advantage of the new computing paradigm. Atomistic simulation methods that offer full fidelity to the underlying potential, i.e., molecular dynamics (MD) and parallel replica dynamics, fail to use the whole machine speedup, leaving a region in time and sample size space that is unattainable with current algorithms. In this paper, we present an extension of the parallel replica dynamics algorithm [A. F. Voter, Phys. Rev. B 57, R13985 (1998)] by combining it with the synchronous sublattice approach of Shim and Amar [ and , Phys. Rev. B 71, 125432 (2005)], thereby exploiting event locality to improve the algorithm scalability. This algorithm is based on a domain decomposition in which events happen independently in different regions in the sample. We develop an analytical expression for the speedup given by this sublattice parallel replica dynamics algorithm and compare it with parallel MD and traditional parallel replica dynamics. We demonstrate how this algorithm, which introduces a slight additional approximation of event locality, enables the study of physical systems unreachable with traditional methodologies and promises to better utilize the resources of current high performance and future exascale computers.
A Parallel Adaboost-Backpropagation Neural Network for Massive Image Dataset Classification
Cao, Jianfang; Chen, Lichao; Wang, Min; Shi, Hao; Tian, Yun
2016-01-01
Image classification uses computers to simulate human understanding and cognition of images by automatically categorizing images. This study proposes a faster image classification approach that parallelizes the traditional Adaboost-Backpropagation (BP) neural network using the MapReduce parallel programming model. First, we construct a strong classifier by assembling the outputs of 15 BP neural networks (which are individually regarded as weak classifiers) based on the Adaboost algorithm. Second, we design Map and Reduce tasks for both the parallel Adaboost-BP neural network and the feature extraction algorithm. Finally, we establish an automated classification model by building a Hadoop cluster. We use the Pascal VOC2007 and Caltech256 datasets to train and test the classification model. The results are superior to those obtained using traditional Adaboost-BP neural network or parallel BP neural network approaches. Our approach increased the average classification accuracy rate by approximately 14.5% and 26.0% compared to the traditional Adaboost-BP neural network and parallel BP neural network, respectively. Furthermore, the proposed approach requires less computation time and scales very well as evaluated by speedup, sizeup and scaleup. The proposed approach may provide a foundation for automated large-scale image classification and demonstrates practical value. PMID:27905520
Yoneda, Arata; Ito, Takuya; Higaki, Takumi; Kutsuna, Natsumaro; Saito, Tamio; Ishimizu, Takeshi; Osada, Hiroyuki; Hasezawa, Seiichiro; Matsui, Minami; Demura, Taku
2010-11-01
Cellulose and pectin are major components of primary cell walls in plants, and it is believed that their mechanical properties are important for cell morphogenesis. It has been hypothesized that cortical microtubules guide the movement of cellulose microfibril synthase in a direction parallel with the microtubules, but the mechanism by which this alignment occurs remains unclear. We have previously identified cobtorin as an inhibitor that perturbs the parallel relationship between cortical microtubules and nascent cellulose microfibrils. In this study, we searched for the protein target of cobtorin, and we found that overexpression of pectin methylesterase and polygalacturonase suppressed the cobtorin-induced cell-swelling phenotype. Furthermore, treatment with polygalacturonase restored the deposition of cellulose microfibrils in the direction parallel with cortical microtubules, and cobtorin perturbed the distribution of methylated pectin. These results suggest that control over the properties of pectin is important for the deposition of cellulose microfibrils and/or the maintenance of their orientation parallel with the cortical microtubules. © 2010 The Authors. The Plant Journal © 2010 Blackwell Publishing Ltd.
Direct application of Padé approximant for solving nonlinear differential equations.
Vazquez-Leal, Hector; Benhammouda, Brahim; Filobello-Nino, Uriel; Sarmiento-Reyes, Arturo; Jimenez-Fernandez, Victor Manuel; Garcia-Gervacio, Jose Luis; Huerta-Chua, Jesus; Morales-Mendoza, Luis Javier; Gonzalez-Lee, Mario
2014-01-01
This work presents a direct procedure to apply Padé method to find approximate solutions for nonlinear differential equations. Moreover, we present some cases study showing the strength of the method to generate highly accurate rational approximate solutions compared to other semi-analytical methods. The type of tested nonlinear equations are: a highly nonlinear boundary value problem, a differential-algebraic oscillator problem, and an asymptotic problem. The high accurate handy approximations obtained by the direct application of Padé method shows the high potential if the proposed scheme to approximate a wide variety of problems. What is more, the direct application of the Padé approximant aids to avoid the previous application of an approximative method like Taylor series method, homotopy perturbation method, Adomian Decomposition method, homotopy analysis method, variational iteration method, among others, as tools to obtain a power series solutions to post-treat with the Padé approximant. 34L30.
Generation of low-divergence laser beams
Kronberg, J.W.
1993-09-14
Apparatus for transforming a conventional beam of coherent light, having a Gaussian energy distribution and relatively high divergence, into a beam in which the energy distribution approximates a single, non-zero-order Bessel function and which therefore has much lower divergence. The apparatus comprises a zone plate having transmitting and reflecting zones defined by the pattern of light interference produced by the combination of a beam of coherent light with a Gaussian energy distribution and one having such a Bessel distribution. The interference pattern between the two beams is a concentric array of multiple annuli, and is preferably recorded as a hologram. The hologram is then used to form the transmitting and reflecting zones by photo-etching portions of a reflecting layer deposited on a plate made of a transmitting material. A Bessel beam, containing approximately 50% of the energy of the incident beam, is produced by passing a Gaussian beam through such a Bessel zone plate. The reflected beam, also containing approximately 50% of the incident beam energy and having a Bessel energy distribution, can be redirected in the same direction and parallel to the transmitted beam. Alternatively, a filter similar to the Bessel zone plate can be placed within the resonator cavity of a conventional laser system having a front mirror and a rear mirror, preferably axially aligned with the mirrors and just inside the front mirror to generate Bessel energy distribution light beams at the laser source. 11 figures.
Parallel adaptive discontinuous Galerkin approximation for thin layer avalanche modeling
NASA Astrophysics Data System (ADS)
Patra, A. K.; Nichita, C. C.; Bauer, A. C.; Pitman, E. B.; Bursik, M.; Sheridan, M. F.
2006-08-01
This paper describes the development of highly accurate adaptive discontinuous Galerkin schemes for the solution of the equations arising from a thin layer type model of debris flows. Such flows have wide applicability in the analysis of avalanches induced by many natural calamities, e.g. volcanoes, earthquakes, etc. These schemes are coupled with special parallel solution methodologies to produce a simulation tool capable of very high-order numerical accuracy. The methodology successfully replicates cold rock avalanches at Mount Rainier, Washington and hot volcanic particulate flows at Colima Volcano, Mexico.
Ion acceleration and heating by kinetic Alfvén waves associated with magnetic reconnection
NASA Astrophysics Data System (ADS)
Liang, Ji; Lin, Yu; Johnson, Jay R.; Wang, Zheng-Xiong; Wang, Xueyi
2017-10-01
Our previous study on the generation and signatures of kinetic Alfvén waves (KAWs) associated with magnetic reconnection in a current sheet revealed that KAWs are a common feature during reconnection [Liang et al. J. Geophys. Res.: Space Phys. 121, 6526 (2016)]. In this paper, ion acceleration and heating by the KAWs generated during magnetic reconnection are investigated with a three-dimensional (3-D) hybrid model. It is found that in the outflow region, a fraction of inflow ions are accelerated by the KAWs generated in the leading bulge region of reconnection, and their parallel velocities gradually increase up to slightly super-Alfvénic. As a result of wave-particle interactions, an accelerated ion beam forms in the direction of the anti-parallel magnetic field, in addition to the core ion population, leading to the development of non-Maxwellian velocity distributions, which include a trapped population with parallel velocities consistent with the wave speed. The ions are heated in both parallel and perpendicular directions. In the parallel direction, the heating results from nonlinear Landau resonance of trapped ions. In the perpendicular direction, however, evidence of stochastic heating by the KAWs is found during the acceleration stage, with an increase of magnetic moment μ. The coherence in the perpendicular ion temperature T⊥ and the perpendicular electric and magnetic fields of KAWs also provides evidence for perpendicular heating by KAWs. The parallel and perpendicular heating of the accelerated beam occur simultaneously, leading to the development of temperature anisotropy with T⊥>T∥ . The heating rate agrees with the damping rate of the KAWs, and the heating is dominated by the accelerated ion beam. In the later stage, with the increase of the fraction of the accelerated ions, interaction between the accelerated beam and the core population also contributes to the ion heating, ultimately leading to overlap of the beams and an overall anisotropy with T∥>T⊥ .
The Tera Multithreaded Architecture and Unstructured Meshes
NASA Technical Reports Server (NTRS)
Bokhari, Shahid H.; Mavriplis, Dimitri J.
1998-01-01
The Tera Multithreaded Architecture (MTA) is a new parallel supercomputer currently being installed at San Diego Supercomputing Center (SDSC). This machine has an architecture quite different from contemporary parallel machines. The computational processor is a custom design and the machine uses hardware to support very fine grained multithreading. The main memory is shared, hardware randomized and flat. These features make the machine highly suited to the execution of unstructured mesh problems, which are difficult to parallelize on other architectures. We report the results of a study carried out during July-August 1998 to evaluate the execution of EUL3D, a code that solves the Euler equations on an unstructured mesh, on the 2 processor Tera MTA at SDSC. Our investigation shows that parallelization of an unstructured code is extremely easy on the Tera. We were able to get an existing parallel code (designed for a shared memory machine), running on the Tera by changing only the compiler directives. Furthermore, a serial version of this code was compiled to run in parallel on the Tera by judicious use of directives to invoke the "full/empty" tag bits of the machine to obtain synchronization. This version achieves 212 and 406 Mflop/s on one and two processors respectively, and requires no attention to partitioning or placement of data issues that would be of paramount importance in other parallel architectures.
Efficiency of parallel direct optimization
NASA Technical Reports Server (NTRS)
Janies, D. A.; Wheeler, W. C.
2001-01-01
Tremendous progress has been made at the level of sequential computation in phylogenetics. However, little attention has been paid to parallel computation. Parallel computing is particularly suited to phylogenetics because of the many ways large computational problems can be broken into parts that can be analyzed concurrently. In this paper, we investigate the scaling factors and efficiency of random addition and tree refinement strategies using the direct optimization software, POY, on a small (10 slave processors) and a large (256 slave processors) cluster of networked PCs running LINUX. These algorithms were tested on several data sets composed of DNA and morphology ranging from 40 to 500 taxa. Various algorithms in POY show fundamentally different properties within and between clusters. All algorithms are efficient on the small cluster for the 40-taxon data set. On the large cluster, multibuilding exhibits excellent parallel efficiency, whereas parallel building is inefficient. These results are independent of data set size. Branch swapping in parallel shows excellent speed-up for 16 slave processors on the large cluster. However, there is no appreciable speed-up for branch swapping with the further addition of slave processors (>16). This result is independent of data set size. Ratcheting in parallel is efficient with the addition of up to 32 processors in the large cluster. This result is independent of data set size. c2001 The Willi Hennig Society.
Voltage and Current Clamp Transients with Membrane Dielectric Loss
Fitzhugh, R.; Cole, K. S.
1973-01-01
Transient responses of a space-clamped squid axon membrane to step changes of voltage or current are often approximated by exponential functions of time, corresponding to a series resistance and a membrane capacity of 1.0 μF/cm2. Curtis and Cole (1938, J. Gen. Physiol. 21:757) found, however, that the membrane had a constant phase angle impedance z = z1(jωτ)-α, with a mean α = 0.85. (α = 1.0 for an ideal capacitor; α < 1.0 may represent dielectric loss.) This result is supported by more recently published experimental data. For comparison with experiments, we have computed functions expressing voltage and current transients with constant phase angle capacitance, a parallel leakage conductance, and a series resistance, at nine values of α from 0.5 to 1.0. A series in powers of tα provided a good approximation for short times; one in powers of t-α, for long times; for intermediate times, a rational approximation matching both series for a finite number of terms was used. These computations may help in determining experimental series resistances and parallel leakage conductances from membrane voltage or current clamp data. PMID:4754194
The influence of foot position on scrum kinetics during machine scrummaging.
Bayne, Helen; Kat, Cor-Jacques
2018-05-23
The purpose of this study was to investigate the effect of variations in the alignment of the feet on scrum kinetics during machine scrummaging. Twenty nine rugby forwards from amateur-level teams completed maximal scrum efforts against an instrumented scrum machine, with the feet in parallel and non-parallel positions. Three-dimensional forces, the moment about the vertical axis and sagittal plane joint angles were measured during the sustained pushing phase. There was a decrease in the magnitude of the resultant force and compression force in both of the non-parallel conditions compared to parallel and larger compression forces were associated with more extended hip and knee angles. Scrummaging with the left foot forward resulted in the lateral force being directed more towards the left and the turning moment becoming more clockwise. These directional changes were reversed when scrummaging with the right foot forward. Scrummaging with the right foot positioned ahead of the left may serve to counteract the natural clockwise wheel of the live scrum and could be used to achieve an anti-clockwise rotation of the scrum for tactical reasons. However, this would be associated with lower resultant forces and a greater lateral shear force component directed towards the right.
Controllable spin polarization and spin filtering in a zigzag silicene nanoribbon
DOE Office of Scientific and Technical Information (OSTI.GOV)
Farokhnezhad, Mohsen, E-mail: Mohsen-farokhnezhad@physics.iust.ac.ir; Esmaeilzadeh, Mahdi, E-mail: mahdi@iust.ac.ir; Pournaghavi, Nezhat
2015-05-07
Using non-equilibrium Green's function, we study the spin-dependent electron transport properties in a zigzag silicene nanoribbon. To produce and control spin polarization, it is assumed that two ferromagnetic strips are deposited on the both edges of the silicene nanoribbon and an electric field is perpendicularly applied to the nanoribbon plane. The spin polarization is studied for both parallel and anti-parallel configurations of exchange magnetic fields induced by the ferromagnetic strips. We find that complete spin polarization can take place in the presence of perpendicular electric field for anti-parallel configuration and the nanoribbon can work as a perfect spin filter. Themore » spin direction of transmitted electrons can be easily changed from up to down and vice versa by reversing the electric field direction. For parallel configuration, perfect spin filtering can occur even in the absence of electric field. In this case, the spin direction can be changed by changing the electron energy. Finally, we investigate the effects of nonmagnetic Anderson disorder on spin dependent conductance and find that the perfect spin filtering properties of nanoribbon are destroyed by strong disorder, but the nanoribbon retains these properties in the presence of weak disorder.« less
NASA Technical Reports Server (NTRS)
Mishchenko, Michael I.; Zakharova, Nadia T.
1999-01-01
Many remote sensing applications rely on accurate knowledge of the bidirectional reflection function (BRF) of surfaces composed of discrete, randomly positioned scattering particles. Theoretical computations of BRFs for plane-parallel particulate layers are usually reduced to solving the radiative transfer equation (RTE) using one of existing exact or approximate techniques. Since semi-empirical approximate approaches are notorious for their low accuracy, violation of the energy conservation law, and ability to produce unphysical results, the use of numerically exact solutions of RTE has gained justified popularity. For example, the computation of BRFs for macroscopically flat particulate surfaces in many geophysical publications is based on the adding-doubling (AD) and discrete ordinate (DO) methods. A further saving of computer resources can be achieved by using a more efficient technique to solve the plane-parallel RTE than the AD and DO methods. Since many natural particulate surfaces can be well represented by the model of an optically semi-infinite, homogeneous scattering layer, one can find the BRF directly by solving the Ambartsumian's nonlinear integral equation using a simple iterative technique. In this way, the computation of the internal radiation field is avoided and the computer code becomes highly efficient and very accurate and compact. Furthermore, the BRF thus obtained fully obeys the fundamental physical laws of energy conservation and reciprocity. In this paper, we discuss numerical aspects and the computer implementation of this technique, examine the applicability of the Henyey-Greenstein phase function and the sigma-Eddington approximation in BRF and flux calculations, and describe sample applications demonstrating the potential effect of particle shape on the bidirectional reflectance of flat regolith surfaces. Although the effects of packing density and coherent backscattering are currently neglected, they can also be incorporated. The FORTRAN implementation of the technique is available on the World Wide Web, and can be applied to a wide range of remote sensing problems. BRF computations for undulated (macroscopically rough) surfaces are more complicated and often rely on time consuming Monte Carlo procedures. This approach is especially inefficient for optically thick, weakly absorbing media (e.g., snow and desert surfaces at visible wavelengths since a photon may undergo many internal scattering events before it exists the medium or is absorbed. However, undulated surfaces can often be represented as collections of locally flat tilted facets characterized by the BRF found from the traditional plane parallel RTE. In this way the MOnte Carlo procedure could be used only to evaluate the effects of surface shadowing and multiple surface reflections, thereby bypassing the time-consuming ray tracing inside the medium and providing a great savings of CPU time.
Qiu, Gongzhe
2017-01-01
Due to the symmetry of conventional periodic-permanent-magnet electromagnetic acoustic transducers (PPM EMATs), two shear (SH) waves can be generated and propagated simultaneously in opposite directions, which makes the signal recognition and interpretation complicatedly. Thus, this work presents a new SH wave PPM EMAT design, rotating the parallel line sources to realize the wave beam focusing in a single-direction. The theoretical model of distributed line sources was deduced firstly, and the effects of some parameters, such as the inner coil width, adjacent line sources spacing and the angle between parallel line sources, on SH wave focusing and directivity were studied mainly with the help of 3D FEM. Employing the proposed PPM EMATs, some experiments are carried out to verify the reliability of FEM simulation. The results indicate that rotating the parallel line sources can strength the wave on the closing side of line sources, decreasing the inner coil width and the adjacent line sources spacing can improve the amplitude and directivity of signals excited by transducers. Compared with traditional PPM EMATs, both the capacity of unidirectional excitation and directivity of the proposed PPM EMATs are improved significantly. PMID:29186790
Song, Xiaochun; Qiu, Gongzhe
2017-11-24
Due to the symmetry of conventional periodic-permanent-magnet electromagnetic acoustic transducers (PPM EMATs), two shear (SH) waves can be generated and propagated simultaneously in opposite directions, which makes the signal recognition and interpretation complicatedly. Thus, this work presents a new SH wave PPM EMAT design, rotating the parallel line sources to realize the wave beam focusing in a single-direction. The theoretical model of distributed line sources was deduced firstly, and the effects of some parameters, such as the inner coil width, adjacent line sources spacing and the angle between parallel line sources, on SH wave focusing and directivity were studied mainly with the help of 3D FEM. Employing the proposed PPM EMATs, some experiments are carried out to verify the reliability of FEM simulation. The results indicate that rotating the parallel line sources can strength the wave on the closing side of line sources, decreasing the inner coil width and the adjacent line sources spacing can improve the amplitude and directivity of signals excited by transducers. Compared with traditional PPM EMATs, both the capacity of unidirectional excitation and directivity of the proposed PPM EMATs are improved significantly.
Some fast elliptic solvers on parallel architectures and their complexities
NASA Technical Reports Server (NTRS)
Gallopoulos, E.; Saad, Y.
1989-01-01
The discretization of separable elliptic partial differential equations leads to linear systems with special block tridiagonal matrices. Several methods are known to solve these systems, the most general of which is the Block Cyclic Reduction (BCR) algorithm which handles equations with nonconstant coefficients. A method was recently proposed to parallelize and vectorize BCR. In this paper, the mapping of BCR on distributed memory architectures is discussed, and its complexity is compared with that of other approaches including the Alternating-Direction method. A fast parallel solver is also described, based on an explicit formula for the solution, which has parallel computational compelxity lower than that of parallel BCR.
Some fast elliptic solvers on parallel architectures and their complexities
NASA Technical Reports Server (NTRS)
Gallopoulos, E.; Saad, Youcef
1989-01-01
The discretization of separable elliptic partial differential equations leads to linear systems with special block triangular matrices. Several methods are known to solve these systems, the most general of which is the Block Cyclic Reduction (BCR) algorithm which handles equations with nonconsistant coefficients. A method was recently proposed to parallelize and vectorize BCR. Here, the mapping of BCR on distributed memory architectures is discussed, and its complexity is compared with that of other approaches, including the Alternating-Direction method. A fast parallel solver is also described, based on an explicit formula for the solution, which has parallel computational complexity lower than that of parallel BCR.
NASA Technical Reports Server (NTRS)
Campbell, David; Wysong, Ingrid; Kaplan, Carolyn; Mott, David; Wadsworth, Dean; VanGilder, Douglas
2000-01-01
An AFRL/NRL team has recently been selected to develop a scalable, parallel, reacting, multidimensional (SUPREM) Direct Simulation Monte Carlo (DSMC) code for the DoD user community under the High Performance Computing Modernization Office (HPCMO) Common High Performance Computing Software Support Initiative (CHSSI). This paper will introduce the JANNAF Exhaust Plume community to this three-year development effort and present the overall goals, schedule, and current status of this new code.
Choi, Yun Ho; Yoo, Sung Jin
2017-03-28
A minimal-approximation-based distributed adaptive consensus tracking approach is presented for strict-feedback multiagent systems with unknown heterogeneous nonlinearities and control directions under a directed network. Existing approximation-based consensus results for uncertain nonlinear multiagent systems in lower-triangular form have used multiple function approximators in each local controller to approximate unmatched nonlinearities of each follower. Thus, as the follower's order increases, the number of the approximators used in its local controller increases. However, the proposed approach employs only one function approximator to construct the local controller of each follower regardless of the order of the follower. The recursive design methodology using a new error transformation is derived for the proposed minimal-approximation-based design. Furthermore, a bounding lemma on parameters of Nussbaum functions is presented to handle the unknown control direction problem in the minimal-approximation-based distributed consensus tracking framework and the stability of the overall closed-loop system is rigorously analyzed in the Lyapunov sense.
Dusty Cloud Acceleration by Radiation Pressure in Rapidly Star-forming Galaxies
NASA Astrophysics Data System (ADS)
Zhang, Dong; Davis, Shane W.; Jiang, Yan-Fei; Stone, James M.
2018-02-01
We perform two-dimensional and three-dimensional radiation hydrodynamic simulations to study cold clouds accelerated by radiation pressure on dust in the environment of rapidly star-forming galaxies dominated by infrared flux. We utilize the reduced speed of light approximation to solve the frequency-averaged, time-dependent radiative transfer equation. We find that radiation pressure is capable of accelerating the clouds to hundreds of kilometers per second while remaining dense and cold, consistent with observations. We compare these results to simulations where acceleration is provided by entrainment in a hot wind, where the momentum injection of the hot flow is comparable to the momentum in the radiation field. We find that the survival time of the cloud accelerated by the radiation field is significantly longer than that of a cloud entrained in a hot outflow. We show that the dynamics of the irradiated cloud depends on the initial optical depth, temperature of the cloud, and intensity of the flux. Additionally, gas pressure from the background may limit cloud acceleration if the density ratio between the cloud and background is ≲ {10}2. In general, a 10 pc-scale optically thin cloud forms a pancake structure elongated perpendicular to the direction of motion, while optically thick clouds form a filamentary structure elongated parallel to the direction of motion. The details of accelerated cloud morphology and geometry can also be affected by other factors, such as the cloud lengthscale, reduced speed of light approximation, spatial resolution, initial cloud structure, and dimensionality of the run, but these have relatively little affect on the cloud velocity or survival time.
Testing New Programming Paradigms with NAS Parallel Benchmarks
NASA Technical Reports Server (NTRS)
Jin, H.; Frumkin, M.; Schultz, M.; Yan, J.
2000-01-01
Over the past decade, high performance computing has evolved rapidly, not only in hardware architectures but also with increasing complexity of real applications. Technologies have been developing to aim at scaling up to thousands of processors on both distributed and shared memory systems. Development of parallel programs on these computers is always a challenging task. Today, writing parallel programs with message passing (e.g. MPI) is the most popular way of achieving scalability and high performance. However, writing message passing programs is difficult and error prone. Recent years new effort has been made in defining new parallel programming paradigms. The best examples are: HPF (based on data parallelism) and OpenMP (based on shared memory parallelism). Both provide simple and clear extensions to sequential programs, thus greatly simplify the tedious tasks encountered in writing message passing programs. HPF is independent of memory hierarchy, however, due to the immaturity of compiler technology its performance is still questionable. Although use of parallel compiler directives is not new, OpenMP offers a portable solution in the shared-memory domain. Another important development involves the tremendous progress in the internet and its associated technology. Although still in its infancy, Java promisses portability in a heterogeneous environment and offers possibility to "compile once and run anywhere." In light of testing these new technologies, we implemented new parallel versions of the NAS Parallel Benchmarks (NPBs) with HPF and OpenMP directives, and extended the work with Java and Java-threads. The purpose of this study is to examine the effectiveness of alternative programming paradigms. NPBs consist of five kernels and three simulated applications that mimic the computation and data movement of large scale computational fluid dynamics (CFD) applications. We started with the serial version included in NPB2.3. Optimization of memory and cache usage was applied to several benchmarks, noticeably BT and SP, resulting in better sequential performance. In order to overcome the lack of an HPF performance model and guide the development of the HPF codes, we employed an empirical performance model for several primitives found in the benchmarks. We encountered a few limitations of HPF, such as lack of supporting the "REDISTRIBUTION" directive and no easy way to handle irregular computation. The parallelization with OpenMP directives was done at the outer-most loop level to achieve the largest granularity. The performance of six HPF and OpenMP benchmarks is compared with their MPI counterparts for the Class-A problem size in the figure in next page. These results were obtained on an SGI Origin2000 (195MHz) with MIPSpro-f77 compiler 7.2.1 for OpenMP and MPI codes and PGI pghpf-2.4.3 compiler with MPI interface for HPF programs.
NASA Astrophysics Data System (ADS)
Ying, Jia-ju; Chen, Yu-dan; Liu, Jie; Wu, Dong-sheng; Lu, Jun
2016-10-01
The maladjustment of photoelectric instrument binocular optical axis parallelism will affect the observe effect directly. A binocular optical axis parallelism digital calibration system is designed. On the basis of the principle of optical axis binocular photoelectric instrument calibration, the scheme of system is designed, and the binocular optical axis parallelism digital calibration system is realized, which include four modules: multiband parallel light tube, optical axis translation, image acquisition system and software system. According to the different characteristics of thermal infrared imager and low-light-level night viewer, different algorithms is used to localize the center of the cross reticle. And the binocular optical axis parallelism calibration is realized for calibrating low-light-level night viewer and thermal infrared imager.
Exponential series approaches for nonparametric graphical models
NASA Astrophysics Data System (ADS)
Janofsky, Eric
Markov Random Fields (MRFs) or undirected graphical models are parsimonious representations of joint probability distributions. This thesis studies high-dimensional, continuous-valued pairwise Markov Random Fields. We are particularly interested in approximating pairwise densities whose logarithm belongs to a Sobolev space. For this problem we propose the method of exponential series which approximates the log density by a finite-dimensional exponential family with the number of sufficient statistics increasing with the sample size. We consider two approaches to estimating these models. The first is regularized maximum likelihood. This involves optimizing the sum of the log-likelihood of the data and a sparsity-inducing regularizer. We then propose a variational approximation to the likelihood based on tree-reweighted, nonparametric message passing. This approximation allows for upper bounds on risk estimates, leverages parallelization and is scalable to densities on hundreds of nodes. We show how the regularized variational MLE may be estimated using a proximal gradient algorithm. We then consider estimation using regularized score matching. This approach uses an alternative scoring rule to the log-likelihood, which obviates the need to compute the normalizing constant of the distribution. For general continuous-valued exponential families, we provide parameter and edge consistency results. As a special case we detail a new approach to sparse precision matrix estimation which has statistical performance competitive with the graphical lasso and computational performance competitive with the state-of-the-art glasso algorithm. We then describe results for model selection in the nonparametric pairwise model using exponential series. The regularized score matching problem is shown to be a convex program; we provide scalable algorithms based on consensus alternating direction method of multipliers (ADMM) and coordinate-wise descent. We use simulations to compare our method to others in the literature as well as the aforementioned TRW estimator.
Droplet impact on regular micro-grooved surfaces
NASA Astrophysics Data System (ADS)
Hu, Hai-Bao; Huang, Su-He; Chen, Li-Bin
2013-08-01
We have investigated experimentally the process of a droplet impact on a regular micro-grooved surface. The target surfaces are patterned such that micro-scale spokes radiate from the center, concentric circles, and parallel lines on the polishing copper plate, using Quasi-LIGA molding technology. The dynamic behavior of water droplets impacting on these structured surfaces is examined using a high-speed camera, including the drop impact processes, the maximum spreading diameters, and the lengths and numbers of fingers at different values of Weber number. Experimental results validate that the spreading processes are arrested on all target surfaces at low velocity. Also, the experimental results at higher impact velocity demonstrate that the spreading process is conducted on the surface parallel to the micro-grooves, but is arrested in the direction perpendicular to the micro-grooves. Besides, the lengths of fingers increase observably, even when they are ejected out as tiny droplets along the groove direction, at the same time the drop recoil velocity is reduced by micro-grooves which are parallel to the spreading direction, but not by micro-grooves which are vertical to the spreading direction.
NASA Astrophysics Data System (ADS)
Olano, C. A.
2009-11-01
Context: Using certain simplifications, Kompaneets derived a partial differential equation that states the local geometrical and kinematical conditions that each surface element of a shock wave, created by a point blast in a stratified gaseous medium, must satisfy. Kompaneets could solve his equation analytically for the case of a wave propagating in an exponentially stratified medium, obtaining the form of the shock front at progressive evolutionary stages. Complete analytical solutions of the Kompaneets equation for shock wave motion in further plane-parallel stratified media were not found, except for radially stratified media. Aims: We aim to analytically solve the Kompaneets equation for the motion of a shock wave in different plane-parallel stratified media that can reflect a wide variety of astrophysical contexts. We were particularly interested in solving the Kompaneets equation for a strong explosion in the interstellar medium of the Galactic disk, in which, due to intense winds and explosions of stars, gigantic gaseous structures known as superbubbles and supershells are formed. Methods: Using the Kompaneets approximation, we derived a pair of equations that we call adapted Kompaneets equations, that govern the propagation of a shock wave in a stratified medium and that permit us to obtain solutions in parametric form. The solutions provided by the system of adapted Kompaneets equations are equivalent to those of the Kompaneets equation. We solved the adapted Kompaneets equations for shock wave propagation in a generic stratified medium by means of a power-series method. Results: Using the series solution for a shock wave in a generic medium, we obtained the series solutions for four specific media whose respective density distributions in the direction perpendicular to the stratification plane are of an exponential, power-law type (one with exponent k=-1 and the other with k =-2) and a quadratic hyperbolic-secant. From these series solutions, we deduced exact solutions for the four media in terms of elemental functions. The exact solution for shock wave propagation in a medium of quadratic hyperbolic-secant density distribution is very appropriate to describe the growth of superbubbles in the Galactic disk. Member of the Carrera del Investigador Científico del CONICET, Argentina.
Lu, Zhao; Sun, Jing; Butts, Kenneth
2014-05-01
Support vector regression for approximating nonlinear dynamic systems is more delicate than the approximation of indicator functions in support vector classification, particularly for systems that involve multitudes of time scales in their sampled data. The kernel used for support vector learning determines the class of functions from which a support vector machine can draw its solution, and the choice of kernel significantly influences the performance of a support vector machine. In this paper, to bridge the gap between wavelet multiresolution analysis and kernel learning, the closed-form orthogonal wavelet is exploited to construct new multiscale asymmetric orthogonal wavelet kernels for linear programming support vector learning. The closed-form multiscale orthogonal wavelet kernel provides a systematic framework to implement multiscale kernel learning via dyadic dilations and also enables us to represent complex nonlinear dynamics effectively. To demonstrate the superiority of the proposed multiscale wavelet kernel in identifying complex nonlinear dynamic systems, two case studies are presented that aim at building parallel models on benchmark datasets. The development of parallel models that address the long-term/mid-term prediction issue is more intricate and challenging than the identification of series-parallel models where only one-step ahead prediction is required. Simulation results illustrate the effectiveness of the proposed multiscale kernel learning.
Multiple grid problems on concurrent-processing computers
NASA Technical Reports Server (NTRS)
Eberhardt, D. S.; Baganoff, D.
1986-01-01
Three computer codes were studied which make use of concurrent processing computer architectures in computational fluid dynamics (CFD). The three parallel codes were tested on a two processor multiple-instruction/multiple-data (MIMD) facility at NASA Ames Research Center, and are suggested for efficient parallel computations. The first code is a well-known program which makes use of the Beam and Warming, implicit, approximate factored algorithm. This study demonstrates the parallelism found in a well-known scheme and it achieved speedups exceeding 1.9 on the two processor MIMD test facility. The second code studied made use of an embedded grid scheme which is used to solve problems having complex geometries. The particular application for this study considered an airfoil/flap geometry in an incompressible flow. The scheme eliminates some of the inherent difficulties found in adapting approximate factorization techniques onto MIMD machines and allows the use of chaotic relaxation and asynchronous iteration techniques. The third code studied is an application of overset grids to a supersonic blunt body problem. The code addresses the difficulties encountered when using embedded grids on a compressible, and therefore nonlinear, problem. The complex numerical boundary system associated with overset grids is discussed and several boundary schemes are suggested. A boundary scheme based on the method of characteristics achieved the best results.
Multi-threading: A new dimension to massively parallel scientific computation
NASA Astrophysics Data System (ADS)
Nielsen, Ida M. B.; Janssen, Curtis L.
2000-06-01
Multi-threading is becoming widely available for Unix-like operating systems, and the application of multi-threading opens new ways for performing parallel computations with greater efficiency. We here briefly discuss the principles of multi-threading and illustrate the application of multi-threading for a massively parallel direct four-index transformation of electron repulsion integrals. Finally, other potential applications of multi-threading in scientific computing are outlined.
Dip and anisotropy effects on flow using a vertically skewed model grid.
Hoaglund, John R; Pollard, David
2003-01-01
Darcy flow equations relating vertical and bedding-parallel flow to vertical and bedding-parallel gradient components are derived for a skewed Cartesian grid in a vertical plane, correcting for structural dip given the principal hydraulic conductivities in bedding-parallel and bedding-orthogonal directions. Incorrect-minus-correct flow error results are presented for ranges of structural dip (0 < or = theta < or = 90) and gradient directions (0 < or = phi < or = 360). The equations can be coded into ground water models (e.g., MODFLOW) that can use a skewed Cartesian coordinate system to simulate flow in structural terrain with deformed bedding planes. Models modified with these equations will require input arrays of strike and dip, and a solver that can handle off-diagonal hydraulic conductivity terms.
Hydrogen storage in engineered carbon nanospaces.
Burress, Jacob; Kraus, Michael; Beckner, Matt; Cepel, Raina; Suppes, Galen; Wexler, Carlos; Pfeifer, Peter
2009-05-20
It is shown how appropriately engineered nanoporous carbons provide materials for reversible hydrogen storage, based on physisorption, with exceptional storage capacities (approximately 80 g H2/kg carbon, approximately 50 g H2/liter carbon, at 50 bar and 77 K). Nanopores generate high storage capacities (a) by having high surface area to volume ratios, and (b) by hosting deep potential wells through overlapping substrate potentials from opposite pore walls, giving rise to a binding energy nearly twice the binding energy in wide pores. Experimental case studies are presented with surface areas as high as 3100 m(2) g(-1), in which 40% of all surface sites reside in pores of width approximately 0.7 nm and binding energy approximately 9 kJ mol(-1), and 60% of sites in pores of width>1.0 nm and binding energy approximately 5 kJ mol(-1). The findings, including the prevalence of just two distinct binding energies, are in excellent agreement with results from molecular dynamics simulations. It is also shown, from statistical mechanical models, that one can experimentally distinguish between the situation in which molecules do (mobile adsorption) and do not (localized adsorption) move parallel to the surface, how such lateral dynamics affects the hydrogen storage capacity, and how the two situations are controlled by the vibrational frequencies of adsorbed hydrogen molecules parallel and perpendicular to the surface: in the samples presented, adsorption is mobile at 293 K, and localized at 77 K. These findings make a strong case for it being possible to significantly increase hydrogen storage capacities in nanoporous carbons by suitable engineering of the nanopore space.
Radiative transfer in spherical shell atmospheres. I - Rayleigh scattering
NASA Technical Reports Server (NTRS)
Adams, C. N.; Kattawar, G. W.
1978-01-01
The plane-parallel approximation and the more realistic spherical shell approximation for the radiance reflected from a planetary atmosphere are compared and are applied to the study of a planet the size of the earth with a homogeneous conservative Rayleigh scattering atmosphere extending to a height of 100 km. Inadequacies of the approximations are considered. Radiance versus height distributions for both single and multiple scattering are presented, as are results for the fractional radiance from altitudes in the atmosphere which contribute to the total unidirectional reflected radiance at the top of the atmosphere. The data can be used for remote sensing applications and planetary spectroscopy.
NASA Astrophysics Data System (ADS)
Reber, J. E.; Schmalholz, S. M.; Lechmann, S. M.
2009-04-01
We present field data and numerical modeling results which show the evolution of stress and strain patterns during 3D folding resulting in an orthogonal fracture system. The field area is located near Almograve, SW Portugal. The area is part of the Mira Formation which itself is part of the South Portuguese Zone (SPZ). The structural development of the SPZ is characterized by southwest vergent folding and thrust displacement. The metamorphism in the SPZ increases from diagenetic conditions in the southwest to greenschist-facies conditions to the northeast. The Mira Formation is composed of turbiditic layers of Carboniferous age with low sandstone to shale ratio. The data was gathered at three outcrops which show structures similar to chocolate tablet structures in the folded sandstone layers. Chocolate tablet structures are generated under simultaneous extension in two directions and show two fracture systems of the same age which are perpendicular to each other. However, the Mira Formation is located in a convergent area. Also, the outcrops near Almograve show two fracture systems of different age. The fractures orthogonal to the fold axis and the bedding are crosscut by fractures parallel to the fold axis and orthogonal to the bedding. Our hypothesis for the evolution of the observed fracture systems is as follows; the older fractures which are now orthogonal to the fold axis and to the bedding plane were generated during compression while the layers were still approximately horizontal. They are parallel to σ1(i.e. mode 1 fractures). The second and younger fracture family was generated in a phase where there is local extension in the fold limbs. These fractures are orthogonal to the far-field σ1, parallel to the fold axis and perpendicular to the bedding. The shortening direction is constant during the entire folding process. We test our hypothesis with numerical modeling. We use 2D and 3D finite element codes with a mixed formulation for incompressible flow and a viscous rheology. The stress and strain tensor components are calculated at each numerical nodal point. The stress and strain fields are visualized through ellipses and ellipsoids which are calculated using the eigenvalues of the respective tensors. The shortest main axis represents the direction of the smallest stress σ3 and the longest main axis represents the direction of the largest stress σ1. To generate two orthogonal fracture systems in the fold limbs we expect a relatively rapid change of the stress field in the fold limbs during folding. With a relatively slow change of the stress field we would expect to see more than two fracture systems with a wide range of fracture orientation which we did not observe in the field. The preliminary 2D results show, as expected, a sudden flip of the main axes of the stress ellipse which corresponds to a change from limb-parallel compression to extension. For the 3D model we expect similar results and we will investigate the impact of different deformation boundary conditions on the evolution of the 3D stress and strain fields.
NASA Technical Reports Server (NTRS)
Roth, R. J.
1973-01-01
The distribution function of ion energy parallel to the magnetic field of a modified Penning discharge has been measured with a retarding potential energy analyzer. These ions escaped through one of the throats of the magnetic mirror geometry. Simultaneous measurements of the ion energy distribution function perpendicular to the magnetic field have been made with a charge exchange neutral detector. The ion energy distribution functions are approximately Maxwellian, and the parallel and perpendicular kinetic temperatures are equal within experimental error. These results suggest that turbulent processes previously observed in this discharge Maxwellianize the velocity distribution along a radius in velocity space and cause an isotropic energy distribution. When the distributions depart from Maxwellian, they are enhanced above the Maxwellian tail.
Soft-output decoding algorithms in iterative decoding of turbo codes
NASA Technical Reports Server (NTRS)
Benedetto, S.; Montorsi, G.; Divsalar, D.; Pollara, F.
1996-01-01
In this article, we present two versions of a simplified maximum a posteriori decoding algorithm. The algorithms work in a sliding window form, like the Viterbi algorithm, and can thus be used to decode continuously transmitted sequences obtained by parallel concatenated codes, without requiring code trellis termination. A heuristic explanation is also given of how to embed the maximum a posteriori algorithms into the iterative decoding of parallel concatenated codes (turbo codes). The performances of the two algorithms are compared on the basis of a powerful rate 1/3 parallel concatenated code. Basic circuits to implement the simplified a posteriori decoding algorithm using lookup tables, and two further approximations (linear and threshold), with a very small penalty, to eliminate the need for lookup tables are proposed.
Parallel computation of level set method for 500 Hz visual servo control
NASA Astrophysics Data System (ADS)
Fei, Xianfeng; Igarashi, Yasunobu; Hashimoto, Koichi
2008-11-01
We propose a 2D microorganism tracking system using a parallel level set method and a column parallel vision system (CPV). This system keeps a single microorganism in the middle of the visual field under a microscope by visual servoing an automated stage. We propose a new energy function for the level set method. This function constrains an amount of light intensity inside the detected object contour to control the number of the detected objects. This algorithm is implemented in CPV system and computational time for each frame is 2 [ms], approximately. A tracking experiment for about 25 s is demonstrated. Also we demonstrate a single paramecium can be kept tracking even if other paramecia appear in the visual field and contact with the tracked paramecium.
NASA Astrophysics Data System (ADS)
Lee, J.; Kim, K.
A Very Large Scale Integration (VLSI) architecture for robot direct kinematic computation suitable for industrial robot manipulators was investigated. The Denavit-Hartenberg transformations are reviewed to exploit a proper processing element, namely an augmented CORDIC. Specifically, two distinct implementations are elaborated on, such as the bit-serial and parallel. Performance of each scheme is analyzed with respect to the time to compute one location of the end-effector of a 6-links manipulator, and the number of transistors required.
Shahinpoor, Mohsen
1995-01-01
A device for electromagnetically accelerating projectiles. The invention features two parallel conducting circular plates, a plurality of electrode connections to both upper and lower plates, a support base, and a projectile magazine. A projectile is spring-loaded into a firing position concentrically located between the parallel plates. A voltage source is applied to the plates to cause current to flow in directions defined by selectable, discrete electrode connections on both upper and lower plates. Repulsive Lorentz forces are generated to eject the projectile in a 360 degree range of fire.
NASA Technical Reports Server (NTRS)
Lee, J.; Kim, K.
1991-01-01
A Very Large Scale Integration (VLSI) architecture for robot direct kinematic computation suitable for industrial robot manipulators was investigated. The Denavit-Hartenberg transformations are reviewed to exploit a proper processing element, namely an augmented CORDIC. Specifically, two distinct implementations are elaborated on, such as the bit-serial and parallel. Performance of each scheme is analyzed with respect to the time to compute one location of the end-effector of a 6-links manipulator, and the number of transistors required.
Temporal Planning for Compilation of Quantum Approximate Optimization Algorithm Circuits
NASA Technical Reports Server (NTRS)
Venturelli, Davide; Do, Minh Binh; Rieffel, Eleanor Gilbert; Frank, Jeremy David
2017-01-01
We investigate the application of temporal planners to the problem of compiling quantum circuits to newly emerging quantum hardware. While our approach is general, we focus our initial experiments on Quantum Approximate Optimization Algorithm (QAOA) circuits that have few ordering constraints and allow highly parallel plans. We report on experiments using several temporal planners to compile circuits of various sizes to a realistic hardware. This early empirical evaluation suggests that temporal planning is a viable approach to quantum circuit compilation.
Introduction to Numerical Methods
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schoonover, Joseph A.
2016-06-14
These are slides for a lecture for the Parallel Computing Summer Research Internship at the National Security Education Center. This gives an introduction to numerical methods. Repetitive algorithms are used to obtain approximate solutions to mathematical problems, using sorting, searching, root finding, optimization, interpolation, extrapolation, least squares regresion, Eigenvalue problems, ordinary differential equations, and partial differential equations. Many equations are shown. Discretizations allow us to approximate solutions to mathematical models of physical systems using a repetitive algorithm and introduce errors that can lead to numerical instabilities if we are not careful.
NASA Astrophysics Data System (ADS)
Jannati, Mojtaba; Valadan Zoej, Mohammad Javad; Mokhtarzade, Mehdi
2018-03-01
This paper presents a novel approach to epipolar resampling of cross-track linear pushbroom imagery using orbital parameters model (OPM). The backbone of the proposed method relies on modification of attitude parameters of linear array stereo imagery in such a way to parallelize the approximate conjugate epipolar lines (ACELs) with the instantaneous base line (IBL) of the conjugate image points (CIPs). Afterward, a complementary rotation is applied in order to parallelize all the ACELs throughout the stereo imagery. The new estimated attitude parameters are evaluated based on the direction of the IBL and the ACELs. Due to the spatial and temporal variability of the IBL (respectively changes in column and row numbers of the CIPs) and nonparallel nature of the epipolar lines in the stereo linear images, some polynomials in the both column and row numbers of the CIPs are used to model new attitude parameters. As the instantaneous position of sensors remains fix, the digital elevation model (DEM) of the area of interest is not required in the resampling process. According to the experimental results obtained from two pairs of SPOT and RapidEye stereo imagery with a high elevation relief, the average absolute values of remained vertical parallaxes of CIPs in the normalized images were obtained 0.19 and 0.28 pixels respectively, which confirm the high accuracy and applicability of the proposed method.
NASA Astrophysics Data System (ADS)
Calderín, L.; Karasiev, V. V.; Trickey, S. B.
2017-12-01
As the foundation for a new computational implementation, we survey the calculation of the complex electrical conductivity tensor based on the Kubo-Greenwood (KG) formalism (Kubo, 1957; Greenwood, 1958), with emphasis on derivations and technical aspects pertinent to use of projector augmented wave datasets with plane wave basis sets (Blöchl, 1994). New analytical results and a full implementation of the KG approach in an open-source Fortran 90 post-processing code for use with Quantum Espresso (Giannozzi et al., 2009) are presented. Named KGEC ([K]ubo [G]reenwood [E]lectronic [C]onductivity), the code calculates the full complex conductivity tensor (not just the average trace). It supports use of either the original KG formula or the popular one approximated in terms of a Dirac delta function. It provides both Gaussian and Lorentzian representations of the Dirac delta function (though the Lorentzian is preferable on basic grounds). KGEC provides decomposition of the conductivity into intra- and inter-band contributions as well as degenerate state contributions. It calculates the dc conductivity tensor directly. It is MPI parallelized over k-points, bands, and plane waves, with an option to recover the plane wave processes for their use in band parallelization as well. It is designed to provide rapid convergence with respect to k-point density. Examples of its use are given.
Towards a Lithium Radiative / Vapor-Box Divertor
NASA Astrophysics Data System (ADS)
Goldston, Robert; Constantin, Marius; Jaworski, Michael; Myers, Rachel; Ono, Masayuki; Schwartz, Jacob; Scotti, Filippo; Qu, Zhaonan
2014-10-01
Recent research has indicated that the peak perpendicular heat flux on reactor divertor targets will be hundreds of MW/m2 in the absence of dissipation and/or spatial spreading. Thus we are attracted to both enhanced radiative cooling and continuous vapor shielding. Lithium particle lifetimes <=100 micro-sec enhance radiation efficiency at T < 10 eV, while lithium charge-exchange with neutral hydrogen may enhance radiative efficiency for T > 10 eV and n0/ni > 0.1. We are examining if the latter mechanism plays a role in the narrowing of the heat-flux footprint in lithiated NSTX discharges. In parallel we are investigating the possibility of immersing a reactor divertor leg in a channel of lithium vapor. If we approximate the vapor channel as in local equilibrium with lithium-wetted walls ranging from 300 oC at the entrance point to 950 oC 10m downstream in the parallel direction, we find that the vapor can both balance reactor levels of upstream plasma pressure and stop energetic ions and electrons with energies up to at least 25 keV, as might be produced in ELMs. Each 10 l/sec of lithium evaporated deep in the channel and recondensed in cooler regions spreads 100 MW over a much wider area than the original strike point. This work supported by US DOE Contract DE-AC02-09CH11466.
Molecular architecture of human prion protein amyloid: a parallel, in-register beta-structure.
Cobb, Nathan J; Sönnichsen, Frank D; McHaourab, Hassane; Surewicz, Witold K
2007-11-27
Transmissible spongiform encephalopathies (TSEs) represent a group of fatal neurodegenerative diseases that are associated with conformational conversion of the normally monomeric and alpha-helical prion protein, PrP(C), to the beta-sheet-rich PrP(Sc). This latter conformer is believed to constitute the main component of the infectious TSE agent. In contrast to high-resolution data for the PrP(C) monomer, structures of the pathogenic PrP(Sc) or synthetic PrP(Sc)-like aggregates remain elusive. Here we have used site-directed spin labeling and EPR spectroscopy to probe the molecular architecture of the recombinant PrP amyloid, a misfolded form recently reported to induce transmissible disease in mice overexpressing an N-terminally truncated form of PrP(C). Our data show that, in contrast to earlier, largely theoretical models, the con formational conversion of PrP(C) involves major refolding of the C-terminal alpha-helical region. The core of the amyloid maps to C-terminal residues from approximately 160-220, and these residues form single-molecule layers that stack on top of one another with parallel, in-register alignment of beta-strands. This structural insight has important implications for understanding the molecular basis of prion propagation, as well as hereditary prion diseases, most of which are associated with point mutations in the region found to undergo a refolding to beta-structure.
Bi-directional series-parallel elastic actuator and overlap of the actuation layers.
Furnémont, Raphaël; Mathijssen, Glenn; Verstraten, Tom; Lefeber, Dirk; Vanderborght, Bram
2016-01-27
Several robotics applications require high torque-to-weight ratio and energy efficient actuators. Progress in that direction was made by introducing compliant elements into the actuation. A large variety of actuators were developed such as series elastic actuators (SEAs), variable stiffness actuators and parallel elastic actuators (PEAs). SEAs can reduce the peak power while PEAs can reduce the torque requirement on the motor. Nonetheless, these actuators still cannot meet performances close to humans. To combine both advantages, the series parallel elastic actuator (SPEA) was developed. The principle is inspired from biological muscles. Muscles are composed of motor units, placed in parallel, which are variably recruited as the required effort increases. This biological principle is exploited in the SPEA, where springs (layers), placed in parallel, can be recruited one by one. This recruitment is performed by an intermittent mechanism. This paper presents the development of a SPEA using the MACCEPA principle with a self-closing mechanism. This actuator can deliver a bi-directional output torque, variable stiffness and reduced friction. The load on the motor can also be reduced, leading to a lower power consumption. The variable recruitment of the parallel springs can also be tuned in order to further decrease the consumption of the actuator for a given task. First, an explanation of the concept and a brief description of the prior work done will be given. Next, the design and the model of one of the layers will be presented. The working principle of the full actuator will then be given. At the end of this paper, experiments showing the electric consumption of the actuator will display the advantage of the SPEA over an equivalent stiff actuator.
Water liquid-vapor interface subjected to various electric fields: A molecular dynamics study.
Nikzad, Mohammadreza; Azimian, Ahmad Reza; Rezaei, Majid; Nikzad, Safoora
2017-11-28
Investigation of the effects of E-fields on the liquid-vapor interface is essential for the study of floating water bridge and wetting phenomena. The present study employs the molecular dynamics method to investigate the effects of parallel and perpendicular E-fields on the water liquid-vapor interface. For this purpose, density distribution, number of hydrogen bonds, molecular orientation, and surface tension are examined to gain a better understanding of the interface structure. Results indicate enhancements in parallel E-field decrease the interface width and number of hydrogen bonds, while the opposite holds true in the case of perpendicular E-fields. Moreover, perpendicular fields disturb the water structure at the interface. Given that water molecules tend to be parallel to the interface plane, it is observed that perpendicular E-fields fail to realign water molecules in the field direction while the parallel ones easily do so. It is also shown that surface tension rises with increasing strength of parallel E-fields, while it reduces in the case of perpendicular E-fields. Enhancement of surface tension in the parallel field direction demonstrates how the floating water bridge forms between the beakers. Finally, it is found that application of external E-fields to the liquid-vapor interface does not lead to uniform changes in surface tension and that the liquid-vapor interfacial tension term in Young's equation should be calculated near the triple-line of the droplet. This is attributed to the multi-directional nature of the droplet surface, indicating that no constant value can be assigned to a droplet's surface tension in the presence of large electric fields.
NASA Astrophysics Data System (ADS)
Bellerby, Tim
2014-05-01
Model Integration System (MIST) is open-source environmental modelling programming language that directly incorporates data parallelism. The language is designed to enable straightforward programming structures, such as nested loops and conditional statements to be directly translated into sequences of whole-array (or more generally whole data-structure) operations. MIST thus enables the programmer to use well-understood constructs, directly relating to the mathematical structure of the model, without having to explicitly vectorize code or worry about details of parallelization. A range of common modelling operations are supported by dedicated language structures operating on cell neighbourhoods rather than individual cells (e.g.: the 3x3 local neighbourhood needed to implement an averaging image filter can be simply accessed from within a simple loop traversing all image pixels). This facility hides details of inter-process communication behind more mathematically relevant descriptions of model dynamics. The MIST automatic vectorization/parallelization process serves both to distribute work among available nodes and separately to control storage requirements for intermediate expressions - enabling operations on very large domains for which memory availability may be an issue. MIST is designed to facilitate efficient interpreter based implementations. A prototype open source interpreter is available, coded in standard FORTRAN 95, with tools to rapidly integrate existing FORTRAN 77 or 95 code libraries. The language is formally specified and thus not limited to FORTRAN implementation or to an interpreter-based approach. A MIST to FORTRAN compiler is under development and volunteers are sought to create an ANSI-C implementation. Parallel processing is currently implemented using OpenMP. However, parallelization code is fully modularised and could be replaced with implementations using other libraries. GPU implementation is potentially possible.
NASA Astrophysics Data System (ADS)
Dove, P. M.; Davis, K. J.; De Yoreo, J. J.; Orme, C. A.
2001-12-01
Deciphering the complex strategies by which organisms produce nanocrystalline materials with exquisite morphologies is central to understanding biomineralizing systems. One control on the morphology of biogenic nanoparticles is the specific interactions of their surfaces with the organic functional groups provided by the organism and the various inorganic species present in the ambient environment. It is now possible to directly probe the microscopic structural controls on crystal morphology by making quantitative measurements of the dynamic processes occurring at the mineral-water interface. These observations can provide crucial information concerning the actual mechanisms of growth that is otherwise unobtainable through macroscopic techniques. Here we use in situ molecular-scale observations of step dynamics and growth hillock morphology to directly resolve roles of principal impurities in regulating calcite surface morphologies. We show that the interactions of certain inorganic as well as organic impurities with the calcite surface are dependent upon the molecular-scale structures of step-edges. These interactions can assume a primary role in directing crystal morphology. In calcite growth experiments containing magnesium, we show that growth hillock structures become modified owing to the preferential inhibition of step motion along directions approximately parallel to the [010]. Compositional analyses have shown that Mg incorporates at different levels into the two types of nonequivalent steps, which meet at the hillock corner parallel to [010]. A simple calculation of the strain caused by this difference indicates that we should expect a significant retardation at this corner, in agreement with the observed development of [010] steps. If the low-energy step-risers produced by these [010] steps is perpendicular to the c-axis as seems likely from crystallographic considerations, this effect provides a plausible mechanism for the elongated calcite crystal habits found in natural environments that contain magnesium. In a separate study, step-specific interactions are also found between chiral aspartate molecules and the calcite surface. The L and D- aspartate enantiomers exhibit structure preferences for the different types of step-risers on the calcite surface. These site-specific interactions result in the transfer of asymmetry from the organic molecule to the crystal surface through the formation of chiral growth hillocks and surface morphologies. These studies yield direct experimental insight into the molecular-scale structural controls on nanocrystal morphology in biomineralizing systems.
Tools for Analysis and Visualization of Large Time-Varying CFD Data Sets
NASA Technical Reports Server (NTRS)
Wilhelms, Jane; VanGelder, Allen
1997-01-01
In the second year, we continued to built upon and improve our scanline-based direct volume renderer that we developed in the first year of this grant. This extremely general rendering approach can handle regular or irregular grids, including overlapping multiple grids, and polygon mesh surfaces. It runs in parallel on multi-processors. It can also be used in conjunction with a k-d tree hierarchy, where approximate models and error terms are stored in the nodes of the tree, and approximate fast renderings can be created. We have extended our software to handle time-varying data where the data changes but the grid does not. We are now working on extending it to handle more general time-varying data. We have also developed a new extension of our direct volume renderer that uses automatic decimation of the 3D grid, as opposed to an explicit hierarchy. We explored this alternative approach as being more appropriate for very large data sets, where the extra expense of a tree may be unacceptable. We also describe a new approach to direct volume rendering using hardware 3D textures and incorporates lighting effects. Volume rendering using hardware 3D textures is extremely fast, and machines capable of using this technique are becoming more moderately priced. While this technique, at present, is limited to use with regular grids, we are pursuing possible algorithms extending the approach to more general grid types. We have also begun to explore a new method for determining the accuracy of approximate models based on the light field method described at ACM SIGGRAPH '96. In our initial implementation, we automatically image the volume from 32 equi-distant positions on the surface of an enclosing tessellated sphere. We then calculate differences between these images under different conditions of volume approximation or decimation. We are studying whether this will give a quantitative measure of the effects of approximation. We have created new tools for exploring the differences between images produced by various rendering methods. Images created by our software can be stored in the SGI RGB format. Our idtools software reads in pair of images and compares them using various metrics. The differences of the images using the RGB, HSV, and HSL color models can be calculated and shown. We can also calculate the auto-correlation function and the Fourier transform of the image and image differences. We will explore how these image differences compare in order to find useful metrics for quantifying the success of various visualization approaches. In general, progress was consistent with our research plan for the second year of the grant.
NASA Astrophysics Data System (ADS)
Denolle, M.; Dunham, E. M.; Prieto, G.; Beroza, G. C.
2013-05-01
There is no clearer example of the increase in hazard due to prolonged and amplified shaking in sedimentary, than the case of Mexico City in the 1985 Michoacan earthquake. It is critically important to identify what other cities might be susceptible to similar basin amplification effects. Physics-based simulations in 3D crustal structure can be used to model and anticipate those effects, but they rely on our knowledge of the complexity of the medium. We propose a parallel approach to validate ground motion simulations using the ambient seismic field. We compute the Earth's impulse response combining the ambient seismic field and coda-wave enforcing causality and symmetry constraints. We correct the surface impulse responses to account for the source depth, mechanism and duration using a 1D approximation of the local surface-wave excitation. We call the new responses virtual earthquakes. We validate the ground motion predicted from the virtual earthquakes against moderate earthquakes in southern California. We then combine temporary seismic stations on the southern San Andreas Fault and extend the point source approximation of the Virtual Earthquake Approach to model finite kinematic ruptures. We confirm the coupling between source directivity and amplification in downtown Los Angeles seen in simulations.
Schwarz, G; Savko, P
1982-01-01
Dielectric constant and loss of the membrane-active peptide alamethicin in octanol/dioxane mixtures have been measured at frequencies between 5 kHz and 50 MHz. On the basis of a rotational mechanism of dipolar orientation, the observed dispersion provides information regarding size, shape, and dipole moment of the structural entities which the solute may assume in media of diverse lipophilicity. Particularly detailed results are obtained in a pure octanol solvent where an apparent molecular weight of alamethicin could be determined. It turns out that in this quite lipophilic medium most of the peptide material exists as a monomer particle that has approximate length and diameter of 35 and 13 A, respectively. It carries a dipole moment of approximately 75 Debye units (directed nearly parallel to the long axis). At our concentrations of a few milligrams per milliliters, appreciable formation of dimers by head-to-tail linkage is indicated. When the octanol content is reduced by adding greater amounts of dioxane, larger particles are encountered. This is accompanied by a decrease of the effective polarity. The inherent increase of hydrophilicity in the dioxane-enriched solvent apparently favors another monomer conformation that has a low dipole moment and easily aggregates to some kind of micelle. PMID:7115881
Negre, Christian F. A; Mniszewski, Susan M.; Cawkwell, Marc Jon; ...
2016-06-06
We present a reduced complexity algorithm to compute the inverse overlap factors required to solve the generalized eigenvalue problem in a quantum-based molecular dynamics (MD) simulation. Our method is based on the recursive iterative re nement of an initial guess Z of the inverse overlap matrix S. The initial guess of Z is obtained beforehand either by using an approximate divide and conquer technique or dynamically, propagated within an extended Lagrangian dynamics from previous MD time steps. With this formulation, we achieve long-term stability and energy conservation even under incomplete approximate iterative re nement of Z. Linear scaling performance ismore » obtained using numerically thresholded sparse matrix algebra based on the ELLPACK-R sparse matrix data format, which also enables e cient shared memory parallelization. As we show in this article using selfconsistent density functional based tight-binding MD, our approach is faster than conventional methods based on the direct diagonalization of the overlap matrix S for systems as small as a few hundred atoms, substantially accelerating quantum-based simulations even for molecular structures of intermediate size. For a 4,158 atom water-solvated polyalanine system we nd an average speedup factor of 122 for the computation of Z in each MD step.« less
A microfluidic direct formate fuel cell on paper.
Copenhaver, Thomas S; Purohit, Krutarth H; Domalaon, Kryls; Pham, Linda; Burgess, Brianna J; Manorothkul, Natalie; Galvan, Vicente; Sotez, Samantha; Gomez, Frank A; Haan, John L
2015-08-01
We describe the first direct formate fuel cell on a paper microfluidic platform. In traditional membrane-less microfluidic fuel cells (MFCs), external pumping consumes power produced by the fuel cell in order to maintain co-laminar flow of the anode stream and oxidant stream to prevent mixing. However, in paper microfluidics, capillary action drives flow while minimizing stream mixing. In this work, we demonstrate a paper MFC that uses formate and hydrogen peroxide as the anode fuel and cathode oxidant, respectively. Using these materials we achieve a maximum power density of nearly 2.5 mW/mg Pd. In a series configuration, our MFC achieves an open circuit voltage just over 1 V, and in a parallel configuration, short circuit of 20 mA absolute current. We also demonstrate that the MFC does not require continuous flow of fuel and oxidant to produce power. We found that we can pre-saturate the materials on the paper, stop the electrolyte flow, and still produce approximately 0.5 V for 15 min. This type of paper MFC has potential applications in point-of-care diagnostic devices and other electrochemical sensors. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
NASA Technical Reports Server (NTRS)
Liu, C. C. (Principal Investigator); Rodrigues, J. E.
1984-01-01
Examination of LANDSAT and SLAR images in southern Bahia reveals numerous linear features, which are grouped in five sets, based on their trends: N65 degrees E, N70 degrees W, N45 degrees E and NS/N15 degrees E. Owing to their topographic expressions, distributive patterns, spacing between individual lineaments and their mutual relationships, the lineament sets of N65 degrees E and N70 degrees W, as well as the sets of N40 degrees E and N45 degrees W, are considered as two groups of conjugate shear fractures and the former is older and is always cut by the latter. Their conjugate shear angles are 45 degrees and 85 degrees and their bisector lines are approximately in east-west and north-south directions, respectively. According to Badgeley's argumentation on the conjugate shear angles, the former conjugate shear fractures would be caused by: (1) vertical movements, and the bisector of their conjugate angle would be parallel to the long axis of horsting or folding, or (2) by a compressive force in the east-west direction and under a condition of low confining pressure and temperature.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
We present preliminary hypocenter determinations for 52 earthquakes recorded by a large multiinstitutional network of ocean bottom seismometers and ocean bottom hydrophones in the Orozco Fracture Zone in the eastern Pacific during late February to mid-March 1979. The network was deployed as part of the Rivera Ocean Seismic Experiment, also known as Project ROSE. The Orozco Fracture Zone is Physiographically complex, and the pattern of microearthquake hypocenters at least partly reflects this complexity. All of the well-located epicenters lie within the active transform fault segment of the fracture zone. About half of the recorded earthquakes were aligned along a narrowmore » trough that extends eastward from the northern rise crest intersection in the approximate direction of the Cocos-Pacific relative plate motion; these events appear to be characterized by strike-slip faulting. The second major group of activity occurred in the central portion of the transform fault; the microearthquakes in this group do not display a preferred alignment parallel to the direction of spreading, and several are not obviously associated with distinct topographic features. Hypocentral depth was well resolved for many of the earthquakes reported here. Nominal depths range from 0 to 17 km below the seafloor.« less
Ion componsition of zipper events
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaye, S.M.; Shelley, E.G.; Sharp, R.D.
1981-05-01
A class of ion distributions has recently been identified by Fennell et al. (this issue). The distributions are composed of two components, a low-energy component with peak fluxes directed along the field line and a high-energy component with peak fluxes in the perpendicular direction. The transiton between the two components occur over a very narrow range of energies but can occur anywhere between approximately several hundred electron volts and 20 keV. Because of the appearance of this distribution on an energy versus time spectrogram, the ion events have been called zippers. The purpose of this report is to examine themore » mass composition of the zipper events. We find that the low-energy and parallel component is composed primarily of O/sup +/, with, to a lesser degree, H/sup +/ and a trace of He/sup +/. The high-energy and perpendicular component is predominantly H/sup +/, with the relative abundances of O/sup +/ and He/sup +/ down from those of the low-energy component by a factor of approx.10. These results suggest that whereas the low-energy component is probably ionospheric in origin, the source of the high-energy components is most probably the plsamasheet.« less
NASA Technical Reports Server (NTRS)
Woronowicz, Michael
2016-01-01
Analytical expressions for column number density (CND) are developed for optical line of sight paths through a variety of steady free molecule point source models including directionally-constrained effusion (Mach number M = 0) and flow from a sonic orifice (M = 1). Sonic orifice solutions are approximate, developed using a fair simulacrum fitted to the free molecule solution. Expressions are also developed for a spherically-symmetric thermal expansion (M = 0). CND solutions are found for the most general paths relative to these sources and briefly explored. It is determined that the maximum CND from a distant location through directed effusion and sonic orifice cases occurs along the path parallel to the source plane that intersects the plume axis. For the effusive case this value is exactly twice the CND found along the ray originating from that point of intersection and extending to infinity along the plume's axis. For sonic plumes this ratio is reduced to about 4/3. For high Mach number cases the maximum CND will be found along the axial centerline path. Keywords: column number density, plume flows, outgassing, free molecule flow.
Solution of a tridiagonal system of equations on the finite element machine
NASA Technical Reports Server (NTRS)
Bostic, S. W.
1984-01-01
Two parallel algorithms for the solution of tridiagonal systems of equations were implemented on the Finite Element Machine. The Accelerated Parallel Gauss method, an iterative method, and the Buneman algorithm, a direct method, are discussed and execution statistics are presented.
Moving-Article X-Ray Imaging System and Method for 3-D Image Generation
NASA Technical Reports Server (NTRS)
Fernandez, Kenneth R. (Inventor)
2012-01-01
An x-ray imaging system and method for a moving article are provided for an article moved along a linear direction of travel while the article is exposed to non-overlapping x-ray beams. A plurality of parallel linear sensor arrays are disposed in the x-ray beams after they pass through the article. More specifically, a first half of the plurality are disposed in a first of the x-ray beams while a second half of the plurality are disposed in a second of the x-ray beams. Each of the parallel linear sensor arrays is oriented perpendicular to the linear direction of travel. Each of the parallel linear sensor arrays in the first half is matched to a corresponding one of the parallel linear sensor arrays in the second half in terms of an angular position in the first of the x-ray beams and the second of the x-ray beams, respectively.
NASA Astrophysics Data System (ADS)
Marx, Alain; Lütjens, Hinrich
2017-03-01
A hybrid MPI/OpenMP parallel version of the XTOR-2F code [Lütjens and Luciani, J. Comput. Phys. 229 (2010) 8130] solving the two-fluid MHD equations in full tokamak geometry by means of an iterative Newton-Krylov matrix-free method has been developed. The present work shows that the code has been parallelized significantly despite the numerical profile of the problem solved by XTOR-2F, i.e. a discretization with pseudo-spectral representations in all angular directions, the stiffness of the two-fluid stability problem in tokamaks, and the use of a direct LU decomposition to invert the physical pre-conditioner at every Krylov iteration of the solver. The execution time of the parallelized version is an order of magnitude smaller than the sequential one for low resolution cases, with an increasing speedup when the discretization mesh is refined. Moreover, it allows to perform simulations with higher resolutions, previously forbidden because of memory limitations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Blocksome, Michael A.; Mamidala, Amith R.
2013-09-03
Fencing direct memory access (`DMA`) data transfers in a parallel active messaging interface (`PAMI`) of a parallel computer, the PAMI including data communications endpoints, each endpoint including specifications of a client, a context, and a task, the endpoints coupled for data communications through the PAMI and through DMA controllers operatively coupled to segments of shared random access memory through which the DMA controllers deliver data communications deterministically, including initiating execution through the PAMI of an ordered sequence of active DMA instructions for DMA data transfers between two endpoints, effecting deterministic DMA data transfers through a DMA controller and a segmentmore » of shared memory; and executing through the PAMI, with no FENCE accounting for DMA data transfers, an active FENCE instruction, the FENCE instruction completing execution only after completion of all DMA instructions initiated prior to execution of the FENCE instruction for DMA data transfers between the two endpoints.« less
Parallelized implicit propagators for the finite-difference Schrödinger equation
NASA Astrophysics Data System (ADS)
Parker, Jonathan; Taylor, K. T.
1995-08-01
We describe the application of block Gauss-Seidel and block Jacobi iterative methods to the design of implicit propagators for finite-difference models of the time-dependent Schrödinger equation. The block-wise iterative methods discussed here are mixed direct-iterative methods for solving simultaneous equations, in the sense that direct methods (e.g. LU decomposition) are used to invert certain block sub-matrices, and iterative methods are used to complete the solution. We describe parallel variants of the basic algorithm that are well suited to the medium- to coarse-grained parallelism of work-station clusters, and MIMD supercomputers, and we show that under a wide range of conditions, fine-grained parallelism of the computation can be achieved. Numerical tests are conducted on a typical one-electron atom Hamiltonian. The methods converge robustly to machine precision (15 significant figures), in some cases in as few as 6 or 7 iterations. The rate of convergence is nearly independent of the finite-difference grid-point separations.
Directionally solidified article with weld repair
NASA Technical Reports Server (NTRS)
Smashey, Russell W. (Inventor); Snyder, John H. (Inventor); Borne, Bruce L. (Inventor)
2003-01-01
A directionally solidified nickel-base superalloy article has a defect therein extending parallel to the solidification direction. The article is repaired by removing any foreign matter present in the defect, and then heating the article to a repair temperature of from about 60 to about 98 percent of the solidus temperature of the base material in a chamber containing a protective gas that inhibits oxidation of the base material. The defect is filled with a filler metal while maintaining the article at the repair temperature. The filling is accomplished by providing a source of the filler metal of substantially the same composition as the base material of the directionally solidified article, and melting the filler metal into the defect progressively while moving the source of the filler metal relative to the article in a direction parallel to the solidification direction. Optionally, additional artificial heat extraction is accomplished in a heat-flow direction that is within about 45 degrees of the solidification direction, as the filler metal solidifies within the defect. The article may thereafter be heat treated.
Weld repair of directionally solidified articles
NASA Technical Reports Server (NTRS)
Smashey, Russell W. (Inventor); Snyder, John H. (Inventor); Borne, Bruce L. (Inventor)
2002-01-01
A directionally solidified nickel-base superalloy article has a defect therein extending parallel to the solidification direction. The article is repaired by removing any foreign matter present in the defect, and then heating the article to a repair temperature of from about 60 to about 98 percent of the solidus temperature of the base material in a chamber containing a protective gas that inhibits oxidation of the base material. The defect is filled with a filler metal while maintaining the article at the repair temperature. The filling is accomplished by providing a source of the filler metal of substantially the same composition as the base material of the directionally solidified article, and melting the filler metal into the defect progressively while moving the source of the filler metal relative to the article in a direction parallel to the solidification direction. Optionally, additional artificial heat extraction is accomplished in a heat-flow direction that is within about 45 degrees of the solidification direction, as the filler metal solidifies within the defect. The article may thereafter be heat treated.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reimberg, Paulo; Bernardeau, Francis; Pitrou, Cyril, E-mail: paulo.flose-reimberg@cea.fr, E-mail: francis.bernardeau@cea.fr, E-mail: pitrou@iap.fr
Redshift-space distortions are generally considered in the plane parallel limit, where the angular separation between the two sources can be neglected. Given that galaxy catalogues now cover large fractions of the sky, it becomes necessary to consider them in a formalism which takes into account the wide angle separations. In this article we derive an operational formula for the matter correlators in the Newtonian limit to be used in actual data sets. In order to describe the geometrical nature of the wide angle RSD effect on Fourier space, we extend the formalism developed in configuration space to Fourier space withoutmore » relying on a plane-parallel approximation, but under the extra assumption of no bias evolution. We then recover the plane-parallel limit not only in configuration space where the geometry is simpler, but also in Fourier space, and we exhibit the first corrections that should be included in large surveys as a perturbative expansion over the plane-parallel results. We finally compare our results to existing literature, and show explicitly how they are related.« less
Trojak, Benoit; Soudry-Faure, Agnès; Abello, Nicolas; Carpentier, Maud; Jonval, Lysiane; Allard, Coralie; Sabsevari, Foroogh; Blaise, Emilie; Ponavoy, Eddy; Bonin, Bernard; Meille, Vincent; Chauvet-Gelinier, Jean-Christophe
2016-05-17
Approximately 15 million persons in the European Union and 10 million persons in the USA are alcohol-dependent. The global burden of disease and injury attributable to alcohol is considerable: worldwide, approximately one in 25 deaths in 2004 was caused by alcohol. At the same time, alcohol use disorders remain seriously undertreated. In this context, alternative or adjunctive therapies such as brain stimulation may play a prominent role. The early results of studies using transcranial direct current stimulation found that stimulations delivered to the dorsolateral prefrontal cortex result in a significant reduction of craving and an improvement of the decision-making processes in various additive disorders. We, therefore, hypothesize that transcranial direct current stimulation can lead to a decrease in alcohol consumption in patients suffering from alcohol use disorders. We report the protocol of a randomized, double-blind, placebo-controlled, parallel-group trial, to evaluate the efficacy of transcranial direct current stimulation on alcohol reduction in patients with an alcohol use disorder. The study will be conducted in 14 centers in France and Monaco. Altogether, 340 subjects over 18 years of age and diagnosed with an alcohol use disorder will be randomized to receive five consecutive twice-daily sessions of either active or placebo transcranial direct current stimulation. One session consists in delivering a current flow continuously (anode F4; cathode F3) twice for 13 minutes, with treatments separated by a rest interval of 20 min. Efficacy will be evaluated using the change from baseline (alcohol consumption during the 4 weeks before randomization) to 24 weeks in the total alcohol consumption and number of heavy drinking days. Secondary outcome measures will include alcohol craving, clinical and biological improvements, and the effects on mood and quality of life, as well as cognitive and safety assessments, and, for smokers, an assessment of the effects of transcranial direct current stimulation on tobacco consumption. Several studies have reported a beneficial effect of transcranial direct current stimulation on substance use disorders by reducing craving, impulsivity, and risk-taking behavior, and suggest that transcranial direct current stimulation may be a promising treatment in addiction. However, to date, no studies have included sufficiently large samples and sufficient follow-up to confirm the hypothesis. Results from this large randomized controlled trial will give a better overview of the therapeutic potential of transcranial direct current stimulation in alcohol use disorders. Clinical Trials Gov, NCT02505126 (registration date: July 15 2015).
NASA Astrophysics Data System (ADS)
Hong, Ie-Hong; Hsu, Hsin-Zan
2018-03-01
The layered antiferromagnetism of parallel nanowire (NW) arrays self-assembled on Si(110) have been observed at room temperature by direct imaging of both the topographies and magnetic domains using spin-polarized scanning tunneling microscopy/spectroscopy (SP-STM/STS). The topographic STM images reveal that the self-assembled unidirectional and parallel NiSi NWs grow into the Si(110) substrate along the [\\bar{1}10] direction (i.e. the endotaxial growth) and exhibit multiple-layer growth. The spatially-resolved SP-STS maps show that these parallel NiSi NWs of different heights produce two opposite magnetic domains, depending on the heights of either even or odd layers in the layer stack of the NiSi NWs. This layer-wise antiferromagnetic structure can be attributed to an antiferromagnetic interlayer exchange coupling between the adjacent layers in the multiple-layer NiSi NW with a B2 (CsCl-type) crystal structure. Such an endotaxial heterostructure of parallel magnetic NiSi NW arrays with a layered antiferromagnetic ordering in Si(110) provides a new and important perspective for the development of novel Si-based spintronic nanodevices.
Characterization of robotics parallel algorithms and mapping onto a reconfigurable SIMD machine
NASA Technical Reports Server (NTRS)
Lee, C. S. G.; Lin, C. T.
1989-01-01
The kinematics, dynamics, Jacobian, and their corresponding inverse computations are six essential problems in the control of robot manipulators. Efficient parallel algorithms for these computations are discussed and analyzed. Their characteristics are identified and a scheme on the mapping of these algorithms to a reconfigurable parallel architecture is presented. Based on the characteristics including type of parallelism, degree of parallelism, uniformity of the operations, fundamental operations, data dependencies, and communication requirement, it is shown that most of the algorithms for robotic computations possess highly regular properties and some common structures, especially the linear recursive structure. Moreover, they are well-suited to be implemented on a single-instruction-stream multiple-data-stream (SIMD) computer with reconfigurable interconnection network. The model of a reconfigurable dual network SIMD machine with internal direct feedback is introduced. A systematic procedure internal direct feedback is introduced. A systematic procedure to map these computations to the proposed machine is presented. A new scheduling problem for SIMD machines is investigated and a heuristic algorithm, called neighborhood scheduling, that reorders the processing sequence of subtasks to reduce the communication time is described. Mapping results of a benchmark algorithm are illustrated and discussed.
Utilizing GPUs to Accelerate Turbomachinery CFD Codes
NASA Technical Reports Server (NTRS)
MacCalla, Weylin; Kulkarni, Sameer
2016-01-01
GPU computing has established itself as a way to accelerate parallel codes in the high performance computing world. This work focuses on speeding up APNASA, a legacy CFD code used at NASA Glenn Research Center, while also drawing conclusions about the nature of GPU computing and the requirements to make GPGPU worthwhile on legacy codes. Rewriting and restructuring of the source code was avoided to limit the introduction of new bugs. The code was profiled and investigated for parallelization potential, then OpenACC directives were used to indicate parallel parts of the code. The use of OpenACC directives was not able to reduce the runtime of APNASA on either the NVIDIA Tesla discrete graphics card, or the AMD accelerated processing unit. Additionally, it was found that in order to justify the use of GPGPU, the amount of parallel work being done within a kernel would have to greatly exceed the work being done by any one portion of the APNASA code. It was determined that in order for an application like APNASA to be accelerated on the GPU, it should not be modular in nature, and the parallel portions of the code must contain a large portion of the code's computation time.
GRADSPMHD: A parallel MHD code based on the SPH formalism
NASA Astrophysics Data System (ADS)
Vanaverbeke, S.; Keppens, R.; Poedts, S.
2014-03-01
We present GRADSPMHD, a completely Lagrangian parallel magnetohydrodynamics code based on the SPH formalism. The implementation of the equations of SPMHD in the “GRAD-h” formalism assembles known results, including the derivation of the discretized MHD equations from a variational principle, the inclusion of time-dependent artificial viscosity, resistivity and conductivity terms, as well as the inclusion of a mixed hyperbolic/parabolic correction scheme for satisfying the ∇ṡB→ constraint on the magnetic field. The code uses a tree-based formalism for neighbor finding and can optionally use the tree code for computing the self-gravity of the plasma. The structure of the code closely follows the framework of our parallel GRADSPH FORTRAN 90 code which we added previously to the CPC program library. We demonstrate the capabilities of GRADSPMHD by running 1, 2, and 3 dimensional standard benchmark tests and we find good agreement with previous work done by other researchers. The code is also applied to the problem of simulating the magnetorotational instability in 2.5D shearing box tests as well as in global simulations of magnetized accretion disks. We find good agreement with available results on this subject in the literature. Finally, we discuss the performance of the code on a parallel supercomputer with distributed memory architecture. Catalogue identifier: AERP_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AERP_v1_0.html Program obtainable from: CPC Program Library, Queen’s University, Belfast, N. Ireland Licensing provisions: Standard CPC licence, http://cpc.cs.qub.ac.uk/licence/licence.html No. of lines in distributed program, including test data, etc.: 620503 No. of bytes in distributed program, including test data, etc.: 19837671 Distribution format: tar.gz Programming language: FORTRAN 90/MPI. Computer: HPC cluster. Operating system: Unix. Has the code been vectorized or parallelized?: Yes, parallelized using MPI. RAM: ˜30 MB for a Sedov test including 15625 particles on a single CPU. Classification: 12. Nature of problem: Evolution of a plasma in the ideal MHD approximation. Solution method: The equations of magnetohydrodynamics are solved using the SPH method. Running time: The test provided takes approximately 20 min using 4 processors.
NASA Technical Reports Server (NTRS)
Heflinger, L. O.
1970-01-01
In holographic interferometry a small movement of apparatus between exposures causes the background of the reconstructed scene to be covered with interference fringes approximately parallel to each other. The three-dimensional quality of the holographic image is allowable since a mathematical model will give the location of the fringes.
Time-frequency model for echo-delay resolution in wideband biosonar.
Neretti, Nicola; Sanderson, Mark I; Intrator, Nathan; Simmons, James A
2003-04-01
A time/frequency model of the bat's auditory system was developed to examine the basis for the fine (approximately 2 micros) echo-delay resolution of big brown bats (Eptesicus fuscus), and its performance at resolving closely spaced FM sonar echoes in the bat's 20-100-kHz band at different signal-to-noise ratios was computed. The model uses parallel bandpass filters spaced over this band to generate envelopes that individually can have much lower bandwidth than the bat's ultrasonic sonar sounds and still achieve fine delay resolution. Because fine delay separations are inside the integration time of the model's filters (approximately 250-300 micros), resolving them means using interference patterns along the frequency dimension (spectral peaks and notches). The low bandwidth content of the filter outputs is suitable for relay of information to higher auditory areas that have intrinsically poor temporal response properties. If implemented in fully parallel analog-digital hardware, the model is computationally extremely efficient and would improve resolution in military and industrial sonar receivers.
Stochastic bifurcations in the nonlinear parallel Ising model.
Bagnoli, Franco; Rechtman, Raúl
2016-11-01
We investigate the phase transitions of a nonlinear, parallel version of the Ising model, characterized by an antiferromagnetic linear coupling and ferromagnetic nonlinear one. This model arises in problems of opinion formation. The mean-field approximation shows chaotic oscillations, by changing the couplings or the connectivity. The spatial model shows bifurcations in the average magnetization, similar to that seen in the mean-field approximation, induced by the change of the topology, after rewiring short-range to long-range connection, as predicted by the small-world effect. These coherent periodic and chaotic oscillations of the magnetization reflect a certain degree of synchronization of the spins, induced by long-range couplings. Similar bifurcations may be induced in the randomly connected model by changing the couplings or the connectivity and also the dilution (degree of asynchronism) of the updating. We also examined the effects of inhomogeneity, mixing ferromagnetic and antiferromagnetic coupling, which induces an unexpected bifurcation diagram with a "bubbling" behavior, as also happens for dilution.
NASA Astrophysics Data System (ADS)
Hadjidoukas, P. E.; Angelikopoulos, P.; Papadimitriou, C.; Koumoutsakos, P.
2015-03-01
We present Π4U, an extensible framework, for non-intrusive Bayesian Uncertainty Quantification and Propagation (UQ+P) of complex and computationally demanding physical models, that can exploit massively parallel computer architectures. The framework incorporates Laplace asymptotic approximations as well as stochastic algorithms, along with distributed numerical differentiation and task-based parallelism for heterogeneous clusters. Sampling is based on the Transitional Markov Chain Monte Carlo (TMCMC) algorithm and its variants. The optimization tasks associated with the asymptotic approximations are treated via the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). A modified subset simulation method is used for posterior reliability measurements of rare events. The framework accommodates scheduling of multiple physical model evaluations based on an adaptive load balancing library and shows excellent scalability. In addition to the software framework, we also provide guidelines as to the applicability and efficiency of Bayesian tools when applied to computationally demanding physical models. Theoretical and computational developments are demonstrated with applications drawn from molecular dynamics, structural dynamics and granular flow.
A design procedure for the phase-controlled parallel-loaded resonant inverter
NASA Technical Reports Server (NTRS)
King, Roger J.
1989-01-01
High-frequency-link power conversion and distribution based on a resonant inverter (RI) has been recently proposed. The design of several topologies is reviewed, and a simple approximate design procedure is developed for the phase-controlled parallel-loaded RI. This design procedure seeks to ensure the benefits of resonant conversion and is verified by data from a laboratory 2.5 kVA, 20-kHz converter. A simple phasor analysis is introduced as a useful approximation for design purposes. The load is considered to be a linear impedance (or an ac current sink). The design procedure is verified using a 2.5-kVA 20-kHz RI. Also obtained are predictable worst-case ratings for each component of the resonant tank circuit and the inverter switches. For a given load VA requirement, below-resonance operation is found to result in a significantly lower tank VA requirement. Under transient conditions such as load short-circuit, a reversal of the expected commutation sequence is possible.
Parallel Preconditioning for CFD Problems on the CM-5
NASA Technical Reports Server (NTRS)
Simon, Horst D.; Kremenetsky, Mark D.; Richardson, John; Lasinski, T. A. (Technical Monitor)
1994-01-01
Up to today, preconditioning methods on massively parallel systems have faced a major difficulty. The most successful preconditioning methods in terms of accelerating the convergence of the iterative solver such as incomplete LU factorizations are notoriously difficult to implement on parallel machines for two reasons: (1) the actual computation of the preconditioner is not very floating-point intensive, but requires a large amount of unstructured communication, and (2) the application of the preconditioning matrix in the iteration phase (i.e. triangular solves) are difficult to parallelize because of the recursive nature of the computation. Here we present a new approach to preconditioning for very large, sparse, unsymmetric, linear systems, which avoids both difficulties. We explicitly compute an approximate inverse to our original matrix. This new preconditioning matrix can be applied most efficiently for iterative methods on massively parallel machines, since the preconditioning phase involves only a matrix-vector multiplication, with possibly a dense matrix. Furthermore the actual computation of the preconditioning matrix has natural parallelism. For a problem of size n, the preconditioning matrix can be computed by solving n independent small least squares problems. The algorithm and its implementation on the Connection Machine CM-5 are discussed in detail and supported by extensive timings obtained from real problem data.
Efficient partitioning and assignment on programs for multiprocessor execution
NASA Technical Reports Server (NTRS)
Standley, Hilda M.
1993-01-01
The general problem studied is that of segmenting or partitioning programs for distribution across a multiprocessor system. Efficient partitioning and the assignment of program elements are of great importance since the time consumed in this overhead activity may easily dominate the computation, effectively eliminating any gains made by the use of the parallelism. In this study, the partitioning of sequentially structured programs (written in FORTRAN) is evaluated. Heuristics, developed for similar applications are examined. Finally, a model for queueing networks with finite queues is developed which may be used to analyze multiprocessor system architectures with a shared memory approach to the problem of partitioning. The properties of sequentially written programs form obstacles to large scale (at the procedure or subroutine level) parallelization. Data dependencies of even the minutest nature, reflecting the sequential development of the program, severely limit parallelism. The design of heuristic algorithms is tied to the experience gained in the parallel splitting. Parallelism obtained through the physical separation of data has seen some success, especially at the data element level. Data parallelism on a grander scale requires models that accurately reflect the effects of blocking caused by finite queues. A model for the approximation of the performance of finite queueing networks is developed. This model makes use of the decomposition approach combined with the efficiency of product form solutions.
NASA Technical Reports Server (NTRS)
Landis, W. J.; Song, M. J.; Leith, A.; McEwen, L.; McEwen, B. F.
1993-01-01
To define the ultrastructural accommodation of mineral crystals by collagen fibrils and other organic matrix components during vertebrate calcification, electron microscopic 3-D reconstructions were generated from the normally mineralizing leg tendons from the domestic turkey, Meleagris gallopavo. Embedded specimens containing initial collagen mineralizing sites were cut into 0.5-micron-thick sections and viewed and photographed at 1.0 MV in the Albany AEI-EM7 high-voltage electron microscope. Tomographic 3-D reconstructions were computed from a 2 degree tilt series of micrographs taken over a minimum angular range of +/- 60 degrees. Reconstructions of longitudinal tendon profiles confirm the presence of irregularly shaped mineral platelets, whose crystallographic c-axes are oriented generally parallel to one another and directed along the collagen long axes. The reconstructions also corroborate observations of a variable crystal length (up to 170 nm measured along crystallographic c-axes), the presence of crystals initially in either the hole or overlap zones of collagen, and crystal growth in the c-axis direction beyond these zones into adjacent overlap and other hole regions. Tomography shows for the first time that crystal width varies (30-45 nm) but crystal thickness is uniform (approximately 4-6 nm at the resolution limit of tomography); more crystals are located in the collagen hole zones than in the overlap regions at the earliest stages of tendon mineralization; the crystallographic c-axes of the platelets lie within +/- 15-20 degrees of one another rather than being perfectly parallel; adjacent platelets are spatially separated by a minimum of 4.2 +/- 1.0 nm; crystals apparently fuse in coplanar alignment to form larger platelets; development of crystals in width occurs to dimensions beyond single collagen hole zones; and a thin envelope of organic origin may be present along or just beneath the surfaces of individual mineral platelets. Implicit in the results is that the formation of crystals occurs at different sites and times by independent nucleation events in local regions of collagen. These data provide the first direct visual evidence from 3-D imaging describing the size, shape, orientation, and growth of mineral crystals in association with collagen of a normally mineralizing vertebrate tissue. They support concepts that c-axial crystal growth is unhindered by collage hole zone dimensions, that crystals are organized in the tendon in a series of generally parallel platelets, and that crystal growth in width across collagen fibrils may follow channels or grooves formed by adjacent hole zones in register.
Multitasking the three-dimensional transport code TORT on CRAY platforms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Azmy, Y.Y.; Barnett, D.A.; Burre, C.A.
1996-04-01
The multitasking options in the three-dimensional neutral particle transport code TORT originally implemented for Cray`s CTSS operating system are revived and extended to run on Cray Y/MP and C90 computers using the UNICOS operating system. These include two coarse-grained domain decompositions; across octants, and across directions within an octant, termed Octant Parallel (OP), and Direction Parallel (DP), respectively. Parallel performance of the DP is significantly enhanced by increasing the task grain size and reducing load imbalance via dynamic scheduling of the discrete angles among the participating tasks. Substantial Wall Clock speedup factors, approaching 4.5 using 8 tasks, have been measuredmore » in a time-sharing environment, and generally depend on the test problem specifications, number of tasks, and machine loading during execution.« less
Measures of three-dimensional anisotropy and intermittency in strong Alfvénic turbulence
NASA Astrophysics Data System (ADS)
Mallet, A.; Schekochihin, A. A.; Chandran, B. D. G.; Chen, C. H. K.; Horbury, T. S.; Wicks, R. T.; Greenan, C. C.
2016-06-01
We measure the local anisotropy of numerically simulated strong Alfvénic turbulence with respect to two local, physically relevant directions: along the local mean magnetic field and along the local direction of one of the fluctuating Elsasser fields. We find significant scaling anisotropy with respect to both these directions: the fluctuations are `ribbon-like' - statistically, they are elongated along both the mean magnetic field and the fluctuating field. The latter form of anisotropy is due to scale-dependent alignment of the fluctuating fields. The intermittent scalings of the nth-order conditional structure functions in the direction perpendicular to both the local mean field and the fluctuations agree well with the theory of Chandran, Schekochihin & Mallet, while the parallel scalings are consistent with those implied by the critical-balance conjecture. We quantify the relationship between the perpendicular scalings and those in the fluctuation and parallel directions, and find that the scaling exponent of the perpendicular anisotropy (I.e. of the aspect ratio of the Alfvénic structures in the plane perpendicular to the mean magnetic field) depends on the amplitude of the fluctuations. This is shown to be equivalent to the anticorrelation of fluctuation amplitude and alignment at each scale. The dependence of the anisotropy on amplitude is shown to be more significant for the anisotropy between the perpendicular and fluctuation-direction scales than it is between the perpendicular and parallel scales.
NASA Astrophysics Data System (ADS)
Puzyrev, Vladimir; Torres-Verdín, Carlos; Calo, Victor
2018-05-01
The interpretation of resistivity measurements acquired in high-angle and horizontal wells is a critical technical problem in formation evaluation. We develop an efficient parallel 3-D inversion method to estimate the spatial distribution of electrical resistivity in the neighbourhood of a well from deep directional electromagnetic induction measurements. The methodology places no restriction on the spatial distribution of the electrical resistivity around arbitrary well trajectories. The fast forward modelling of triaxial induction measurements performed with multiple transmitter-receiver configurations employs a parallel direct solver. The inversion uses a pre-conditioned gradient-based method whose accuracy is improved using the Wolfe conditions to estimate optimal step lengths at each iteration. The large transmitter-receiver offsets, used in the latest generation of commercial directional resistivity tools, improve the depth of investigation to over 30 m from the wellbore. Several challenging synthetic examples confirm the feasibility of the full 3-D inversion-based interpretations for these distances, hence enabling the integration of resistivity measurements with seismic amplitude data to improve the forecast of the petrophysical and fluid properties. Employing parallel direct solvers for the triaxial induction problems allows for large reductions in computational effort, thereby opening the possibility to invert multiposition 3-D data in practical CPU times.
NASA Technical Reports Server (NTRS)
Morgan, Philip E.
2004-01-01
This final report contains reports of research related to the tasks "Scalable High Performance Computing: Direct and Lark-Eddy Turbulent FLow Simulations Using Massively Parallel Computers" and "Devleop High-Performance Time-Domain Computational Electromagnetics Capability for RCS Prediction, Wave Propagation in Dispersive Media, and Dual-Use Applications. The discussion of Scalable High Performance Computing reports on three objectives: validate, access scalability, and apply two parallel flow solvers for three-dimensional Navier-Stokes flows; develop and validate a high-order parallel solver for Direct Numerical Simulations (DNS) and Large Eddy Simulation (LES) problems; and Investigate and develop a high-order Reynolds averaged Navier-Stokes turbulence model. The discussion of High-Performance Time-Domain Computational Electromagnetics reports on five objectives: enhancement of an electromagnetics code (CHARGE) to be able to effectively model antenna problems; utilize lessons learned in high-order/spectral solution of swirling 3D jets to apply to solving electromagnetics project; transition a high-order fluids code, FDL3DI, to be able to solve Maxwell's Equations using compact-differencing; develop and demonstrate improved radiation absorbing boundary conditions for high-order CEM; and extend high-order CEM solver to address variable material properties. The report also contains a review of work done by the systems engineer.
A fast pulse design for parallel excitation with gridding conjugate gradient.
Feng, Shuo; Ji, Jim
2013-01-01
Parallel excitation (pTx) is recognized as a crucial technique in high field MRI to address the transmit field inhomogeneity problem. However, it can be time consuming to design pTx pulses which is not desirable. In this work, we propose a pulse design with gridding conjugate gradient (CG) based on the small-tip-angle approximation. The two major time consuming matrix-vector multiplications are substituted by two operators which involves with FFT and gridding only. Simulation results have shown that the proposed method is 3 times faster than conventional method and the memory cost is reduced by 1000 times.
NASA Technical Reports Server (NTRS)
Kirk, R. G.; Nicholas, J. C.; Donald, G. H.; Murphy, R. C.
1980-01-01
The summary of a complete analytical design evaluation of an existing parallel flow compressor is presented and a field vibration problem that manifested itself as a subsynchronous vibration that tracked at approximately 2/3 of compressor speed is reviewed. The comparison of predicted and observed peak response speeds, frequency spectrum content, and the performance of the bearing-seal systems are presented as the events of the field problem are reviewed. Conclusions and recommendations are made as to the degree of accuracy of the analytical techniques used to evaluate the compressor design.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lasuik, J.; Shalchi, A., E-mail: andreasm4@yahoo.com
Recently, a new theory for the transport of energetic particles across a mean magnetic field was presented. Compared to other nonlinear theories the new approach has the advantage that it provides a full time-dependent description of the transport. Furthermore, a diffusion approximation is no longer part of that theory. The purpose of this paper is to combine this new approach with a time-dependent model for parallel transport and different turbulence configurations in order to explore the parameter regimes for which we get ballistic transport, compound subdiffusion, and normal Markovian diffusion.
Parallel inhomogeneity and the Alfven resonance. 1: Open field lines
NASA Technical Reports Server (NTRS)
Hansen, P. J.; Harrold, B. G.
1994-01-01
In light of a recent demonstration of the general nonexistence of a singularity at the Alfven resonance in cold, ideal, linearized magnetohydrodynamics, we examine the effect of a small density gradient parallel to uniform, open ambient magnetic field lines. To lowest order, energy deposition is quantitatively unaffected but occurs continuously over a thickened layer. This effect is illustrated in a numerical analysis of a plasma sheet boundary layer model with perfectly absorbing boundary conditions. Consequences of the results are discussed, both for the open field line approximation and for the ensuing closed field line analysis.
Approximate Computing Techniques for Iterative Graph Algorithms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Panyala, Ajay R.; Subasi, Omer; Halappanavar, Mahantesh
Approximate computing enables processing of large-scale graphs by trading off quality for performance. Approximate computing techniques have become critical not only due to the emergence of parallel architectures but also the availability of large scale datasets enabling data-driven discovery. Using two prototypical graph algorithms, PageRank and community detection, we present several approximate computing heuristics to scale the performance with minimal loss of accuracy. We present several heuristics including loop perforation, data caching, incomplete graph coloring and synchronization, and evaluate their efficiency. We demonstrate performance improvements of up to 83% for PageRank and up to 450x for community detection, with lowmore » impact of accuracy for both the algorithms. We expect the proposed approximate techniques will enable scalable graph analytics on data of importance to several applications in science and their subsequent adoption to scale similar graph algorithms.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, Andrew F.; Wetzstein, M.; Naab, T.
2009-10-01
We continue our presentation of VINE. In this paper, we begin with a description of relevant architectural properties of the serial and shared memory parallel computers on which VINE is intended to run, and describe their influences on the design of the code itself. We continue with a detailed description of a number of optimizations made to the layout of the particle data in memory and to our implementation of a binary tree used to access that data for use in gravitational force calculations and searches for smoothed particle hydrodynamics (SPH) neighbor particles. We describe the modifications to the codemore » necessary to obtain forces efficiently from special purpose 'GRAPE' hardware, the interfaces required to allow transparent substitution of those forces in the code instead of those obtained from the tree, and the modifications necessary to use both tree and GRAPE together as a fused GRAPE/tree combination. We conclude with an extensive series of performance tests, which demonstrate that the code can be run efficiently and without modification in serial on small workstations or in parallel using the OpenMP compiler directives on large-scale, shared memory parallel machines. We analyze the effects of the code optimizations and estimate that they improve its overall performance by more than an order of magnitude over that obtained by many other tree codes. Scaled parallel performance of the gravity and SPH calculations, together the most costly components of most simulations, is nearly linear up to at least 120 processors on moderate sized test problems using the Origin 3000 architecture, and to the maximum machine sizes available to us on several other architectures. At similar accuracy, performance of VINE, used in GRAPE-tree mode, is approximately a factor 2 slower than that of VINE, used in host-only mode. Further optimizations of the GRAPE/host communications could improve the speed by as much as a factor of 3, but have not yet been implemented in VINE. Finally, we find that although parallel performance on small problems may reach a plateau beyond which more processors bring no additional speedup, performance never decreases, a factor important for running large simulations on many processors with individual time steps, where only a small fraction of the total particles require updates at any given moment.« less
Integrated parallel reception, excitation, and shimming (iPRES).
Han, Hui; Song, Allen W; Truong, Trong-Kha
2013-07-01
To develop a new concept for a hardware platform that enables integrated parallel reception, excitation, and shimming. This concept uses a single coil array rather than separate arrays for parallel excitation/reception and B0 shimming. It relies on a novel design that allows a radiofrequency current (for excitation/reception) and a direct current (for B0 shimming) to coexist independently in the same coil. Proof-of-concept B0 shimming experiments were performed with a two-coil array in a phantom, whereas B0 shimming simulations were performed with a 48-coil array in the human brain. Our experiments show that individually optimized direct currents applied in each coil can reduce the B0 root-mean-square error by 62-81% and minimize distortions in echo-planar images. The simulations show that dynamic shimming with the 48-coil integrated parallel reception, excitation, and shimming array can reduce the B0 root-mean-square error in the prefrontal and temporal regions by 66-79% as compared with static second-order spherical harmonic shimming and by 12-23% as compared with dynamic shimming with a 48-coil conventional shim array. Our results demonstrate the feasibility of the integrated parallel reception, excitation, and shimming concept to perform parallel excitation/reception and B0 shimming with a unified coil system as well as its promise for in vivo applications. Copyright © 2013 Wiley Periodicals, Inc.
Crustal origin of trench-parallel shear-wave fast polarizations in the Central Andes
NASA Astrophysics Data System (ADS)
Wölbern, I.; Löbl, U.; Rümpker, G.
2014-04-01
In this study, SKS and local S phases are analyzed to investigate variations of shear-wave splitting parameters along two dense seismic profiles across the central Andean Altiplano and Puna plateaus. In contrast to previous observations, the vast majority of the measurements reveal fast polarizations sub-parallel to the subduction direction of the Nazca plate with delay times between 0.3 and 1.2 s. Local phases show larger variations of fast polarizations and exhibit delay times ranging between 0.1 and 1.1 s. Two 70 km and 100 km wide sections along the Altiplano profile exhibit larger delay times and are characterized by fast polarizations oriented sub-parallel to major fault zones. Based on finite-difference wavefield calculations for anisotropic subduction zone models we demonstrate that the observations are best explained by fossil slab anisotropy with fast symmetry axes oriented sub-parallel to the slab movement in combination with a significant component of crustal anisotropy of nearly trench-parallel fast-axis orientation. From the modeling we exclude a sub-lithospheric origin of the observed strong anomalies due to the short-scale variations of the fast polarizations. Instead, our results indicate that anisotropy in the Central Andes generally reflects the direction of plate motion while the observed trench-parallel fast polarizations likely originate in the continental crust above the subducting slab.
Charge Transport in Metal Oxides: A Theoretical Study of Hematite α-Fe2O3
DOE Office of Scientific and Technical Information (OSTI.GOV)
Iordanova, Nellie I.; Dupuis, Michel; Rosso, Kevin M.
2005-04-08
Transport of conduction electrons and holes through the lattice of ??Fe2O3 (hematite) is modeled as a valence alternation of iron cations using ab initio electronic structure calculations and electron transfer theory. Experimental studies have shown that the conductivity along the (001) basal plane is four orders of magnitude larger than the conductivity along the [001] direction. In the context of the small polaron model, a cluster approach was used to compute quantities controlling the mobility of localized electrons and holes, i.e. the reorganization energy and the electronic coupling matrix element that enter Marcus? theory. The calculation of the electronic couplingmore » followed the Generalized Mulliken-Hush approach using the complete active space self-consistent field (CASSCF) method. Our findings demonstrate an approximately three orders of magnitude anisotropy in both electron and hole mobility between directions perpendicular and parallel to the c-axis, in good accord with experimental data. The anisotropy arises from the slowness of both electron and hole mobility across basal oxygen planes relative to that within iron bi-layers between basal oxygen planes. Interestingly, for elementary reaction steps along either of the directions considered, there is only approximately one order of magnitude difference in mobility between electrons and holes, in contrast to accepted classical arguments. Our findings indicate that the most important quantity underlying mobility differences is the electronic coupling, albeit the reorganization energy contributes as well. The large values computed for the electronic coupling suggest that charge transport reactions in hematite are adiabatic in nature. The electronic coupling is found to depend on both the superexchange interaction through the bridging oxygen atoms and the d-shell electron spin coupling within the Fe?Fe donor-acceptor pair, while the reorganization energy is essentially independent of the electron spin coupling.« less