Sample records for parallel pde-based simulations

  1. Multithreaded Model for Dynamic Load Balancing Parallel Adaptive PDE Computations

    NASA Technical Reports Server (NTRS)

    Chrisochoides, Nikos

    1995-01-01

    We present a multithreaded model for the dynamic load-balancing of numerical, adaptive computations required for the solution of Partial Differential Equations (PDE's) on multiprocessors. Multithreading is used as a means of exploring concurrency in the processor level in order to tolerate synchronization costs inherent to traditional (non-threaded) parallel adaptive PDE solvers. Our preliminary analysis for parallel, adaptive PDE solvers indicates that multithreading can be used an a mechanism to mask overheads required for the dynamic balancing of processor workloads with computations required for the actual numerical solution of the PDE's. Also, multithreading can simplify the implementation of dynamic load-balancing algorithms, a task that is very difficult for traditional data parallel adaptive PDE computations. Unfortunately, multithreading does not always simplify program complexity, often makes code re-usability not an easy task, and increases software complexity.

  2. Parallel hyperbolic PDE simulation on clusters: Cell versus GPU

    NASA Astrophysics Data System (ADS)

    Rostrup, Scott; De Sterck, Hans

    2010-12-01

    Increasingly, high-performance computing is looking towards data-parallel computational devices to enhance computational performance. Two technologies that have received significant attention are IBM's Cell Processor and NVIDIA's CUDA programming model for graphics processing unit (GPU) computing. In this paper we investigate the acceleration of parallel hyperbolic partial differential equation simulation on structured grids with explicit time integration on clusters with Cell and GPU backends. The message passing interface (MPI) is used for communication between nodes at the coarsest level of parallelism. Optimizations of the simulation code at the several finer levels of parallelism that the data-parallel devices provide are described in terms of data layout, data flow and data-parallel instructions. Optimized Cell and GPU performance are compared with reference code performance on a single x86 central processing unit (CPU) core in single and double precision. We further compare the CPU, Cell and GPU platforms on a chip-to-chip basis, and compare performance on single cluster nodes with two CPUs, two Cell processors or two GPUs in a shared memory configuration (without MPI). We finally compare performance on clusters with 32 CPUs, 32 Cell processors, and 32 GPUs using MPI. Our GPU cluster results use NVIDIA Tesla GPUs with GT200 architecture, but some preliminary results on recently introduced NVIDIA GPUs with the next-generation Fermi architecture are also included. This paper provides computational scientists and engineers who are considering porting their codes to accelerator environments with insight into how structured grid based explicit algorithms can be optimized for clusters with Cell and GPU accelerators. It also provides insight into the speed-up that may be gained on current and future accelerator architectures for this class of applications. Program summaryProgram title: SWsolver Catalogue identifier: AEGY_v1_0 Program summary URL

  3. Using CLIPS in the domain of knowledge-based massively parallel programming

    NASA Technical Reports Server (NTRS)

    Dvorak, Jiri J.

    1994-01-01

    The Program Development Environment (PDE) is a tool for massively parallel programming of distributed-memory architectures. Adopting a knowledge-based approach, the PDE eliminates the complexity introduced by parallel hardware with distributed memory and offers complete transparency in respect of parallelism exploitation. The knowledge-based part of the PDE is realized in CLIPS. Its principal task is to find an efficient parallel realization of the application specified by the user in a comfortable, abstract, domain-oriented formalism. A large collection of fine-grain parallel algorithmic skeletons, represented as COOL objects in a tree hierarchy, contains the algorithmic knowledge. A hybrid knowledge base with rule modules and procedural parts, encoding expertise about application domain, parallel programming, software engineering, and parallel hardware, enables a high degree of automation in the software development process. In this paper, important aspects of the implementation of the PDE using CLIPS and COOL are shown, including the embedding of CLIPS with C++-based parts of the PDE. The appropriateness of the chosen approach and of the CLIPS language for knowledge-based software engineering are discussed.

  4. Program Code Generator for Cardiac Electrophysiology Simulation with Automatic PDE Boundary Condition Handling

    PubMed Central

    Punzalan, Florencio Rusty; Kunieda, Yoshitoshi; Amano, Akira

    2015-01-01

    Clinical and experimental studies involving human hearts can have certain limitations. Methods such as computer simulations can be an important alternative or supplemental tool. Physiological simulation at the tissue or organ level typically involves the handling of partial differential equations (PDEs). Boundary conditions and distributed parameters, such as those used in pharmacokinetics simulation, add to the complexity of the PDE solution. These factors can tailor PDE solutions and their corresponding program code to specific problems. Boundary condition and parameter changes in the customized code are usually prone to errors and time-consuming. We propose a general approach for handling PDEs and boundary conditions in computational models using a replacement scheme for discretization. This study is an extension of a program generator that we introduced in a previous publication. The program generator can generate code for multi-cell simulations of cardiac electrophysiology. Improvements to the system allow it to handle simultaneous equations in the biological function model as well as implicit PDE numerical schemes. The replacement scheme involves substituting all partial differential terms with numerical solution equations. Once the model and boundary equations are discretized with the numerical solution scheme, instances of the equations are generated to undergo dependency analysis. The result of the dependency analysis is then used to generate the program code. The resulting program code are in Java or C programming language. To validate the automatic handling of boundary conditions in the program code generator, we generated simulation code using the FHN, Luo-Rudy 1, and Hund-Rudy cell models and run cell-to-cell coupling and action potential propagation simulations. One of the simulations is based on a published experiment and simulation results are compared with the experimental data. We conclude that the proposed program code generator can be used to

  5. Molecular Bases of PDE4D Inhibition by Memory-Enhancing GEBR Library Compounds.

    PubMed

    Prosdocimi, Tommaso; Mollica, Luca; Donini, Stefano; Semrau, Marta S; Lucarelli, Anna Paola; Aiolfi, Egidio; Cavalli, Andrea; Storici, Paola; Alfei, Silvana; Brullo, Chiara; Bruno, Olga; Parisini, Emilio

    2018-05-01

    Selected members of the large rolipram-related GEBR family of type 4 phosphodiesterase (PDE4) inhibitors have been shown to facilitate long-term potentiation and to improve memory functions without causing emetic-like behavior in rodents. Despite their micromolar-range binding affinities and their promising pharmacological and toxicological profiles, few if any structure-activity relationship studies have been performed to elucidate the molecular bases of their action. Here, we report the crystal structure of a number of GEBR library compounds in complex with the catalytic domain of PDE4D as well as their inhibitory profiles for both the long PDE4D3 isoform and the catalytic domain alone. Furthermore, we assessed the stability of the observed ligand conformations in the context of the intact enzyme using molecular dynamics simulations. The longer and more flexible ligands appear to be capable of forming contacts with the regulatory portion of the enzyme, thus possibly allowing some degree of selectivity between the different PDE4 isoforms.

  6. Scalable hierarchical PDE sampler for generating spatially correlated random fields using nonmatching meshes: Scalable hierarchical PDE sampler using nonmatching meshes

    DOE PAGES

    Osborn, Sarah; Zulian, Patrick; Benson, Thomas; ...

    2018-01-30

    This work describes a domain embedding technique between two nonmatching meshes used for generating realizations of spatially correlated random fields with applications to large-scale sampling-based uncertainty quantification. The goal is to apply the multilevel Monte Carlo (MLMC) method for the quantification of output uncertainties of PDEs with random input coefficients on general and unstructured computational domains. We propose a highly scalable, hierarchical sampling method to generate realizations of a Gaussian random field on a given unstructured mesh by solving a reaction–diffusion PDE with a stochastic right-hand side. The stochastic PDE is discretized using the mixed finite element method on anmore » embedded domain with a structured mesh, and then, the solution is projected onto the unstructured mesh. This work describes implementation details on how to efficiently transfer data from the structured and unstructured meshes at coarse levels, assuming that this can be done efficiently on the finest level. We investigate the efficiency and parallel scalability of the technique for the scalable generation of Gaussian random fields in three dimensions. An application of the MLMC method is presented for quantifying uncertainties of subsurface flow problems. Here, we demonstrate the scalability of the sampling method with nonmatching mesh embedding, coupled with a parallel forward model problem solver, for large-scale 3D MLMC simulations with up to 1.9·109 unknowns.« less

  7. Scalable hierarchical PDE sampler for generating spatially correlated random fields using nonmatching meshes: Scalable hierarchical PDE sampler using nonmatching meshes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Osborn, Sarah; Zulian, Patrick; Benson, Thomas

    This work describes a domain embedding technique between two nonmatching meshes used for generating realizations of spatially correlated random fields with applications to large-scale sampling-based uncertainty quantification. The goal is to apply the multilevel Monte Carlo (MLMC) method for the quantification of output uncertainties of PDEs with random input coefficients on general and unstructured computational domains. We propose a highly scalable, hierarchical sampling method to generate realizations of a Gaussian random field on a given unstructured mesh by solving a reaction–diffusion PDE with a stochastic right-hand side. The stochastic PDE is discretized using the mixed finite element method on anmore » embedded domain with a structured mesh, and then, the solution is projected onto the unstructured mesh. This work describes implementation details on how to efficiently transfer data from the structured and unstructured meshes at coarse levels, assuming that this can be done efficiently on the finest level. We investigate the efficiency and parallel scalability of the technique for the scalable generation of Gaussian random fields in three dimensions. An application of the MLMC method is presented for quantifying uncertainties of subsurface flow problems. Here, we demonstrate the scalability of the sampling method with nonmatching mesh embedding, coupled with a parallel forward model problem solver, for large-scale 3D MLMC simulations with up to 1.9·109 unknowns.« less

  8. Simulation of Stochastic Processes by Coupled ODE-PDE

    NASA Technical Reports Server (NTRS)

    Zak, Michail

    2008-01-01

    A document discusses the emergence of randomness in solutions of coupled, fully deterministic ODE-PDE (ordinary differential equations-partial differential equations) due to failure of the Lipschitz condition as a new phenomenon. It is possible to exploit the special properties of ordinary differential equations (represented by an arbitrarily chosen, dynamical system) coupled with the corresponding Liouville equations (used to describe the evolution of initial uncertainties in terms of joint probability distribution) in order to simulate stochastic processes with the proscribed probability distributions. The important advantage of the proposed approach is that the simulation does not require a random-number generator.

  9. XRF map identification problems based on a PDE electrodeposition model

    NASA Astrophysics Data System (ADS)

    Sgura, Ivonne; Bozzini, Benedetto

    2017-04-01

    In this paper we focus on the following map identification problem (MIP): given a morphochemical reaction-diffusion (RD) PDE system modeling an electrodepostion process, we look for a time t *, belonging to the transient dynamics and a set of parameters \\mathbf{p} , such that the PDE solution, for the morphology h≤ft(x,y,{{t}\\ast};\\mathbf{p}\\right) and for the chemistry θ ≤ft(x,y,{{t}\\ast};\\mathbf{p}\\right) approximates a given experimental map M *. Towards this aim, we introduce a numerical algorithm using singular value decomposition (SVD) and Frobenius norm to give a measure of error distance between experimental maps for h and θ and simulated solutions of the RD-PDE system on a fixed time integration interval. The technique proposed allows quantitative use of microspectroscopy images, such as XRF maps. Specifically, in this work we have modelled the morphology and manganese distributions of nanostructured components of innovative batteries and we have followed their changes resulting from ageing under operating conditions. The availability of quantitative information on space-time evolution of active materials in terms of model parameters will allow dramatic improvements in knowledge-based optimization of battery fabrication and operation.

  10. Pharmacophore Based Virtual Screening Approach to Identify Selective PDE4B Inhibitors

    PubMed Central

    Gaurav, Anand; Gautam, Vertika

    2017-01-01

    Phosphodiesterase 4 (PDE4) has been established as a promising target in asthma and chronic obstructive pulmonary disease. PDE4B subtype selective inhibitors are known to reduce the dose limiting adverse effect associated with non-selective PDE4B inhibitors. This makes the development of PDE4B subtype selective inhibitors a desirable research goal. To achieve this goal, ligand based pharmacophore modeling approach is employed. Separate pharmacophore hypotheses for PDE4B and PDE4D inhibitors were generated using HypoGen algorithm and 106 PDE4 inhibitors from literature having thiopyrano [3,2-d] Pyrimidines, 2-arylpyrimidines, and triazines skeleton. Suitable training and test sets were created using the molecules as per the guidelines available for HypoGen program. Training set was used for hypothesis development while test set was used for validation purpose. Fisher validation was also used to test the significance of the developed hypothesis. The validated pharmacophore hypotheses for PDE4B and PDE4D inhibitors were used in sequential virtual screening of zinc database of drug like molecules to identify selective PDE4B inhibitors. The hits were screened for their estimated activity and fit value. The top hit was subjected to docking into the active sites of PDE4B and PDE4D to confirm its selectivity for PDE4B. The hits are proposed to be evaluated further using in-vitro assays. PMID:29201082

  11. Multibus-based parallel processor for simulation

    NASA Technical Reports Server (NTRS)

    Ogrady, E. P.; Wang, C.-H.

    1983-01-01

    A Multibus-based parallel processor simulation system is described. The system is intended to serve as a vehicle for gaining hands-on experience, testing system and application software, and evaluating parallel processor performance during development of a larger system based on the horizontal/vertical-bus interprocessor communication mechanism. The prototype system consists of up to seven Intel iSBC 86/12A single-board computers which serve as processing elements, a multiple transmission controller (MTC) designed to support system operation, and an Intel Model 225 Microcomputer Development System which serves as the user interface and input/output processor. All components are interconnected by a Multibus/IEEE 796 bus. An important characteristic of the system is that it provides a mechanism for a processing element to broadcast data to other selected processing elements. This parallel transfer capability is provided through the design of the MTC and a minor modification to the iSBC 86/12A board. The operation of the MTC, the basic hardware-level operation of the system, and pertinent details about the iSBC 86/12A and the Multibus are described.

  12. Parallels between control PDE's (Partial Differential Equations) and systems of ODE's (Ordinary Differential Equations)

    NASA Technical Reports Server (NTRS)

    Hunt, L. R.; Villarreal, Ramiro

    1987-01-01

    System theorists understand that the same mathematical objects which determine controllability for nonlinear control systems of ordinary differential equations (ODEs) also determine hypoellipticity for linear partial differentail equations (PDEs). Moreover, almost any study of ODE systems begins with linear systems. It is remarkable that Hormander's paper on hypoellipticity of second order linear p.d.e.'s starts with equations due to Kolmogorov, which are shown to be analogous to the linear PDEs. Eigenvalue placement by state feedback for a controllable linear system can be paralleled for a Kolmogorov equation if an appropriate type of feedback is introduced. Results concerning transformations of nonlinear systems to linear systems are similar to results for transforming a linear PDE to a Kolmogorov equation.

  13. Xyce Parallel Electronic Simulator : users' guide, version 2.0.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoekstra, Robert John; Waters, Lon J.; Rankin, Eric Lamont

    2004-06-01

    Xyce These input formats include standard analytical models, behavioral models look-up Parallel Electronic Simulator is designed to support a variety of device model inputs. tables, and mesh-level PDE device models. Combined with this flexible interface is an architectural design that greatly simplifies the addition of circuit models. One of the most important feature of Xyce is in providing a platform for computational research and development aimed specifically at the needs of the Laboratory. With Xyce, Sandia now has an 'in-house' capability with which both new electrical (e.g., device model development) and algorithmic (e.g., faster time-integration methods) research and development can be performed. Ultimately, these capabilities are migrated to end users.« less

  14. A scalable parallel black oil simulator on distributed memory parallel computers

    NASA Astrophysics Data System (ADS)

    Wang, Kun; Liu, Hui; Chen, Zhangxin

    2015-11-01

    This paper presents our work on developing a parallel black oil simulator for distributed memory computers based on our in-house parallel platform. The parallel simulator is designed to overcome the performance issues of common simulators that are implemented for personal computers and workstations. The finite difference method is applied to discretize the black oil model. In addition, some advanced techniques are employed to strengthen the robustness and parallel scalability of the simulator, including an inexact Newton method, matrix decoupling methods, and algebraic multigrid methods. A new multi-stage preconditioner is proposed to accelerate the solution of linear systems from the Newton methods. Numerical experiments show that our simulator is scalable and efficient, and is capable of simulating extremely large-scale black oil problems with tens of millions of grid blocks using thousands of MPI processes on parallel computers.

  15. Parallel simulation today

    NASA Technical Reports Server (NTRS)

    Nicol, David; Fujimoto, Richard

    1992-01-01

    This paper surveys topics that presently define the state of the art in parallel simulation. Included in the tutorial are discussions on new protocols, mathematical performance analysis, time parallelism, hardware support for parallel simulation, load balancing algorithms, and dynamic memory management for optimistic synchronization.

  16. Biomolecular surface construction by PDE transform.

    PubMed

    Zheng, Qiong; Yang, Siyang; Wei, Guo-Wei

    2012-03-01

    This work proposes a new framework for the surface generation based on the partial differential equation (PDE) transform. The PDE transform has recently been introduced as a general approach for the mode decomposition of images, signals, and data. It relies on the use of arbitrarily high-order PDEs to achieve the time-frequency localization, control the spectral distribution, and regulate the spatial resolution. The present work provides a new variational derivation of high-order PDE transforms. The fast Fourier transform is utilized to accomplish the PDE transform so as to avoid stringent stability constraints in solving high-order PDEs. As a consequence, the time integration of high-order PDEs can be done efficiently with the fast Fourier transform. The present approach is validated with a variety of test examples in two-dimensional and three-dimensional settings. We explore the impact of the PDE transform parameters, such as the PDE order and propagation time, on the quality of resulting surfaces. Additionally, we utilize a set of 10 proteins to compare the computational efficiency of the present surface generation method and a standard approach in Cartesian meshes. Moreover, we analyze the present method by examining some benchmark indicators of biomolecular surface, that is, surface area, surface-enclosed volume, solvation free energy, and surface electrostatic potential. A test set of 13 protein molecules is used in the present investigation. The electrostatic analysis is carried out via the Poisson-Boltzmann equation model. To further demonstrate the utility of the present PDE transform-based surface method, we solve the Poisson-Nernst-Planck equations with a PDE transform surface of a protein. Second-order convergence is observed for the electrostatic potential and concentrations. Finally, to test the capability and efficiency of the present PDE transform-based surface generation method, we apply it to the construction of an excessively large biomolecule, a

  17. Biomolecular surface construction by PDE transform

    PubMed Central

    Zheng, Qiong; Yang, Siyang; Wei, Guo-Wei

    2011-01-01

    This work proposes a new framework for the surface generation based on the partial differential equation (PDE) transform. The PDE transform has recently been introduced as a general approach for the mode decomposition of images, signals, and data. It relies on the use of arbitrarily high order PDEs to achieve the time-frequency localization, control the spectral distribution, and regulate the spatial resolution. The present work provides a new variational derivation of high order PDE transforms. The fast Fourier transform is utilized to accomplish the PDE transform so as to avoid stringent stability constraints in solving high order PDEs. As a consequence, the time integration of high order PDEs can be done efficiently with the fast Fourier transform. The present approach is validated with a variety of test examples in two and three-dimensional settings. We explore the impact of the PDE transform parameters, such as the PDE order and propagation time, on the quality of resulting surfaces. Additionally, we utilize a set of 10 proteins to compare the computational efficiency of the present surface generation method and the MSMS approach in Cartesian meshes. Moreover, we analyze the present method by examining some benchmark indicators of biomolecular surface, i.e., surface area, surface enclosed volume, solvation free energy and surface electrostatic potential. A test set of 13 protein molecules is used in the present investigation. The electrostatic analysis is carried out via the Poisson-Boltzmann equation model. To further demonstrate the utility of the present PDE transform based surface method, we solve the Poisson-Nernst-Planck (PNP) equations with a PDE transform surface of a protein. Second order convergence is observed for the electrostatic potential and concentrations. Finally, to test the capability and efficiency of the present PDE transform based surface generation method, we apply it to the construction of an excessively large biomolecule, a virus

  18. Output Feedback-Based Boundary Control of Uncertain Coupled Semilinear Parabolic PDE Using Neurodynamic Programming.

    PubMed

    Talaei, Behzad; Jagannathan, Sarangapani; Singler, John

    2018-04-01

    In this paper, neurodynamic programming-based output feedback boundary control of distributed parameter systems governed by uncertain coupled semilinear parabolic partial differential equations (PDEs) under Neumann or Dirichlet boundary control conditions is introduced. First, Hamilton-Jacobi-Bellman (HJB) equation is formulated in the original PDE domain and the optimal control policy is derived using the value functional as the solution of the HJB equation. Subsequently, a novel observer is developed to estimate the system states given the uncertain nonlinearity in PDE dynamics and measured outputs. Consequently, the suboptimal boundary control policy is obtained by forward-in-time estimation of the value functional using a neural network (NN)-based online approximator and estimated state vector obtained from the NN observer. Novel adaptive tuning laws in continuous time are proposed for learning the value functional online to satisfy the HJB equation along system trajectories while ensuring the closed-loop stability. Local uniformly ultimate boundedness of the closed-loop system is verified by using Lyapunov theory. The performance of the proposed controller is verified via simulation on an unstable coupled diffusion reaction process.

  19. Discovery of a novel orally active PDE-4 inhibitor effective in an ovalbumin-induced asthma murine model.

    PubMed

    Kwak, Hyun Jeong; Nam, Ji Yeon; Song, Jin Sook; No, Zaesung; Yang, Sung Don; Cheon, Hyae Gyeong

    2012-06-15

    Phosphodiesterase-4 (PDE-4) is responsible for metabolizing adenosine 3',5'-cyclic monophosphate that reduces the activation of a wide range of inflammatory cells including eosinophils. PDE-4 inhibitors are under development for the treatment of respiratory diseases such as asthma and chronic obstructive pulmonary disease. Herein, we report a novel PDE-4 inhibitor, PDE-423 (3-[1-(3-cyclopropylmethoxy-4-difluoromethoxybenzyl)-1H-pyrazol-3-yl]-benzoic acid), which shows good in vitro and in vivo oral activities. PDE-423 exhibited in vitro IC(50)s of 140 nM and 550 nM in enzyme assay and cell-based assay, respectively. In vivo study using ovalbumin-induced asthmatic mice revealed that PDE-423 reduced methacholine-stimulated airway hyperreactivity in a dose-dependent manner by once daily oral administration (ED(50)=18.3 mg/kg), in parallel with decreased eosinophil peroxidase activity and improved lung histology. In addition, PDE-423 was effective in diminishing lipopolysaccharide-induced neutrophilia in vivo as well as in vitro. Oral administration of PDE-423 (100 mg/kg) had no effect on the duration of xylazine/ketamine-induced anesthesia and did not induce vomiting incidence in ferrets up to the dose of 1000 mg/kg. The present study indicates that a novel PDE-4 inhibitor, PDE-423, has good pharmacological profiles implicating this as a potential candidate for the development of a new anti-asthmatic drug. Copyright © 2012 Elsevier B.V. All rights reserved.

  20. A fast ultrasonic simulation tool based on massively parallel implementations

    NASA Astrophysics Data System (ADS)

    Lambert, Jason; Rougeron, Gilles; Lacassagne, Lionel; Chatillon, Sylvain

    2014-02-01

    This paper presents a CIVA optimized ultrasonic inspection simulation tool, which takes benefit of the power of massively parallel architectures: graphical processing units (GPU) and multi-core general purpose processors (GPP). This tool is based on the classical approach used in CIVA: the interaction model is based on Kirchoff, and the ultrasonic field around the defect is computed by the pencil method. The model has been adapted and parallelized for both architectures. At this stage, the configurations addressed by the tool are : multi and mono-element probes, planar specimens made of simple isotropic materials, planar rectangular defects or side drilled holes of small diameter. Validations on the model accuracy and performances measurements are presented.

  1. Parallelized direct execution simulation of message-passing parallel programs

    NASA Technical Reports Server (NTRS)

    Dickens, Phillip M.; Heidelberger, Philip; Nicol, David M.

    1994-01-01

    As massively parallel computers proliferate, there is growing interest in findings ways by which performance of massively parallel codes can be efficiently predicted. This problem arises in diverse contexts such as parallelizing computers, parallel performance monitoring, and parallel algorithm development. In this paper we describe one solution where one directly executes the application code, but uses a discrete-event simulator to model details of the presumed parallel machine such as operating system and communication network behavior. Because this approach is computationally expensive, we are interested in its own parallelization specifically the parallelization of the discrete-event simulator. We describe methods suitable for parallelized direct execution simulation of message-passing parallel programs, and report on the performance of such a system, Large Application Parallel Simulation Environment (LAPSE), we have built on the Intel Paragon. On all codes measured to date, LAPSE predicts performance well typically within 10 percent relative error. Depending on the nature of the application code, we have observed low slowdowns (relative to natively executing code) and high relative speedups using up to 64 processors.

  2. In silico design of novel hERG-neutral sildenafil-like PDE5 inhibitors.

    PubMed

    Kayık, Gülru; Tüzün, Nurcan Ş; Durdagi, Serdar

    2017-10-01

    Cyclic nucleotide phosphodiesterase enzymes (PDEs) have functions in regulating the levels of intracellular second messengers, 3', 5'-cyclic adenosine monophosphate (cAMP) and 3', 5'-cyclic guanosine monophosphate (cGMP), via hydrolysis and decomposing mechanisms in cells. They take essential roles in modulating various cellular activities such as memory and smooth muscle functions. PDE type 5 (PDE5) inhibitors enhance the vasodilatory effects of cGMP in the corpus cavernosum and they are used to treat erectile dysfunction. Patch clamp experiments showed that the IC 50 values of the human ether-à-go-go-related gene (hERG1) potassium (K) ion channel blocking affinity of PDE5 inhibitors sildenafil, vardenafil, and tadalafil as 33, 12, and 100 μM, respectively. hERG1 channel is responsible for the regulation of the action potential of human ventricular myocyte by contributing the rapid component of delayed rectifier K + current (I Kr ) component of the cardiac action potential. In this work, interaction patterns and binding affinity predictions of selected PDE5 inhibitors against the hERG1 channel are studied. It is attempted to develop PDE5 inhibitor analogs with lower binding affinity to hERG1 ion channel while keeping their pharmacological activity against their principal target PDE5 using in silico methods. Based on detailed analyses of docking poses and predicted interaction energies, novel analogs of PDE5 inhibitors with lower predicted binding affinity to hERG1 channels without loosing their principal target activity were proposed. Moreover, molecular dynamics (MD) simulations and post-processing MD analyses (i.e. Molecular Mechanics/Generalized Born Surface Area calculations) were performed. Detailed analysis of molecular simulations helped us to better understand the PDE5 inhibitor-target binding interactions in the atomic level. Results of this study can be useful for designing of novel and safe PDE5 inhibitors with enhanced activity and other tailored

  3. Fragment-Based Discovery of Pyrimido[1,2-b]indazole PDE10A Inhibitors.

    PubMed

    Chino, Ayaka; Seo, Ryushi; Amano, Yasushi; Namatame, Ichiji; Hamaguchi, Wataru; Honbou, Kazuya; Mihara, Takuma; Yamazaki, Mayako; Tomishima, Masaki; Masuda, Naoyuki

    2018-01-01

    In this study, we report the identification of potent pyrimidoindazoles as phosphodiesterase10A (PDE10A) inhibitors by using the method of fragment-based drug discovery (FBDD). The pyrazolopyridine derivative 2 was found to be a fragment hit compound which could occupy a part of the binding site of PDE10A enzyme by using the method of the X-ray co-crystal structure analysis. On the basis of the crystal structure of compound 2 and PDE10A protein, a number of compounds were synthesized and evaluated, by means of structure-activity relationship (SAR) studies, which culminated in the discovery of a novel pyrimidoindazole derivative 13 having good physicochemical properties.

  4. Global magnetosphere simulations using constrained-transport Hall-MHD with CWENO reconstruction

    NASA Astrophysics Data System (ADS)

    Lin, L.; Germaschewski, K.; Maynard, K. M.; Abbott, S.; Bhattacharjee, A.; Raeder, J.

    2013-12-01

    We present a new CWENO (Centrally-Weighted Essentially Non-Oscillatory) reconstruction based MHD solver for the OpenGGCM global magnetosphere code. The solver was built using libMRC, a library for creating efficient parallel PDE solvers on structured grids. The use of libMRC gives us access to its core functionality of providing an automated code generation framework which takes a user provided PDE right hand side in symbolic form to generate an efficient, computer architecture specific, parallel code. libMRC also supports block-structured adaptive mesh refinement and implicit-time stepping through integration with the PETSc library. We validate the new CWENO Hall-MHD solver against existing solvers both in standard test problems as well as in global magnetosphere simulations.

  5. Geometry of PDE's. IV

    NASA Astrophysics Data System (ADS)

    Prástaro, Agostino

    2008-02-01

    Following our previous results on this subject [R.P. Agarwal, A. Prástaro, Geometry of PDE's. III(I): Webs on PDE's and integral bordism groups. The general theory, Adv. Math. Sci. Appl. 17 (2007) 239-266; R.P. Agarwal, A. Prástaro, Geometry of PDE's. III(II): Webs on PDE's and integral bordism groups. Applications to Riemannian geometry PDE's, Adv. Math. Sci. Appl. 17 (2007) 267-285; A. Prástaro, Geometry of PDE's and Mechanics, World Scientific, Singapore, 1996; A. Prástaro, Quantum and integral (co)bordism in partial differential equations, Acta Appl. Math. (5) (3) (1998) 243-302; A. Prástaro, (Co)bordism groups in PDE's, Acta Appl. Math. 59 (2) (1999) 111-201; A. Prástaro, Quantized Partial Differential Equations, World Scientific Publishing Co, Singapore, 2004, 500 pp.; A. Prástaro, Geometry of PDE's. I: Integral bordism groups in PDE's, J. Math. Anal. Appl. 319 (2006) 547-566; A. Prástaro, Geometry of PDE's. II: Variational PDE's and integral bordism groups, J. Math. Anal. Appl. 321 (2006) 930-948; A. Prástaro, Th.M. Rassias, Ulam stability in geometry of PDE's, Nonlinear Funct. Anal. Appl. 8 (2) (2003) 259-278; I. Stakgold, Boundary Value Problems of Mathematical Physics, I, The MacMillan Company, New York, 1967; I. Stakgold, Boundary Value Problems of Mathematical Physics, II, Collier-MacMillan, Canada, Ltd, Toronto, Ontario, 1968], integral bordism groups of the Navier-Stokes equation are calculated for smooth, singular and weak solutions, respectively. Then a characterization of global solutions is made on this ground. Enough conditions to assure existence of global smooth solutions are given and related to nullity of integral characteristic numbers of the boundaries. Stability of global solutions are related to some characteristic numbers of the space-like Cauchy dataE Global solutions of variational problems constrained by (NS) are classified by means of suitable integral bordism groups too.

  6. Parallel Allostery by cAMP and PDE Coordinates Activation and Termination Phases in cAMP Signaling.

    PubMed

    Krishnamurthy, Srinath; Tulsian, Nikhil Kumar; Chandramohan, Arun; Anand, Ganesh S

    2015-09-15

    The second messenger molecule cAMP regulates the activation phase of the cAMP signaling pathway through high-affinity interactions with the cytosolic cAMP receptor, the protein kinase A regulatory subunit (PKAR). Phosphodiesterases (PDEs) are enzymes responsible for catalyzing hydrolysis of cAMP to 5' AMP. It was recently shown that PDEs interact with PKAR to initiate the termination phase of the cAMP signaling pathway. While the steps in the activation phase are well understood, steps in the termination pathway are unknown. Specifically, the binding and allosteric networks that regulate the dynamic interplay between PKAR, PDE, and cAMP are unclear. In this study, PKAR and PDE from Dictyostelium discoideum (RD and RegA, respectively) were used as a model system to monitor complex formation in the presence and absence of cAMP. Amide hydrogen/deuterium exchange mass spectrometry was used to monitor slow conformational transitions in RD, using disordered regions as conformational probes. Our results reveal that RD regulates its interactions with cAMP and RegA at distinct loci by undergoing slow conformational transitions between two metastable states. In the presence of cAMP, RD and RegA form a stable ternary complex, while in the absence of cAMP they maintain transient interactions. RegA and cAMP each bind at orthogonal sites on RD with resultant contrasting effects on its dynamics through parallel allosteric relays at multiple important loci. RD thus serves as an integrative node in cAMP termination by coordinating multiple allosteric relays and governing the output signal response. Copyright © 2015 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  7. Boron-based phosphodiesterase inhibitors show novel binding of boron to PDE4 bimetal center.

    PubMed

    Freund, Yvonne R; Akama, Tsutomu; Alley, M R K; Antunes, Joana; Dong, Chen; Jarnagin, Kurt; Kimura, Richard; Nieman, James A; Maples, Kirk R; Plattner, Jacob J; Rock, Fernando; Sharma, Rashmi; Singh, Rajeshwar; Sanders, Virginia; Zhou, Yasheen

    2012-09-21

    We have used boron-based molecules to create novel, competitive, reversible inhibitors of phosphodiesterase 4 (PDE4). The co-crystal structure reveals a binding configuration which is unique compared to classical catechol PDE4 inhibitors, with boron binding to the activated water in the bimetal center. These phenoxybenzoxaboroles can be optimized to generate submicromolar potency enzyme inhibitors, which inhibit TNF-α, IL-2, IFN-γ, IL-5 and IL-10 activities in vitro and show safety and efficacy for topical treatment of human psoriasis. They provide a valuable new route for creating novel potent anti-PDE4 inhibitors. Copyright © 2012 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  8. An FPGA-Based Massively Parallel Neuromorphic Cortex Simulator

    PubMed Central

    Wang, Runchun M.; Thakur, Chetan S.; van Schaik, André

    2018-01-01

    This paper presents a massively parallel and scalable neuromorphic cortex simulator designed for simulating large and structurally connected spiking neural networks, such as complex models of various areas of the cortex. The main novelty of this work is the abstraction of a neuromorphic architecture into clusters represented by minicolumns and hypercolumns, analogously to the fundamental structural units observed in neurobiology. Without this approach, simulating large-scale fully connected networks needs prohibitively large memory to store look-up tables for point-to-point connections. Instead, we use a novel architecture, based on the structural connectivity in the neocortex, such that all the required parameters and connections can be stored in on-chip memory. The cortex simulator can be easily reconfigured for simulating different neural networks without any change in hardware structure by programming the memory. A hierarchical communication scheme allows one neuron to have a fan-out of up to 200 k neurons. As a proof-of-concept, an implementation on one Altera Stratix V FPGA was able to simulate 20 million to 2.6 billion leaky-integrate-and-fire (LIF) neurons in real time. We verified the system by emulating a simplified auditory cortex (with 100 million neurons). This cortex simulator achieved a low power dissipation of 1.62 μW per neuron. With the advent of commercially available FPGA boards, our system offers an accessible and scalable tool for the design, real-time simulation, and analysis of large-scale spiking neural networks. PMID:29692702

  9. An FPGA-Based Massively Parallel Neuromorphic Cortex Simulator.

    PubMed

    Wang, Runchun M; Thakur, Chetan S; van Schaik, André

    2018-01-01

    This paper presents a massively parallel and scalable neuromorphic cortex simulator designed for simulating large and structurally connected spiking neural networks, such as complex models of various areas of the cortex. The main novelty of this work is the abstraction of a neuromorphic architecture into clusters represented by minicolumns and hypercolumns, analogously to the fundamental structural units observed in neurobiology. Without this approach, simulating large-scale fully connected networks needs prohibitively large memory to store look-up tables for point-to-point connections. Instead, we use a novel architecture, based on the structural connectivity in the neocortex, such that all the required parameters and connections can be stored in on-chip memory. The cortex simulator can be easily reconfigured for simulating different neural networks without any change in hardware structure by programming the memory. A hierarchical communication scheme allows one neuron to have a fan-out of up to 200 k neurons. As a proof-of-concept, an implementation on one Altera Stratix V FPGA was able to simulate 20 million to 2.6 billion leaky-integrate-and-fire (LIF) neurons in real time. We verified the system by emulating a simplified auditory cortex (with 100 million neurons). This cortex simulator achieved a low power dissipation of 1.62 μW per neuron. With the advent of commercially available FPGA boards, our system offers an accessible and scalable tool for the design, real-time simulation, and analysis of large-scale spiking neural networks.

  10. Parallelization and automatic data distribution for nuclear reactor simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liebrock, L.M.

    1997-07-01

    Detailed attempts at realistic nuclear reactor simulations currently take many times real time to execute on high performance workstations. Even the fastest sequential machine can not run these simulations fast enough to ensure that the best corrective measure is used during a nuclear accident to prevent a minor malfunction from becoming a major catastrophe. Since sequential computers have nearly reached the speed of light barrier, these simulations will have to be run in parallel to make significant improvements in speed. In physical reactor plants, parallelism abounds. Fluids flow, controls change, and reactions occur in parallel with only adjacent components directlymore » affecting each other. These do not occur in the sequentialized manner, with global instantaneous effects, that is often used in simulators. Development of parallel algorithms that more closely approximate the real-world operation of a reactor may, in addition to speeding up the simulations, actually improve the accuracy and reliability of the predictions generated. Three types of parallel architecture (shared memory machines, distributed memory multicomputers, and distributed networks) are briefly reviewed as targets for parallelization of nuclear reactor simulation. Various parallelization models (loop-based model, shared memory model, functional model, data parallel model, and a combined functional and data parallel model) are discussed along with their advantages and disadvantages for nuclear reactor simulation. A variety of tools are introduced for each of the models. Emphasis is placed on the data parallel model as the primary focus for two-phase flow simulation. Tools to support data parallel programming for multiple component applications and special parallelization considerations are also discussed.« less

  11. A parallel algorithm for switch-level timing simulation on a hypercube multiprocessor

    NASA Technical Reports Server (NTRS)

    Rao, Hariprasad Nannapaneni

    1989-01-01

    The parallel approach to speeding up simulation is studied, specifically the simulation of digital LSI MOS circuitry on the Intel iPSC/2 hypercube. The simulation algorithm is based on RSIM, an event driven switch-level simulator that incorporates a linear transistor model for simulating digital MOS circuits. Parallel processing techniques based on the concepts of Virtual Time and rollback are utilized so that portions of the circuit may be simulated on separate processors, in parallel for as large an increase in speed as possible. A partitioning algorithm is also developed in order to subdivide the circuit for parallel processing.

  12. Massively parallel multicanonical simulations

    NASA Astrophysics Data System (ADS)

    Gross, Jonathan; Zierenberg, Johannes; Weigel, Martin; Janke, Wolfhard

    2018-03-01

    Generalized-ensemble Monte Carlo simulations such as the multicanonical method and similar techniques are among the most efficient approaches for simulations of systems undergoing discontinuous phase transitions or with rugged free-energy landscapes. As Markov chain methods, they are inherently serial computationally. It was demonstrated recently, however, that a combination of independent simulations that communicate weight updates at variable intervals allows for the efficient utilization of parallel computational resources for multicanonical simulations. Implementing this approach for the many-thread architecture provided by current generations of graphics processing units (GPUs), we show how it can be efficiently employed with of the order of 104 parallel walkers and beyond, thus constituting a versatile tool for Monte Carlo simulations in the era of massively parallel computing. We provide the fully documented source code for the approach applied to the paradigmatic example of the two-dimensional Ising model as starting point and reference for practitioners in the field.

  13. Array-based Hierarchical Mesh Generation in Parallel

    DOE PAGES

    Ray, Navamita; Grindeanu, Iulian; Zhao, Xinglin; ...

    2015-11-03

    In this paper, we describe an array-based hierarchical mesh generation capability through uniform refinement of unstructured meshes for efficient solution of PDE's using finite element methods and multigrid solvers. A multi-degree, multi-dimensional and multi-level framework is designed to generate the nested hierarchies from an initial mesh that can be used for a number of purposes such as multi-level methods to generating large meshes. The capability is developed under the parallel mesh framework “Mesh Oriented dAtaBase” a.k.a MOAB. We describe the underlying data structures and algorithms to generate such hierarchies and present numerical results for computational efficiency and mesh quality. Inmore » conclusion, we also present results to demonstrate the applicability of the developed capability to a multigrid finite-element solver.« less

  14. Parallel programming with Easy Java Simulations

    NASA Astrophysics Data System (ADS)

    Esquembre, F.; Christian, W.; Belloni, M.

    2018-01-01

    Nearly all of today's processors are multicore, and ideally programming and algorithm development utilizing the entire processor should be introduced early in the computational physics curriculum. Parallel programming is often not introduced because it requires a new programming environment and uses constructs that are unfamiliar to many teachers. We describe how we decrease the barrier to parallel programming by using a java-based programming environment to treat problems in the usual undergraduate curriculum. We use the easy java simulations programming and authoring tool to create the program's graphical user interface together with objects based on those developed by Kaminsky [Building Parallel Programs (Course Technology, Boston, 2010)] to handle common parallel programming tasks. Shared-memory parallel implementations of physics problems, such as time evolution of the Schrödinger equation, are available as source code and as ready-to-run programs from the AAPT-ComPADRE digital library.

  15. Synchronization Of Parallel Discrete Event Simulations

    NASA Technical Reports Server (NTRS)

    Steinman, Jeffrey S.

    1992-01-01

    Adaptive, parallel, discrete-event-simulation-synchronization algorithm, Breathing Time Buckets, developed in Synchronous Parallel Environment for Emulation and Discrete Event Simulation (SPEEDES) operating system. Algorithm allows parallel simulations to process events optimistically in fluctuating time cycles that naturally adapt while simulation in progress. Combines best of optimistic and conservative synchronization strategies while avoiding major disadvantages. Algorithm processes events optimistically in time cycles adapting while simulation in progress. Well suited for modeling communication networks, for large-scale war games, for simulated flights of aircraft, for simulations of computer equipment, for mathematical modeling, for interactive engineering simulations, and for depictions of flows of information.

  16. Simulation Exploration through Immersive Parallel Planes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brunhart-Lupo, Nicholas J; Bush, Brian W; Gruchalla, Kenny M

    We present a visualization-driven simulation system that tightly couples systems dynamics simulations with an immersive virtual environment to allow analysts to rapidly develop and test hypotheses in a high-dimensional parameter space. To accomplish this, we generalize the two-dimensional parallel-coordinates statistical graphic as an immersive 'parallel-planes' visualization for multivariate time series emitted by simulations running in parallel with the visualization. In contrast to traditional parallel coordinate's mapping the multivariate dimensions onto coordinate axes represented by a series of parallel lines, we map pairs of the multivariate dimensions onto a series of parallel rectangles. As in the case of parallel coordinates, eachmore » individual observation in the dataset is mapped to a polyline whose vertices coincide with its coordinate values. Regions of the rectangles can be 'brushed' to highlight and select observations of interest: a 'slider' control allows the user to filter the observations by their time coordinate. In an immersive virtual environment, users interact with the parallel planes using a joystick that can select regions on the planes, manipulate selection, and filter time. The brushing and selection actions are used to both explore existing data as well as to launch additional simulations corresponding to the visually selected portions of the input parameter space. As soon as the new simulations complete, their resulting observations are displayed in the virtual environment. This tight feedback loop between simulation and immersive analytics accelerates users' realization of insights about the simulation and its output.« less

  17. A hybrid algorithm for parallel molecular dynamics simulations

    NASA Astrophysics Data System (ADS)

    Mangiardi, Chris M.; Meyer, R.

    2017-10-01

    This article describes algorithms for the hybrid parallelization and SIMD vectorization of molecular dynamics simulations with short-range forces. The parallelization method combines domain decomposition with a thread-based parallelization approach. The goal of the work is to enable efficient simulations of very large (tens of millions of atoms) and inhomogeneous systems on many-core processors with hundreds or thousands of cores and SIMD units with large vector sizes. In order to test the efficiency of the method, simulations of a variety of configurations with up to 74 million atoms have been performed. Results are shown that were obtained on multi-core systems with Sandy Bridge and Haswell processors as well as systems with Xeon Phi many-core processors.

  18. Parallel Signal Processing and System Simulation using aCe

    NASA Technical Reports Server (NTRS)

    Dorband, John E.; Aburdene, Maurice F.

    2003-01-01

    Recently, networked and cluster computation have become very popular for both signal processing and system simulation. A new language is ideally suited for parallel signal processing applications and system simulation since it allows the programmer to explicitly express the computations that can be performed concurrently. In addition, the new C based parallel language (ace C) for architecture-adaptive programming allows programmers to implement algorithms and system simulation applications on parallel architectures by providing them with the assurance that future parallel architectures will be able to run their applications with a minimum of modification. In this paper, we will focus on some fundamental features of ace C and present a signal processing application (FFT).

  19. Parallel Agent-Based Simulations on Clusters of GPUs and Multi-Core Processors

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aaby, Brandon G; Perumalla, Kalyan S; Seal, Sudip K

    2010-01-01

    An effective latency-hiding mechanism is presented in the parallelization of agent-based model simulations (ABMS) with millions of agents. The mechanism is designed to accommodate the hierarchical organization as well as heterogeneity of current state-of-the-art parallel computing platforms. We use it to explore the computation vs. communication trade-off continuum available with the deep computational and memory hierarchies of extant platforms and present a novel analytical model of the tradeoff. We describe our implementation and report preliminary performance results on two distinct parallel platforms suitable for ABMS: CUDA threads on multiple, networked graphical processing units (GPUs), and pthreads on multi-core processors. Messagemore » Passing Interface (MPI) is used for inter-GPU as well as inter-socket communication on a cluster of multiple GPUs and multi-core processors. Results indicate the benefits of our latency-hiding scheme, delivering as much as over 100-fold improvement in runtime for certain benchmark ABMS application scenarios with several million agents. This speed improvement is obtained on our system that is already two to three orders of magnitude faster on one GPU than an equivalent CPU-based execution in a popular simulator in Java. Thus, the overall execution of our current work is over four orders of magnitude faster when executed on multiple GPUs.« less

  20. n-body simulations using message passing parallel computers.

    NASA Astrophysics Data System (ADS)

    Grama, A. Y.; Kumar, V.; Sameh, A.

    The authors present new parallel formulations of the Barnes-Hut method for n-body simulations on message passing computers. These parallel formulations partition the domain efficiently incurring minimal communication overhead. This is in contrast to existing schemes that are based on sorting a large number of keys or on the use of global data structures. The new formulations are augmented by alternate communication strategies which serve to minimize communication overhead. The impact of these communication strategies is experimentally studied. The authors report on experimental results obtained from an astrophysical simulation on an nCUBE2 parallel computer.

  1. Parallel-distributed mobile robot simulator

    NASA Astrophysics Data System (ADS)

    Okada, Hiroyuki; Sekiguchi, Minoru; Watanabe, Nobuo

    1996-06-01

    The aim of this project is to achieve an autonomous learning and growth function based on active interaction with the real world. It should also be able to autonomically acquire knowledge about the context in which jobs take place, and how the jobs are executed. This article describes a parallel distributed movable robot system simulator with an autonomous learning and growth function. The autonomous learning and growth function which we are proposing is characterized by its ability to learn and grow through interaction with the real world. When the movable robot interacts with the real world, the system compares the virtual environment simulation with the interaction result in the real world. The system then improves the virtual environment to match the real-world result more closely. This the system learns and grows. It is very important that such a simulation is time- realistic. The parallel distributed movable robot simulator was developed to simulate the space of a movable robot system with an autonomous learning and growth function. The simulator constructs a virtual space faithful to the real world and also integrates the interfaces between the user, the actual movable robot and the virtual movable robot. Using an ultrafast CG (computer graphics) system (FUJITSU AG series), time-realistic 3D CG is displayed.

  2. PDE4 and PDE5 regulate cyclic nucleotide contents and relaxing effects on carbachol-induced contraction in the bovine abomasum.

    PubMed

    Kaneda, Takeharu; Kido, Yuuki; Tajima, Tsuyoshi; Urakawa, Norimoto; Shimizu, Kazumasa

    2015-01-01

    The effects of various selective phosphodiesterase (PDE) inhibitors on carbachol (CCh)-induced contraction in the bovine abomasum were investigated. Various selective PDE inhibitors, vinpocetine (type 1), erythro-9-(2-hydroxy-3-nonyl) adenine (EHNA, type 2), milrinone (type 3), Ro20-1724 (type 4), vardenafil (type 5), BRL-50481 (type 7) and BAY73-6691 (type 9), inhibited CCh-induced contractions in a concentration-dependent manner. Among the PDE inhibitors, Ro20-1724 and vardenafil induced more relaxation than the other inhibitors based on the data for the IC50 or maximum relaxation. In smooth muscle of the bovine abomasum, we showed the expression of PDE4B, 4C, 4D and 5 by RT-PCR analysis. In the presence of CCh, Ro20-1724 increased the cAMP content, but not the cGMP content. By contrast, vardenafil increased the cGMP content, but not the cAMP content. These results suggest that Ro20-1724-induced relaxation was correlated with cAMP and that vardenafil-induced relaxation was correlated with cGMP in the bovine abomasum. In conclusion, PDE4 and PDE5 are the enzymes involved in regulation of the relaxation associated with cAMP and cGMP, respectively, in the bovine abomasum.

  3. Direct interaction of the inhibitory gamma-subunit of Rod cGMP phosphodiesterase (PDE6) with the PDE6 GAFa domains.

    PubMed

    Muradov, Khakim G; Granovsky, Alexey E; Schey, Kevin L; Artemyev, Nikolai O

    2002-03-26

    Retinal rod and cone cGMP phosphodiesterases (PDE6 family) function as the effector enzyme in the vertebrate visual transduction cascade. The activity of PDE6 catalytic subunits is controlled by the Pgamma-subunits. In addition to the inhibition of cGMP hydrolysis at the catalytic sites, Pgamma is known to stimulate a noncatalytic binding of cGMP to the regulatory GAFa-GAFb domains of PDE6. The latter role of Pgamma has been attributed to its polycationic region. To elucidate the structural basis for the regulation of cGMP binding to the GAF domains of PDE6, a photoexcitable peptide probe corresponding to the polycationic region of Pgamma, Pgamma-21-45, was specifically cross-linked to rod PDE6alphabeta. The site of Pgamma-21-45 cross-linking was localized to Met138Gly139 within the PDE6alpha GAFa domain using mass spectrometric analysis. Chimeras between PDE5 and cone PDE6alpha', containing GAFa and/or GAFb domains of PDE6alpha' have been generated to probe a potential role of the GAFb domains in binding to Pgamma. Analysis of the inhibition of the PDE5/PDE6alpha' chimeras by Pgamma supported the role of PDE6 GAFa but not GAFb domains in the interaction with Pgamma. Our results suggest that a direct binding of the polycationic region of Pgamma to the GAFa domains of PDE6 may lead to a stabilization of the noncatalytic cGMP-binding sites.

  4. Parallel Simulation of Unsteady Turbulent Flames

    NASA Technical Reports Server (NTRS)

    Menon, Suresh

    1996-01-01

    Time-accurate simulation of turbulent flames in high Reynolds number flows is a challenging task since both fluid dynamics and combustion must be modeled accurately. To numerically simulate this phenomenon, very large computer resources (both time and memory) are required. Although current vector supercomputers are capable of providing adequate resources for simulations of this nature, the high cost and their limited availability, makes practical use of such machines less than satisfactory. At the same time, the explicit time integration algorithms used in unsteady flow simulations often possess a very high degree of parallelism, making them very amenable to efficient implementation on large-scale parallel computers. Under these circumstances, distributed memory parallel computers offer an excellent near-term solution for greatly increased computational speed and memory, at a cost that may render the unsteady simulations of the type discussed above more feasible and affordable.This paper discusses the study of unsteady turbulent flames using a simulation algorithm that is capable of retaining high parallel efficiency on distributed memory parallel architectures. Numerical studies are carried out using large-eddy simulation (LES). In LES, the scales larger than the grid are computed using a time- and space-accurate scheme, while the unresolved small scales are modeled using eddy viscosity based subgrid models. This is acceptable for the moment/energy closure since the small scales primarily provide a dissipative mechanism for the energy transferred from the large scales. However, for combustion to occur, the species must first undergo mixing at the small scales and then come into molecular contact. Therefore, global models cannot be used. Recently, a new model for turbulent combustion was developed, in which the combustion is modeled, within the subgrid (small-scales) using a methodology that simulates the mixing and the molecular transport and the chemical kinetics

  5. Structure-Based Design, Synthesis, Biological Evaluation, and Molecular Docking of Novel PDE10 Inhibitors with Antioxidant Activities

    NASA Astrophysics Data System (ADS)

    Li, Jinxuan; Chen, Jing-Yi; Deng, Ya-Lin; Zhou, Qian; Wu, Yinuo; Wu, Deyan; Luo, Hai-Bin

    2018-05-01

    Phosphodiesterase 10 is a promising target for the treatment of a series of central nervous system (CNS) diseases. Imbalance between oxidative stress and antioxidant defense systems as a universal condition in neurodegenerative disorders is widely studied as a potential therapy for CNS diseases, such as Alzheimer’s disease (AD), Parkinson’s disease (PD) and amyotrophic lateral sclerosis (ALS). To discover multifunctional pharmaceuticals as a treatment for neurodegenerative diseases, a series of quinazoline-based derivatives with PDE10 inhibitory activities and antioxidant activities were designed and synthesized. Nine out of thirteen designed compounds showed good PDE10 inhibition at the concentration of 1.0 μM. Among these compounds, eight exhibited moderate to excellent antioxidant activity with ORAC (oxygen radical absorbance capacity) value above 1.0. Molecular docking was performed for better understanding of the binding patterns of these compounds with PDE10. Compound 11e, which showed remarkable inhibitory activity against PDE10 and antioxidant activity may serve as a lead for the further modification.

  6. Simulation Exploration through Immersive Parallel Planes: Preprint

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brunhart-Lupo, Nicholas; Bush, Brian W.; Gruchalla, Kenny

    We present a visualization-driven simulation system that tightly couples systems dynamics simulations with an immersive virtual environment to allow analysts to rapidly develop and test hypotheses in a high-dimensional parameter space. To accomplish this, we generalize the two-dimensional parallel-coordinates statistical graphic as an immersive 'parallel-planes' visualization for multivariate time series emitted by simulations running in parallel with the visualization. In contrast to traditional parallel coordinate's mapping the multivariate dimensions onto coordinate axes represented by a series of parallel lines, we map pairs of the multivariate dimensions onto a series of parallel rectangles. As in the case of parallel coordinates, eachmore » individual observation in the dataset is mapped to a polyline whose vertices coincide with its coordinate values. Regions of the rectangles can be 'brushed' to highlight and select observations of interest: a 'slider' control allows the user to filter the observations by their time coordinate. In an immersive virtual environment, users interact with the parallel planes using a joystick that can select regions on the planes, manipulate selection, and filter time. The brushing and selection actions are used to both explore existing data as well as to launch additional simulations corresponding to the visually selected portions of the input parameter space. As soon as the new simulations complete, their resulting observations are displayed in the virtual environment. This tight feedback loop between simulation and immersive analytics accelerates users' realization of insights about the simulation and its output.« less

  7. Implementation of Parallel Dynamic Simulation on Shared-Memory vs. Distributed-Memory Environments

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jin, Shuangshuang; Chen, Yousu; Wu, Di

    2015-12-09

    Power system dynamic simulation computes the system response to a sequence of large disturbance, such as sudden changes in generation or load, or a network short circuit followed by protective branch switching operation. It consists of a large set of differential and algebraic equations, which is computational intensive and challenging to solve using single-processor based dynamic simulation solution. High-performance computing (HPC) based parallel computing is a very promising technology to speed up the computation and facilitate the simulation process. This paper presents two different parallel implementations of power grid dynamic simulation using Open Multi-processing (OpenMP) on shared-memory platform, and Messagemore » Passing Interface (MPI) on distributed-memory clusters, respectively. The difference of the parallel simulation algorithms and architectures of the two HPC technologies are illustrated, and their performances for running parallel dynamic simulation are compared and demonstrated.« less

  8. Real-time electron dynamics for massively parallel excited-state simulations

    NASA Astrophysics Data System (ADS)

    Andrade, Xavier

    The simulation of the real-time dynamics of electrons, based on time dependent density functional theory (TDDFT), is a powerful approach to study electronic excited states in molecular and crystalline systems. What makes the method attractive is its flexibility to simulate different kinds of phenomena beyond the linear-response regime, including strongly-perturbed electronic systems and non-adiabatic electron-ion dynamics. Electron-dynamics simulations are also attractive from a computational point of view. They can run efficiently on massively parallel architectures due to the low communication requirements. Our implementations of electron dynamics, based on the codes Octopus (real-space) and Qball (plane-waves), allow us to simulate systems composed of thousands of atoms and to obtain good parallel scaling up to 1.6 million processor cores. Due to the versatility of real-time electron dynamics and its parallel performance, we expect it to become the method of choice to apply the capabilities of exascale supercomputers for the simulation of electronic excited states.

  9. Parallel discrete-event simulation of FCFS stochastic queueing networks

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1988-01-01

    Physical systems are inherently parallel. Intuition suggests that simulations of these systems may be amenable to parallel execution. The parallel execution of a discrete-event simulation requires careful synchronization of processes in order to ensure the execution's correctness; this synchronization can degrade performance. Largely negative results were recently reported in a study which used a well-known synchronization method on queueing network simulations. Discussed here is a synchronization method (appointments), which has proven itself to be effective on simulations of FCFS queueing networks. The key concept behind appointments is the provision of lookahead. Lookahead is a prediction on a processor's future behavior, based on an analysis of the processor's simulation state. It is shown how lookahead can be computed for FCFS queueing network simulations, give performance data that demonstrates the method's effectiveness under moderate to heavy loads, and discuss performance tradeoffs between the quality of lookahead, and the cost of computing lookahead.

  10. Xyce parallel electronic simulator : users' guide.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mei, Ting; Rankin, Eric Lamont; Thornquist, Heidi K.

    2011-05-01

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: (1) Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). Note that this includes support for most popular parallel and serial computers; (2) Improved performance for all numerical kernels (e.g., time integrator, nonlinear and linear solvers) through state-of-the-artmore » algorithms and novel techniques. (3) Device models which are specifically tailored to meet Sandia's needs, including some radiation-aware devices (for Sandia users only); and (4) Object-oriented code design and implementation using modern coding practices that ensure that the Xyce Parallel Electronic Simulator will be maintainable and extensible far into the future. Xyce is a parallel code in the most general sense of the phrase - a message passing parallel implementation - which allows it to run efficiently on the widest possible number of computing platforms. These include serial, shared-memory and distributed-memory parallel as well as heterogeneous platforms. Careful attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. The development of Xyce provides a platform for computational research and development aimed specifically at the needs of the Laboratory. With Xyce, Sandia has an 'in-house' capability with which both new electrical (e.g., device model development) and algorithmic (e.g., faster time-integration methods, parallel solver algorithms) research and development can be performed. As a result, Xyce is a

  11. Domain decomposition in time for PDE-constrained optimization

    DOE PAGES

    Barker, Andrew T.; Stoll, Martin

    2015-08-28

    Here, PDE-constrained optimization problems have a wide range of applications, but they lead to very large and ill-conditioned linear systems, especially if the problems are time dependent. In this paper we outline an approach for dealing with such problems by decomposing them in time and applying an additive Schwarz preconditioner in time, so that we can take advantage of parallel computers to deal with the very large linear systems. We then illustrate the performance of our method on a variety of problems.

  12. Terascale Optimal PDE Simulations (TOPS) Center

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Professor Olof B. Widlund

    2007-07-09

    Our work has focused on the development and analysis of domain decomposition algorithms for a variety of problems arising in continuum mechanics modeling. In particular, we have extended and analyzed FETI-DP and BDDC algorithms; these iterative solvers were first introduced and studied by Charbel Farhat and his collaborators, see [11, 45, 12], and by Clark Dohrmann of SANDIA, Albuquerque, see [43, 2, 1], respectively. These two closely related families of methods are of particular interest since they are used more extensively than other iterative substructuring methods to solve very large and difficult problems. Thus, the FETI algorithms are part ofmore » the SALINAS system developed by the SANDIA National Laboratories for very large scale computations, and as already noted, BDDC was first developed by a SANDIA scientist, Dr. Clark Dohrmann. The FETI algorithms are also making inroads in commercial engineering software systems. We also note that the analysis of these algorithms poses very real mathematical challenges. The success in developing this theory has, in several instances, led to significant improvements in the performance of these algorithms. A very desirable feature of these iterative substructuring and other domain decomposition algorithms is that they respect the memory hierarchy of modern parallel and distributed computing systems, which is essential for approaching peak floating point performance. The development of improved methods, together with more powerful computer systems, is making it possible to carry out simulations in three dimensions, with quite high resolution, relatively easily. This work is supported by high quality software systems, such as Argonne's PETSc library, which facilitates code development as well as the access to a variety of parallel and distributed computer systems. The success in finding scalable and robust domain decomposition algorithms for very large number of processors and very large finite element problems is, e

  13. Parallelizing Timed Petri Net simulations

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1993-01-01

    The possibility of using parallel processing to accelerate the simulation of Timed Petri Nets (TPN's) was studied. It was recognized that complex system development tools often transform system descriptions into TPN's or TPN-like models, which are then simulated to obtain information about system behavior. Viewed this way, it was important that the parallelization of TPN's be as automatic as possible, to admit the possibility of the parallelization being embedded in the system design tool. Later years of the grant were devoted to examining the problem of joint performance and reliability analysis, to explore whether both types of analysis could be accomplished within a single framework. In this final report, the results of our studies are summarized. We believe that the problem of parallelizing TPN's automatically for MIMD architectures has been almost completely solved for a large and important class of problems. Our initial investigations into joint performance/reliability analysis are two-fold; it was shown that Monte Carlo simulation, with importance sampling, offers promise of joint analysis in the context of a single tool, and methods for the parallel simulation of general Continuous Time Markov Chains, a model framework within which joint performance/reliability models can be cast, were developed. However, very much more work is needed to determine the scope and generality of these approaches. The results obtained in our two studies, future directions for this type of work, and a list of publications are included.

  14. Data parallel sorting for particle simulation

    NASA Technical Reports Server (NTRS)

    Dagum, Leonardo

    1992-01-01

    Sorting on a parallel architecture is a communications intensive event which can incur a high penalty in applications where it is required. In the case of particle simulation, only integer sorting is necessary, and sequential implementations easily attain the minimum performance bound of O (N) for N particles. Parallel implementations, however, have to cope with the parallel sorting problem which, in addition to incurring a heavy communications cost, can make the minimun performance bound difficult to attain. This paper demonstrates how the sorting problem in a particle simulation can be reduced to a merging problem, and describes an efficient data parallel algorithm to solve this merging problem in a particle simulation. The new algorithm is shown to be optimal under conditions usual for particle simulation, and its fieldwise implementation on the Connection Machine is analyzed in detail. The new algorithm is about four times faster than a fieldwise implementation of radix sort on the Connection Machine.

  15. Random number generators for large-scale parallel Monte Carlo simulations on FPGA

    NASA Astrophysics Data System (ADS)

    Lin, Y.; Wang, F.; Liu, B.

    2018-05-01

    Through parallelization, field programmable gate array (FPGA) can achieve unprecedented speeds in large-scale parallel Monte Carlo (LPMC) simulations. FPGA presents both new constraints and new opportunities for the implementations of random number generators (RNGs), which are key elements of any Monte Carlo (MC) simulation system. Using empirical and application based tests, this study evaluates all of the four RNGs used in previous FPGA based MC studies and newly proposed FPGA implementations for two well-known high-quality RNGs that are suitable for LPMC studies on FPGA. One of the newly proposed FPGA implementations: a parallel version of additive lagged Fibonacci generator (Parallel ALFG) is found to be the best among the evaluated RNGs in fulfilling the needs of LPMC simulations on FPGA.

  16. A Component-Based Extension Framework for Large-Scale Parallel Simulations in NEURON

    PubMed Central

    King, James G.; Hines, Michael; Hill, Sean; Goodman, Philip H.; Markram, Henry; Schürmann, Felix

    2008-01-01

    As neuronal simulations approach larger scales with increasing levels of detail, the neurosimulator software represents only a part of a chain of tools ranging from setup, simulation, interaction with virtual environments to analysis and visualizations. Previously published approaches to abstracting simulator engines have not received wide-spread acceptance, which in part may be to the fact that they tried to address the challenge of solving the model specification problem. Here, we present an approach that uses a neurosimulator, in this case NEURON, to describe and instantiate the network model in the simulator's native model language but then replaces the main integration loop with its own. Existing parallel network models are easily adopted to run in the presented framework. The presented approach is thus an extension to NEURON but uses a component-based architecture to allow for replaceable spike exchange components and pluggable components for monitoring, analysis, or control that can run in this framework alongside with the simulation. PMID:19430597

  17. A hybrid parallel framework for the cellular Potts model simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jiang, Yi; He, Kejing; Dong, Shoubin

    2009-01-01

    The Cellular Potts Model (CPM) has been widely used for biological simulations. However, most current implementations are either sequential or approximated, which can't be used for large scale complex 3D simulation. In this paper we present a hybrid parallel framework for CPM simulations. The time-consuming POE solving, cell division, and cell reaction operation are distributed to clusters using the Message Passing Interface (MPI). The Monte Carlo lattice update is parallelized on shared-memory SMP system using OpenMP. Because the Monte Carlo lattice update is much faster than the POE solving and SMP systems are more and more common, this hybrid approachmore » achieves good performance and high accuracy at the same time. Based on the parallel Cellular Potts Model, we studied the avascular tumor growth using a multiscale model. The application and performance analysis show that the hybrid parallel framework is quite efficient. The hybrid parallel CPM can be used for the large scale simulation ({approx}10{sup 8} sites) of complex collective behavior of numerous cells ({approx}10{sup 6}).« less

  18. OpenMP Parallelization and Optimization of Graph-Based Machine Learning Algorithms

    DOE PAGES

    Meng, Zhaoyi; Koniges, Alice; He, Yun Helen; ...

    2016-09-21

    In this paper, we investigate the OpenMP parallelization and optimization of two novel data classification algorithms. The new algorithms are based on graph and PDE solution techniques and provide significant accuracy and performance advantages over traditional data classification algorithms in serial mode. The methods leverage the Nystrom extension to calculate eigenvalue/eigenvectors of the graph Laplacian and this is a self-contained module that can be used in conjunction with other graph-Laplacian based methods such as spectral clustering. We use performance tools to collect the hotspots and memory access of the serial codes and use OpenMP as the parallelization language to parallelizemore » the most time-consuming parts. Where possible, we also use library routines. We then optimize the OpenMP implementations and detail the performance on traditional supercomputer nodes (in our case a Cray XC30), and test the optimization steps on emerging testbed systems based on Intel’s Knights Corner and Landing processors. We show both performance improvement and strong scaling behavior. Finally, a large number of optimization techniques and analyses are necessary before the algorithm reaches almost ideal scaling.« less

  19. Array-based, parallel hierarchical mesh refinement algorithms for unstructured meshes

    DOE PAGES

    Ray, Navamita; Grindeanu, Iulian; Zhao, Xinglin; ...

    2016-08-18

    In this paper, we describe an array-based hierarchical mesh refinement capability through uniform refinement of unstructured meshes for efficient solution of PDE's using finite element methods and multigrid solvers. A multi-degree, multi-dimensional and multi-level framework is designed to generate the nested hierarchies from an initial coarse mesh that can be used for a variety of purposes such as in multigrid solvers/preconditioners, to do solution convergence and verification studies and to improve overall parallel efficiency by decreasing I/O bandwidth requirements (by loading smaller meshes and in memory refinement). We also describe a high-order boundary reconstruction capability that can be used tomore » project the new points after refinement using high-order approximations instead of linear projection in order to minimize and provide more control on geometrical errors introduced by curved boundaries.The capability is developed under the parallel unstructured mesh framework "Mesh Oriented dAtaBase" (MOAB Tautges et al. (2004)). We describe the underlying data structures and algorithms to generate such hierarchies in parallel and present numerical results for computational efficiency and effect on mesh quality. Furthermore, we also present results to demonstrate the applicability of the developed capability to study convergence properties of different point projection schemes for various mesh hierarchies and to a multigrid finite-element solver for elliptic problems.« less

  20. Massively parallel simulator of optical coherence tomography of inhomogeneous turbid media.

    PubMed

    Malektaji, Siavash; Lima, Ivan T; Escobar I, Mauricio R; Sherif, Sherif S

    2017-10-01

    An accurate and practical simulator for Optical Coherence Tomography (OCT) could be an important tool to study the underlying physical phenomena in OCT such as multiple light scattering. Recently, many researchers have investigated simulation of OCT of turbid media, e.g., tissue, using Monte Carlo methods. The main drawback of these earlier simulators is the long computational time required to produce accurate results. We developed a massively parallel simulator of OCT of inhomogeneous turbid media that obtains both Class I diffusive reflectivity, due to ballistic and quasi-ballistic scattered photons, and Class II diffusive reflectivity due to multiply scattered photons. This Monte Carlo-based simulator is implemented on graphic processing units (GPUs), using the Compute Unified Device Architecture (CUDA) platform and programming model, to exploit the parallel nature of propagation of photons in tissue. It models an arbitrary shaped sample medium as a tetrahedron-based mesh and uses an advanced importance sampling scheme. This new simulator speeds up simulations of OCT of inhomogeneous turbid media by about two orders of magnitude. To demonstrate this result, we have compared the computation times of our new parallel simulator and its serial counterpart using two samples of inhomogeneous turbid media. We have shown that our parallel implementation reduced simulation time of OCT of the first sample medium from 407 min to 92 min by using a single GPU card, to 12 min by using 8 GPU cards and to 7 min by using 16 GPU cards. For the second sample medium, the OCT simulation time was reduced from 209 h to 35.6 h by using a single GPU card, and to 4.65 h by using 8 GPU cards, and to only 2 h by using 16 GPU cards. Therefore our new parallel simulator is considerably more practical to use than its central processing unit (CPU)-based counterpart. Our new parallel OCT simulator could be a practical tool to study the different physical phenomena underlying OCT

  1. A Comparison of PETSC Library and HPF Implementations of an Archetypal PDE Computation

    NASA Technical Reports Server (NTRS)

    Hayder, M. Ehtesham; Keyes, David E.; Mehrotra, Piyush

    1997-01-01

    Two paradigms for distributed-memory parallel computation that free the application programmer from the details of message passing are compared for an archetypal structured scientific computation a nonlinear, structured-grid partial differential equation boundary value problem using the same algorithm on the same hardware. Both paradigms, parallel libraries represented by Argonne's PETSC, and parallel languages represented by the Portland Group's HPF, are found to be easy to use for this problem class, and both are reasonably effective in exploiting concurrency after a short learning curve. The level of involvement required by the application programmer under either paradigm includes specification of the data partitioning (corresponding to a geometrically simple decomposition of the domain of the PDE). Programming in SPAM style for the PETSC library requires writing the routines that discretize the PDE and its Jacobian, managing subdomain-to-processor mappings (affine global- to-local index mappings), and interfacing to library solver routines. Programming for HPF requires a complete sequential implementation of the same algorithm, introducing concurrency through subdomain blocking (an effort similar to the index mapping), and modest experimentation with rewriting loops to elucidate to the compiler the latent concurrency. Correctness and scalability are cross-validated on up to 32 nodes of an IBM SP2.

  2. Computer Science Techniques Applied to Parallel Atomistic Simulation

    NASA Astrophysics Data System (ADS)

    Nakano, Aiichiro

    1998-03-01

    Recent developments in parallel processing technology and multiresolution numerical algorithms have established large-scale molecular dynamics (MD) simulations as a new research mode for studying materials phenomena such as fracture. However, this requires large system sizes and long simulated times. We have developed: i) Space-time multiresolution schemes; ii) fuzzy-clustering approach to hierarchical dynamics; iii) wavelet-based adaptive curvilinear-coordinate load balancing; iv) multilevel preconditioned conjugate gradient method; and v) spacefilling-curve-based data compression for parallel I/O. Using these techniques, million-atom parallel MD simulations are performed for the oxidation dynamics of nanocrystalline Al. The simulations take into account the effect of dynamic charge transfer between Al and O using the electronegativity equalization scheme. The resulting long-range Coulomb interaction is calculated efficiently with the fast multipole method. Results for temperature and charge distributions, residual stresses, bond lengths and bond angles, and diffusivities of Al and O will be presented. The oxidation of nanocrystalline Al is elucidated through immersive visualization in virtual environments. A unique dual-degree education program at Louisiana State University will also be discussed in which students can obtain a Ph.D. in Physics & Astronomy and a M.S. from the Department of Computer Science in five years. This program fosters interdisciplinary research activities for interfacing High Performance Computing and Communications with large-scale atomistic simulations of advanced materials. This work was supported by NSF (CAREER Program), ARO, PRF, and Louisiana LEQSF.

  3. Massively parallel quantum computer simulator

    NASA Astrophysics Data System (ADS)

    De Raedt, K.; Michielsen, K.; De Raedt, H.; Trieu, B.; Arnold, G.; Richter, M.; Lippert, Th.; Watanabe, H.; Ito, N.

    2007-01-01

    We describe portable software to simulate universal quantum computers on massive parallel computers. We illustrate the use of the simulation software by running various quantum algorithms on different computer architectures, such as a IBM BlueGene/L, a IBM Regatta p690+, a Hitachi SR11000/J1, a Cray X1E, a SGI Altix 3700 and clusters of PCs running Windows XP. We study the performance of the software by simulating quantum computers containing up to 36 qubits, using up to 4096 processors and up to 1 TB of memory. Our results demonstrate that the simulator exhibits nearly ideal scaling as a function of the number of processors and suggest that the simulation software described in this paper may also serve as benchmark for testing high-end parallel computers.

  4. pWeb: A High-Performance, Parallel-Computing Framework for Web-Browser-Based Medical Simulation.

    PubMed

    Halic, Tansel; Ahn, Woojin; De, Suvranu

    2014-01-01

    This work presents a pWeb - a new language and compiler for parallelization of client-side compute intensive web applications such as surgical simulations. The recently introduced HTML5 standard has enabled creating unprecedented applications on the web. Low performance of the web browser, however, remains the bottleneck of computationally intensive applications including visualization of complex scenes, real time physical simulations and image processing compared to native ones. The new proposed language is built upon web workers for multithreaded programming in HTML5. The language provides fundamental functionalities of parallel programming languages as well as the fork/join parallel model which is not supported by web workers. The language compiler automatically generates an equivalent parallel script that complies with the HTML5 standard. A case study on realistic rendering for surgical simulations demonstrates enhanced performance with a compact set of instructions.

  5. Tutorial: Parallel Computing of Simulation Models for Risk Analysis.

    PubMed

    Reilly, Allison C; Staid, Andrea; Gao, Michael; Guikema, Seth D

    2016-10-01

    Simulation models are widely used in risk analysis to study the effects of uncertainties on outcomes of interest in complex problems. Often, these models are computationally complex and time consuming to run. This latter point may be at odds with time-sensitive evaluations or may limit the number of parameters that are considered. In this article, we give an introductory tutorial focused on parallelizing simulation code to better leverage modern computing hardware, enabling risk analysts to better utilize simulation-based methods for quantifying uncertainty in practice. This article is aimed primarily at risk analysts who use simulation methods but do not yet utilize parallelization to decrease the computational burden of these models. The discussion is focused on conceptual aspects of embarrassingly parallel computer code and software considerations. Two complementary examples are shown using the languages MATLAB and R. A brief discussion of hardware considerations is located in the Appendix. © 2016 Society for Risk Analysis.

  6. PDE4 as a target for cognition enhancement

    PubMed Central

    Richter, Wito; Menniti, Frank S.; Zhang, Han-Ting; Conti, Marco

    2014-01-01

    Introduction The second messengers cAMP and cGMP mediate fundamental aspects of brain function relevant to memory, learning and cognitive functions. Consequently, cyclic nucleotide phosphodiesterases (PDEs), the enzymes that inactivate the cyclic nucleotides, are promising targets for the development of cognition-enhancing drugs. Areas covered PDE4 is the largest of the eleven mammalian PDE families. This review covers the properties and functions of the PDE4 family, highlighting procognitive and memory-enhancing effects associated with their inactivation. Expert opinion PAN-selective PDE4 inhibitors exert a number of memory- and cognition-enhancing effects and have neuroprotective and neuroregenerative properties in preclinical models. The major hurdle for their clinical application is to target inhibitors to specific PDE4 isoforms relevant to particular cognitive disorders to realize the therapeutic potential while avoiding side effects, in particular emesis and nausea. The PDE4 family comprises four genes, PDE4A-D, each expressed as multiple variants. Progress to date stems from characterization of rodent models with selective ablation of individual PDE4 subtypes, revealing that individual subtypes exert unique and non-redundant functions in the brain. Thus, targeting specific PDE4 subtypes, as well as splicing variants or conformational states, represents a promising strategy to separate the therapeutic benefits from the side effects of PAN-PDE4 inhibitors. PMID:23883342

  7. New findings on phosphodiesterases, MoPdeH and MoPdeL, in Magnaporthe oryzae revealed by structural analysis.

    PubMed

    Yang, Li-Na; Yin, Ziyi; Zhang, Xi; Feng, Wanzhen; Xiao, Yuhan; Zhang, Haifeng; Zheng, Xiaobo; Zhang, Zhengguang

    2018-05-01

    The cyclic adenosine monophosphate (cAMP) signalling pathway mediates signal communication and sensing during infection-related morphogenesis in eukaryotes. Many studies have implicated cAMP as a critical mediator of appressorium development in the rice blast fungus, Magnaporthe oryzae. The cAMP phosphodiesterases, MoPdeH and MoPdeL, as key regulators of intracellular cAMP levels, play pleiotropic roles in cell wall integrity, cellular morphology, appressorium formation and infectious growth in M. oryzae. Here, we analysed the roles of domains of MoPdeH and MoPdeL separately or in chimeras. The results indicated that the HD and EAL domains of MoPdeH are indispensable for its phosphodiesterase activity and function. Replacement of the MoPdeH HD domain with the L1 and L2 domains of MoPdeL, either singly or together, resulted in decreased cAMP hydrolysis activity of MoPdeH. All of the transformants exhibited phenotypes similar to that of the ΔMopdeH mutant, but also revealed that EAL and L1 play additional roles in conidiation, and that L1 is involved in infectious growth. We further found that the intracellular cAMP level is important for surface signal recognition and hyphal autolysis. The intracellular cAMP level negatively regulates Mps1-MAPK and positively regulates Pmk1-MAPK in the rice blast fungus. Our results provide new information to better understand the cAMP signalling pathway in the development, differentiation and plant infection of the fungus. © 2017 BSPP AND JOHN WILEY & SONS LTD.

  8. Bivariate spline solution of time dependent nonlinear PDE for a population density over irregular domains.

    PubMed

    Gutierrez, Juan B; Lai, Ming-Jun; Slavov, George

    2015-12-01

    We study a time dependent partial differential equation (PDE) which arises from classic models in ecology involving logistic growth with Allee effect by introducing a discrete weak solution. Existence, uniqueness and stability of the discrete weak solutions are discussed. We use bivariate splines to approximate the discrete weak solution of the nonlinear PDE. A computational algorithm is designed to solve this PDE. A convergence analysis of the algorithm is presented. We present some simulations of population development over some irregular domains. Finally, we discuss applications in epidemiology and other ecological problems. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Dependability analysis of parallel systems using a simulation-based approach. M.S. Thesis

    NASA Technical Reports Server (NTRS)

    Sawyer, Darren Charles

    1994-01-01

    The analysis of dependability in large, complex, parallel systems executing real applications or workloads is examined in this thesis. To effectively demonstrate the wide range of dependability problems that can be analyzed through simulation, the analysis of three case studies is presented. For each case, the organization of the simulation model used is outlined, and the results from simulated fault injection experiments are explained, showing the usefulness of this method in dependability modeling of large parallel systems. The simulation models are constructed using DEPEND and C++. Where possible, methods to increase dependability are derived from the experimental results. Another interesting facet of all three cases is the presence of some kind of workload of application executing in the simulation while faults are injected. This provides a completely new dimension to this type of study, not possible to model accurately with analytical approaches.

  10. The rational search for PDE10A inhibitors from Sophora flavescens roots using pharmacophore‑ and docking‑based virtual screening.

    PubMed

    Fan, Han-Tian; Guo, Jun-Fang; Zhang, Yu-Xin; Gu, Yu-Xi; Ning, Zhong-Qi; Qiao, Yan-Jiang; Wang, Xing

    2018-01-01

    Phosphodiesterase 10A (PDE10A) has been confirmed to be an important target for the treatment of central nervous system (CNS) disorders. The purpose of the present study was to identify PDE10A inhibitors from herbs used in traditional Chinese medicine. Pharmacophore and molecular docking techniques were used to virtually screen the chemical molecule database of Sophora flavescens, a well‑known Chinese herb that has been used for improving mental health and regulating the CNS. The pharmacophore model generated recognized the common functional groups of known PDE10A inhibitors. In addition, molecular docking was used to calculate the binding affinity of ligand‑PDE10A interactions and to investigate the possible binding pattern. Virtual screening based on the pharmacophore model and molecular docking was performed to identify potential PDE10A inhibitors from S. flavescens. The results demonstrated that nine hits from S. flavescens were potential PDE10A inhibitors, and their biological activity was further validated using literature mining. A total of two compounds were reported to inhibit cyclic adenosine monophosphate phosphodiesterase, and one protected against glutamate‑induced oxidative stress in the CNS. The remaining six compounds require further bioactivity validation. The results of the present study demonstrated that this method was a time‑ and cost‑saving strategy for the identification of bioactive compounds from traditional Chinese medicine.

  11. New Therapeutic Applications of Phosphodiesterase 5 Inhibitors (PDE5-Is).

    PubMed

    Ribaudo, Giovanni; Pagano, Mario Angelo; Bova, Sergio; Zagotto, Giuseppe

    2016-01-01

    Phosphodiesterase 5 inhibitors (PDE5-Is) sildenafil, vardenafil, tadalafil and the recently approved avanafil represent the first-line choice for both on-demand and chronic treatment of erectile dysfunction (ED). In addition to this, sildenafil and tadalafil, have also been approved for the treatment of pulmonary arterial hypertension. Due to its expression and localization in many tissues, PDE5 and its regulation has been reported to be involved in several other diseases. We aim to provide an updated overview of the emerging therapeutic applications of PDE5-Is besides ED, taking into account the latest ongoing research reports. We searched online databases (Pubmed, Reaxys, Scopus) to lay the bases for an accurate, quality criteria-based literature update. We focused our attention on most recent research reports, in particular when supported by pre-clinical and clinical data. The regulation of PDE5 may influence pathological conditions such as, among the others, heart failure, cystic fibrosis, cancer, CNS-related diseases, diabetes and dysfunctions affecting male urinary/reproductive system. Sildenafil, vardenafil, tadalafil and the other chemical entities considered PDE5-Is showed overall positive results and significant improvements in the studied disease, thus some discordant results, in particular when comparing pre-clinical and clinical data, have to be pointed out, suggesting that further insights are needed especially to assess the exact molecular pathway underlying.

  12. Identification of cancer cytotoxic modulators of PDE3A by predictive chemogenomics

    PubMed Central

    de Waal, Luc; Lewis, Timothy A.; Rees, Matthew G.; Tsherniak, Aviad; Wu, Xiaoyun; Choi, Peter S.; Gechijian, Lara; Hartigan, Christina; Faloon, Patrick W.; Hickey, Mark J.; Tolliday, Nicola; Carr, Steven A.; Clemons, Paul A.; Munoz, Benito; Wagner, Bridget K.; Shamji, Alykhan F.; Koehler, Angela N.; Schenone, Monica; Burgin, Alex B.; Schreiber, Stuart L.; Greulich, Heidi; Meyerson, Matthew

    2015-01-01

    High cancer death rates indicate the need for new anti-cancer therapeutic agents. Approaches to discover new cancer drugs include target-based drug discovery and phenotypic screening. Here, we identified phosphodiesterase 3A modulators as cell-selective cancer cytotoxic compounds by phenotypic compound library screening and target deconvolution by predictive chemogenomics. We found that sensitivity to 6-(4-(diethylamino)-3-nitrophenyl)-5-methyl-4,5-dihydropyridazin-3(2H)-one, or DNMDP, across 766 cancer cell lines correlates with expression of the phosphodiesterase 3A gene, PDE3A. Like DNMDP, a subset of known PDE3A inhibitors kill selected cancer cells while others do not. Furthermore, PDE3A depletion leads to DNMDP resistance. We demonstrated that DNMDP binding to PDE3A promotes an interaction between PDE3A and Schlafen 12 (SLFN12), suggesting a neomorphic activity. Co-expression of SLFN12 with PDE3A correlates with DNMDP sensitivity, while depletion of SLFN12 results in decreased DNMDP sensitivity. Our results implicate PDE3A modulators as candidate cancer therapeutic agents and demonstrate the power of predictive chemogenomics in small-molecule discovery. PMID:26656089

  13. Parallel implementation of the particle simulation method with dynamic load balancing: Toward realistic geodynamical simulation

    NASA Astrophysics Data System (ADS)

    Furuichi, M.; Nishiura, D.

    2015-12-01

    Fully Lagrangian methods such as Smoothed Particle Hydrodynamics (SPH) and Discrete Element Method (DEM) have been widely used to solve the continuum and particles motions in the computational geodynamics field. These mesh-free methods are suitable for the problems with the complex geometry and boundary. In addition, their Lagrangian nature allows non-diffusive advection useful for tracking history dependent properties (e.g. rheology) of the material. These potential advantages over the mesh-based methods offer effective numerical applications to the geophysical flow and tectonic processes, which are for example, tsunami with free surface and floating body, magma intrusion with fracture of rock, and shear zone pattern generation of granular deformation. In order to investigate such geodynamical problems with the particle based methods, over millions to billion particles are required for the realistic simulation. Parallel computing is therefore important for handling such huge computational cost. An efficient parallel implementation of SPH and DEM methods is however known to be difficult especially for the distributed-memory architecture. Lagrangian methods inherently show workload imbalance problem for parallelization with the fixed domain in space, because particles move around and workloads change during the simulation. Therefore dynamic load balance is key technique to perform the large scale SPH and DEM simulation. In this work, we present the parallel implementation technique of SPH and DEM method utilizing dynamic load balancing algorithms toward the high resolution simulation over large domain using the massively parallel super computer system. Our method utilizes the imbalances of the executed time of each MPI process as the nonlinear term of parallel domain decomposition and minimizes them with the Newton like iteration method. In order to perform flexible domain decomposition in space, the slice-grid algorithm is used. Numerical tests show that our

  14. HEAT.PRO - THERMAL IMBALANCE FORCE SIMULATION AND ANALYSIS USING PDE2D

    NASA Technical Reports Server (NTRS)

    Vigue, Y.

    1994-01-01

    HEAT.PRO calculates the thermal imbalance force resulting from satellite surface heating. The heated body of a satellite re-radiates energy at a rate that is proportional to its temperature, losing the energy in the form of photons. By conservation of momentum, this momentum flux out of the body creates a reaction force against the radiation surface, and the net thermal force can be observed as a small perturbation that affects long term orbital behavior of the satellite. HEAT.PRO calculates this thermal imbalance force and then determines its effects on satellite orbits, especially where the Earth's shadowing of an orbiting satellite causes periodic changes in the spacecraft's thermal environment. HEAT.PRO implements a finite element method routine called PDE2D which incorporates material properties to determine the solar panel surface temperatures. The nodal temperatures are computed at specified time steps and are used to determine the magnitude and direction of the thermal force on the spacecraft. These calculations are based on the solar panel orientation and satellite's position with respect to the earth and sun. It is necessary to have accurate, current knowledge of surface emissivity, thermal conductivity, heat capacity, and material density. These parameters, which may change due to degradation of materials in the environment of space, influence the nodal temperatures that are computed and thus the thermal force calculations. HEAT.PRO was written in FORTRAN 77 for Cray series computers running UNICOS. The source code contains directives for and is used as input to the required partial differential equation solver, PDE2D. HEAT.PRO is available on a 9-track 1600 BPI magnetic tape in UNIX tar format (standard distribution medium) or a .25 inch streaming magnetic tape cartridge in UNIX tar format. An electronic copy of the documentation in Macintosh Microsoft Word format is included on the distribution tape. HEAT.PRO was developed in 1991. Cray and UNICOS are

  15. The UPSF code: a metaprogramming-based high-performance automatically parallelized plasma simulation framework

    NASA Astrophysics Data System (ADS)

    Gao, Xiatian; Wang, Xiaogang; Jiang, Binhao

    2017-10-01

    UPSF (Universal Plasma Simulation Framework) is a new plasma simulation code designed for maximum flexibility by using edge-cutting techniques supported by C++17 standard. Through use of metaprogramming technique, UPSF provides arbitrary dimensional data structures and methods to support various kinds of plasma simulation models, like, Vlasov, particle in cell (PIC), fluid, Fokker-Planck, and their variants and hybrid methods. Through C++ metaprogramming technique, a single code can be used to arbitrary dimensional systems with no loss of performance. UPSF can also automatically parallelize the distributed data structure and accelerate matrix and tensor operations by BLAS. A three-dimensional particle in cell code is developed based on UPSF. Two test cases, Landau damping and Weibel instability for electrostatic and electromagnetic situation respectively, are presented to show the validation and performance of the UPSF code.

  16. Applying Parallel Processing Techniques to Tether Dynamics Simulation

    NASA Technical Reports Server (NTRS)

    Wells, B. Earl

    1996-01-01

    The focus of this research has been to determine the effectiveness of applying parallel processing techniques to a sizable real-world problem, the simulation of the dynamics associated with a tether which connects two objects in low earth orbit, and to explore the degree to which the parallelization process can be automated through the creation of new software tools. The goal has been to utilize this specific application problem as a base to develop more generally applicable techniques.

  17. In permanent atrial fibrillation, PDE3 reduces force responses to 5‐HT, but PDE3 and PDE4 do not cause the blunting of atrial arrhythmias

    PubMed Central

    Schwarz, Simon; Ravens, Ursula; Knaut, Michael

    2016-01-01

    Abstract Background and Purpose 5‐HT increases force and L‐type Ca2 + current (ICa,L) and causes arrhythmias through 5‐HT4 receptors in human atrium. In permanent atrial fibrillation (peAF), atrial force responses to 5‐HT are blunted, arrhythmias abolished but ICa,L responses only moderately attenuated. We investigated whether, in peAF, this could be due to an increased function of PDE3 and/or PDE4, using the inhibitors cilostamide (300 nM) and rolipram (1 μM) respectively. Experimental Approach Contractile force, arrhythmic contractions and ICa,L were assessed in right atrial trabeculae and myocytes, obtained from patients with sinus rhythm (SR), paroxysmal atrial fibrillation (pAF) and peAF. Key Results Maximum force responses to 5‐HT were reduced to 15% in peAF, but not in pAF. Cilostamide, but not rolipram, increased both the blunted force responses to 5‐HT in peAF and the inotropic potency of 5‐HT fourfold to sevenfold in trabeculae of patients with SR, pAF and peAF. Lusitropic responses to 5‐HT were not decreased in peAF. Responses of ICa,L to 5‐HT did not differ and were unaffected by cilostamide or rolipram in myocytes from patients with SR or peAF. Concurrent cilostamide and rolipram increased 5‐HT's propensity to elicit arrhythmias in trabeculae from patients with SR, but not with peAF. Conclusions and Implications PDE3, but not PDE4, reduced inotropic responses to 5‐HT in peAF, independently of lusitropy and ICa,L, but PDE3 activity was the same as that in patients with SR and pAF. Atrial remodelling in peAF abolished the facilitation of 5‐HT to induce arrhythmias by inhibition of PDE3 plus PDE4. PMID:27238373

  18. PDE-based geophysical modelling using finite elements: examples from 3D resistivity and 2D magnetotellurics

    NASA Astrophysics Data System (ADS)

    Schaa, R.; Gross, L.; du Plessis, J.

    2016-04-01

    We present a general finite-element solver, escript, tailored to solve geophysical forward and inverse modeling problems in terms of partial differential equations (PDEs) with suitable boundary conditions. Escript’s abstract interface allows geoscientists to focus on solving the actual problem without being experts in numerical modeling. General-purpose finite element solvers have found wide use especially in engineering fields and find increasing application in the geophysical disciplines as these offer a single interface to tackle different geophysical problems. These solvers are useful for data interpretation and for research, but can also be a useful tool in educational settings. This paper serves as an introduction into PDE-based modeling with escript where we demonstrate in detail how escript is used to solve two different forward modeling problems from applied geophysics (3D DC resistivity and 2D magnetotellurics). Based on these two different cases, other geophysical modeling work can easily be realized. The escript package is implemented as a Python library and allows the solution of coupled, linear or non-linear, time-dependent PDEs. Parallel execution for both shared and distributed memory architectures is supported and can be used without modifications to the scripts.

  19. Parallel computing method for simulating hydrological processesof large rivers under climate change

    NASA Astrophysics Data System (ADS)

    Wang, H.; Chen, Y.

    2016-12-01

    Climate change is one of the proverbial global environmental problems in the world.Climate change has altered the watershed hydrological processes in time and space distribution, especially in worldlarge rivers.Watershed hydrological process simulation based on physically based distributed hydrological model can could have better results compared with the lumped models.However, watershed hydrological process simulation includes large amount of calculations, especially in large rivers, thus needing huge computing resources that may not be steadily available for the researchers or at high expense, this seriously restricted the research and application. To solve this problem, the current parallel method are mostly parallel computing in space and time dimensions.They calculate the natural features orderly thatbased on distributed hydrological model by grid (unit, a basin) from upstream to downstream.This articleproposes ahigh-performancecomputing method of hydrological process simulation with high speedratio and parallel efficiency.It combinedthe runoff characteristics of time and space of distributed hydrological model withthe methods adopting distributed data storage, memory database, distributed computing, parallel computing based on computing power unit.The method has strong adaptability and extensibility,which means it canmake full use of the computing and storage resources under the condition of limited computing resources, and the computing efficiency can be improved linearly with the increase of computing resources .This method can satisfy the parallel computing requirements ofhydrological process simulation in small, medium and large rivers.

  20. Parallel simulation of tsunami inundation on a large-scale supercomputer

    NASA Astrophysics Data System (ADS)

    Oishi, Y.; Imamura, F.; Sugawara, D.

    2013-12-01

    An accurate prediction of tsunami inundation is important for disaster mitigation purposes. One approach is to approximate the tsunami wave source through an instant inversion analysis using real-time observation data (e.g., Tsushima et al., 2009) and then use the resulting wave source data in an instant tsunami inundation simulation. However, a bottleneck of this approach is the large computational cost of the non-linear inundation simulation and the computational power of recent massively parallel supercomputers is helpful to enable faster than real-time execution of a tsunami inundation simulation. Parallel computers have become approximately 1000 times faster in 10 years (www.top500.org), and so it is expected that very fast parallel computers will be more and more prevalent in the near future. Therefore, it is important to investigate how to efficiently conduct a tsunami simulation on parallel computers. In this study, we are targeting very fast tsunami inundation simulations on the K computer, currently the fastest Japanese supercomputer, which has a theoretical peak performance of 11.2 PFLOPS. One computing node of the K computer consists of 1 CPU with 8 cores that share memory, and the nodes are connected through a high-performance torus-mesh network. The K computer is designed for distributed-memory parallel computation, so we have developed a parallel tsunami model. Our model is based on TUNAMI-N2 model of Tohoku University, which is based on a leap-frog finite difference method. A grid nesting scheme is employed to apply high-resolution grids only at the coastal regions. To balance the computation load of each CPU in the parallelization, CPUs are first allocated to each nested layer in proportion to the number of grid points of the nested layer. Using CPUs allocated to each layer, 1-D domain decomposition is performed on each layer. In the parallel computation, three types of communication are necessary: (1) communication to adjacent neighbours for the

  1. A New Parallel Boundary Condition for Turbulence Simulations in Stellarators

    NASA Astrophysics Data System (ADS)

    Martin, Mike F.; Landreman, Matt; Dorland, William; Xanthopoulos, Pavlos

    2017-10-01

    For gyrokinetic simulations of core turbulence, the ``twist-and-shift'' parallel boundary condition (Beer et al., PoP, 1995), which involves a shift in radial wavenumber proportional to the global shear and a quantization of the simulation domain's aspect ratio, is the standard choice. But as this condition was derived under the assumption of axisymmetry, ``twist-and-shift'' as it stands is formally incorrect for turbulence simulations in stellarators. Moreover, for low-shear stellarators like W7X and HSX, the use of a global shear in the traditional boundary condition places an inflexible constraint on the aspect ratio of the domain, requiring more grid points to fully resolve its extent. Here, we present a parallel boundary condition for ``stellarator-symmetric'' simulations that relies on the local shear along a field line. This boundary condition is similar to ``twist-and-shift'', but has an added flexibility in choosing the parallel length of the domain based on local shear consideration in order to optimize certain parameters such as the aspect ratio of the simulation domain.

  2. Effects of PDE4 Pathway Inhibition in Rat Experimental Stroke

    PubMed Central

    Yang, Fan; Sumbria, Rachita K.; Xue, Dong; Yu, Chuanhui; He, Dan; Liu, Shuo; Paganini-Hill, Annlia; Fisher, Mark J.

    2015-01-01

    PURPOSE The first genomewide association study indicated that variations in the phosphodiesterase 4D (PDE4D) gene confer risk for ischemic stroke. However, inconsistencies among the studies designed to replicate the findings indicated the need for further investigation to elucidate the role of the PDE4 pathway in stroke pathogenesis. Hence, we studied the effect of global inhibition of the PDE4 pathway in two rat experimental stroke models, using the PDE4 inhibitor rolipram. Further, the specific role of the PDE4D isoform in ischemic stroke pathogenesis was studied using PDE4D knockout rats in experimental stroke. METHODS Rats were subjected to either the ligation or embolic stroke model and treated with rolipram (3mg/kg; i.p.) prior to the ischemic insult. Similarly, the PDE4D knockout rats were subjected to experimental stroke using the embolic model. RESULTS Global inhibition of the PDE4 pathway using rolipram produced infarcts that were 225% (p<0.01) and 138% (p<0.05) of control in the ligation and embolic models, respectively. PDE4D knockout rats subjected to embolic stroke showed no change in infarct size compared to wild-type control. CONCLUSIONS Despite increase in infarct size after global inhibition of the PDE4 pathway with rolipram, specific inhibition of the PDE4D isoform had no effect on experimental stroke. These findings support a role for the PDE4 pathway, independent of the PDE4D isoform, in ischemic stroke pathogenesis. PMID:25224348

  3. Parallel STEPS: Large Scale Stochastic Spatial Reaction-Diffusion Simulation with High Performance Computers.

    PubMed

    Chen, Weiliang; De Schutter, Erik

    2017-01-01

    Stochastic, spatial reaction-diffusion simulations have been widely used in systems biology and computational neuroscience. However, the increasing scale and complexity of models and morphologies have exceeded the capacity of any serial implementation. This led to the development of parallel solutions that benefit from the boost in performance of modern supercomputers. In this paper, we describe an MPI-based, parallel operator-splitting implementation for stochastic spatial reaction-diffusion simulations with irregular tetrahedral meshes. The performance of our implementation is first examined and analyzed with simulations of a simple model. We then demonstrate its application to real-world research by simulating the reaction-diffusion components of a published calcium burst model in both Purkinje neuron sub-branch and full dendrite morphologies. Simulation results indicate that our implementation is capable of achieving super-linear speedup for balanced loading simulations with reasonable molecule density and mesh quality. In the best scenario, a parallel simulation with 2,000 processes runs more than 3,600 times faster than its serial SSA counterpart, and achieves more than 20-fold speedup relative to parallel simulation with 100 processes. In a more realistic scenario with dynamic calcium influx and data recording, the parallel simulation with 1,000 processes and no load balancing is still 500 times faster than the conventional serial SSA simulation.

  4. On extending parallelism to serial simulators

    NASA Technical Reports Server (NTRS)

    Nicol, David; Heidelberger, Philip

    1994-01-01

    This paper describes an approach to discrete event simulation modeling that appears to be effective for developing portable and efficient parallel execution of models of large distributed systems and communication networks. In this approach, the modeler develops submodels using an existing sequential simulation modeling tool, using the full expressive power of the tool. A set of modeling language extensions permit automatically synchronized communication between submodels; however, the automation requires that any such communication must take a nonzero amount off simulation time. Within this modeling paradigm, a variety of conservative synchronization protocols can transparently support conservative execution of submodels on potentially different processors. A specific implementation of this approach, U.P.S. (Utilitarian Parallel Simulator), is described, along with performance results on the Intel Paragon.

  5. Cloning and characterization of a cAMP-specific phosphodiesterase (TbPDE2B) from Trypanosoma brucei

    PubMed Central

    Rascón, Ana; Soderling, Scott H.; Schaefer, Jonathan B.; Beavo, Joseph A.

    2002-01-01

    Here we report the cloning, expression, and characterization of a cAMP-specific phosphodiesterase (PDE) from Trypanosoma brucei (TbPDE2B). Using a bioinformatic approach, two different expressed sequence tag clones were identified and used to isolate the complete sequence of two identical PDE genes arranged in tandem. Each gene consists of 2,793 bases that predict a protein of 930 aa with a molecular mass of 103.2 kDa. Two GAF (for cGMP binding and stimulated PDEs, Anabaena adenylyl cyclases, and Escherichia coli FhlA) domains, similar to those contained in many signaling molecules including mammalian PDE2, PDE5, PDE6, PDE10, and PDE11, were located N-terminal to a consensus PDE catalytic domain. The catalytic domain is homologous to the catalytic domain of all 11 mammalian PDEs, the Dictyostelium discoideum RegA, and a probable PDE from Caenorhabditis elegans. It is most similar to the T. brucei PDE2A (89% identity). TbPDE2B has substrate specificity for cAMP with a Km of 2.4 μM. cGMP is not hydrolyzed by TbPDE2B nor does this cyclic nucleotide modulate cAMP PDE activity. The nonselective PDE inhibitors 3-isobutyl-1-methylxanthine, papaverine and pentoxifyline are poor inhibitors of TbPDE2B. Similarly, PDE inhibitors selective for the mammalian PDE families 2, 3, 5, and 6 (erythro-9-[3-(2-hydroxynonyl)]-adenine, enoximone, zaprinast, and sildenafil) were also unable to inhibit this enzyme. However, dipyridamole was a reasonably good inhibitor of this enzyme with an IC50 of 27 μM. cAMP plays key roles in cell growth and differentiation in this parasite, and PDEs are responsible for the hydrolysis of this important second messenger. Therefore, parasite PDEs, including this one, have the potential to be attractive targets for selective drug design. PMID:11930017

  6. A compositional reservoir simulator on distributed memory parallel computers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rame, M.; Delshad, M.

    1995-12-31

    This paper presents the application of distributed memory parallel computes to field scale reservoir simulations using a parallel version of UTCHEM, The University of Texas Chemical Flooding Simulator. The model is a general purpose highly vectorized chemical compositional simulator that can simulate a wide range of displacement processes at both field and laboratory scales. The original simulator was modified to run on both distributed memory parallel machines (Intel iPSC/960 and Delta, Connection Machine 5, Kendall Square 1 and 2, and CRAY T3D) and a cluster of workstations. A domain decomposition approach has been taken towards parallelization of the code. Amore » portion of the discrete reservoir model is assigned to each processor by a set-up routine that attempts a data layout as even as possible from the load-balance standpoint. Each of these subdomains is extended so that data can be shared between adjacent processors for stencil computation. The added routines that make parallel execution possible are written in a modular fashion that makes the porting to new parallel platforms straight forward. Results of the distributed memory computing performance of Parallel simulator are presented for field scale applications such as tracer flood and polymer flood. A comparison of the wall-clock times for same problems on a vector supercomputer is also presented.« less

  7. A path-level exact parallelization strategy for sequential simulation

    NASA Astrophysics Data System (ADS)

    Peredo, Oscar F.; Baeza, Daniel; Ortiz, Julián M.; Herrero, José R.

    2018-01-01

    Sequential Simulation is a well known method in geostatistical modelling. Following the Bayesian approach for simulation of conditionally dependent random events, Sequential Indicator Simulation (SIS) method draws simulated values for K categories (categorical case) or classes defined by K different thresholds (continuous case). Similarly, Sequential Gaussian Simulation (SGS) method draws simulated values from a multivariate Gaussian field. In this work, a path-level approach to parallelize SIS and SGS methods is presented. A first stage of re-arrangement of the simulation path is performed, followed by a second stage of parallel simulation for non-conflicting nodes. A key advantage of the proposed parallelization method is to generate identical realizations as with the original non-parallelized methods. Case studies are presented using two sequential simulation codes from GSLIB: SISIM and SGSIM. Execution time and speedup results are shown for large-scale domains, with many categories and maximum kriging neighbours in each case, achieving high speedup results in the best scenarios using 16 threads of execution in a single machine.

  8. Parallel STEPS: Large Scale Stochastic Spatial Reaction-Diffusion Simulation with High Performance Computers

    PubMed Central

    Chen, Weiliang; De Schutter, Erik

    2017-01-01

    Stochastic, spatial reaction-diffusion simulations have been widely used in systems biology and computational neuroscience. However, the increasing scale and complexity of models and morphologies have exceeded the capacity of any serial implementation. This led to the development of parallel solutions that benefit from the boost in performance of modern supercomputers. In this paper, we describe an MPI-based, parallel operator-splitting implementation for stochastic spatial reaction-diffusion simulations with irregular tetrahedral meshes. The performance of our implementation is first examined and analyzed with simulations of a simple model. We then demonstrate its application to real-world research by simulating the reaction-diffusion components of a published calcium burst model in both Purkinje neuron sub-branch and full dendrite morphologies. Simulation results indicate that our implementation is capable of achieving super-linear speedup for balanced loading simulations with reasonable molecule density and mesh quality. In the best scenario, a parallel simulation with 2,000 processes runs more than 3,600 times faster than its serial SSA counterpart, and achieves more than 20-fold speedup relative to parallel simulation with 100 processes. In a more realistic scenario with dynamic calcium influx and data recording, the parallel simulation with 1,000 processes and no load balancing is still 500 times faster than the conventional serial SSA simulation. PMID:28239346

  9. A pathophysiological role of PDE3 in allergic airway inflammation

    PubMed Central

    Beute, Jan; Lukkes, Melanie; Koekoek, Ewout P.; Nastiti, Hedwika; Ganesh, Keerthana; de Bruijn, Marjolein J.W.; Hockman, Steve; van Nimwegen, Menno; Braunstahl, Gert-Jan; Boon, Louis; Lambrecht, Bart N.; Manganiello, Vince C.; Hendriks, Rudi W.

    2018-01-01

    Phosphodiesterase 3 (PDE3) and PDE4 regulate levels of cyclic AMP, which are critical in various cell types involved in allergic airway inflammation. Although PDE4 inhibition attenuates allergic airway inflammation, reported side effects preclude its application as an antiasthma drug in humans. Case reports showed that enoximone, which is a smooth muscle relaxant that inhibits PDE3, is beneficial and lifesaving in status asthmaticus and is well tolerated. However, clinical observations also showed antiinflammatory effects of PDE3 inhibition. In this study, we investigated the role of PDE3 in a house dust mite–driven (HDM-driven) allergic airway inflammation (AAI) model that is characterized by T helper 2 cell activation, eosinophilia, and reduced mucosal barrier function. Compared with wild-type (WT) littermates, mice with a targeted deletion of the PDE3A or PDE3B gene showed significantly reduced HDM-driven AAI. Therapeutic intervention in WT mice showed that all hallmarks of HDM-driven AAI were abrogated by the PDE3 inhibitors enoximone and milrinone. Importantly, we found that enoximone also reduced the upregulation of the CD11b integrin on mouse and human eosinophils in vitro, which is crucial for their recruitment during allergic inflammation. This study provides evidence for a hitherto unknown antiinflammatory role of PDE3 inhibition in allergic airway inflammation and offers a potentially novel treatment approach. PMID:29367458

  10. Efficient parallel simulation of CO2 geologic sequestration insaline aquifers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhang, Keni; Doughty, Christine; Wu, Yu-Shu

    2007-01-01

    An efficient parallel simulator for large-scale, long-termCO2 geologic sequestration in saline aquifers has been developed. Theparallel simulator is a three-dimensional, fully implicit model thatsolves large, sparse linear systems arising from discretization of thepartial differential equations for mass and energy balance in porous andfractured media. The simulator is based on the ECO2N module of the TOUGH2code and inherits all the process capabilities of the single-CPU TOUGH2code, including a comprehensive description of the thermodynamics andthermophysical properties of H2O-NaCl- CO2 mixtures, modeling singleand/or two-phase isothermal or non-isothermal flow processes, two-phasemixtures, fluid phases appearing or disappearing, as well as saltprecipitation or dissolution. The newmore » parallel simulator uses MPI forparallel implementation, the METIS software package for simulation domainpartitioning, and the iterative parallel linear solver package Aztec forsolving linear equations by multiple processors. In addition, theparallel simulator has been implemented with an efficient communicationscheme. Test examples show that a linear or super-linear speedup can beobtained on Linux clusters as well as on supercomputers. Because of thesignificant improvement in both simulation time and memory requirement,the new simulator provides a powerful tool for tackling larger scale andmore complex problems than can be solved by single-CPU codes. Ahigh-resolution simulation example is presented that models buoyantconvection, induced by a small increase in brine density caused bydissolution of CO2.« less

  11. PDE1C deficiency antagonizes pathological cardiac remodeling and dysfunction

    PubMed Central

    Knight, Walter E.; Chen, Si; Zhang, Yishuai; Oikawa, Masayoshi; Wu, Meiping; Zhou, Qian; Miller, Clint L.; Cai, Yujun; Mickelsen, Deanne M.; Moravec, Christine; Small, Eric M.; Abe, Junichi; Yan, Chen

    2016-01-01

    Cyclic nucleotide phosphodiesterase 1C (PDE1C) represents a major phosphodiesterase activity in human myocardium, but its function in the heart remains unknown. Using genetic and pharmacological approaches, we studied the expression, regulation, function, and underlying mechanisms of PDE1C in the pathogenesis of cardiac remodeling and dysfunction. PDE1C expression is up-regulated in mouse and human failing hearts and is highly expressed in cardiac myocytes but not in fibroblasts. In adult mouse cardiac myocytes, PDE1C deficiency or inhibition attenuated myocyte death and apoptosis, which was largely dependent on cyclic AMP/PKA and PI3K/AKT signaling. PDE1C deficiency also attenuated cardiac myocyte hypertrophy in a PKA-dependent manner. Conditioned medium taken from PDE1C-deficient cardiac myocytes attenuated TGF-β–stimulated cardiac fibroblast activation through a mechanism involving the crosstalk between cardiac myocytes and fibroblasts. In vivo, cardiac remodeling and dysfunction induced by transverse aortic constriction, including myocardial hypertrophy, apoptosis, cardiac fibrosis, and loss of contractile function, were significantly attenuated in PDE1C-knockout mice relative to wild-type mice. These results indicate that PDE1C activation plays a causative role in pathological cardiac remodeling and dysfunction. Given the continued development of highly specific PDE1 inhibitors and the high expression level of PDE1C in the human heart, our findings could have considerable therapeutic significance. PMID:27791092

  12. CUBE: Information-optimized parallel cosmological N-body simulation code

    NASA Astrophysics Data System (ADS)

    Yu, Hao-Ran; Pen, Ue-Li; Wang, Xin

    2018-05-01

    CUBE, written in Coarray Fortran, is a particle-mesh based parallel cosmological N-body simulation code. The memory usage of CUBE can approach as low as 6 bytes per particle. Particle pairwise (PP) force, cosmological neutrinos, spherical overdensity (SO) halofinder are included.

  13. PDE and cognitive processing: beyond the memory domain.

    PubMed

    Heckman, P R A; Blokland, A; Ramaekers, J; Prickaerts, J

    2015-03-01

    Phosphodiesterase inhibitors (PDE-Is) enhance cAMP and/or cGMP signaling via reducing the degradation of these cyclic nucleotides. Both cAMP and cGMP signaling are essential for a variety of cellular functions and exert their effects both pre- and post-synaptically. Either of these second messengers relays and amplifies incoming signals at receptors on the cell surface making them important elements in signal transduction cascades and essential in cellular signaling in a variety of cell functions including neurotransmitter release and neuroprotection. Consequently, these processes can be influenced by PDE-Is as they increase cAMP and/or cGMP concentrations. PDE-Is have been considered as possible therapeutic agents to treat impaired memory function linked to several brain disorders, including depression, schizophrenia and Alzheimer's disease (AD). This review will, however, focus on the possible role of phosphodiesterases (PDEs) in cognitive decline beyond the memory domain. Here we will discuss the involvement of PDEs on three related domains: attention, information filtering (sensory- and sensorimotor gating) and response inhibition (drug-induced hyperlocomotion). Currently, these are emerging cognitive domains in the field of PDE research. Here we discuss experimental studies and the potential beneficial effects of PDE-I drugs on these cognitive domains, as effects of PDE-Is on these domains could potentially influence effects on memory performance. Overall, PDE4 seems to be the most promising target for all domains discussed in this review. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Parallelization of sequential Gaussian, indicator and direct simulation algorithms

    NASA Astrophysics Data System (ADS)

    Nunes, Ruben; Almeida, José A.

    2010-08-01

    Improving the performance and robustness of algorithms on new high-performance parallel computing architectures is a key issue in efficiently performing 2D and 3D studies with large amount of data. In geostatistics, sequential simulation algorithms are good candidates for parallelization. When compared with other computational applications in geosciences (such as fluid flow simulators), sequential simulation software is not extremely computationally intensive, but parallelization can make it more efficient and creates alternatives for its integration in inverse modelling approaches. This paper describes the implementation and benchmarking of a parallel version of the three classic sequential simulation algorithms: direct sequential simulation (DSS), sequential indicator simulation (SIS) and sequential Gaussian simulation (SGS). For this purpose, the source used was GSLIB, but the entire code was extensively modified to take into account the parallelization approach and was also rewritten in the C programming language. The paper also explains in detail the parallelization strategy and the main modifications. Regarding the integration of secondary information, the DSS algorithm is able to perform simple kriging with local means, kriging with an external drift and collocated cokriging with both local and global correlations. SIS includes a local correction of probabilities. Finally, a brief comparison is presented of simulation results using one, two and four processors. All performance tests were carried out on 2D soil data samples. The source code is completely open source and easy to read. It should be noted that the code is only fully compatible with Microsoft Visual C and should be adapted for other systems/compilers.

  15. Xyce parallel electronic simulator : reference guide.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mei, Ting; Rankin, Eric Lamont; Thornquist, Heidi K.

    2011-05-01

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users Guide. The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users Guide. The Xyce Parallel Electronic Simulator has been written to support, in a rigorous manner, the simulation needs of the Sandia National Laboratories electrical designers. It is targeted specifically to runmore » on large-scale parallel computing platforms but also runs well on a variety of architectures including single processor workstations. It also aims to support a variety of devices and models specific to Sandia needs. This document is intended to complement the Xyce Users Guide. It contains comprehensive, detailed information about a number of topics pertinent to the usage of Xyce. Included in this document is a netlist reference for the input-file commands and elements supported within Xyce; a command line reference, which describes the available command line arguments for Xyce; and quick-references for users of other circuit codes, such as Orcad's PSpice and Sandia's ChileSPICE.« less

  16. Phosphodiesterase-1b (Pde1b) knockout mice are resistant to forced swim and tail suspension induced immobility and show upregulation of Pde10a.

    PubMed

    Hufgard, Jillian R; Williams, Michael T; Skelton, Matthew R; Grubisha, Olivera; Ferreira, Filipa M; Sanger, Helen; Wright, Mary E; Reed-Kessler, Tracy M; Rasmussen, Kurt; Duman, Ronald S; Vorhees, Charles V

    2017-06-01

    Major depressive disorder is a leading cause of suicide and disability. Despite this, current antidepressants provide insufficient efficacy in more than 60% of patients. Most current antidepressants are presynaptic reuptake inhibitors; postsynaptic signal regulation has not received as much attention as potential treatment targets. We examined the effects of disruption of the postsynaptic cyclic nucleotide hydrolyzing enzyme, phosphodiesterase (PDE) 1b, on depressive-like behavior and the effects on PDE1B protein in wild-type (WT) mice following stress. Littermate knockout (KO) and WT mice were tested in locomotor activity, tail suspension (TST), and forced swim tests (FST). FST was also used to compare the effects of two antidepressants, fluoxetine and bupropion, in KO versus WT mice. Messenger RNA (mRNA) expression changes were also determined. WT mice underwent acute or chronic stress and markers of stress and PDE1B expression were examined. Pde1b KO mice exhibited decreased TST and FST immobility. When treated with antidepressants, both WT and KO mice showed decreased FST immobility and the effect was additive in KO mice. Mice lacking Pde1b had increased striatal Pde10a mRNA expression. In WT mice, acute and chronic stress upregulated PDE1B expression while PDE10A expression was downregulated after chronic but not acute stress. PDE1B is a potential therapeutic target for depression treatment because of the antidepressant-like phenotype seen in Pde1b KO mice.

  17. PDE Nozzle Optimization Using a Genetic Algorithm

    NASA Technical Reports Server (NTRS)

    Billings, Dana; Turner, James E. (Technical Monitor)

    2000-01-01

    Genetic algorithms, which simulate evolution in natural systems, have been used to find solutions to optimization problems that seem intractable to standard approaches. In this study, the feasibility of using a GA to find an optimum, fixed profile nozzle for a pulse detonation engine (PDE) is demonstrated. The objective was to maximize impulse during the detonation wave passage and blow-down phases of operation. Impulse of each profile variant was obtained by using the CFD code Mozart/2.0 to simulate the transient flow. After 7 generations, the method has identified a nozzle profile that certainly is a candidate for optimum solution. The constraints on the generality of this possible solution remain to be clarified.

  18. PDE5 inhibition alleviates functional muscle ischemia in boys with Duchenne muscular dystrophy.

    PubMed

    Nelson, Michael D; Rader, Florian; Tang, Xiu; Tavyev, Jane; Nelson, Stanley F; Miceli, M Carrie; Elashoff, Robert M; Sweeney, H Lee; Victor, Ronald G

    2014-06-10

    To determine whether phosphodiesterase type 5 (PDE5) inhibition can alleviate exercise-induced skeletal muscle ischemia in boys with Duchenne muscular dystrophy (DMD). In 10 boys with DMD and 10 healthy age-matched male controls, we assessed exercise-induced attenuation of reflex sympathetic vasoconstriction, i.e., functional sympatholysis, a protective mechanism that matches oxygen delivery to metabolic demand. Reflex vasoconstriction was induced by simulated orthostatic stress, measured as the decrease in forearm muscle oxygenation with near-infrared spectroscopy, and performed when the forearm muscles were rested or lightly exercised with rhythmic handgrip exercise. Then, the patients underwent an open-label, dose-escalation, crossover trial with single oral doses of tadalafil or sildenafil. The major new findings are 2-fold: first, sympatholysis is impaired in boys with DMD-producing functional muscle ischemia-despite contemporary background therapy with corticosteroids alone or in combination with cardioprotective medication. Second, PDE5 inhibition with standard clinical doses of either tadalafil or sildenafil alleviates this ischemia in a dose-dependent manner. Furthermore, PDE5 inhibition also normalizes the exercise-induced increase in skeletal muscle blood flow (measured by Doppler ultrasound), which is markedly blunted in boys with DMD. These data provide in-human proof of concept for PDE5 inhibition as a putative new therapeutic strategy for DMD. This study provides Class IV evidence that in patients with DMD, PDE5 inhibition restores functional sympatholysis. © 2014 American Academy of Neurology.

  19. Acoustic simulation in architecture with parallel algorithm

    NASA Astrophysics Data System (ADS)

    Li, Xiaohong; Zhang, Xinrong; Li, Dan

    2004-03-01

    In allusion to complexity of architecture environment and Real-time simulation of architecture acoustics, a parallel radiosity algorithm was developed. The distribution of sound energy in scene is solved with this method. And then the impulse response between sources and receivers at frequency segment, which are calculated with multi-process, are combined into whole frequency response. The numerical experiment shows that parallel arithmetic can improve the acoustic simulating efficiency of complex scene.

  20. Pricing index-based catastrophe bonds: Part 1: Formulation and discretization issues using a numerical PDE approach

    NASA Astrophysics Data System (ADS)

    Unger, André J. A.

    2010-02-01

    This work is the first installment in a two-part series, and focuses on the development of a numerical PDE approach to price components of a Bermudan-style callable catastrophe (CAT) bond. The bond is based on two underlying stochastic variables; the PCS index which posts quarterly estimates of industry-wide hurricane losses as well as a single-factor CIR interest rate model for the three-month LIBOR. The aggregate PCS index is analogous to losses claimed under traditional reinsurance in that it is used to specify a reinsurance layer. The proposed CAT bond model contains a Bermudan-style call feature designed to allow the reinsurer to minimize their interest rate risk exposure on making substantial fixed coupon payments using capital from the reinsurance premium. Numerical PDE methods are the fundamental strategy for pricing early-exercise constraints, such as the Bermudan-style call feature, into contingent claim models. Therefore, the objective and unique contribution of this first installment in the two-part series is to develop a formulation and discretization strategy for the proposed CAT bond model utilizing a numerical PDE approach. Object-oriented code design is fundamental to the numerical methods used to aggregate the PCS index, and implement the call feature. Therefore, object-oriented design issues that relate specifically to the development of a numerical PDE approach for the component of the proposed CAT bond model that depends on the PCS index and LIBOR are described here. Formulation, numerical methods and code design issues that relate to aggregating the PCS index and introducing the call option are the subject of the companion paper.

  1. Program For Parallel Discrete-Event Simulation

    NASA Technical Reports Server (NTRS)

    Beckman, Brian C.; Blume, Leo R.; Geiselman, John S.; Presley, Matthew T.; Wedel, John J., Jr.; Bellenot, Steven F.; Diloreto, Michael; Hontalas, Philip J.; Reiher, Peter L.; Weiland, Frederick P.

    1991-01-01

    User does not have to add any special logic to aid in synchronization. Time Warp Operating System (TWOS) computer program is special-purpose operating system designed to support parallel discrete-event simulation. Complete implementation of Time Warp mechanism. Supports only simulations and other computations designed for virtual time. Time Warp Simulator (TWSIM) subdirectory contains sequential simulation engine interface-compatible with TWOS. TWOS and TWSIM written in, and support simulations in, C programming language.

  2. Visualization and Tracking of Parallel CFD Simulations

    NASA Technical Reports Server (NTRS)

    Vaziri, Arsi; Kremenetsky, Mark

    1995-01-01

    We describe a system for interactive visualization and tracking of a 3-D unsteady computational fluid dynamics (CFD) simulation on a parallel computer. CM/AVS, a distributed, parallel implementation of a visualization environment (AVS) runs on the CM-5 parallel supercomputer. A CFD solver is run as a CM/AVS module on the CM-5. Data communication between the solver, other parallel visualization modules, and a graphics workstation, which is running AVS, are handled by CM/AVS. Partitioning of the visualization task, between CM-5 and the workstation, can be done interactively in the visual programming environment provided by AVS. Flow solver parameters can also be altered by programmable interactive widgets. This system partially removes the requirement of storing large solution files at frequent time steps, a characteristic of the traditional 'simulate (yields) store (yields) visualize' post-processing approach.

  3. Comparison of the Pharmacological Profiles of Selective PDE4B and PDE4D Inhibitors in the Central Nervous System

    PubMed Central

    Zhang, Chong; Xu, Ying; Zhang, Han-Ting; Gurney, Mark E.; O’Donnell, James M.

    2017-01-01

    Inhibition of cyclic AMP (cAMP)-specific phosphodiesterase 4 (PDE4) has been proposed as a potential treatment for a series of neuropsychological conditions such as depression, anxiety and memory loss. However, the specific involvement of each of the PDE4 subtypes (PDE4A, 4B and 4C) in different categories of behavior has yet to be elucidated. In the present study, we compared the possible pharmacological effects of PDE4B and PDE4D selective inhibitors, A-33 and D159687, in mediating neurological function in mice. Both compounds were equally potent in stimulating cAMP signaling in the mouse hippocampal cell line HT-22 leading to an increase in CREB phosphorylation. In contrast, A-33 and D159687 displayed distinct neuropharmacological effects in mouse behavioral tests. A-33 has an antidepressant-like profile as indicated by reduced immobility time in the forced swim and tail suspension tasks, as well as reduced latency to feed in the novelty suppressed feeding test. D159687, on the other hand, had a procognitive profile as it improved memory in the novel object recognition test but had no antidepressant or anxiolytic benefit. The present data suggests that inhibitors targeting specific subtypes of PDE4 may exhibit differential pharmacological effects and aid a more efficient pharmacotherapy towards neuropsychological conditions. PMID:28054669

  4. cellGPU: Massively parallel simulations of dynamic vertex models

    NASA Astrophysics Data System (ADS)

    Sussman, Daniel M.

    2017-10-01

    Vertex models represent confluent tissue by polygonal or polyhedral tilings of space, with the individual cells interacting via force laws that depend on both the geometry of the cells and the topology of the tessellation. This dependence on the connectivity of the cellular network introduces several complications to performing molecular-dynamics-like simulations of vertex models, and in particular makes parallelizing the simulations difficult. cellGPU addresses this difficulty and lays the foundation for massively parallelized, GPU-based simulations of these models. This article discusses its implementation for a pair of two-dimensional models, and compares the typical performance that can be expected between running cellGPU entirely on the CPU versus its performance when running on a range of commercial and server-grade graphics cards. By implementing the calculation of topological changes and forces on cells in a highly parallelizable fashion, cellGPU enables researchers to simulate time- and length-scales previously inaccessible via existing single-threaded CPU implementations. Program Files doi:http://dx.doi.org/10.17632/6j2cj29t3r.1 Licensing provisions: MIT Programming language: CUDA/C++ Nature of problem: Simulations of off-lattice "vertex models" of cells, in which the interaction forces depend on both the geometry and the topology of the cellular aggregate. Solution method: Highly parallelized GPU-accelerated dynamical simulations in which the force calculations and the topological features can be handled on either the CPU or GPU. Additional comments: The code is hosted at https://gitlab.com/dmsussman/cellGPU, with documentation additionally maintained at http://dmsussman.gitlab.io/cellGPUdocumentation

  5. Optimized Hypervisor Scheduler for Parallel Discrete Event Simulations on Virtual Machine Platforms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yoginath, Srikanth B; Perumalla, Kalyan S

    2013-01-01

    With the advent of virtual machine (VM)-based platforms for parallel computing, it is now possible to execute parallel discrete event simulations (PDES) over multiple virtual machines, in contrast to executing in native mode directly over hardware as is traditionally done over the past decades. While mature VM-based parallel systems now offer new, compelling benefits such as serviceability, dynamic reconfigurability and overall cost effectiveness, the runtime performance of parallel applications can be significantly affected. In particular, most VM-based platforms are optimized for general workloads, but PDES execution exhibits unique dynamics significantly different from other workloads. Here we first present results frommore » experiments that highlight the gross deterioration of the runtime performance of VM-based PDES simulations when executed using traditional VM schedulers, quantitatively showing the bad scaling properties of the scheduler as the number of VMs is increased. The mismatch is fundamental in nature in the sense that any fairness-based VM scheduler implementation would exhibit this mismatch with PDES runs. We also present a new scheduler optimized specifically for PDES applications, and describe its design and implementation. Experimental results obtained from running PDES benchmarks (PHOLD and vehicular traffic simulations) over VMs show over an order of magnitude improvement in the run time of the PDES-optimized scheduler relative to the regular VM scheduler, with over 20 reduction in run time of simulations using up to 64 VMs. The observations and results are timely in the context of emerging systems such as cloud platforms and VM-based high performance computing installations, highlighting to the community the need for PDES-specific support, and the feasibility of significantly reducing the runtime overhead for scalable PDES on VM platforms.« less

  6. Parallel discrete event simulation: A shared memory approach

    NASA Technical Reports Server (NTRS)

    Reed, Daniel A.; Malony, Allen D.; Mccredie, Bradley D.

    1987-01-01

    With traditional event list techniques, evaluating a detailed discrete event simulation model can often require hours or even days of computation time. Parallel simulation mimics the interacting servers and queues of a real system by assigning each simulated entity to a processor. By eliminating the event list and maintaining only sufficient synchronization to insure causality, parallel simulation can potentially provide speedups that are linear in the number of processors. A set of shared memory experiments is presented using the Chandy-Misra distributed simulation algorithm to simulate networks of queues. Parameters include queueing network topology and routing probabilities, number of processors, and assignment of network nodes to processors. These experiments show that Chandy-Misra distributed simulation is a questionable alternative to sequential simulation of most queueing network models.

  7. A local PDE model of aggregation formation in bacterial colonies

    NASA Astrophysics Data System (ADS)

    Chavy-Waddy, Paul-Christopher; Kolokolnikov, Theodore

    2016-10-01

    We study pattern formation in a model of cyanobacteria motion recently proposed by Galante, Wisen, Bhaya and Levy. By taking a continuum limit of their model, we derive a novel fourth-order nonlinear parabolic PDE equation that governs the behaviour of the model. This PDE is {{u}t}=-{{u}xx}-{{u}xxxx}+α {{≤ft(\\frac{{{u}x}{{u}xx}}{u}\\right)}x} . We then derive the instability thresholds for the onset of pattern formation. We also compute analytically the spatial profiles of the steady state aggregation density. These profiles are shown to be of the form \\text{sec}{{\\text{h}}p} where the exponent p is related to the parameters of the model. Full numerical simulations give a favorable comparison between the continuum and the underlying discrete system, and show that the aggregation profiles are stable above the critical threshold.

  8. Suppressing correlations in massively parallel simulations of lattice models

    NASA Astrophysics Data System (ADS)

    Kelling, Jeffrey; Ódor, Géza; Gemming, Sibylle

    2017-11-01

    For lattice Monte Carlo simulations parallelization is crucial to make studies of large systems and long simulation time feasible, while sequential simulations remain the gold-standard for correlation-free dynamics. Here, various domain decomposition schemes are compared, concluding with one which delivers virtually correlation-free simulations on GPUs. Extensive simulations of the octahedron model for 2 + 1 dimensional Kardar-Parisi-Zhang surface growth, which is very sensitive to correlation in the site-selection dynamics, were performed to show self-consistency of the parallel runs and agreement with the sequential algorithm. We present a GPU implementation providing a speedup of about 30 × over a parallel CPU implementation on a single socket and at least 180 × with respect to the sequential reference.

  9. Methods of parallel computation applied on granular simulations

    NASA Astrophysics Data System (ADS)

    Martins, Gustavo H. B.; Atman, Allbens P. F.

    2017-06-01

    Every year, parallel computing has becoming cheaper and more accessible. As consequence, applications were spreading over all research areas. Granular materials is a promising area for parallel computing. To prove this statement we study the impact of parallel computing in simulations of the BNE (Brazil Nut Effect). This property is due the remarkable arising of an intruder confined to a granular media when vertically shaken against gravity. By means of DEM (Discrete Element Methods) simulations, we study the code performance testing different methods to improve clock time. A comparison between serial and parallel algorithms, using OpenMP® is also shown. The best improvement was obtained by optimizing the function that find contacts using Verlet's cells.

  10. A substrate selectivity and inhibitor design lesson from the PDE10-cAMP crystal structure: a computational study.

    PubMed

    Lau, Justin Kai-Chi; Li, Xiao-Bo; Cheng, Yuen-Kit

    2010-04-22

    Phosphodiesterases (PDEs) catalyze the hydrolysis of second messengers cAMP and cGMP in regulating many important cellular signals and have been recognized as important drug targets. Experimentally, a range of specificity/selectivity toward cAMP and cGMP is well-known for the individual PDE families. The study reported here reveals that PDEs might also exhibit selectivity toward conformations of the endogenous substrates cAMP and cGMP. Molecular dynamics simulations and free energy study have been applied to study the binding of the cAMP torsional conformers about the glycosyl bond in PDE10A2. The computational results elucidated that PDE10A2 is energetically more favorable in complex with the syn cAMP conformer (as reported in the crystal structure) and the binding of anti cAMP to PDE10A2 would lead to either a nonreactive configuration or significant perturbation on the catalytic pocket of the enzyme. This experimentally inaccessible information provides important molecular insights for the development of effective PDE10 ligands.

  11. Estimating the magnitude of near-membrane PDE4 activity in living cells.

    PubMed

    Xin, Wenkuan; Feinstein, Wei P; Britain, Andrea L; Ochoa, Cristhiaan D; Zhu, Bing; Richter, Wito; Leavesley, Silas J; Rich, Thomas C

    2015-09-15

    Recent studies have demonstrated that functionally discrete pools of phosphodiesterase (PDE) activity regulate distinct cellular functions. While the importance of localized pools of enzyme activity has become apparent, few studies have estimated enzyme activity within discrete subcellular compartments. Here we present an approach to estimate near-membrane PDE activity. First, total PDE activity is measured using traditional PDE activity assays. Second, known cAMP concentrations are dialyzed into single cells and the spatial spread of cAMP is monitored using cyclic nucleotide-gated channels. Third, mathematical models are used to estimate the spatial distribution of PDE activity within cells. Using this three-tiered approach, we observed two pharmacologically distinct pools of PDE activity, a rolipram-sensitive pool and an 8-methoxymethyl IBMX (8MM-IBMX)-sensitive pool. We observed that the rolipram-sensitive PDE (PDE4) was primarily responsible for cAMP hydrolysis near the plasma membrane. Finally, we observed that PDE4 was capable of blunting cAMP levels near the plasma membrane even when 100 μM cAMP were introduced into the cell via a patch pipette. Two compartment models predict that PDE activity near the plasma membrane, near cyclic nucleotide-gated channels, was significantly lower than total cellular PDE activity and that a slow spatial spread of cAMP allowed PDE activity to effectively hydrolyze near-membrane cAMP. These results imply that cAMP levels near the plasma membrane are distinct from those in other subcellular compartments; PDE activity is not uniform within cells; and localized pools of AC and PDE activities are responsible for controlling cAMP levels within distinct subcellular compartments. Copyright © 2015 the American Physiological Society.

  12. Estimating the magnitude of near-membrane PDE4 activity in living cells

    PubMed Central

    Xin, Wenkuan; Feinstein, Wei P.; Britain, Andrea L.; Ochoa, Cristhiaan D.; Zhu, Bing; Richter, Wito; Leavesley, Silas J.

    2015-01-01

    Recent studies have demonstrated that functionally discrete pools of phosphodiesterase (PDE) activity regulate distinct cellular functions. While the importance of localized pools of enzyme activity has become apparent, few studies have estimated enzyme activity within discrete subcellular compartments. Here we present an approach to estimate near-membrane PDE activity. First, total PDE activity is measured using traditional PDE activity assays. Second, known cAMP concentrations are dialyzed into single cells and the spatial spread of cAMP is monitored using cyclic nucleotide-gated channels. Third, mathematical models are used to estimate the spatial distribution of PDE activity within cells. Using this three-tiered approach, we observed two pharmacologically distinct pools of PDE activity, a rolipram-sensitive pool and an 8-methoxymethyl IBMX (8MM-IBMX)-sensitive pool. We observed that the rolipram-sensitive PDE (PDE4) was primarily responsible for cAMP hydrolysis near the plasma membrane. Finally, we observed that PDE4 was capable of blunting cAMP levels near the plasma membrane even when 100 μM cAMP were introduced into the cell via a patch pipette. Two compartment models predict that PDE activity near the plasma membrane, near cyclic nucleotide-gated channels, was significantly lower than total cellular PDE activity and that a slow spatial spread of cAMP allowed PDE activity to effectively hydrolyze near-membrane cAMP. These results imply that cAMP levels near the plasma membrane are distinct from those in other subcellular compartments; PDE activity is not uniform within cells; and localized pools of AC and PDE activities are responsible for controlling cAMP levels within distinct subcellular compartments. PMID:26201952

  13. Parallel Discrete Molecular Dynamics Simulation With Speculation and In-Order Commitment.

    PubMed

    Khan, Md Ashfaquzzaman; Herbordt, Martin C

    2011-07-20

    Discrete molecular dynamics simulation (DMD) uses simplified and discretized models enabling simulations to advance by event rather than by timestep. DMD is an instance of discrete event simulation and so is difficult to scale: even in this multi-core era, all reported DMD codes are serial. In this paper we discuss the inherent difficulties of scaling DMD and present our method of parallelizing DMD through event-based decomposition. Our method is microarchitecture inspired: speculative processing of events exposes parallelism, while in-order commitment ensures correctness. We analyze the potential of this parallelization method for shared-memory multiprocessors. Achieving scalability required extensive experimentation with scheduling and synchronization methods to mitigate serialization. The speed-up achieved for a variety of system sizes and complexities is nearly 6× on an 8-core and over 9× on a 12-core processor. We present and verify analytical models that account for the achieved performance as a function of available concurrency and architectural limitations.

  14. Xyce parallel electronic simulator users guide, version 6.1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R; Mei, Ting; Russo, Thomas V.

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas; Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). This includes support for most popular parallel and serial computers; A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows one to developmore » new types of analysis without requiring the implementation of analysis-specific device models; Device models that are specifically tailored to meet Sandia's needs, including some radiationaware devices (for Sandia users only); and Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase-a message passing parallel implementation-which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows.« less

  15. Xyce Parallel Electronic Simulator Users' Guide Version 6.8

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Aadithya, Karthik Venkatraman; Mei, Ting

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been de- signed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel com- puting platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows onemore » to develop new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandia's needs, including some radiation- aware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase$-$ a message passing parallel implementation $-$ which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows.« less

  16. Xyce parallel electronic simulator users guide, version 6.0.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R; Mei, Ting; Russo, Thomas V.

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows one to developmore » new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandias needs, including some radiationaware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase a message passing parallel implementation which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows.« less

  17. A direct-execution parallel architecture for the Advanced Continuous Simulation Language (ACSL)

    NASA Technical Reports Server (NTRS)

    Carroll, Chester C.; Owen, Jeffrey E.

    1988-01-01

    A direct-execution parallel architecture for the Advanced Continuous Simulation Language (ACSL) is presented which overcomes the traditional disadvantages of simulations executed on a digital computer. The incorporation of parallel processing allows the mapping of simulations into a digital computer to be done in the same inherently parallel manner as they are currently mapped onto an analog computer. The direct-execution format maximizes the efficiency of the executed code since the need for a high level language compiler is eliminated. Resolution is greatly increased over that which is available with an analog computer without the sacrifice in execution speed normally expected with digitial computer simulations. Although this report covers all aspects of the new architecture, key emphasis is placed on the processing element configuration and the microprogramming of the ACLS constructs. The execution times for all ACLS constructs are computed using a model of a processing element based on the AMD 29000 CPU and the AMD 29027 FPU. The increase in execution speed provided by parallel processing is exemplified by comparing the derived execution times of two ACSL programs with the execution times for the same programs executed on a similar sequential architecture.

  18. AKAP3 Selectively Binds PDE4A Isoforms in Bovine Spermatozoa1

    PubMed Central

    Bajpai, Malini; Fiedler, Sarah E.; Huang, Zaohua; Vijayaraghavan, Srinivasan; Olson, Gary E.; Livera, Gabriel; Conti, Marco; Carr, Daniel W.

    2006-01-01

    Cyclic AMP plays an important role in regulating sperm motility and acrosome reaction through activation of cAMP-dependent protein kinase A (PKA). Phosphodiesterases (PDEs) modulate the levels of cyclic nucleotides by catalyzing their degradation. Although PDE inhibitors specific to PDE1 and PDE4 are known to alter sperm motility and capacitation in humans, little is known about the role or subcellular distribution of PDEs in spermatozoa. The localization of PKA is regulated by A-kinase anchoring proteins (AKAPs), which may also control the intracellular distribution of PDE. The present study was undertaken to investigate the role and localization of PDE4 during sperm capacitation. Addition of Rolipram or RS25344, PDE4-specific inhibitors significantly increased the progressive motility of bovine spermatozoa. Immunolocalization techniques detected both PDE4A and AKAP3 (formerly known as AKAP110) in the principal piece of bovine spermatozoa. The PDE4A5 isoform was detected primarily in the Triton X-100-soluble fraction of caudal epididymal spermatozoa. However, in ejaculated spermatozoa it was seen primarily in the SDS-soluble fraction, indicating a shift in PDE4A5 localization into insoluble organelles during sperm capacitation. AKAP3 was detected only in the SDS-soluble fraction of both caudal and ejaculated sperm. Immunoprecipitation experiments using COS cells cotransfected with AKAP3 and either Pde4a5 or Pde4d provide evidence that PDE4A5 but not PDE4D interacts with AKAP3. Pulldown assays using sperm cell lysates confirm this interaction in vitro. These data suggest that AKAP3 binds both PKA and PDE4A and functions as a scaffolding protein in spermatozoa to regulate local cAMP concentrations and modulate sperm functions. PMID:16177223

  19. GENESIS: a hybrid-parallel and multi-scale molecular dynamics simulator with enhanced sampling algorithms for biomolecular and cellular simulations

    PubMed Central

    Jung, Jaewoon; Mori, Takaharu; Kobayashi, Chigusa; Matsunaga, Yasuhiro; Yoda, Takao; Feig, Michael; Sugita, Yuji

    2015-01-01

    GENESIS (Generalized-Ensemble Simulation System) is a new software package for molecular dynamics (MD) simulations of macromolecules. It has two MD simulators, called ATDYN and SPDYN. ATDYN is parallelized based on an atomic decomposition algorithm for the simulations of all-atom force-field models as well as coarse-grained Go-like models. SPDYN is highly parallelized based on a domain decomposition scheme, allowing large-scale MD simulations on supercomputers. Hybrid schemes combining OpenMP and MPI are used in both simulators to target modern multicore computer architectures. Key advantages of GENESIS are (1) the highly parallel performance of SPDYN for very large biological systems consisting of more than one million atoms and (2) the availability of various REMD algorithms (T-REMD, REUS, multi-dimensional REMD for both all-atom and Go-like models under the NVT, NPT, NPAT, and NPγT ensembles). The former is achieved by a combination of the midpoint cell method and the efficient three-dimensional Fast Fourier Transform algorithm, where the domain decomposition space is shared in real-space and reciprocal-space calculations. Other features in SPDYN, such as avoiding concurrent memory access, reducing communication times, and usage of parallel input/output files, also contribute to the performance. We show the REMD simulation results of a mixed (POPC/DMPC) lipid bilayer as a real application using GENESIS. GENESIS is released as free software under the GPLv2 licence and can be easily modified for the development of new algorithms and molecular models. WIREs Comput Mol Sci 2015, 5:310–323. doi: 10.1002/wcms.1220 PMID:26753008

  20. GENESIS: a hybrid-parallel and multi-scale molecular dynamics simulator with enhanced sampling algorithms for biomolecular and cellular simulations.

    PubMed

    Jung, Jaewoon; Mori, Takaharu; Kobayashi, Chigusa; Matsunaga, Yasuhiro; Yoda, Takao; Feig, Michael; Sugita, Yuji

    2015-07-01

    GENESIS (Generalized-Ensemble Simulation System) is a new software package for molecular dynamics (MD) simulations of macromolecules. It has two MD simulators, called ATDYN and SPDYN. ATDYN is parallelized based on an atomic decomposition algorithm for the simulations of all-atom force-field models as well as coarse-grained Go-like models. SPDYN is highly parallelized based on a domain decomposition scheme, allowing large-scale MD simulations on supercomputers. Hybrid schemes combining OpenMP and MPI are used in both simulators to target modern multicore computer architectures. Key advantages of GENESIS are (1) the highly parallel performance of SPDYN for very large biological systems consisting of more than one million atoms and (2) the availability of various REMD algorithms (T-REMD, REUS, multi-dimensional REMD for both all-atom and Go-like models under the NVT, NPT, NPAT, and NPγT ensembles). The former is achieved by a combination of the midpoint cell method and the efficient three-dimensional Fast Fourier Transform algorithm, where the domain decomposition space is shared in real-space and reciprocal-space calculations. Other features in SPDYN, such as avoiding concurrent memory access, reducing communication times, and usage of parallel input/output files, also contribute to the performance. We show the REMD simulation results of a mixed (POPC/DMPC) lipid bilayer as a real application using GENESIS. GENESIS is released as free software under the GPLv2 licence and can be easily modified for the development of new algorithms and molecular models. WIREs Comput Mol Sci 2015, 5:310-323. doi: 10.1002/wcms.1220.

  1. [A study of PDE6B gene mutation and phenotype in Chinese cases with retinitis pigmentosa].

    PubMed

    Cui, Yun; Zhao, Kan-xing; Wang, Li; Wang, Qing; Zhang, Wei; Chen, Wei-ying; Wang, Li-ming

    2003-01-01

    To identify the mutation spectrum of phosphodiesterase beta subunit (PDE6B) gene, the incidence in Chinese patients with retinitis pigmentosa (RP) and their clinical phenotypic characteristics. Screening of mutations within PDE6B gene was performed using polymerase chain reaction-heteroduplex-single strand conformation polymorphism (PCR-SSCP) and DNA sequence in 35 autosomal recessive (AR) RP and 55 sporadic RP cases. The phenotypes of the patients with the gene mutation were examined and analyzed. Novel complex heterozygous variants of PDE6B gene in a sporadic case, a T to C transversion in codon 323 resulting in the substitution of Gly by Ser and 2 base pairs (bp: G and T) insert between the 27th-28th bp upstream of the 5'-end of exon 10 were both present in a same isolate RP. But they are not found in 100 unrelated healthy individuals. Ocular findings showed diffuse pigmentary retinal degeneration in the midperipheral and peripheral fundi, optic atrophy and vessel attenuation. Multi-focal ERG indicated that the rod function was more severely deteriorated. A mutation was found in a case with RP in a ARRP family, a G to A transversion at 19th base upstream 5'-end of exon 11 (within intron 10) of PDE6B gene. A sporadic RP carried a sequence variant of PDE6B gene, a G to C transition, at the 15th base adjacent to the 3'-end of exon l8. In another isolate case with RP was found 2 bp (GT) insert between 31st and 32nd base upstream 5'-end of exon 4 (in intron 3) of PDE6B gene. There are novel complex heterozygous mutations of PDE6B gene responsible for a sporadic RP patient in China. This gene mutation associated with rod deterioration and RP. Several DNA variants were found in introns of PDE6B gene in national population.

  2. Symplectic molecular dynamics simulations on specially designed parallel computers.

    PubMed

    Borstnik, Urban; Janezic, Dusanka

    2005-01-01

    We have developed a computer program for molecular dynamics (MD) simulation that implements the Split Integration Symplectic Method (SISM) and is designed to run on specialized parallel computers. The MD integration is performed by the SISM, which analytically treats high-frequency vibrational motion and thus enables the use of longer simulation time steps. The low-frequency motion is treated numerically on specially designed parallel computers, which decreases the computational time of each simulation time step. The combination of these approaches means that less time is required and fewer steps are needed and so enables fast MD simulations. We study the computational performance of MD simulation of molecular systems on specialized computers and provide a comparison to standard personal computers. The combination of the SISM with two specialized parallel computers is an effective way to increase the speed of MD simulations up to 16-fold over a single PC processor.

  3. Cigarette Smoke Upregulates PDE3 and PDE4 to Decrease cAMP in Airway Cells.

    PubMed

    Zuo, Haoxiao; Han, Bing; Poppinga, Wilfred J; Ringnalda, Lennard; Kistemaker, Loes E M; Halayko, Andrew J; Gosens, Reinoud; Nikolaev, Viacheslav O; Schmidt, Martina

    2018-05-03

    3', 5'-cyclic adenosine monophosphate (cAMP) is a central second messenger that broadly regulates cell function and can underpin pathophysiology. In chronic obstructive pulmonary disease (COPD), a lung disease primarily provoked by cigarette smoke (CS), the induction of cAMP-dependent pathways, via inhibition of hydrolyzing phosphodiesterases (PDEs), is a prime therapeutic strategy. Mechanisms that disrupt cAMP signaling in airway cells, in particular regulation of endogenous PDEs are poorly understood. We used a novel Förster resonance energy transfer (FRET) based cAMP biosensor in mouse in vivo, ex vivo precision cut lung slices (PCLS), and in human in vitro cell models to track the effects of CS exposure. Under fenoterol stimulated conditions, FRET responses to cilostamide were significantly increased in in vivo, ex vivo PCLS exposed to CS and in human airway smooth muscle cells exposed to CS extract. FRET signals to rolipram were only increased in the in vivo CS model. Under basal conditions, FRET responses to cilostamide and rolipram were significantly increased in in vivo, ex vivo PCLS exposed to CS. Elevated FRET signals to rolipram correlated with a protein upregulation of PDE4 subtypes. In ex vivo PCLS exposed to CS extract, rolipram reversed downregulation of ciliary beating frequency, whereas only cilostamide significantly increased airway relaxation of methacholine pre-contracted airways. We show that CS upregulates expression and activity of both PDE3 and PDE4, which regulate real-time cAMP dynamics. These mechanisms determine the availability of cAMP and can contribute to CS-induced pulmonary pathophysiology. This article is protected by copyright. All rights reserved.

  4. Long-time atomistic simulations with the Parallel Replica Dynamics method

    NASA Astrophysics Data System (ADS)

    Perez, Danny

    Molecular Dynamics (MD) -- the numerical integration of atomistic equations of motion -- is a workhorse of computational materials science. Indeed, MD can in principle be used to obtain any thermodynamic or kinetic quantity, without introducing any approximation or assumptions beyond the adequacy of the interaction potential. It is therefore an extremely powerful and flexible tool to study materials with atomistic spatio-temporal resolution. These enviable qualities however come at a steep computational price, hence limiting the system sizes and simulation times that can be achieved in practice. While the size limitation can be efficiently addressed with massively parallel implementations of MD based on spatial decomposition strategies, allowing for the simulation of trillions of atoms, the same approach usually cannot extend the timescales much beyond microseconds. In this article, we discuss an alternative parallel-in-time approach, the Parallel Replica Dynamics (ParRep) method, that aims at addressing the timescale limitation of MD for systems that evolve through rare state-to-state transitions. We review the formal underpinnings of the method and demonstrate that it can provide arbitrarily accurate results for any definition of the states. When an adequate definition of the states is available, ParRep can simulate trajectories with a parallel speedup approaching the number of replicas used. We demonstrate the usefulness of ParRep by presenting different examples of materials simulations where access to long timescales was essential to access the physical regime of interest and discuss practical considerations that must be addressed to carry out these simulations. Work supported by the United States Department of Energy (U.S. DOE), Office of Science, Office of Basic Energy Sciences, Materials Sciences and Engineering Division.

  5. Xyce Parallel Electronic Simulator Users Guide Version 6.2.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Mei, Ting; Russo, Thomas V.

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been de- signed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel com- puting platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows onemore » to develop new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandia's needs, including some radiation- aware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase -- a message passing parallel implementation -- which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. Trademarks The information herein is subject to change without notice. Copyright c 2002-2014 Sandia Corporation. All rights reserved. Xyce TM Electronic Simulator and Xyce TM are trademarks of Sandia Corporation. Portions of the Xyce TM code are: Copyright c 2002, The Regents of the University of California. Produced at the Lawrence Livermore National Laboratory. Written by Alan Hindmarsh, Allan Taylor, Radu Serban. UCRL-CODE-2002-59 All rights reserved. Orcad, Orcad Capture, PSpice and Probe are

  6. Xyce Parallel Electronic Simulator Users Guide Version 6.4

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Mei, Ting; Russo, Thomas V.

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been de- signed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel com- puting platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows onemore » to develop new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandia's needs, including some radiation- aware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase -- a message passing parallel implementation -- which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. Trademarks The information herein is subject to change without notice. Copyright c 2002-2015 Sandia Corporation. All rights reserved. Xyce TM Electronic Simulator and Xyce TM are trademarks of Sandia Corporation. Portions of the Xyce TM code are: Copyright c 2002, The Regents of the University of California. Produced at the Lawrence Livermore National Laboratory. Written by Alan Hindmarsh, Allan Taylor, Radu Serban. UCRL-CODE-2002-59 All rights reserved. Orcad, Orcad Capture, PSpice and Probe are

  7. Xyce™ Parallel Electronic Simulator Users' Guide, Version 6.5.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Aadithya, Karthik V.; Mei, Ting

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows one to developmore » new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandia's needs, including some radiation- aware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase -- a message passing parallel implementation -- which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. The information herein is subject to change without notice. Copyright © 2002-2016 Sandia Corporation. All rights reserved.« less

  8. A parallel simulated annealing algorithm for standard cell placement on a hypercube computer

    NASA Technical Reports Server (NTRS)

    Jones, Mark Howard

    1987-01-01

    A parallel version of a simulated annealing algorithm is presented which is targeted to run on a hypercube computer. A strategy for mapping the cells in a two dimensional area of a chip onto processors in an n-dimensional hypercube is proposed such that both small and large distance moves can be applied. Two types of moves are allowed: cell exchanges and cell displacements. The computation of the cost function in parallel among all the processors in the hypercube is described along with a distributed data structure that needs to be stored in the hypercube to support parallel cost evaluation. A novel tree broadcasting strategy is used extensively in the algorithm for updating cell locations in the parallel environment. Studies on the performance of the algorithm on example industrial circuits show that it is faster and gives better final placement results than the uniprocessor simulated annealing algorithms. An improved uniprocessor algorithm is proposed which is based on the improved results obtained from parallelization of the simulated annealing algorithm.

  9. Petascale turbulence simulation using a highly parallel fast multipole method on GPUs

    NASA Astrophysics Data System (ADS)

    Yokota, Rio; Barba, L. A.; Narumi, Tetsu; Yasuoka, Kenji

    2013-03-01

    This paper reports large-scale direct numerical simulations of homogeneous-isotropic fluid turbulence, achieving sustained performance of 1.08 petaflop/s on GPU hardware using single precision. The simulations use a vortex particle method to solve the Navier-Stokes equations, with a highly parallel fast multipole method (FMM) as numerical engine, and match the current record in mesh size for this application, a cube of 40963 computational points solved with a spectral method. The standard numerical approach used in this field is the pseudo-spectral method, relying on the FFT algorithm as the numerical engine. The particle-based simulations presented in this paper quantitatively match the kinetic energy spectrum obtained with a pseudo-spectral method, using a trusted code. In terms of parallel performance, weak scaling results show the FMM-based vortex method achieving 74% parallel efficiency on 4096 processes (one GPU per MPI process, 3 GPUs per node of the TSUBAME-2.0 system). The FFT-based spectral method is able to achieve just 14% parallel efficiency on the same number of MPI processes (using only CPU cores), due to the all-to-all communication pattern of the FFT algorithm. The calculation time for one time step was 108 s for the vortex method and 154 s for the spectral method, under these conditions. Computing with 69 billion particles, this work exceeds by an order of magnitude the largest vortex-method calculations to date.

  10. Parallelization of Program to Optimize Simulated Trajectories (POST3D)

    NASA Technical Reports Server (NTRS)

    Hammond, Dana P.; Korte, John J. (Technical Monitor)

    2001-01-01

    This paper describes the parallelization of the Program to Optimize Simulated Trajectories (POST3D). POST3D uses a gradient-based optimization algorithm that reaches an optimum design point by moving from one design point to the next. The gradient calculations required to complete the optimization process, dominate the computational time and have been parallelized using a Single Program Multiple Data (SPMD) on a distributed memory NUMA (non-uniform memory access) architecture. The Origin2000 was used for the tests presented.

  11. Expression of phosphodiesterase 6 (PDE6) in human breast cancer cells.

    PubMed

    Dong, Hongli; Claffey, Kevin P; Brocke, Stefan; Epstein, Paul M

    2013-01-01

    Considerable epidemiological evidence demonstrates a positive association between artificial light at night (LAN) levels and incidence rates of breast cancer, suggesting that exposure to LAN is a risk factor for breast cancer. There is a 30-50% higher risk of breast cancer in the highest LAN exposed countries compared to the lowest LAN countries, and studies showing higher incidence of breast cancer among shift workers exposed to more LAN have led the International Agency for Research on Cancer to classify shift work as a probable human carcinogen. Nevertheless, the means by which light can affect breast cancer is still unknown. In this study we examined established human breast cancer cell lines and patients' primary breast cancer tissues for expression of genetic components of phosphodiesterase 6 (PDE6), a cGMP-specific PDE involved in transduction of the light signal, and previously thought to be selectively expressed in photoreceptors. By microarray analysis we find highly significant expression of mRNA for the PDE6B, PDE6C, and PDE6D genes in both the cell lines and patients' tissues, minimal expression of PDE6A and PDE6G and no expression of PDE6H. Using antibody specific for PDE6β, we find expression of PDE6B protein in a wide range of patients' tissues by immunohistochemistry, and in MCF-7 breast cancer cells by immunofluorescence and Western blot analysis. Considerable expression of key circadian genes, PERIOD 2, CLOCK, TIMELESS, CRYPTOCHROME 1, and CRYPTOCHROME 2 was also seen in all breast cancer cell lines and all patients' breast cancer tissues. These studies indicate that genes for PDE6 and control of circadian rhythm are expressed in human breast cancer cells and tissues and may play a role in transducing the effects of light on breast cancer.

  12. Parallel-Processing Test Bed For Simulation Software

    NASA Technical Reports Server (NTRS)

    Blech, Richard; Cole, Gary; Townsend, Scott

    1996-01-01

    Second-generation Hypercluster computing system is multiprocessor test bed for research on parallel algorithms for simulation in fluid dynamics, electromagnetics, chemistry, and other fields with large computational requirements but relatively low input/output requirements. Built from standard, off-shelf hardware readily upgraded as improved technology becomes available. System used for experiments with such parallel-processing concepts as message-passing algorithms, debugging software tools, and computational steering. First-generation Hypercluster system described in "Hypercluster Parallel Processor" (LEW-15283).

  13. The high-affinity phosphodiesterase PdeH regulates development and aflatoxin biosynthesis in Aspergillus flavus.

    PubMed

    Yang, Kunlong; Liu, Yinghang; Liang, Linlin; Li, Zhenguo; Qin, Qiuping; Nie, Xinyi; Wang, Shihua

    2017-04-01

    Cyclic AMP signaling controls a range of physiological processes in response to extracellular stimuli in organisms. Among the signaling cascades, cAMP, as a second messenger, is orchestrated by adenylate cyclase (biosynthesis) and cAMP phosphodiesterases (PDEs) (hydrolysis). In this study, we investigated the function of the high-affinity (PdeH) and low-affinity (PdeL) cAMP phosphodiesterase from the carcinogenic aflatoxin producing fungus Aspergillus flavus, and found that instead of PdeL, inactivation of PdeH exhibited a reduction in conidiation and sclerotia formation. However, the ΔpdeL/ΔpdeH mutant exhibited an enhanced phenotype defects, a similar phenotype defects to wild-type strain treated with exogenous cAMP. The activation of PKA activity was inhibited in the ΔpdeH or ΔpdeL/ΔpdeH mutant, both of whom exhibited increasing AF production. Further analysis by qRT-PCR revealed that pdeH had a high transcriptional level compared to pdeL in wild-type strain, and affected pdeL transcription. Green fluorescent protein tagging at the C-terminus of PDEs showed that PdeH-GFP is broadly compartmentalized in the cytosol, while PdeL-GFP localized mainly to the nucleus. Overall, our results indicated that PdeH plays a major role, but has overlapping function with PdeL, in vegetative growth, development and AF biosynthesis in A. flavus. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  14. Boundary Control of Linear Uncertain 1-D Parabolic PDE Using Approximate Dynamic Programming.

    PubMed

    Talaei, Behzad; Jagannathan, Sarangapani; Singler, John

    2018-04-01

    This paper develops a near optimal boundary control method for distributed parameter systems governed by uncertain linear 1-D parabolic partial differential equations (PDE) by using approximate dynamic programming. A quadratic surface integral is proposed to express the optimal cost functional for the infinite-dimensional state space. Accordingly, the Hamilton-Jacobi-Bellman (HJB) equation is formulated in the infinite-dimensional domain without using any model reduction. Subsequently, a neural network identifier is developed to estimate the unknown spatially varying coefficient in PDE dynamics. Novel tuning law is proposed to guarantee the boundedness of identifier approximation error in the PDE domain. A radial basis network (RBN) is subsequently proposed to generate an approximate solution for the optimal surface kernel function online. The tuning law for near optimal RBN weights is created, such that the HJB equation error is minimized while the dynamics are identified and closed-loop system remains stable. Ultimate boundedness (UB) of the closed-loop system is verified by using the Lyapunov theory. The performance of the proposed controller is successfully confirmed by simulation on an unstable diffusion-reaction process.

  15. Longtime dynamics of the PDE model for the motion toward light of bacterial colonies

    NASA Astrophysics Data System (ADS)

    Taranets, R.; Chugunova, M.

    2018-03-01

    We study stationary solutions and longtime dynamics of the PDE model for cyanobacteria motion, which was recently proposed by Chavy-Waddy and Kolokolnikov (2016 Nonlinearity 29 3174). For different values of the parameter α, which controls the extent of the aggregate, we analyse a family of corresponding steady states and their stability (considering symmetric and non-symmetric cases separately). We derive the rate of convergence toward steady states, show existence of weak nonnegative solutions, and we also discover that the value α = 3 is a special case for this PDE model. Using numerical simulations we compare different regimes and illustrate convergence toward steady states.

  16. SPEEDES - A multiple-synchronization environment for parallel discrete-event simulation

    NASA Technical Reports Server (NTRS)

    Steinman, Jeff S.

    1992-01-01

    Synchronous Parallel Environment for Emulation and Discrete-Event Simulation (SPEEDES) is a unified parallel simulation environment. It supports multiple-synchronization protocols without requiring users to recompile their code. When a SPEEDES simulation runs on one node, all the extra parallel overhead is removed automatically at run time. When the same executable runs in parallel, the user preselects the synchronization algorithm from a list of options. SPEEDES currently runs on UNIX networks and on the California Institute of Technology/Jet Propulsion Laboratory Mark III Hypercube. SPEEDES also supports interactive simulations. Featured in the SPEEDES environment is a new parallel synchronization approach called Breathing Time Buckets. This algorithm uses some of the conservative techniques found in Time Bucket synchronization, along with the optimism that characterizes the Time Warp approach. A mathematical model derived from first principles predicts the performance of Breathing Time Buckets. Along with the Breathing Time Buckets algorithm, this paper discusses the rules for processing events in SPEEDES, describes the implementation of various other synchronization protocols supported by SPEEDES, describes some new ones for the future, discusses interactive simulations, and then gives some performance results.

  17. A hybrid parallel architecture for electrostatic interactions in the simulation of dissipative particle dynamics

    NASA Astrophysics Data System (ADS)

    Yang, Sheng-Chun; Lu, Zhong-Yuan; Qian, Hu-Jun; Wang, Yong-Lei; Han, Jie-Ping

    2017-11-01

    In this work, we upgraded the electrostatic interaction method of CU-ENUF (Yang, et al., 2016) which first applied CUNFFT (nonequispaced Fourier transforms based on CUDA) to the reciprocal-space electrostatic computation and made the computation of electrostatic interaction done thoroughly in GPU. The upgraded edition of CU-ENUF runs concurrently in a hybrid parallel way that enables the computation parallelizing on multiple computer nodes firstly, then further on the installed GPU in each computer. By this parallel strategy, the size of simulation system will be never restricted to the throughput of a single CPU or GPU. The most critical technical problem is how to parallelize a CUNFFT in the parallel strategy, which is conquered effectively by deep-seated research of basic principles and some algorithm skills. Furthermore, the upgraded method is capable of computing electrostatic interactions for both the atomistic molecular dynamics (MD) and the dissipative particle dynamics (DPD). Finally, the benchmarks conducted for validation and performance indicate that the upgraded method is able to not only present a good precision when setting suitable parameters, but also give an efficient way to compute electrostatic interactions for huge simulation systems. Program Files doi:http://dx.doi.org/10.17632/zncf24fhpv.1 Licensing provisions: GNU General Public License 3 (GPL) Programming language: C, C++, and CUDA C Supplementary material: The program is designed for effective electrostatic interactions of large-scale simulation systems, which runs on particular computers equipped with NVIDIA GPUs. It has been tested on (a) single computer node with Intel(R) Core(TM) i7-3770@ 3.40 GHz (CPU) and GTX 980 Ti (GPU), and (b) MPI parallel computer nodes with the same configurations. Nature of problem: For molecular dynamics simulation, the electrostatic interaction is the most time-consuming computation because of its long-range feature and slow convergence in simulation space

  18. Compartmentalized PDE4A5 Signaling Impairs Hippocampal Synaptic Plasticity and Long-Term Memory.

    PubMed

    Havekes, Robbert; Park, Alan J; Tolentino, Rosa E; Bruinenberg, Vibeke M; Tudor, Jennifer C; Lee, Yool; Hansen, Rolf T; Guercio, Leonardo A; Linton, Edward; Neves-Zaph, Susana R; Meerlo, Peter; Baillie, George S; Houslay, Miles D; Abel, Ted

    2016-08-24

    Alterations in cAMP signaling are thought to contribute to neurocognitive and neuropsychiatric disorders. Members of the cAMP-specific phosphodiesterase 4 (PDE4) family, which contains >25 different isoforms, play a key role in determining spatial cAMP degradation so as to orchestrate compartmentalized cAMP signaling in cells. Each isoform binds to a different set of protein complexes through its unique N-terminal domain, thereby leading to targeted degradation of cAMP in specific intracellular compartments. However, the functional role of specific compartmentalized PDE4 isoforms has not been examined in vivo Here, we show that increasing protein levels of the PDE4A5 isoform in mouse hippocampal excitatory neurons impairs a long-lasting form of hippocampal synaptic plasticity and attenuates hippocampus-dependent long-term memories without affecting anxiety. In contrast, viral expression of a truncated version of PDE4A5, which lacks the unique N-terminal targeting domain, does not affect long-term memory. Further, overexpression of the PDE4A1 isoform, which targets a different subset of signalosomes, leaves memory undisturbed. Fluorescence resonance energy transfer sensor-based cAMP measurements reveal that the full-length PDE4A5, in contrast to the truncated form, hampers forskolin-mediated increases in neuronal cAMP levels. Our study indicates that the unique N-terminal localization domain of PDE4A5 is essential for the targeting of specific cAMP-dependent signaling underlying synaptic plasticity and memory. The development of compounds to disrupt the compartmentalization of individual PDE4 isoforms by targeting their unique N-terminal domains may provide a fruitful approach to prevent cognitive deficits in neuropsychiatric and neurocognitive disorders that are associated with alterations in cAMP signaling. Neurons exhibit localized signaling processes that enable biochemical cascades to be activated selectively in specific subcellular compartments. The

  19. Identification of the gamma subunit-interacting residues on photoreceptor cGMP phosphodiesterase, PDE6alpha '.

    PubMed

    Granovsky, A E; Artemyev, N O

    2000-12-29

    Photoreceptor cGMP phosphodiesterase (PDE6) is the effector enzyme in the G protein-mediated visual transduction cascade. In the dark, the activity of PDE6 is shut off by the inhibitory gamma subunit (Pgamma). Chimeric proteins between cone PDE6alpha' and cGMP-binding and cGMP-specific PDE (PDE5) have been constructed and expressed in Sf9 cells to study the mechanism of inhibition of PDE6 catalytic activity by Pgamma. Substitution of the segment PDE5-(773-820) by the corresponding PDE6alpha'-(737-784) sequence in the wild-type PDE5 or in a PDE5/PDE6alpha' chimera containing the catalytic domain of PDE5 results in chimeric enzymes capable of inhibitory interaction with Pgamma. The catalytic properties of the chimeric PDEs remained similar to those of PDE5. Ala-scanning mutational analysis of the Pgamma-binding region, PDE6alpha'-(750-760), revealed PDE6alpha' residues essential for the interaction. The M758A mutation markedly impaired and the Q752A mutation moderately impaired the inhibition of chimeric PDE by Pgamma. The analysis of the catalytic properties of mutant PDEs and a model of the PDE6 catalytic domain suggest that residues Met(758) and Gln(752) directly bind Pgamma. A model of the PDE6 catalytic site shows that PDE6alpha'-(750-760) forms a loop at the entrance to the cGMP-binding pocket. Binding of Pgamma to Met(758) would effectively block access of cGMP to the catalytic cavity, providing a structural basis for the mechanism of PDE6 inhibition.

  20. Parallel Discrete Molecular Dynamics Simulation With Speculation and In-Order Commitment*†

    PubMed Central

    Khan, Md. Ashfaquzzaman; Herbordt, Martin C.

    2011-01-01

    Discrete molecular dynamics simulation (DMD) uses simplified and discretized models enabling simulations to advance by event rather than by timestep. DMD is an instance of discrete event simulation and so is difficult to scale: even in this multi-core era, all reported DMD codes are serial. In this paper we discuss the inherent difficulties of scaling DMD and present our method of parallelizing DMD through event-based decomposition. Our method is microarchitecture inspired: speculative processing of events exposes parallelism, while in-order commitment ensures correctness. We analyze the potential of this parallelization method for shared-memory multiprocessors. Achieving scalability required extensive experimentation with scheduling and synchronization methods to mitigate serialization. The speed-up achieved for a variety of system sizes and complexities is nearly 6× on an 8-core and over 9× on a 12-core processor. We present and verify analytical models that account for the achieved performance as a function of available concurrency and architectural limitations. PMID:21822327

  1. Xyce parallel electronic simulator users' guide, Version 6.0.1.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R; Mei, Ting; Russo, Thomas V.

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel computing platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows one to developmore » new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandias needs, including some radiationaware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase a message passing parallel implementation which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows.« less

  2. Investigation of Thrust Augmentation and Acoustic Performance by Ejectors on PDE

    NASA Astrophysics Data System (ADS)

    Xu, Gui-yang; Weng, Chun-sheng; Li, Ning; Huang, Xiao-long

    2016-04-01

    Thrust augmentation and acoustic performance of a Pulse Detonation Engine (PDE) with ejector system is experimentally investigated. For these tests the LEjector/DEjector is varied from 1.18 to 4 and the axial placement of the ejector relative to the PDE exhaust is varied from an x/DPDE of -3 to 3. Results from the tests show that the optimum LEjector/DEjector based on thrust augmentation and Overall Sound Pressure Level (OASPL) is found to be 2.61. The divergent ejector performed the best based on thrust augmentation, while the reduction effect for OASPL and Peak Sound Pressure Level (PSPL) at 60° is most prominent for the convergent ejector. The optimum axial position based on thrust augmentation is determined to be x/DPDE = 2, while, x/DPDE = 0 based on OASPL and PSPL.

  3. Exploring the structure determinants of pyrazinone derivatives as PDE5 3HC8 inhibitors: an in silico analysis.

    PubMed

    Li, Yan; Wu, Wenzhao; Ren, Hong; Wang, Jinghui; Zhang, Shuwei; Li, Guohui; Yang, Ling

    2012-09-01

    Phosphodiesterase type 5 (PDE5) inhibitors are clinically indicated for the treatment of erectile dysfunction, pulmonary hypertension and various other diseases. In this work, both ligand- and receptor-based three-dimensional quantitative structure-activity relationship (3D-QSAR) studies were carried out using comparative molecular field analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA) techniques on 122 pyrazinone derivatives as PDE inhibitors. The resultant optimum 3D-QSAR model exhibits a proper predictive ability as indicated by the statistical results of Q² of 0.584, R(ncv)² of 0.884 and R(pre)² of 0.817, respectively. In addition, docking analysis and molecular dynamics (MD) simulation were also applied to elucidate the probable binding modes of these inhibitors. Our main findings are: (1) Introduction of bulky, electropositive and hydrophobic substituents at 12- and 19-positions can increase the biological activities. (2) N atom at 8-position is detrimental to the inhibitor activity, and the effect of N atoms at 5- and 6-positions on compound activity is co-determined by both the hydrophobic force and the π-π stacking interaction. (3) Bulky and hydrophilic substitutions are favored at the 27-position of ring D. (4) Electronegative and hydrophilic substitutions around 5- and 6-positions increase the inhibitory activity. (5) Hydrophobic forces and π-π stacking interaction with Phe786 and Phe820 are crucial in determining the binding of pyrazinone derivatives to PDE5. (6) Bulky substitutions around ring C favors selectivity against PDE11, while bulky groups near the 21-position disfavor the selectivity. The information obtained from this work can be utilized to accurately predict the binding affinity of related analogues and also facilitate future rational designs of novel PDE5 inhibitors with improved activity and selectivity. Copyright © 2012 Elsevier Inc. All rights reserved.

  4. Immunoprecipitation of PDE2 phosphorylated and inactivated by an associated protein kinase.

    PubMed

    Bentley, J Kelley

    2005-01-01

    A PDE2A2-associated protein kinase phosphorylates PDE2A2 in vivo and in vitro to inhibit its catalytic activity. Rat brain PDE2A2 may be solubilized using nona (ethylene glycol) mono dodecyl ether (Lubrol 12A9). PDE2A2 exists in a complex with a protein kinase regulating its activity in an adenosine triphosphate-dependent manner. When native or recombinant PDE2 is immunoprecipitated from PC12 cells using an antibody to the amino terminus in a buffer containing Lubrol 12A9, protease inhibitors, and phosphatase inhibitors, a coimmunoprecipitating nerve growth factor-stimulated protein kinase acts to phosphorylate it. PDE2A2 phosphoryla-tion occurs optimally at pH 6.5 in a sodium 2-(4-morpholino)-ethane sulfonate buffer with 5 mM MgCl2 and 1 mM Na3VO4. I describe protocols for producing an antibody to an amino-terminal bacterial fusion protein encoding amino acids 1-251 of PDE2A2 as well as the use of this antibody in immunoprecipitating a PDE2: tyrosine protein-kinase complex from rat brain or PC12 cells.

  5. A boundary PDE feedback control approach for the stabilization of mortgage price dynamics

    NASA Astrophysics Data System (ADS)

    Rigatos, G.; Siano, P.; Sarno, D.

    2017-11-01

    Several transactions taking place in financial markets are dependent on the pricing of mortgages (loans for the purchase of residences, land or farms). In this article, a method for stabilization of mortgage price dynamics is developed. It is considered that mortgage prices follow a PDE model which is equivalent to a multi-asset Black-Scholes PDE. Actually it is a diffusion process evolving in a 2D assets space, where the first asset is the house price and the second asset is the interest rate. By applying semi-discretization and a finite differences scheme this multi-asset PDE is transformed into a state-space model consisting of ordinary nonlinear differential equations. For the local subsystems, into which the mortgage PDE is decomposed, it becomes possible to apply boundary-based feedback control. The controller design proceeds by showing that the state-space model of the mortgage price PDE stands for a differentially flat system. Next, for each subsystem which is related to a nonlinear ODE, a virtual control input is computed, that can invert the subsystem's dynamics and can eliminate the subsystem's tracking error. From the last row of the state-space description, the control input (boundary condition) that is actually applied to the multi-factor mortgage price PDE system is found. This control input contains recursively all virtual control inputs which were computed for the individual ODE subsystems associated with the previous rows of the state-space equation. Thus, by tracing the rows of the state-space model backwards, at each iteration of the control algorithm, one can finally obtain the control input that should be applied to the mortgage price PDE system so as to assure that all its state variables will converge to the desirable setpoints. By showing the feasibility of such a control method it is also proven that through selected modification of the PDE boundary conditions the price of the mortgage can be made to converge and stabilize at specific

  6. Co-possession of phosphodiesterase type-5 inhibitors (PDE5-I) with nitrates.

    PubMed

    Chang, Li-Ling; Ma, Mark; Allmen, Heather von; Henderson, Scott C; Harper, Kristine; Hornbuckle, Kenneth

    2010-06-01

    Estimate the proportion of phosphodiesterase type-5 inhibitor (PDE5-I) patients who co-possess nitrates and compare the proportion of tadalafil patients dispensed nitrates to a matched control group. Secondarily, examine the percentage of co-possession of PDE5-Is and nitrates where the products were dispensed on the same day or written by the same prescriber. Male patients aged 18+ years filling PDE5-I prescriptions between December 2003 and March 2006 were identified using a U.S. longitudinal prescription database (IMS Health LRx). Similar patients not dispensed a PDE5-I during this period were matched to the tadalafil-dispensed cohort using a propensity score approach. Co-possession, as a proxy for concurrent use, was defined as an overlap in time on therapy for a PDE5-I and nitrate and was compared for the three PDE5-Is and for tadalafil to the matched control group. Among 601,063 tadalafil patients, 3.31% were dispensed a nitrate during the study period, compared to 6.18% in control patients (n = 601,063). When co-possessed prescriptions were defined by overlapping exposure periods, the proportion of PDE5-I patients with co-possessed nitrates ranged from 1.44% (tadalafil) to 1.72% (vardenafil) and 2.13% (sildenafil). Co-possession percentages of PDE5-I prescriptions were 0.83% for tadalafil and 1.07% for sildenafil and vardenafil. The majority (54.29%) of co-possessed PDE5-I and nitrate prescriptions had the nitrate dispensed prior to the PDE5-I prescription identified in the study cohort. Keeping in mind the limitations of observational studies, these results suggest that co-dispensing of nitrates and PDE5-Is is low. Compared to control patients, the proportion of nitrate co-possession was lowest for patients filling tadalafil. Tadalafil patients also had the lowest co-possessed proportion among the three PDE5-I cohorts. While the majority of co-possessed drug pairs were prescribed by different providers, the highest percentage of co-prescribing from the same

  7. A conservative approach to parallelizing the Sharks World simulation

    NASA Technical Reports Server (NTRS)

    Nicol, David M.; Riffe, Scott E.

    1990-01-01

    Parallelizing a benchmark problem for parallel simulation, the Sharks World, is described. The described solution is conservative, in the sense that no state information is saved, and no 'rollbacks' occur. The used approach illustrates both the principal advantage and principal disadvantage of conservative parallel simulation. The advantage is that by exploiting lookahead an approach was found that dramatically improves the serial execution time, and also achieves excellent speedups. The disadvantage is that if the model rules are changed in such a way that the lookahead is destroyed, it is difficult to modify the solution to accommodate the changes.

  8. A parallel computational model for GATE simulations.

    PubMed

    Rannou, F R; Vega-Acevedo, N; El Bitar, Z

    2013-12-01

    GATE/Geant4 Monte Carlo simulations are computationally demanding applications, requiring thousands of processor hours to produce realistic results. The classical strategy of distributing the simulation of individual events does not apply efficiently for Positron Emission Tomography (PET) experiments, because it requires a centralized coincidence processing and large communication overheads. We propose a parallel computational model for GATE that handles event generation and coincidence processing in a simple and efficient way by decentralizing event generation and processing but maintaining a centralized event and time coordinator. The model is implemented with the inclusion of a new set of factory classes that can run the same executable in sequential or parallel mode. A Mann-Whitney test shows that the output produced by this parallel model in terms of number of tallies is equivalent (but not equal) to its sequential counterpart. Computational performance evaluation shows that the software is scalable and well balanced. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  9. Is PDE4 too difficult a drug target?

    PubMed

    Higgs, Gerry

    2010-05-01

    The search for selective inhibitors of PDE4 as novel anti-inflammatory drugs has continued for more than 30 years. Although several compounds have demonstrated therapeutic effects in diseases such as asthma, COPD, atopic dermatitis and psoriasis, none have reached the market. A persistent challenge in the development of PDE4 inhibitors has been drug-induced gastrointestinal adverse effects, such as nausea. However, extensive clinical trials with well-tolerated doses of roflumilast (Daxas; Nycomed/Mitsubishi Tanabe Pharma Corp/Forest Laboratories Inc) in COPD, a disease that is generally unresponsive to existing therapies, have demonstrated significant therapeutic improvements. In addition, GlaxoSmithKline plc is developing 256066, an inhaled formulation of a PDE4 inhibitor that has demonstrated efficacy in trials in asthma, and apremilast from Celgene Corp has been reported to be effective for the treatment of psoriasis. Despite the challenges and complications that have been encountered during the development of PDE4 inhibitors, these drugs may provide a genuinely novel class of anti-inflammatory agents, and there are several compounds in development that could fulfill that promise.

  10. Parallel-In-Time For Moving Meshes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Falgout, R. D.; Manteuffel, T. A.; Southworth, B.

    2016-02-04

    With steadily growing computational resources available, scientists must develop e ective ways to utilize the increased resources. High performance, highly parallel software has be- come a standard. However until recent years parallelism has focused primarily on the spatial domain. When solving a space-time partial di erential equation (PDE), this leads to a sequential bottleneck in the temporal dimension, particularly when taking a large number of time steps. The XBraid parallel-in-time library was developed as a practical way to add temporal parallelism to existing se- quential codes with only minor modi cations. In this work, a rezoning-type moving mesh is appliedmore » to a di usion problem and formulated in a parallel-in-time framework. Tests and scaling studies are run using XBraid and demonstrate excellent results for the simple model problem considered herein.« less

  11. Massively Parallel Processing for Fast and Accurate Stamping Simulations

    NASA Astrophysics Data System (ADS)

    Gress, Jeffrey J.; Xu, Siguang; Joshi, Ramesh; Wang, Chuan-tao; Paul, Sabu

    2005-08-01

    The competitive automotive market drives automotive manufacturers to speed up the vehicle development cycles and reduce the lead-time. Fast tooling development is one of the key areas to support fast and short vehicle development programs (VDP). In the past ten years, the stamping simulation has become the most effective validation tool in predicting and resolving all potential formability and quality problems before the dies are physically made. The stamping simulation and formability analysis has become an critical business segment in GM math-based die engineering process. As the simulation becomes as one of the major production tools in engineering factory, the simulation speed and accuracy are the two of the most important measures for stamping simulation technology. The speed and time-in-system of forming analysis becomes an even more critical to support the fast VDP and tooling readiness. Since 1997, General Motors Die Center has been working jointly with our software vendor to develop and implement a parallel version of simulation software for mass production analysis applications. By 2001, this technology was matured in the form of distributed memory processing (DMP) of draw die simulations in a networked distributed memory computing environment. In 2004, this technology was refined to massively parallel processing (MPP) and extended to line die forming analysis (draw, trim, flange, and associated spring-back) running on a dedicated computing environment. The evolution of this technology and the insight gained through the implementation of DM0P/MPP technology as well as performance benchmarks are discussed in this publication.

  12. Mesh Algorithms for PDE with Sieve I: Mesh Distribution

    DOE PAGES

    Knepley, Matthew G.; Karpeev, Dmitry A.

    2009-01-01

    We have developed a new programming framework, called Sieve, to support parallel numerical partial differential equation(s) (PDE) algorithms operating over distributed meshes. We have also developed a reference implementation of Sieve in C++ as a library of generic algorithms operating on distributed containers conforming to the Sieve interface. Sieve makes instances of the incidence relation, or arrows, the conceptual first-class objects represented in the containers. Further, generic algorithms acting on this arrow container are systematically used to provide natural geometric operations on the topology and also, through duality, on the data. Finally, coverings and duality are used to encode notmore » only individual meshes, but all types of hierarchies underlying PDE data structures, including multigrid and mesh partitions. In order to demonstrate the usefulness of the framework, we show how the mesh partition data can be represented and manipulated using the same fundamental mechanisms used to represent meshes. We present the complete description of an algorithm to encode a mesh partition and then distribute a mesh, which is independent of the mesh dimension, element shape, or embedding. Moreover, data associated with the mesh can be similarly distributed with exactly the same algorithm. The use of a high level of abstraction within the Sieve leads to several benefits in terms of code reuse, simplicity, and extensibility. We discuss these benefits and compare our approach to other existing mesh libraries.« less

  13. Xyce Parallel Electronic Simulator - Users' Guide Version 2.1.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hutchinson, Scott A; Hoekstra, Robert J.; Russo, Thomas V.

    This manual describes the use of theXyceParallel Electronic Simulator.Xycehasbeen designed as a SPICE-compatible, high-performance analog circuit simulator, andhas been written to support the simulation needs of the Sandia National Laboratorieselectrical designers. This development has focused on improving capability over thecurrent state-of-the-art in the following areas:%04Capability to solve extremely large circuit problems by supporting large-scale par-allel computing platforms (up to thousands of processors). Note that this includessupport for most popular parallel and serial computers.%04Improved performance for all numerical kernels (e.g., time integrator, nonlinearand linear solvers) through state-of-the-art algorithms and novel techniques.%04Device models which are specifically tailored to meet Sandia's needs, includingmanymore » radiation-aware devices.3 XyceTMUsers' Guide%04Object-oriented code design and implementation using modern coding practicesthat ensure that theXyceParallel Electronic Simulator will be maintainable andextensible far into the future.Xyceis a parallel code in the most general sense of the phrase - a message passingparallel implementation - which allows it to run efficiently on the widest possible numberof computing platforms. These include serial, shared-memory and distributed-memoryparallel as well as heterogeneous platforms. Careful attention has been paid to thespecific nature of circuit-simulation problems to ensure that optimal parallel efficiencyis achieved as the number of processors grows.The development ofXyceprovides a platform for computational research and de-velopment aimed specifically at the needs of the Laboratory. WithXyce, Sandia hasan %22in-house%22 capability with which both new electrical (e.g., device model develop-ment) and algorithmic (e.g., faster time-integration methods, parallel solver algorithms)research and development can be performed. As a result,Xyceis a unique electricalsimulation capability

  14. cAMP-specific PDE4 Phosphodiesterases and AIP in the Pathogenesis of Pituitary Tumors

    PubMed Central

    Bolger, Graeme B.; Bizzi, Mariana Ferreira; Brant Pinheiro, Sergio Veloso; Trivellin, Giampaolo; Smoot, Lisa; Accavitti, Mary-Ann; Korbonits, Márta; Ribeiro-Oliveira, Antonio

    2016-01-01

    PDE4 cyclic nucleotide phosphodiesterases regulate cAMP abundance in cells and thereby regulate numerous processes, including cell growth and differentiation. The rat PDE4A5 isoform (human homologue PDE4A4) interacts with the AIP protein (also called XAP2 or ARA-9). Germline mutations in AIP occur in approximately 20% of patients with Familial Isolated Pituitary Adenoma (FIPA) and 20% of childhood-onset simplex somatotroph adenomas. We therefore examined the protein expression of PDE4A4 and the closely-related isoform PDE4A8 in normal human pituitary tissue and in pituitary adenomas. PDE4A4 had low expression in normal pituitary, but was significantly over-expressed in somatotroph, lactotroph, corticotroph and clinically non-functioning gonadotroph adenomas (P<0.0001 for all subtypes). Likewise, PDE4A8 was expressed in normal pituitary and was also significantly over-expressed in the adenoma subtypes (P<0.0001 for all). Among the different adenoma subtypes, corticotroph and lactotroph adenomas were the highest and lowest expressed for PDE4A4, respectively, whereas the opposite was observed for PDE4A8. Naturally occurring oncogenic variants in AIP were shown by a two-hybrid assay to disrupt the ability of AIP to interact with PDE4A5. A reverse-two-hybrid screen identified numerous additional variants in the TPR region of AIP that also disrupted its ability to interact with PDE4A5. The expression of PDE4A4 and PDE4A8 in normal pituitary, their increased expression in adenomatous pituitary cells where AIP is meant to participate, and the disruption of the PDE4A4-AIP interaction by AIP mutants may play a role in pituitary tumorigenesis. PMID:27267386

  15. A derivation and scalable implementation of the synchronous parallel kinetic Monte Carlo method for simulating long-time dynamics

    NASA Astrophysics Data System (ADS)

    Byun, Hye Suk; El-Naggar, Mohamed Y.; Kalia, Rajiv K.; Nakano, Aiichiro; Vashishta, Priya

    2017-10-01

    Kinetic Monte Carlo (KMC) simulations are used to study long-time dynamics of a wide variety of systems. Unfortunately, the conventional KMC algorithm is not scalable to larger systems, since its time scale is inversely proportional to the simulated system size. A promising approach to resolving this issue is the synchronous parallel KMC (SPKMC) algorithm, which makes the time scale size-independent. This paper introduces a formal derivation of the SPKMC algorithm based on local transition-state and time-dependent Hartree approximations, as well as its scalable parallel implementation based on a dual linked-list cell method. The resulting algorithm has achieved a weak-scaling parallel efficiency of 0.935 on 1024 Intel Xeon processors for simulating biological electron transfer dynamics in a 4.2 billion-heme system, as well as decent strong-scaling parallel efficiency. The parallel code has been used to simulate a lattice of cytochrome complexes on a bacterial-membrane nanowire, and it is broadly applicable to other problems such as computational synthesis of new materials.

  16. Xyce parallel electronic simulator design.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thornquist, Heidi K.; Rankin, Eric Lamont; Mei, Ting

    2010-09-01

    This document is the Xyce Circuit Simulator developer guide. Xyce has been designed from the 'ground up' to be a SPICE-compatible, distributed memory parallel circuit simulator. While it is in many respects a research code, Xyce is intended to be a production simulator. As such, having software quality engineering (SQE) procedures in place to insure a high level of code quality and robustness are essential. Version control, issue tracking customer support, C++ style guildlines and the Xyce release process are all described. The Xyce Parallel Electronic Simulator has been under development at Sandia since 1999. Historically, Xyce has mostly beenmore » funded by ASC, the original focus of Xyce development has primarily been related to circuits for nuclear weapons. However, this has not been the only focus and it is expected that the project will diversify. Like many ASC projects, Xyce is a group development effort, which involves a number of researchers, engineers, scientists, mathmaticians and computer scientists. In addition to diversity of background, it is to be expected on long term projects for there to be a certain amount of staff turnover, as people move on to different projects. As a result, it is very important that the project maintain high software quality standards. The point of this document is to formally document a number of the software quality practices followed by the Xyce team in one place. Also, it is hoped that this document will be a good source of information for new developers.« less

  17. Inflated speedups in parallel simulations via malloc()

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1990-01-01

    Discrete-event simulation programs make heavy use of dynamic memory allocation in order to support simulation's very dynamic space requirements. When programming in C one is likely to use the malloc() routine. However, a parallel simulation which uses the standard Unix System V malloc() implementation may achieve an overly optimistic speedup, possibly superlinear. An alternate implementation provided on some (but not all systems) can avoid the speedup anomaly, but at the price of significantly reduced available free space. This is especially severe on most parallel architectures, which tend not to support virtual memory. It is shown how a simply implemented user-constructed interface to malloc() can both avoid artificially inflated speedups, and make efficient use of the dynamic memory space. The interface simply catches blocks on the basis of their size. The problem is demonstrated empirically, and the effectiveness of the solution is shown both empirically and analytically.

  18. Ion and Electron Energization in Guide Field Reconnection Outflows with Kinetic Riemann Simulations and Parallel Shock Simulations

    NASA Astrophysics Data System (ADS)

    Zhang, Q.; Drake, J. F.; Swisdak, M.

    2017-12-01

    How ions and electrons are energized in magnetic reconnection outflows is an essential topic throughout the heliosphere. Here we carry out guide field PIC Riemann simulations to explore the ion and electron energization mechanisms far downstream of the x-line. Riemann simulations, with their simple magnetic geometry, facilitate the study of the reconnection outflow far downstream of the x-line in much more detail than is possible with conventional reconnection simulations. We find that the ions get accelerated at rotational discontinuities, counter stream, and give rise to two slow shocks. We demonstrate that the energization mechanism at the slow shocks is essentially the same as that of parallel electrostatic shocks. Also, the electron confining electric potential at the slow shocks is driven by the counterstreaming beams, which tend to break the quasi-neutrality. Based on this picture, we build a kinetic model to self consistently predict the downstream ion and electron temperatures. Additional explorations using parallel shock simulations also imply that in a very low beta(0.001 0.01 for a modest guide field) regime, electron energization will be insignificant compared to the ion energization. Our model and the parallel shock simulations might be used as simple tools to understand and estimate the energization of ions and electrons and the energy partition far downstream of the x-line.

  19. Relation of Parallel Discrete Event Simulation algorithms with physical models

    NASA Astrophysics Data System (ADS)

    Shchur, L. N.; Shchur, L. V.

    2015-09-01

    We extend concept of local simulation times in parallel discrete event simulation (PDES) in order to take into account architecture of the current hardware and software in high-performance computing. We shortly review previous research on the mapping of PDES on physical problems, and emphasise how physical results may help to predict parallel algorithms behaviour.

  20. Parallel algorithms for simulating continuous time Markov chains

    NASA Technical Reports Server (NTRS)

    Nicol, David M.; Heidelberger, Philip

    1992-01-01

    We have previously shown that the mathematical technique of uniformization can serve as the basis of synchronization for the parallel simulation of continuous-time Markov chains. This paper reviews the basic method and compares five different methods based on uniformization, evaluating their strengths and weaknesses as a function of problem characteristics. The methods vary in their use of optimism, logical aggregation, communication management, and adaptivity. Performance evaluation is conducted on the Intel Touchstone Delta multiprocessor, using up to 256 processors.

  1. Xyce Parallel Electronic Simulator Users' Guide Version 6.7.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Aadithya, Karthik Venkatraman; Mei, Ting

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been designed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel com- puting platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows one tomore » develop new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandia's needs, including some radiation- aware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase -- a message passing parallel implementation -- which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. The information herein is subject to change without notice. Copyright c 2002-2017 Sandia Corporation. All rights reserved. Trademarks Xyce TM Electronic Simulator and Xyce TM are trademarks of Sandia Corporation. Orcad, Orcad Capture, PSpice and Probe are registered trademarks of Cadence Design Systems, Inc. Microsoft, Windows and Windows 7 are registered trademarks of Microsoft Corporation. Medici, DaVinci and Taurus are registered trademarks of Synopsys Corporation. Amtec and TecPlot are trademarks

  2. ANNarchy: a code generation approach to neural simulations on parallel hardware

    PubMed Central

    Vitay, Julien; Dinkelbach, Helge Ü.; Hamker, Fred H.

    2015-01-01

    Many modern neural simulators focus on the simulation of networks of spiking neurons on parallel hardware. Another important framework in computational neuroscience, rate-coded neural networks, is mostly difficult or impossible to implement using these simulators. We present here the ANNarchy (Artificial Neural Networks architect) neural simulator, which allows to easily define and simulate rate-coded and spiking networks, as well as combinations of both. The interface in Python has been designed to be close to the PyNN interface, while the definition of neuron and synapse models can be specified using an equation-oriented mathematical description similar to the Brian neural simulator. This information is used to generate C++ code that will efficiently perform the simulation on the chosen parallel hardware (multi-core system or graphical processing unit). Several numerical methods are available to transform ordinary differential equations into an efficient C++code. We compare the parallel performance of the simulator to existing solutions. PMID:26283957

  3. A sweep algorithm for massively parallel simulation of circuit-switched networks

    NASA Technical Reports Server (NTRS)

    Gaujal, Bruno; Greenberg, Albert G.; Nicol, David M.

    1992-01-01

    A new massively parallel algorithm is presented for simulating large asymmetric circuit-switched networks, controlled by a randomized-routing policy that includes trunk-reservation. A single instruction multiple data (SIMD) implementation is described, and corresponding experiments on a 16384 processor MasPar parallel computer are reported. A multiple instruction multiple data (MIMD) implementation is also described, and corresponding experiments on an Intel IPSC/860 parallel computer, using 16 processors, are reported. By exploiting parallelism, our algorithm increases the possible execution rate of such complex simulations by as much as an order of magnitude.

  4. Near-realtime simulations of biolelectric activity in small mammalian hearts using graphical processing units

    PubMed Central

    Vigmond, Edward J.; Boyle, Patrick M.; Leon, L. Joshua; Plank, Gernot

    2014-01-01

    Simulations of cardiac bioelectric phenomena remain a significant challenge despite continual advancements in computational machinery. Spanning large temporal and spatial ranges demands millions of nodes to accurately depict geometry, and a comparable number of timesteps to capture dynamics. This study explores a new hardware computing paradigm, the graphics processing unit (GPU), to accelerate cardiac models, and analyzes results in the context of simulating a small mammalian heart in real time. The ODEs associated with membrane ionic flow were computed on traditional CPU and compared to GPU performance, for one to four parallel processing units. The scalability of solving the PDE responsible for tissue coupling was examined on a cluster using up to 128 cores. Results indicate that the GPU implementation was between 9 and 17 times faster than the CPU implementation and scaled similarly. Solving the PDE was still 160 times slower than real time. PMID:19964295

  5. A tool for simulating parallel branch-and-bound methods

    NASA Astrophysics Data System (ADS)

    Golubeva, Yana; Orlov, Yury; Posypkin, Mikhail

    2016-01-01

    The Branch-and-Bound method is known as one of the most powerful but very resource consuming global optimization methods. Parallel and distributed computing can efficiently cope with this issue. The major difficulty in parallel B&B method is the need for dynamic load redistribution. Therefore design and study of load balancing algorithms is a separate and very important research topic. This paper presents a tool for simulating parallel Branchand-Bound method. The simulator allows one to run load balancing algorithms with various numbers of processors, sizes of the search tree, the characteristics of the supercomputer's interconnect thereby fostering deep study of load distribution strategies. The process of resolution of the optimization problem by B&B method is replaced by a stochastic branching process. Data exchanges are modeled using the concept of logical time. The user friendly graphical interface to the simulator provides efficient visualization and convenient performance analysis.

  6. AZTEC: A parallel iterative package for the solving linear systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hutchinson, S.A.; Shadid, J.N.; Tuminaro, R.S.

    1996-12-31

    We describe a parallel linear system package, AZTEC. The package incorporates a number of parallel iterative methods (e.g. GMRES, biCGSTAB, CGS, TFQMR) and preconditioners (e.g. Jacobi, Gauss-Seidel, polynomial, domain decomposition with LU or ILU within subdomains). Additionally, AZTEC allows for the reuse of previous preconditioning factorizations within Newton schemes for nonlinear methods. Currently, a number of different users are using this package to solve a variety of PDE applications.

  7. PDE3, but not PDE4, reduces β1- and β2-adrenoceptor-mediated inotropic and lusitropic effects in failing ventricle from metoprolol-treated patients

    PubMed Central

    Molenaar, Peter; Christ, Torsten; Hussain, Rizwan I; Engel, Andreas; Berk, Emanuel; Gillette, Katherine T; Chen, Lu; Galindo-Tovar, Alejandro; Krobert, Kurt A; Ravens, Ursula; Levy, Finn Olav; Kaumann, Alberto J

    2013-01-01

    Background and Purpose PDE3 and/or PDE4 control ventricular effects of catecholamines in several species but their relative effects in failing human ventricle are unknown. We investigated whether the PDE3-selective inhibitor cilostamide (0.3–1 μM) or PDE4 inhibitor rolipram (1–10 μM) modified the positive inotropic and lusitropic effects of catecholamines in human failing myocardium. Experimental Approach Right and left ventricular trabeculae from freshly explanted hearts of 5 non-β-blocker-treated and 15 metoprolol-treated patients with terminal heart failure were paced to contract at 1 Hz. The effects of (-)-noradrenaline, mediated through β1 adrenoceptors (β2 adrenoceptors blocked with ICI118551), and (-)-adrenaline, mediated through β2 adrenoceptors (β1 adrenoceptors blocked with CGP20712A), were assessed in the absence and presence of PDE inhibitors. Catecholamine potencies were estimated from –logEC50s. Key Results Cilostamide did not significantly potentiate the inotropic effects of the catecholamines in non-β-blocker-treated patients. Cilostamide caused greater potentiation (P = 0.037) of the positive inotropic effects of (-)-adrenaline (0.78 ± 0.12 log units) than (-)-noradrenaline (0.47 ± 0.12 log units) in metoprolol-treated patients. Lusitropic effects of the catecholamines were also potentiated by cilostamide. Rolipram did not affect the inotropic and lusitropic potencies of (-)-noradrenaline or (-)-adrenaline on right and left ventricular trabeculae from metoprolol-treated patients. Conclusions and Implications Metoprolol induces a control by PDE3 of ventricular effects mediated through both β1 and β2 adrenoceptors, thereby further reducing sympathetic cardiostimulation in patients with terminal heart failure. Concurrent therapy with a PDE3 blocker and metoprolol could conceivably facilitate cardiostimulation evoked by adrenaline through β2 adrenoceptors. PDE4 does not appear to reduce inotropic and lusitropic effects of

  8. PRATHAM: Parallel Thermal Hydraulics Simulations using Advanced Mesoscopic Methods

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Joshi, Abhijit S; Jain, Prashant K; Mudrich, Jaime A

    2012-01-01

    At the Oak Ridge National Laboratory, efforts are under way to develop a 3D, parallel LBM code called PRATHAM (PaRAllel Thermal Hydraulic simulations using Advanced Mesoscopic Methods) to demonstrate the accuracy and scalability of LBM for turbulent flow simulations in nuclear applications. The code has been developed using FORTRAN-90, and parallelized using the message passing interface MPI library. Silo library is used to compact and write the data files, and VisIt visualization software is used to post-process the simulation data in parallel. Both the single relaxation time (SRT) and multi relaxation time (MRT) LBM schemes have been implemented in PRATHAM.more » To capture turbulence without prohibitively increasing the grid resolution requirements, an LES approach [5] is adopted allowing large scale eddies to be numerically resolved while modeling the smaller (subgrid) eddies. In this work, a Smagorinsky model has been used, which modifies the fluid viscosity by an additional eddy viscosity depending on the magnitude of the rate-of-strain tensor. In LBM, this is achieved by locally varying the relaxation time of the fluid.« less

  9. Parallel discrete event simulation using shared memory

    NASA Technical Reports Server (NTRS)

    Reed, Daniel A.; Malony, Allen D.; Mccredie, Bradley D.

    1988-01-01

    With traditional event-list techniques, evaluating a detailed discrete-event simulation-model can often require hours or even days of computation time. By eliminating the event list and maintaining only sufficient synchronization to ensure causality, parallel simulation can potentially provide speedups that are linear in the numbers of processors. A set of shared-memory experiments, using the Chandy-Misra distributed-simulation algorithm, to simulate networks of queues is presented. Parameters of the study include queueing network topology and routing probabilities, number of processors, and assignment of network nodes to processors. These experiments show that Chandy-Misra distributed simulation is a questionable alternative to sequential-simulation of most queueing network models.

  10. A cAMP-specific phosphodiesterase (PDE8B) that is mutated in adrenal hyperplasia is expressed widely in human and mouse tissues: a novel PDE8B isoform in human adrenal cortex

    PubMed Central

    Horvath, Anelia; Giatzakis, Christoforos; Tsang, Kitman; Greene, Elizabeth; Osorio, Paulo; Boikos, Sosipatros; Libè, Rossella; Patronas, Yianna; Robinson-White, Audrey; Remmers, Elaine; Bertherat, Jerôme; Nesterova, Maria; Stratakis, Constantine A.

    2009-01-01

    Bilateral adrenocortical hyperplasia (BAH) is the second most common cause of corticotropin-independent Cushing syndrome (CS). Genetic forms of BAH have been associated with complex syndromes such as Carney Complex and McCune Albright syndrome or may present as isolated micronodular adrenocortical disease (iMAD) usually in children and young adults with CS. A genome-wide association study identified inactivating phosphodiesterase (PDE) 11A (PDE11A) sequencing defects as low-penetrance predisposing factors for iMAD and related abnormalities; we also described a mutation (c.914A>C/H305P) in cAMP-specific PDE8B, in a patient with iMAD. In this study we further characterize this mutation; we also found a novel PDE8B isoform, highly expressed in the adrenal gland. This mutation is shown to significantly affect the ability of the protein to degrade cAMP in vitro. Tumor tissues from patients with iMAD and no mutations in the coding PDE8B sequence or any other related genes (PRKAR1A, PDE11A) showed down-regulated PDE8B expression (compared to normal adrenal cortex). Pde8b is detectable in the adrenal gland of newborn mice and is widely expressed in other mouse tissues. We conclude that PDE8B is another PDE gene linked to iMAD; it is a candidate causative gene for other adrenocortical lesions linked to the cAMP-signaling pathway, and possibly for tumors in other tissues. PMID:18431404

  11. Development of a Scintillation Proximity Assay (SPA) Based, High Throughput Screening Feasible Method for the Identification of PDE12 Activity Modulators.

    PubMed

    Mang, Samuel; Bucher, Hannes; Nickolaus, Peter

    2016-01-01

    The scintillation proximity assay (SPA) technology has been widely used to establish high throughput screens (HTS) for a range of targets in the pharmaceutical industry. PDE12 (aka. 2'- phosphodiesterase) has been published to participate in the degradation of oligoadenylates that are involved in the establishment of an antiviral state via the activation of ribonuclease L (RNAse-L). Degradation of oligoadenylates by PDE12 terminates these antiviral activities, leading to decreased resistance of cells for a variety of viral pathogens. Therefore inhibitors of PDE12 are discussed as antiviral therapy. Here we describe the use of the yttrium silicate SPA bead technology to assess inhibitory activity of compounds against PDE12 in a homogeneous, robust HTS feasible assay using tritiated adenosine-P-adenylate ([3H]ApA) as substrate. We found that the used [3H]ApA educt, was not able to bind to SPA beads, whereas the product [3H]AMP, as known before, was able to bind to SPA beads. This enables the measurement of PDE12 activity on [3H]ApA as a substrate using a wallac microbeta counter. This method describes a robust and high throughput capable format in terms of specificity, commonly used compound solvents, ease of detection and assay matrices. The method could facilitate the search for PDE12 inhibitors as antiviral compounds.

  12. Research on odor interaction between aldehyde compounds via a partial differential equation (PDE) model.

    PubMed

    Yan, Luchun; Liu, Jiemin; Qu, Chen; Gu, Xingye; Zhao, Xia

    2015-01-28

    In order to explore the odor interaction of binary odor mixtures, a series of odor intensity evaluation tests were performed using both individual components and binary mixtures of aldehydes. Based on the linear relation between the logarithm of odor activity value and odor intensity of individual substances, the relationship between concentrations of individual constituents and their joint odor intensity was investigated by employing a partial differential equation (PDE) model. The obtained results showed that the binary odor interaction was mainly influenced by the mixing ratio of two constituents, but not the concentration level of an odor sample. Besides, an extended PDE model was also proposed on the basis of the above experiments. Through a series of odor intensity matching tests for several different binary odor mixtures, the extended PDE model was proved effective at odor intensity prediction. Furthermore, odorants of the same chemical group and similar odor type exhibited similar characteristics in the binary odor interaction. The overall results suggested that the PDE model is a more interpretable way of demonstrating the odor interactions of binary odor mixtures.

  13. Final Report: Subcontract B623868 Algebraic Multigrid solvers for coupled PDE systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brannick, J.

    The Pennsylvania State University (“Subcontractor”) continued to work on the design of algebraic multigrid solvers for coupled systems of partial differential equations (PDEs) arising in numerical modeling of various applications, with a main focus on solving the Dirac equation arising in Quantum Chromodynamics (QCD). The goal of the proposed work was to develop combined geometric and algebraic multilevel solvers that are robust and lend themselves to efficient implementation on massively parallel heterogeneous computers for these QCD systems. The research in these areas built on previous works, focusing on the following three topics: (1) the development of parallel full-multigrid (PFMG) andmore » non-Galerkin coarsening techniques in this frame work for solving the Wilson Dirac system; (2) the use of these same Wilson MG solvers for preconditioning the Overlap and Domain Wall formulations of the Dirac equation; and (3) the design and analysis of algebraic coarsening algorithms for coupled PDE systems including Stokes equation, Maxwell equation and linear elasticity.« less

  14. Experimental confirmation of a PDE-based approach to design of feedback controls

    NASA Technical Reports Server (NTRS)

    Banks, H. T.; Smith, Ralph C.; Brown, D. E.; Silcox, R. J.; Metcalf, Vern L.

    1995-01-01

    Issues regarding the experimental implementation of partial differential equation based controllers are discussed in this work. While the motivating application involves the reduction of vibration levels for a circular plate through excitation of surface-mounted piezoceramic patches, the general techniques described here will extend to a variety of applications. The initial step is the development of a PDE model which accurately captures the physics of the underlying process. This model is then discretized to yield a vector-valued initial value problem. Optimal control theory is used to determine continuous-time voltages to the patches, and the approximations needed to facilitate discrete time implementation are addressed. Finally, experimental results demonstrating the control of both transient and steady state vibrations through these techniques are presented.

  15. Massively parallel algorithms for trace-driven cache simulations

    NASA Technical Reports Server (NTRS)

    Nicol, David M.; Greenberg, Albert G.; Lubachevsky, Boris D.

    1991-01-01

    Trace driven cache simulation is central to computer design. A trace is a very long sequence of reference lines from main memory. At the t(exp th) instant, reference x sub t is hashed into a set of cache locations, the contents of which are then compared with x sub t. If at the t sup th instant x sub t is not present in the cache, then it is said to be a miss, and is loaded into the cache set, possibly forcing the replacement of some other memory line, and making x sub t present for the (t+1) sup st instant. The problem of parallel simulation of a subtrace of N references directed to a C line cache set is considered, with the aim of determining which references are misses and related statistics. A simulation method is presented for the Least Recently Used (LRU) policy, which regradless of the set size C runs in time O(log N) using N processors on the exclusive read, exclusive write (EREW) parallel model. A simpler LRU simulation algorithm is given that runs in O(C log N) time using N/log N processors. Timings are presented of the second algorithm's implementation on the MasPar MP-1, a machine with 16384 processors. A broad class of reference based line replacement policies are considered, which includes LRU as well as the Least Frequently Used and Random replacement policies. A simulation method is presented for any such policy that on any trace of length N directed to a C line set runs in the O(C log N) time with high probability using N processors on the EREW model. The algorithms are simple, have very little space overhead, and are well suited for SIMD implementation.

  16. Parallel Performance Optimizations on Unstructured Mesh-based Simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sarje, Abhinav; Song, Sukhyun; Jacobsen, Douglas

    2015-01-01

    © The Authors. Published by Elsevier B.V. This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra- and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter- and intranode data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cachemore » efficiency, as well as communication reduction approaches. We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2×. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.« less

  17. Parallel performance optimizations on unstructured mesh-based simulations

    DOE PAGES

    Sarje, Abhinav; Song, Sukhyun; Jacobsen, Douglas; ...

    2015-06-01

    This paper addresses two key parallelization challenges the unstructured mesh-based ocean modeling code, MPAS-Ocean, which uses a mesh based on Voronoi tessellations: (1) load imbalance across processes, and (2) unstructured data access patterns, that inhibit intra- and inter-node performance. Our work analyzes the load imbalance due to naive partitioning of the mesh, and develops methods to generate mesh partitioning with better load balance and reduced communication. Furthermore, we present methods that minimize both inter- and intranode data movement and maximize data reuse. Our techniques include predictive ordering of data elements for higher cache efficiency, as well as communication reduction approaches.more » We present detailed performance data when running on thousands of cores using the Cray XC30 supercomputer and show that our optimization strategies can exceed the original performance by over 2×. Additionally, many of these solutions can be broadly applied to a wide variety of unstructured grid-based computations.« less

  18. Adaptive wavelet collocation methods for initial value boundary problems of nonlinear PDE's

    NASA Technical Reports Server (NTRS)

    Cai, Wei; Wang, Jian-Zhong

    1993-01-01

    We have designed a cubic spline wavelet decomposition for the Sobolev space H(sup 2)(sub 0)(I) where I is a bounded interval. Based on a special 'point-wise orthogonality' of the wavelet basis functions, a fast Discrete Wavelet Transform (DWT) is constructed. This DWT transform will map discrete samples of a function to its wavelet expansion coefficients in O(N log N) operations. Using this transform, we propose a collocation method for the initial value boundary problem of nonlinear PDE's. Then, we test the efficiency of the DWT transform and apply the collocation method to solve linear and nonlinear PDE's.

  19. A Parallel, Finite-Volume Algorithm for Large-Eddy Simulation of Turbulent Flows

    NASA Technical Reports Server (NTRS)

    Bui, Trong T.

    1999-01-01

    A parallel, finite-volume algorithm has been developed for large-eddy simulation (LES) of compressible turbulent flows. This algorithm includes piecewise linear least-square reconstruction, trilinear finite-element interpolation, Roe flux-difference splitting, and second-order MacCormack time marching. Parallel implementation is done using the message-passing programming model. In this paper, the numerical algorithm is described. To validate the numerical method for turbulence simulation, LES of fully developed turbulent flow in a square duct is performed for a Reynolds number of 320 based on the average friction velocity and the hydraulic diameter of the duct. Direct numerical simulation (DNS) results are available for this test case, and the accuracy of this algorithm for turbulence simulations can be ascertained by comparing the LES solutions with the DNS results. The effects of grid resolution, upwind numerical dissipation, and subgrid-scale dissipation on the accuracy of the LES are examined. Comparison with DNS results shows that the standard Roe flux-difference splitting dissipation adversely affects the accuracy of the turbulence simulation. For accurate turbulence simulations, only 3-5 percent of the standard Roe flux-difference splitting dissipation is needed.

  20. The cost of conservative synchronization in parallel discrete event simulations

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1990-01-01

    The performance of a synchronous conservative parallel discrete-event simulation protocol is analyzed. The class of simulation models considered is oriented around a physical domain and possesses a limited ability to predict future behavior. A stochastic model is used to show that as the volume of simulation activity in the model increases relative to a fixed architecture, the complexity of the average per-event overhead due to synchronization, event list manipulation, lookahead calculations, and processor idle time approach the complexity of the average per-event overhead of a serial simulation. The method is therefore within a constant factor of optimal. The analysis demonstrates that on large problems--those for which parallel processing is ideally suited--there is often enough parallel workload so that processors are not usually idle. The viability of the method is also demonstrated empirically, showing how good performance is achieved on large problems using a thirty-two node Intel iPSC/2 distributed memory multiprocessor.

  1. Combination of 2D/3D ligand-based similarity search in rapid virtual screening from multimillion compound repositories. Selection and biological evaluation of potential PDE4 and PDE5 inhibitors.

    PubMed

    Dobi, Krisztina; Hajdú, István; Flachner, Beáta; Fabó, Gabriella; Szaszkó, Mária; Bognár, Melinda; Magyar, Csaba; Simon, István; Szisz, Dániel; Lőrincz, Zsolt; Cseh, Sándor; Dormán, György

    2014-05-28

    Rapid in silico selection of target focused libraries from commercial repositories is an attractive and cost effective approach. If structures of active compounds are available rapid 2D similarity search can be performed on multimillion compound databases but the generated library requires further focusing by various 2D/3D chemoinformatics tools. We report here a combination of the 2D approach with a ligand-based 3D method (Screen3D) which applies flexible matching to align reference and target compounds in a dynamic manner and thus to assess their structural and conformational similarity. In the first case study we compared the 2D and 3D similarity scores on an existing dataset derived from the biological evaluation of a PDE5 focused library. Based on the obtained similarity metrices a fusion score was proposed. The fusion score was applied to refine the 2D similarity search in a second case study where we aimed at selecting and evaluating a PDE4B focused library. The application of this fused 2D/3D similarity measure led to an increase of the hit rate from 8.5% (1st round, 47% inhibition at 10 µM) to 28.5% (2nd round at 50% inhibition at 10 µM) and the best two hits had 53 nM inhibitory activities.

  2. A novel thermoregulatory role for PDE10A in mouse and human adipocytes.

    PubMed

    Hankir, Mohammed K; Kranz, Mathias; Gnad, Thorsten; Weiner, Juliane; Wagner, Sally; Deuther-Conrad, Winnie; Bronisch, Felix; Steinhoff, Karen; Luthardt, Julia; Klöting, Nora; Hesse, Swen; Seibyl, John P; Sabri, Osama; Heiker, John T; Blüher, Matthias; Pfeifer, Alexander; Brust, Peter; Fenske, Wiebke K

    2016-07-01

    Phosphodiesterase type 10A (PDE10A) is highly enriched in striatum and is under evaluation as a drug target for several psychiatric/neurodegenerative diseases. Preclinical studies implicate PDE10A in the regulation of energy homeostasis, but the mechanisms remain unclear. By utilizing small-animal PET/MRI and the novel radioligand [(18)F]-AQ28A, we found marked levels of PDE10A in interscapular brown adipose tissue (BAT) of mice. Pharmacological inactivation of PDE10A with the highly selective inhibitor MP-10 recruited BAT and potentiated thermogenesis in vivo In diet-induced obese mice, chronic administration of MP-10 caused weight loss associated with increased energy expenditure, browning of white adipose tissue, and improved insulin sensitivity. Analysis of human PET data further revealed marked levels of PDE10A in the supraclavicular region where brown/beige adipocytes are clustered in adults. Finally, the inhibition of PDE10A with MP-10 stimulated thermogenic gene expression in human brown adipocytes and induced browning of human white adipocytes. Collectively, our findings highlight a novel thermoregulatory role for PDE10A in mouse and human adipocytes and promote PDE10A inhibitors as promising candidates for the treatment of obesity and diabetes. © 2016 The Authors. Published under the terms of the CC BY 4.0 license.

  3. Implementation and performance of FDPS: a framework for developing parallel particle simulation codes

    NASA Astrophysics Data System (ADS)

    Iwasawa, Masaki; Tanikawa, Ataru; Hosono, Natsuki; Nitadori, Keigo; Muranushi, Takayuki; Makino, Junichiro

    2016-08-01

    We present the basic idea, implementation, measured performance, and performance model of FDPS (Framework for Developing Particle Simulators). FDPS is an application-development framework which helps researchers to develop simulation programs using particle methods for large-scale distributed-memory parallel supercomputers. A particle-based simulation program for distributed-memory parallel computers needs to perform domain decomposition, exchange of particles which are not in the domain of each computing node, and gathering of the particle information in other nodes which are necessary for interaction calculation. Also, even if distributed-memory parallel computers are not used, in order to reduce the amount of computation, algorithms such as the Barnes-Hut tree algorithm or the Fast Multipole Method should be used in the case of long-range interactions. For short-range interactions, some methods to limit the calculation to neighbor particles are required. FDPS provides all of these functions which are necessary for efficient parallel execution of particle-based simulations as "templates," which are independent of the actual data structure of particles and the functional form of the particle-particle interaction. By using FDPS, researchers can write their programs with the amount of work necessary to write a simple, sequential and unoptimized program of O(N2) calculation cost, and yet the program, once compiled with FDPS, will run efficiently on large-scale parallel supercomputers. A simple gravitational N-body program can be written in around 120 lines. We report the actual performance of these programs and the performance model. The weak scaling performance is very good, and almost linear speed-up was obtained for up to the full system of the K computer. The minimum calculation time per timestep is in the range of 30 ms (N = 107) to 300 ms (N = 109). These are currently limited by the time for the calculation of the domain decomposition and communication

  4. DISC1, PDE4B, and NDE1 at the centrosome and synapse

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bradshaw, Nicholas J.; Ogawa, Fumiaki; Antolin-Fontes, Beatriz

    Disrupted-In-Schizophrenia 1 (DISC1) is a risk factor for schizophrenia and other major mental illnesses. Its protein binding partners include the Nuclear Distribution Factor E Homologs (NDE1 and NDEL1), LIS1, and phosphodiesterases 4B and 4D (PDE4B and PDE4D). We demonstrate that NDE1, NDEL1 and LIS1, together with their binding partner dynein, associate with DISC1, PDE4B and PDE4D within the cell, and provide evidence that this complex is present at the centrosome. LIS1 and NDEL1 have been previously suggested to be synaptic, and we now demonstrate localisation of DISC1, NDE1, and PDE4B at synapses in cultured neurons. NDE1 is phosphorylated by cAMP-dependantmore » Protein Kinase A (PKA), whose activity is, in turn, regulated by the cAMP hydrolysis activity of phosphodiesterases, including PDE4. We propose that DISC1 acts as an assembly scaffold for all of these proteins and that the NDE1/NDEL1/LIS1/dynein complex is modulated by cAMP levels via PKA and PDE4.« less

  5. A Novel Access to Arylated and Heteroarylated Beta-Carboline Based PDE5 Inhibitors

    PubMed Central

    Ahmed, Nermin S.; Gary, Bernard D.; Piazza, Gary A.; Tinsley, Heather N.; Laufer, Stefan; Abadi, Ashraf H.

    2016-01-01

    Starting from a previously reported lead compound GR30040X (a hydantoin tetrahydro-β-carboline derivative with a 4- pyridinyl ring at C- 5), a series of structurally related tetrahydro-β-carboline derivatives were prepared. The tet-rahydro-β-carboline skeleton was fused either to a hydantoin or to a piperazindione ring, the pendant aryl group attached to C-5 or C-6 was changed to a 3, 4-dimethoxyphenyl or a 3-pyridinyl ring; different N-substituents on the terminal ring were introduced, a straight chain ethyl group, a branched tert. butyl and P-chlorophenyl group rather than n-butyl group of the lead compound. All four possible diastereomers of target tetrahydro-β-carboline derivatives were prepared, separated by column chromatography and the significance of these stereochemical manipulations was studied. Synthesized compounds were evaluated for their inhibitory effect versus PDE5. Seven hits were obtained with appreciable inhibitory activity versus PDE5 with IC50s 0.14 - 4.99 μM. PMID:21054274

  6. Global Magnetohydrodynamic Simulation Using High Performance FORTRAN on Parallel Computers

    NASA Astrophysics Data System (ADS)

    Ogino, T.

    High Performance Fortran (HPF) is one of modern and common techniques to achieve high performance parallel computation. We have translated a 3-dimensional magnetohydrodynamic (MHD) simulation code of the Earth's magnetosphere from VPP Fortran to HPF/JA on the Fujitsu VPP5000/56 vector-parallel supercomputer and the MHD code was fully vectorized and fully parallelized in VPP Fortran. The entire performance and capability of the HPF MHD code could be shown to be almost comparable to that of VPP Fortran. A 3-dimensional global MHD simulation of the earth's magnetosphere was performed at a speed of over 400 Gflops with an efficiency of 76.5 VPP5000/56 in vector and parallel computation that permitted comparison with catalog values. We have concluded that fluid and MHD codes that are fully vectorized and fully parallelized in VPP Fortran can be translated with relative ease to HPF/JA, and a code in HPF/JA may be expected to perform comparably to the same code written in VPP Fortran.

  7. Using parallel computing for the display and simulation of the space debris environment

    NASA Astrophysics Data System (ADS)

    Möckel, M.; Wiedemann, C.; Flegel, S.; Gelhaus, J.; Vörsmann, P.; Klinkrad, H.; Krag, H.

    2011-07-01

    Parallelism is becoming the leading paradigm in today's computer architectures. In order to take full advantage of this development, new algorithms have to be specifically designed for parallel execution while many old ones have to be upgraded accordingly. One field in which parallel computing has been firmly established for many years is computer graphics. Calculating and displaying three-dimensional computer generated imagery in real time requires complex numerical operations to be performed at high speed on a large number of objects. Since most of these objects can be processed independently, parallel computing is applicable in this field. Modern graphics processing units (GPUs) have become capable of performing millions of matrix and vector operations per second on multiple objects simultaneously. As a side project, a software tool is currently being developed at the Institute of Aerospace Systems that provides an animated, three-dimensional visualization of both actual and simulated space debris objects. Due to the nature of these objects it is possible to process them individually and independently from each other. Therefore, an analytical orbit propagation algorithm has been implemented to run on a GPU. By taking advantage of all its processing power a huge performance increase, compared to its CPU-based counterpart, could be achieved. For several years efforts have been made to harness this computing power for applications other than computer graphics. Software tools for the simulation of space debris are among those that could profit from embracing parallelism. With recently emerged software development tools such as OpenCL it is possible to transfer the new algorithms used in the visualization outside the field of computer graphics and implement them, for example, into the space debris simulation environment. This way they can make use of parallel hardware such as GPUs and Multi-Core-CPUs for faster computation. In this paper the visualization software

  8. Using parallel computing for the display and simulation of the space debris environment

    NASA Astrophysics Data System (ADS)

    Moeckel, Marek; Wiedemann, Carsten; Flegel, Sven Kevin; Gelhaus, Johannes; Klinkrad, Heiner; Krag, Holger; Voersmann, Peter

    Parallelism is becoming the leading paradigm in today's computer architectures. In order to take full advantage of this development, new algorithms have to be specifically designed for parallel execution while many old ones have to be upgraded accordingly. One field in which parallel computing has been firmly established for many years is computer graphics. Calculating and displaying three-dimensional computer generated imagery in real time requires complex numerical operations to be performed at high speed on a large number of objects. Since most of these objects can be processed independently, parallel computing is applicable in this field. Modern graphics processing units (GPUs) have become capable of performing millions of matrix and vector operations per second on multiple objects simultaneously. As a side project, a software tool is currently being developed at the Institute of Aerospace Systems that provides an animated, three-dimensional visualization of both actual and simulated space debris objects. Due to the nature of these objects it is possible to process them individually and independently from each other. Therefore, an analytical orbit propagation algorithm has been implemented to run on a GPU. By taking advantage of all its processing power a huge performance increase, compared to its CPU-based counterpart, could be achieved. For several years efforts have been made to harness this computing power for applications other than computer graphics. Software tools for the simulation of space debris are among those that could profit from embracing parallelism. With recently emerged software development tools such as OpenCL it is possible to transfer the new algorithms used in the visualization outside the field of computer graphics and implement them, for example, into the space debris simulation environment. This way they can make use of parallel hardware such as GPUs and Multi-Core-CPUs for faster computation. In this paper the visualization software

  9. Reusable Component Model Development Approach for Parallel and Distributed Simulation

    PubMed Central

    Zhu, Feng; Yao, Yiping; Chen, Huilong; Yao, Feng

    2014-01-01

    Model reuse is a key issue to be resolved in parallel and distributed simulation at present. However, component models built by different domain experts usually have diversiform interfaces, couple tightly, and bind with simulation platforms closely. As a result, they are difficult to be reused across different simulation platforms and applications. To address the problem, this paper first proposed a reusable component model framework. Based on this framework, then our reusable model development approach is elaborated, which contains two phases: (1) domain experts create simulation computational modules observing three principles to achieve their independence; (2) model developer encapsulates these simulation computational modules with six standard service interfaces to improve their reusability. The case study of a radar model indicates that the model developed using our approach has good reusability and it is easy to be used in different simulation platforms and applications. PMID:24729751

  10. A special case of the Poisson PDE formulated for Earth's surface and its capability to approximate the terrain mass density employing land-based gravity data, a case study in the south of Iran

    NASA Astrophysics Data System (ADS)

    AllahTavakoli, Yahya; Safari, Abdolreza; Vaníček, Petr

    2016-12-01

    This paper resurrects a version of Poisson's Partial Differential Equation (PDE) associated with the gravitational field at the Earth's surface and illustrates how the PDE possesses a capability to extract the mass density of Earth's topography from land-based gravity data. Herein, first we propound a theorem which mathematically introduces this version of Poisson's PDE adapted for the Earth's surface and then we use this PDE to develop a method of approximating the terrain mass density. Also, we carry out a real case study showing how the proposed approach is able to be applied to a set of land-based gravity data. In the case study, the method is summarized by an algorithm and applied to a set of gravity stations located along a part of the north coast of the Persian Gulf in the south of Iran. The results were numerically validated via rock-samplings as well as a geological map. Also, the method was compared with two conventional methods of mass density reduction. The numerical experiments indicate that the Poisson PDE at the Earth's surface has the capability to extract the mass density from land-based gravity data and is able to provide an alternative and somewhat more precise method of estimating the terrain mass density.

  11. Parallel Simulation of Subsonic Fluid Dynamics on a Cluster of Workstations.

    DTIC Science & Technology

    1994-11-01

    inside wind musical instruments. Typical simulations achieve $80\\%$ parallel efficiency (speedup/processors) using 20 HP-Apollo workstations. Detailed...TERMS AI, MIT, Artificial Intelligence, Distributed Computing, Workstation Cluster, Network, Fluid Dynamics, Musical Instruments 17. SECURITY...for example, the flow of air inside wind musical instruments. Typical simulations achieve 80% parallel efficiency (speedup/processors) using 20 HP

  12. PDE5 inhibitors as therapeutics for heart disease, diabetes and cancer.

    PubMed

    Das, Anindita; Durrant, David; Salloum, Fadi N; Xi, Lei; Kukreja, Rakesh C

    2015-03-01

    The phosphodiesterase 5 (PDE5) inhibitors, including sildenafil (Viagra™), vardenafil (Levitra™), and tadalafil (Cialis™) have been developed for treatment of erectile dysfunction. Moreover, sildenafil and tadalafil are used for the management of pulmonary arterial hypertension in patients. Since our first report showing the cardioprotective effect of sildenafil in 2002, there has been tremendous growth of preclinical and clinical studies on the use of PDE5 inhibitors for cardiovascular diseases and cancer. Numerous animal studies have demonstrated that PDE5 inhibitors have powerful protective effect against myocardial ischemia/reperfusion (I/R) injury, doxorubicin cardiotoxicity, ischemic and diabetic cardiomyopathy, cardiac hypertrophy, Duchenne muscular dystrophy and the improvement of stem cell efficacy for myocardial repair. Mechanistically, PDE5 inhibitors protect the heart against I/R injury through increased expression of nitric oxide synthases, activation of protein kinase G (PKG), PKG-dependent hydrogen sulfide generation, and phosphorylation of glycogen synthase kinase-3β - a master switch immediately proximal to mitochondrial permeability transition pore and the end effector of cardioprotection. In addition, PDE5 inhibitors enhance the sensitivity of certain types of cancer to standard chemotherapeutic drugs, including doxorubicin. Many clinical trials with PDE5 inhibitors have focused on the potential cardiovascular and anti-cancer benefits. Despite mixed results of these clinical trials, there is a continuing strong interest by basic scientists and clinical investigators in exploring their new clinical uses. It is our hope that future new mechanistic investigations and carefully designed clinical trials would help in reaping additional benefits of PDE5 inhibitors for cardiovascular disease and cancer in patients. Copyright © 2014 Elsevier Inc. All rights reserved.

  13. The estimation of material and patch parameters in a PDE-based circular plate model

    NASA Technical Reports Server (NTRS)

    Banks, H. T.; Smith, Ralph C.; Brown, D. E.; Metcalf, Vern L.; Silcox, R. J.

    1995-01-01

    The estimation of material and patch parameters for a system involving a circular plate, to which piezoceramic patches are bonded, is considered. A partial differential equation (PDE) model for the thin circular plate is used with the passive and active contributions form the patches included in the internal and external bending moments. This model contains piecewise constant parameters describing the density, flexural rigidity, Poisson ratio, and Kelvin-Voigt damping for the system as well as patch constants and a coefficient for viscous air damping. Examples demonstrating the estimation of these parameters with experimental acceleration data and a variety of inputs to the experimental plate are presented. By using a physically-derived PDE model to describe the system, parameter sets consistent across experiments are obtained, even when phenomena such as damping due to electric circuits affect the system dynamics.

  14. Research on Parallel Three Phase PWM Converters base on RTDS

    NASA Astrophysics Data System (ADS)

    Xia, Yan; Zou, Jianxiao; Li, Kai; Liu, Jingbo; Tian, Jun

    2018-01-01

    Converters parallel operation can increase capacity of the system, but it may lead to potential zero-sequence circulating current, so the control of circulating current was an important goal in the design of parallel inverters. In this paper, the Real Time Digital Simulator (RTDS) is used to model the converters parallel system in real time and study the circulating current restraining. The equivalent model of two parallel converters and zero-sequence circulating current(ZSCC) were established and analyzed, then a strategy using variable zero vector control was proposed to suppress the circulating current. For two parallel modular converters, hardware-in-the-loop(HIL) study based on RTDS and practical experiment were implemented, results prove that the proposed control strategy is feasible and effective.

  15. A conformational switch in the inhibitory gamma-subunit of PDE6 upon enzyme activation by transducin.

    PubMed

    Granovsky, A E; Artemyev, N O

    2001-11-06

    In response to light, a photoreceptor G protein, transducin, activates cGMP-phosphodiesterase (PDE6) by displacing the inhibitory gamma-subunits (Pgamma) from the enzyme's catalytic sites. Evidence suggests that the activation of PDE6 involves a conformational change of the key inhibitory C-terminal domain of Pgamma. In this study, the C-terminal region of Pgamma, Pgamma-73-85, has been targeted for Ala-scanning mutagenesis to identify the point-to-point interactions between Pgamma and the PDE6 catalytic subunits and to probe the nature of the conformational change. Pgamma mutants were tested for their ability to inhibit PDE6 and a chimeric PDE5-conePDE6 enzyme containing the Pgamma C-terminus-binding site of cone PDE. This analysis has revealed that in addition to previously characterized Ile86 and Ile87, important inhibitory contact residues of Pgamma include Asn74, His75, and Leu78. The patterns of mutant PDE5-conePDE6 enzyme inhibition suggest the interaction between the PgammaAsn74/His75 sequence and Met758 of the cone PDE6alpha' catalytic subunit. This interaction, and the interaction between the PgammaIle86/Ile87 and PDE6alpha'Phe777/Phe781 residues, is most consistent with an alpha-helical structure of the Pgamma C-terminus. The analysis of activation of PDE6 enzymes containing Pgamma mutants with Ala-substituted transducin-contact residues demonstrated the critical role of PgammaLeu76. Accordingly, we hypothesize that the initial step in PDE6 activation involves an interaction of transducin-alpha with PgammaLeu76. This interaction introduces a bend into the alpha-helical structure of the Pgamma C-terminus, allowing transducin-alpha to further twist the C-terminus thereby uncovering the catalytic pocket of PDE6.

  16. Phosphodiesterase (PDE5) inhibition assay for rapid detection of erectile dysfunction drugs and analogs in sexual enhancement products.

    PubMed

    Santillo, Michael F; Mapa, Mapa S T

    2018-02-28

    Products marketed as dietary supplements for sexual enhancement are frequently adulterated with phosphodiesterase-5 (PDE5) inhibitors, which are erectile dysfunction drugs or their analogs that can cause adverse health effects. Due to widespread adulteration, a rapid screening assay was developed to detect PDE5 inhibitors in adulterated products. The assay employs fluorescence detection and is based on measuring inhibition of PDE5 activity, the pharmacological mechanism shared among the adulterants. Initially, the assay reaction scheme was established and characterized, followed by analysis of 9 representative PDE5 inhibitors (IC 50 , 0.4-4.0 ng mL -1 ), demonstrating sensitive detection in matrix-free solutions. Next, dietary supplements serving as matrix blanks (n = 25) were analyzed to determine matrix interference and establish a threshold value; there were no false positives. Finally, matrix blanks were spiked with 9 individual PDE5 inhibitors, along with several mixtures. All 9 adulterants were successfully detected (≤ 5 % false negative rate; n = 20) at a concentration of 1.00 mg g -1 , which is over 5 times lower than concentrations commonly encountered in adulterated products. A major distinction of the PDE5 inhibition assay is the ability to detect adulterants without prior knowledge of their chemical structures, demonstrating a broad-based detection capability that can address a continuously evolving threat of new adulterants. The PDE5 inhibition assay can analyze over 40 samples simultaneously within 15 minutes and involves a single incubation step and simple data analysis, all of which are advantageous for combating the widespread adulteration of sex-enhancement products. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.

  17. Parallel multiscale simulations of a brain aneurysm

    PubMed Central

    Grinberg, Leopold; Fedosov, Dmitry A.; Karniadakis, George Em

    2012-01-01

    Cardiovascular pathologies, such as a brain aneurysm, are affected by the global blood circulation as well as by the local microrheology. Hence, developing computational models for such cases requires the coupling of disparate spatial and temporal scales often governed by diverse mathematical descriptions, e.g., by partial differential equations (continuum) and ordinary differential equations for discrete particles (atomistic). However, interfacing atomistic-based with continuum-based domain discretizations is a challenging problem that requires both mathematical and computational advances. We present here a hybrid methodology that enabled us to perform the first multi-scale simulations of platelet depositions on the wall of a brain aneurysm. The large scale flow features in the intracranial network are accurately resolved by using the high-order spectral element Navier-Stokes solver εκ αr. The blood rheology inside the aneurysm is modeled using a coarse-grained stochastic molecular dynamics approach (the dissipative particle dynamics method) implemented in the parallel code LAMMPS. The continuum and atomistic domains overlap with interface conditions provided by effective forces computed adaptively to ensure continuity of states across the interface boundary. A two-way interaction is allowed with the time-evolving boundary of the (deposited) platelet clusters tracked by an immersed boundary method. The corresponding heterogeneous solvers ( εκ αr and LAMMPS) are linked together by a computational multilevel message passing interface that facilitates modularity and high parallel efficiency. Results of multiscale simulations of clot formation inside the aneurysm in a patient-specific arterial tree are presented. We also discuss the computational challenges involved and present scalability results of our coupled solver on up to 300K computer processors. Validation of such coupled atomistic-continuum models is a main open issue that has to be addressed in future

  18. Parallel multiscale simulations of a brain aneurysm.

    PubMed

    Grinberg, Leopold; Fedosov, Dmitry A; Karniadakis, George Em

    2013-07-01

    Cardiovascular pathologies, such as a brain aneurysm, are affected by the global blood circulation as well as by the local microrheology. Hence, developing computational models for such cases requires the coupling of disparate spatial and temporal scales often governed by diverse mathematical descriptions, e.g., by partial differential equations (continuum) and ordinary differential equations for discrete particles (atomistic). However, interfacing atomistic-based with continuum-based domain discretizations is a challenging problem that requires both mathematical and computational advances. We present here a hybrid methodology that enabled us to perform the first multi-scale simulations of platelet depositions on the wall of a brain aneurysm. The large scale flow features in the intracranial network are accurately resolved by using the high-order spectral element Navier-Stokes solver εκ αr . The blood rheology inside the aneurysm is modeled using a coarse-grained stochastic molecular dynamics approach (the dissipative particle dynamics method) implemented in the parallel code LAMMPS. The continuum and atomistic domains overlap with interface conditions provided by effective forces computed adaptively to ensure continuity of states across the interface boundary. A two-way interaction is allowed with the time-evolving boundary of the (deposited) platelet clusters tracked by an immersed boundary method. The corresponding heterogeneous solvers ( εκ αr and LAMMPS) are linked together by a computational multilevel message passing interface that facilitates modularity and high parallel efficiency. Results of multiscale simulations of clot formation inside the aneurysm in a patient-specific arterial tree are presented. We also discuss the computational challenges involved and present scalability results of our coupled solver on up to 300K computer processors. Validation of such coupled atomistic-continuum models is a main open issue that has to be addressed in future

  19. Parallel multiscale simulations of a brain aneurysm

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Grinberg, Leopold; Fedosov, Dmitry A.; Karniadakis, George Em, E-mail: george_karniadakis@brown.edu

    2013-07-01

    Cardiovascular pathologies, such as a brain aneurysm, are affected by the global blood circulation as well as by the local microrheology. Hence, developing computational models for such cases requires the coupling of disparate spatial and temporal scales often governed by diverse mathematical descriptions, e.g., by partial differential equations (continuum) and ordinary differential equations for discrete particles (atomistic). However, interfacing atomistic-based with continuum-based domain discretizations is a challenging problem that requires both mathematical and computational advances. We present here a hybrid methodology that enabled us to perform the first multiscale simulations of platelet depositions on the wall of a brain aneurysm.more » The large scale flow features in the intracranial network are accurately resolved by using the high-order spectral element Navier–Stokes solver NεκTαr. The blood rheology inside the aneurysm is modeled using a coarse-grained stochastic molecular dynamics approach (the dissipative particle dynamics method) implemented in the parallel code LAMMPS. The continuum and atomistic domains overlap with interface conditions provided by effective forces computed adaptively to ensure continuity of states across the interface boundary. A two-way interaction is allowed with the time-evolving boundary of the (deposited) platelet clusters tracked by an immersed boundary method. The corresponding heterogeneous solvers (NεκTαr and LAMMPS) are linked together by a computational multilevel message passing interface that facilitates modularity and high parallel efficiency. Results of multiscale simulations of clot formation inside the aneurysm in a patient-specific arterial tree are presented. We also discuss the computational challenges involved and present scalability results of our coupled solver on up to 300 K computer processors. Validation of such coupled atomistic-continuum models is a main open issue that has to be addressed

  20. A "Reverse-Schur" Approach to Optimization With Linear PDE Constraints: Application to Biomolecule Analysis and Design.

    PubMed

    Bardhan, Jaydeep P; Altman, Michael D; Tidor, B; White, Jacob K

    2009-01-01

    We present a partial-differential-equation (PDE)-constrained approach for optimizing a molecule's electrostatic interactions with a target molecule. The approach, which we call reverse-Schur co-optimization, can be more than two orders of magnitude faster than the traditional approach to electrostatic optimization. The efficiency of the co-optimization approach may enhance the value of electrostatic optimization for ligand-design efforts-in such projects, it is often desirable to screen many candidate ligands for their viability, and the optimization of electrostatic interactions can improve ligand binding affinity and specificity. The theoretical basis for electrostatic optimization derives from linear-response theory, most commonly continuum models, and simple assumptions about molecular binding processes. Although the theory has been used successfully to study a wide variety of molecular binding events, its implications have not yet been fully explored, in part due to the computational expense associated with the optimization. The co-optimization algorithm achieves improved performance by solving the optimization and electrostatic simulation problems simultaneously, and is applicable to both unconstrained and constrained optimization problems. Reverse-Schur co-optimization resembles other well-known techniques for solving optimization problems with PDE constraints. Model problems as well as realistic examples validate the reverse-Schur method, and demonstrate that our technique and alternative PDE-constrained methods scale very favorably compared to the standard approach. Regularization, which ordinarily requires an explicit representation of the objective function, can be included using an approximate Hessian calculated using the new BIBEE/P (boundary-integral-based electrostatics estimation by preconditioning) method.

  1. Streaming parallel GPU acceleration of large-scale filter-based spiking neural networks.

    PubMed

    Slażyński, Leszek; Bohte, Sander

    2012-01-01

    The arrival of graphics processing (GPU) cards suitable for massively parallel computing promises affordable large-scale neural network simulation previously only available at supercomputing facilities. While the raw numbers suggest that GPUs may outperform CPUs by at least an order of magnitude, the challenge is to develop fine-grained parallel algorithms to fully exploit the particulars of GPUs. Computation in a neural network is inherently parallel and thus a natural match for GPU architectures: given inputs, the internal state for each neuron can be updated in parallel. We show that for filter-based spiking neurons, like the Spike Response Model, the additive nature of membrane potential dynamics enables additional update parallelism. This also reduces the accumulation of numerical errors when using single precision computation, the native precision of GPUs. We further show that optimizing simulation algorithms and data structures to the GPU's architecture has a large pay-off: for example, matching iterative neural updating to the memory architecture of the GPU speeds up this simulation step by a factor of three to five. With such optimizations, we can simulate in better-than-realtime plausible spiking neural networks of up to 50 000 neurons, processing over 35 million spiking events per second.

  2. Homology modeling, docking studies and molecular dynamic simulations using graphical processing unit architecture to probe the type-11 phosphodiesterase catalytic site: a computational approach for the rational design of selective inhibitors.

    PubMed

    Cichero, Elena; D'Ursi, Pasqualina; Moscatelli, Marco; Bruno, Olga; Orro, Alessandro; Rotolo, Chiara; Milanesi, Luciano; Fossa, Paola

    2013-12-01

    Phosphodiesterase 11 (PDE11) is the latest isoform of the PDEs family to be identified, acting on both cyclic adenosine monophosphate and cyclic guanosine monophosphate. The initial reports of PDE11 found evidence for PDE11 expression in skeletal muscle, prostate, testis, and salivary glands; however, the tissue distribution of PDE11 still remains a topic of active study and some controversy. Given the sequence similarity between PDE11 and PDE5, several PDE5 inhibitors have been shown to cross-react with PDE11. Accordingly, many non-selective inhibitors, such as IBMX, zaprinast, sildenafil, and dipyridamole, have been documented to inhibit PDE11. Only recently, a series of dihydrothieno[3,2-d]pyrimidin-4(3H)-one derivatives proved to be selective toward the PDE11 isoform. In the absence of experimental data about PDE11 X-ray structures, we found interesting to gain a better understanding of the enzyme-inhibitor interactions using in silico simulations. In this work, we describe a computational approach based on homology modeling, docking, and molecular dynamics simulation to derive a predictive 3D model of PDE11. Using a Graphical Processing Unit architecture, it is possible to perform long simulations, find stable interactions involved in the complex, and finally to suggest guideline for the identification and synthesis of potent and selective inhibitors. © 2013 John Wiley & Sons A/S.

  3. On the utility of graphics cards to perform massively parallel simulation of advanced Monte Carlo methods

    PubMed Central

    Lee, Anthony; Yau, Christopher; Giles, Michael B.; Doucet, Arnaud; Holmes, Christopher C.

    2011-01-01

    We present a case-study on the utility of graphics cards to perform massively parallel simulation of advanced Monte Carlo methods. Graphics cards, containing multiple Graphics Processing Units (GPUs), are self-contained parallel computational devices that can be housed in conventional desktop and laptop computers and can be thought of as prototypes of the next generation of many-core processors. For certain classes of population-based Monte Carlo algorithms they offer massively parallel simulation, with the added advantage over conventional distributed multi-core processors that they are cheap, easily accessible, easy to maintain, easy to code, dedicated local devices with low power consumption. On a canonical set of stochastic simulation examples including population-based Markov chain Monte Carlo methods and Sequential Monte Carlo methods, we nd speedups from 35 to 500 fold over conventional single-threaded computer code. Our findings suggest that GPUs have the potential to facilitate the growth of statistical modelling into complex data rich domains through the availability of cheap and accessible many-core computation. We believe the speedup we observe should motivate wider use of parallelizable simulation methods and greater methodological attention to their design. PMID:22003276

  4. Engineered stabilization and structural analysis of the autoinhibited conformation of PDE4

    DOE PAGES

    Cedervall, Peder; Aulabaugh, Ann; Geoghegan, Kieran F.; ...

    2015-03-09

    Phosphodiesterase 4 (PDE4) is an essential contributor to intracellular signaling and an important drug target. The four members of this enzyme family (PDE4A to -D) are functional dimers in which each subunit contains two upstream conserved regions (UCR), UCR1 and -2, which precede the C-terminal catalytic domain. Alternative promoters, transcriptional start sites, and mRNA splicing lead to the existence of over 25 variants of PDE4, broadly classified as long, short, and supershort forms. We report the X-ray crystal structure of long form PDE4B containing UCR1, UCR2, and the catalytic domain, crystallized as a dimer in which a disulfide bond cross-linksmore » cysteines engineered into UCR2 and the catalytic domain. Biochemical and mass spectrometric analyses showed that the UCR2-catalytic domain interaction occurs in trans, and established that this interaction regulates the catalytic activity of PDE4. By elucidating the key structural determinants of dimerization, we show that only long forms of PDE4 can be regulated by this mechanism. The results also provide a structural basis for the long-standing observation of high- and low-affinity binding sites for the prototypic inhibitor rolipram.« less

  5. Molecular modeling study of binding to the catalytic site of PDE4 enzymes by a novel class of inhibitors

    NASA Astrophysics Data System (ADS)

    Lawrenz, Morgan E.; Salter, E. A.; Wierzbicki, Andrzej; Thompson, W. J.

    Cyclic nucleotide phosphodiesterases (PDEs) comprise a superfamily of enzymes that hydrolyze the second messengers adenosine and guanosine 3',5'-cyclic monophosphate (cAMP and cGMP) to their noncyclic nucleotides (5'-AMP and 5'-GMP). Selective inhibitors of all 11 gene families of PDEs are being sought based on the different biochemical properties of the different isoforms, including their substrate specificities. The PDE4 gene family consists of cAMP-specific isoforms; selective PDE4 inhibitors such as rolipram have been developed, and related agents are used clinically as anti-inflammatory agents for asthma and COPD. The known crystal structures of PDE4 bound with rolipram and IBMX have allowed us to define plausible binding orientations for a novel class of benzylpyridazinone-based PDE4 inhibitors represented by EMD 94360 and EMD 95832 that are structurally distinct from rolipram. Molecular mechanics modeling with autodocking is used to explore energetically favorable binding orientations within the PDE4 catalytic site. We present two putative orientations for EMD 94360/95832 inhibitor binding. Our estimated interaction energies for rolipram, IBMX, EMD 94360, and EMD 95832 are consistent with the experimental data for their IC50 values. Key binding residues and interactions in these orientations are identified and compared with known binding motifs proposed for rolipram. The experimentally observed improved strength of inhibition exhibited by this novel class of PDE4 inhibitors is explained by the molecular modeling reported here.

  6. On Parallelizing Single Dynamic Simulation Using HPC Techniques and APIs of Commercial Software

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Diao, Ruisheng; Jin, Shuangshuang; Howell, Frederic

    Time-domain simulations are heavily used in today’s planning and operation practices to assess power system transient stability and post-transient voltage/frequency profiles following severe contingencies to comply with industry standards. Because of the increased modeling complexity, it is several times slower than real time for state-of-the-art commercial packages to complete a dynamic simulation for a large-scale model. With the growing stochastic behavior introduced by emerging technologies, power industry has seen a growing need for performing security assessment in real time. This paper presents a parallel implementation framework to speed up a single dynamic simulation by leveraging the existing stability model librarymore » in commercial tools through their application programming interfaces (APIs). Several high performance computing (HPC) techniques are explored such as parallelizing the calculation of generator current injection, identifying fast linear solvers for network solution, and parallelizing data outputs when interacting with APIs in the commercial package, TSAT. The proposed method has been tested on a WECC planning base case with detailed synchronous generator models and exhibits outstanding scalable performance with sufficient accuracy.« less

  7. Parallel network simulations with NEURON.

    PubMed

    Migliore, M; Cannia, C; Lytton, W W; Markram, Henry; Hines, M L

    2006-10-01

    The NEURON simulation environment has been extended to support parallel network simulations. Each processor integrates the equations for its subnet over an interval equal to the minimum (interprocessor) presynaptic spike generation to postsynaptic spike delivery connection delay. The performance of three published network models with very different spike patterns exhibits superlinear speedup on Beowulf clusters and demonstrates that spike communication overhead is often less than the benefit of an increased fraction of the entire problem fitting into high speed cache. On the EPFL IBM Blue Gene, almost linear speedup was obtained up to 100 processors. Increasing one model from 500 to 40,000 realistic cells exhibited almost linear speedup on 2,000 processors, with an integration time of 9.8 seconds and communication time of 1.3 seconds. The potential for speed-ups of several orders of magnitude makes practical the running of large network simulations that could otherwise not be explored.

  8. Parallel Network Simulations with NEURON

    PubMed Central

    Migliore, M.; Cannia, C.; Lytton, W.W; Markram, Henry; Hines, M. L.

    2009-01-01

    The NEURON simulation environment has been extended to support parallel network simulations. Each processor integrates the equations for its subnet over an interval equal to the minimum (interprocessor) presynaptic spike generation to postsynaptic spike delivery connection delay. The performance of three published network models with very different spike patterns exhibits superlinear speedup on Beowulf clusters and demonstrates that spike communication overhead is often less than the benefit of an increased fraction of the entire problem fitting into high speed cache. On the EPFL IBM Blue Gene, almost linear speedup was obtained up to 100 processors. Increasing one model from 500 to 40,000 realistic cells exhibited almost linear speedup on 2000 processors, with an integration time of 9.8 seconds and communication time of 1.3 seconds. The potential for speed-ups of several orders of magnitude makes practical the running of large network simulations that could otherwise not be explored. PMID:16732488

  9. Modularized Parallel Neutron Instrument Simulation on the TeraGrid

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Meili; Cobb, John W; Hagen, Mark E

    2007-01-01

    In order to build a bridge between the TeraGrid (TG), a national scale cyberinfrastructure resource, and neutron science, the Neutron Science TeraGrid Gateway (NSTG) is focused on introducing productive HPC usage to the neutron science community, primarily the Spallation Neutron Source (SNS) at Oak Ridge National Laboratory (ORNL). Monte Carlo simulations are used as a powerful tool for instrument design and optimization at SNS. One of the successful efforts of a collaboration team composed of NSTG HPC experts and SNS instrument scientists is the development of a software facility named PSoNI, Parallelizing Simulations of Neutron Instruments. Parallelizing the traditional serialmore » instrument simulation on TeraGrid resources, PSoNI quickly computes full instrument simulation at sufficient statistical levels in instrument de-sign. Upon SNS successful commissioning, to the end of 2007, three out of five commissioned instruments in SNS target station will be available for initial users. Advanced instrument study, proposal feasibility evalua-tion, and experiment planning are on the immediate schedule of SNS, which pose further requirements such as flexibility and high runtime efficiency on fast instrument simulation. PSoNI has been redesigned to meet the new challenges and a preliminary version is developed on TeraGrid. This paper explores the motivation and goals of the new design, and the improved software structure. Further, it describes the realized new fea-tures seen from MPI parallelized McStas running high resolution design simulations of the SEQUOIA and BSS instruments at SNS. A discussion regarding future work, which is targeted to do fast simulation for automated experiment adjustment and comparing models to data in analysis, is also presented.« less

  10. PDE3A mutations cause autosomal dominant hypertension with brachydactyly.

    PubMed

    Maass, Philipp G; Aydin, Atakan; Luft, Friedrich C; Schächterle, Carolin; Weise, Anja; Stricker, Sigmar; Lindschau, Carsten; Vaegler, Martin; Qadri, Fatimunnisa; Toka, Hakan R; Schulz, Herbert; Krawitz, Peter M; Parkhomchuk, Dmitri; Hecht, Jochen; Hollfinger, Irene; Wefeld-Neuenfeld, Yvette; Bartels-Klein, Eireen; Mühl, Astrid; Kann, Martin; Schuster, Herbert; Chitayat, David; Bialer, Martin G; Wienker, Thomas F; Ott, Jürg; Rittscher, Katharina; Liehr, Thomas; Jordan, Jens; Plessis, Ghislaine; Tank, Jens; Mai, Knut; Naraghi, Ramin; Hodge, Russell; Hopp, Maxwell; Hattenbach, Lars O; Busjahn, Andreas; Rauch, Anita; Vandeput, Fabrice; Gong, Maolian; Rüschendorf, Franz; Hübner, Norbert; Haller, Hermann; Mundlos, Stefan; Bilginturan, Nihat; Movsesian, Matthew A; Klussmann, Enno; Toka, Okan; Bähring, Sylvia

    2015-06-01

    Cardiovascular disease is the most common cause of death worldwide, and hypertension is the major risk factor. Mendelian hypertension elucidates mechanisms of blood pressure regulation. Here we report six missense mutations in PDE3A (encoding phosphodiesterase 3A) in six unrelated families with mendelian hypertension and brachydactyly type E (HTNB). The syndrome features brachydactyly type E (BDE), severe salt-independent but age-dependent hypertension, an increased fibroblast growth rate, neurovascular contact at the rostral-ventrolateral medulla, altered baroreflex blood pressure regulation and death from stroke before age 50 years when untreated. In vitro analyses of mesenchymal stem cell-derived vascular smooth muscle cells (VSMCs) and chondrocytes provided insights into molecular pathogenesis. The mutations increased protein kinase A-mediated PDE3A phosphorylation and resulted in gain of function, with increased cAMP-hydrolytic activity and enhanced cell proliferation. Levels of phosphorylated VASP were diminished, and PTHrP levels were dysregulated. We suggest that the identified PDE3A mutations cause the syndrome. VSMC-expressed PDE3A deserves scrutiny as a therapeutic target for the treatment of hypertension.

  11. Parallelization of a Monte Carlo particle transport simulation code

    NASA Astrophysics Data System (ADS)

    Hadjidoukas, P.; Bousis, C.; Emfietzoglou, D.

    2010-05-01

    We have developed a high performance version of the Monte Carlo particle transport simulation code MC4. The original application code, developed in Visual Basic for Applications (VBA) for Microsoft Excel, was first rewritten in the C programming language for improving code portability. Several pseudo-random number generators have been also integrated and studied. The new MC4 version was then parallelized for shared and distributed-memory multiprocessor systems using the Message Passing Interface. Two parallel pseudo-random number generator libraries (SPRNG and DCMT) have been seamlessly integrated. The performance speedup of parallel MC4 has been studied on a variety of parallel computing architectures including an Intel Xeon server with 4 dual-core processors, a Sun cluster consisting of 16 nodes of 2 dual-core AMD Opteron processors and a 200 dual-processor HP cluster. For large problem size, which is limited only by the physical memory of the multiprocessor server, the speedup results are almost linear on all systems. We have validated the parallel implementation against the serial VBA and C implementations using the same random number generator. Our experimental results on the transport and energy loss of electrons in a water medium show that the serial and parallel codes are equivalent in accuracy. The present improvements allow for studying of higher particle energies with the use of more accurate physical models, and improve statistics as more particles tracks can be simulated in low response time.

  12. A PDE Pricing Framework for Cross-Currency Interest Rate Derivatives with Target Redemption Features

    NASA Astrophysics Data System (ADS)

    Christara, Christina C.; Minh Dang, Duy; Jackson, Kenneth R.; Lakhany, Asif

    2010-09-01

    We propose a general framework for efficient pricing via a partial differential equation (PDE) approach for exotic cross-currency interest rate (IR) derivatives, with strong emphasis on long-dated foreign exchange (FX) IR hybrids, namely Power Reverse Dual Currency (PRDC) swaps with a FX Target Redemption (FX-TARN) provision. The FX-TARN provision provides a cap on the FX-linked PRDC coupon amounts, and once the accumulated coupon amount reaches this cap, the underlying PRDC swap terminates. Our PDE pricing framework is based on an auxiliary state variable to keep track of the total accumulated PRDC coupon amount. Finite differences on uniform grids and the Alternating Direction Implicit (ADI) method are used for the spatial and time discretizations, respectively, of the model-dependent PDE corresponding to each discretized value of the auxiliary variable. Numerical examples illustrating the convergence properties of the numerical methods are provided.

  13. Inhibition of PDE4B suppresses inflammation by increasing expression of the deubiquitinase CYLD

    PubMed Central

    Komatsu, Kensei; Lee, Ji-Yun; Miyata, Masanori; Hyang Lim, Jae; Jono, Hirofumi; Koga, Tomoaki; Xu, Haidong; Yan, Chen; Kai, Hirofumi; Li, Jian-Dong

    2013-01-01

    The deubiquitinase CYLD acts as a key negative regulator to tightly control overactive inflammation. Most anti-inflammatory strategies have focused on directly targeting the positive regulator, which often results in significant side effects such as suppression of the host defence response. Here, we show that inhibition of phosphodiesterase 4B (PDE4B) markedly enhances upregulation of CYLD expression in response to bacteria, thereby suggesting that PDE4B acts as a negative regulator for CYLD. Interestingly, in Cyld-deficient mice, inhibition of PDE4B no longer suppresses inflammation. Moreover, PDE4B negatively regulates CYLD via specific activation of JNK2 but not JNK1. Importantly, ototopical post-inoculation administration of a PDE4 inhibitor suppresses inflammation in this animal model, thus demonstrating the therapeutic potential of targeting PDE4. These studies provide insights into how inflammation is tightly regulated via the inhibition of its negative regulator and may also lead to the development of new anti-inflammatory therapeutics that upregulate CYLD expression. PMID:23575688

  14. The Distributed Diagonal Force Decomposition Method for Parallelizing Molecular Dynamics Simulations

    PubMed Central

    Boršnik, Urban; Miller, Benjamin T.; Brooks, Bernard R.; Janežič, Dušanka

    2011-01-01

    Parallelization is an effective way to reduce the computational time needed for molecular dynamics simulations. We describe a new parallelization method, the distributed-diagonal force decomposition method, with which we extend and improve the existing force decomposition methods. Our new method requires less data communication during molecular dynamics simulations than replicated data and current force decomposition methods, increasing the parallel efficiency. It also dynamically load-balances the processors' computational load throughout the simulation. The method is readily implemented in existing molecular dynamics codes and it has been incorporated into the CHARMM program, allowing its immediate use in conjunction with the many molecular dynamics simulation techniques that are already present in the program. We also present the design of the Force Decomposition Machine, a cluster of personal computers and networks that is tailored to running molecular dynamics simulations using the distributed diagonal force decomposition method. The design is expandable and provides various degrees of fault resilience. This approach is easily adaptable to computers with Graphics Processing Units because it is independent of the processor type being used. PMID:21793007

  15. Synchronous parallel system for emulation and discrete event simulation

    NASA Technical Reports Server (NTRS)

    Steinman, Jeffrey S. (Inventor)

    1992-01-01

    A synchronous parallel system for emulation and discrete event simulation having parallel nodes responds to received messages at each node by generating event objects having individual time stamps, stores only the changes to state variables of the simulation object attributable to the event object, and produces corresponding messages. The system refrains from transmitting the messages and changing the state variables while it determines whether the changes are superseded, and then stores the unchanged state variables in the event object for later restoral to the simulation object if called for. This determination preferably includes sensing the time stamp of each new event object and determining which new event object has the earliest time stamp as the local event horizon, determining the earliest local event horizon of the nodes as the global event horizon, and ignoring the events whose time stamps are less than the global event horizon. Host processing between the system and external terminals enables such a terminal to query, monitor, command or participate with a simulation object during the simulation process.

  16. Synchronous Parallel System for Emulation and Discrete Event Simulation

    NASA Technical Reports Server (NTRS)

    Steinman, Jeffrey S. (Inventor)

    2001-01-01

    A synchronous parallel system for emulation and discrete event simulation having parallel nodes responds to received messages at each node by generating event objects having individual time stamps, stores only the changes to the state variables of the simulation object attributable to the event object and produces corresponding messages. The system refrains from transmitting the messages and changing the state variables while it determines whether the changes are superseded, and then stores the unchanged state variables in the event object for later restoral to the simulation object if called for. This determination preferably includes sensing the time stamp of each new event object and determining which new event object has the earliest time stamp as the local event horizon, determining the earliest local event horizon of the nodes as the global event horizon, and ignoring events whose time stamps are less than the global event horizon. Host processing between the system and external terminals enables such a terminal to query, monitor, command or participate with a simulation object during the simulation process.

  17. A Parallel Sliding Region Algorithm to Make Agent-Based Modeling Possible for a Large-Scale Simulation: Modeling Hepatitis C Epidemics in Canada.

    PubMed

    Wong, William W L; Feng, Zeny Z; Thein, Hla-Hla

    2016-11-01

    Agent-based models (ABMs) are computer simulation models that define interactions among agents and simulate emergent behaviors that arise from the ensemble of local decisions. ABMs have been increasingly used to examine trends in infectious disease epidemiology. However, the main limitation of ABMs is the high computational cost for a large-scale simulation. To improve the computational efficiency for large-scale ABM simulations, we built a parallelizable sliding region algorithm (SRA) for ABM and compared it to a nonparallelizable ABM. We developed a complex agent network and performed two simulations to model hepatitis C epidemics based on the real demographic data from Saskatchewan, Canada. The first simulation used the SRA that processed on each postal code subregion subsequently. The second simulation processed the entire population simultaneously. It was concluded that the parallelizable SRA showed computational time saving with comparable results in a province-wide simulation. Using the same method, SRA can be generalized for performing a country-wide simulation. Thus, this parallel algorithm enables the possibility of using ABM for large-scale simulation with limited computational resources.

  18. Generating performance portable geoscientific simulation code with Firedrake (Invited)

    NASA Astrophysics Data System (ADS)

    Ham, D. A.; Bercea, G.; Cotter, C. J.; Kelly, P. H.; Loriant, N.; Luporini, F.; McRae, A. T.; Mitchell, L.; Rathgeber, F.

    2013-12-01

    This presentation will demonstrate how a change in simulation programming paradigm can be exploited to deliver sophisticated simulation capability which is far easier to programme than are conventional models, is capable of exploiting different emerging parallel hardware, and is tailored to the specific needs of geoscientific simulation. Geoscientific simulation represents a grand challenge computational task: many of the largest computers in the world are tasked with this field, and the requirements of resolution and complexity of scientists in this field are far from being sated. However, single thread performance has stalled, even sometimes decreased, over the last decade, and has been replaced by ever more parallel systems: both as conventional multicore CPUs and in the emerging world of accelerators. At the same time, the needs of scientists to couple ever-more complex dynamics and parametrisations into their models makes the model development task vastly more complex. The conventional approach of writing code in low level languages such as Fortran or C/C++ and then hand-coding parallelism for different platforms by adding library calls and directives forces the intermingling of the numerical code with its implementation. This results in an almost impossible set of skill requirements for developers, who must simultaneously be domain science experts, numericists, software engineers and parallelisation specialists. Even more critically, it requires code to be essentially rewritten for each emerging hardware platform. Since new platforms are emerging constantly, and since code owners do not usually control the procurement of the supercomputers on which they must run, this represents an unsustainable development load. The Firedrake system, conversely, offers the developer the opportunity to write PDE discretisations in the high-level mathematical language UFL from the FEniCS project (http://fenicsproject.org). Non-PDE model components, such as parametrisations

  19. STOCHSIMGPU: parallel stochastic simulation for the Systems Biology Toolbox 2 for MATLAB.

    PubMed

    Klingbeil, Guido; Erban, Radek; Giles, Mike; Maini, Philip K

    2011-04-15

    The importance of stochasticity in biological systems is becoming increasingly recognized and the computational cost of biologically realistic stochastic simulations urgently requires development of efficient software. We present a new software tool STOCHSIMGPU that exploits graphics processing units (GPUs) for parallel stochastic simulations of biological/chemical reaction systems and show that significant gains in efficiency can be made. It is integrated into MATLAB and works with the Systems Biology Toolbox 2 (SBTOOLBOX2) for MATLAB. The GPU-based parallel implementation of the Gillespie stochastic simulation algorithm (SSA), the logarithmic direct method (LDM) and the next reaction method (NRM) is approximately 85 times faster than the sequential implementation of the NRM on a central processing unit (CPU). Using our software does not require any changes to the user's models, since it acts as a direct replacement of the stochastic simulation software of the SBTOOLBOX2. The software is open source under the GPL v3 and available at http://www.maths.ox.ac.uk/cmb/STOCHSIMGPU. The web site also contains supplementary information. klingbeil@maths.ox.ac.uk Supplementary data are available at Bioinformatics online.

  20. Relationship of phosphodiesterase 4D (PDE4D) gene polymorphisms with risk of ischemic stroke: a hospital based case-control study.

    PubMed

    Kumar, Amit; Misra, Shubham; Kumar, Pradeep; Sagar, Ram; Gulati, Arti; Prasad, Kameshwar

    2017-08-01

    Stroke remains a leading cause of death and disability worldwide. Ischemic stroke (IS) accounts for around 80-85% of total stroke and is a complex polygenic multi-factorial disorder which is affected by a complex combination of vascular, environmental, and genetic factors. The study was conducted with an aim to examine the relationship of single nucleotide polymorphisms (SNPs) of PDE4D (T83C, C87T, and C45T) gene with increasing risk of IS in patients in North Indian population. In this hospital-based case-control study, 250 IS subjects and 250 age-and sex-matched control subjects were enrolled from the Neurosciences Centre, A.I.I.M.S., New Delhi, India. Deoxyribonucleic acids (DNAs) were extracted using the conventional Phenol-Chloroform isolation method. Different genotypes were determined by Polymerase chain reaction- Restriction fragment length polymorphism method. Odds ratio (OR) and 95% Confidence Interval (CI) of relationship of polymorphisms with risk of IS were calculated by conditional multivariable regression analysis. High blood pressure, low socioeconomic status, dyslipidemia, diabetes, and family history of stroke were observed to be statistically significant risk factors for IS. Multivariable adjusted analysis demonstrated a statistically significant relationship between SNP 83 of PDE4D gene polymorphism and increasing odds of IS under the dominant model of inheritance (OR, 1.59; 95% CI, 1.02 to 2.50; p value = 0.04) after adjustment of potential confounding variables. Stratified analysis on the basis of TOAST classification demonstrated a statistically significant association for increasing 2.73 times odds for developing large vessel disease stroke as compared to controls (OR, 2.73; 95% CI, 1.16 to 0.02; p value = 0.02). We did not find any significant association of SNPs (C87T and C45T) of the PDE4D gene with the risk of IS. SNP 83 of PDE4D gene may increase the risk for developing IS whereas SNP 87 and SNP45 of PDE4D may not be associated with

  1. A “Reverse-Schur” Approach to Optimization With Linear PDE Constraints: Application to Biomolecule Analysis and Design

    PubMed Central

    Bardhan, Jaydeep P.; Altman, Michael D.

    2009-01-01

    We present a partial-differential-equation (PDE)-constrained approach for optimizing a molecule’s electrostatic interactions with a target molecule. The approach, which we call reverse-Schur co-optimization, can be more than two orders of magnitude faster than the traditional approach to electrostatic optimization. The efficiency of the co-optimization approach may enhance the value of electrostatic optimization for ligand-design efforts–in such projects, it is often desirable to screen many candidate ligands for their viability, and the optimization of electrostatic interactions can improve ligand binding affinity and specificity. The theoretical basis for electrostatic optimization derives from linear-response theory, most commonly continuum models, and simple assumptions about molecular binding processes. Although the theory has been used successfully to study a wide variety of molecular binding events, its implications have not yet been fully explored, in part due to the computational expense associated with the optimization. The co-optimization algorithm achieves improved performance by solving the optimization and electrostatic simulation problems simultaneously, and is applicable to both unconstrained and constrained optimization problems. Reverse-Schur co-optimization resembles other well-known techniques for solving optimization problems with PDE constraints. Model problems as well as realistic examples validate the reverse-Schur method, and demonstrate that our technique and alternative PDE-constrained methods scale very favorably compared to the standard approach. Regularization, which ordinarily requires an explicit representation of the objective function, can be included using an approximate Hessian calculated using the new BIBEE/P (boundary-integral-based electrostatics estimation by preconditioning) method. PMID:23055839

  2. Parallel Harmony Search Based Distributed Energy Resource Optimization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ceylan, Oguzhan; Liu, Guodong; Tomsovic, Kevin

    2015-01-01

    This paper presents a harmony search based parallel optimization algorithm to minimize voltage deviations in three phase unbalanced electrical distribution systems and to maximize active power outputs of distributed energy resources (DR). The main contribution is to reduce the adverse impacts on voltage profile during a day as photovoltaics (PVs) output or electrical vehicles (EVs) charging changes throughout a day. The IEEE 123- bus distribution test system is modified by adding DRs and EVs under different load profiles. The simulation results show that by using parallel computing techniques, heuristic methods may be used as an alternative optimization tool in electricalmore » power distribution systems operation.« less

  3. Roles of PDE1 in Pathological Cardiac Remodeling and Dysfunction.

    PubMed

    Chen, Si; Knight, Walter E; Yan, Chen

    2018-04-23

    Pathological cardiac hypertrophy and dysfunction is a response to various stress stimuli and can result in reduced cardiac output and heart failure. Cyclic nucleotide signaling regulates several cardiac functions including contractility, remodeling, and fibrosis. Cyclic nucleotide phosphodiesterases (PDEs), by catalyzing the hydrolysis of cyclic nucleotides, are critical in the homeostasis of intracellular cyclic nucleotide signaling and hold great therapeutic potential as drug targets. Recent studies have revealed that the inhibition of the PDE family member PDE1 plays a protective role in pathological cardiac remodeling and dysfunction by the modulation of distinct cyclic nucleotide signaling pathways. This review summarizes recent key findings regarding the roles of PDE1 in the cardiac system that can lead to a better understanding of its therapeutic potential.

  4. Rip3 knockdown rescues photoreceptor cell death in blind pde6c zebrafish.

    PubMed

    Viringipurampeer, I A; Shan, X; Gregory-Evans, K; Zhang, J P; Mohammadi, Z; Gregory-Evans, C Y

    2014-05-01

    Achromatopsia is a progressive autosomal recessive retinal disease characterized by early loss of cone photoreceptors and later rod photoreceptor loss. In most cases, mutations have been identified in CNGA3, CNGB3, GNAT2, PDE6C or PDE6H genes. Owing to this genetic heterogeneity, mutation-independent therapeutic schemes aimed at preventing cone cell death are very attractive treatment strategies. In pde6c(w59) mutant zebrafish, cone photoreceptors expressed high levels of receptor-interacting protein kinase 1 (RIP1) and receptor-interacting protein kinase 3 (RIP3) kinases, key regulators of necroptotic cell death. In contrast, rod photoreceptor cells were alternatively immunopositive for caspase-3 indicating activation of caspase-dependent apoptosis in these cells. Morpholino gene knockdown of rip3 in pde6c(w59) embryos rescued the dying cone photoreceptors by inhibiting the formation of reactive oxygen species and by inhibiting second-order neuron remodelling in the inner retina. In rip3 morphant larvae, visual function was restored in the cones by upregulation of the rod phosphodiesterase genes (pde6a and pde6b), compensating for the lack of cone pde6c suggesting that cones are able to adapt to their local environment. Furthermore, we demonstrated through pharmacological inhibition of RIP1 and RIP3 activity that cone cell death was also delayed. Collectively, these results demonstrate that the underlying mechanism of cone cell death in the pde6c(w59) mutant retina is through necroptosis, whereas rod photoreceptor bystander death occurs through a caspase-dependent mechanism. This suggests that targeting the RIP kinase signalling pathway could be an effective therapeutic intervention in retinal degeneration patients. As bystander cell death is an important feature of many retinal diseases, combinatorial approaches targeting different cell death pathways may evolve as an important general principle in treatment.

  5. Behavioral and neurochemical characterization of mice deficient in the phosphodiesterase-1B (PDE1B) enzyme.

    PubMed

    Siuciak, J A; McCarthy, S A; Chapin, D S; Reed, T M; Vorhees, C V; Repaske, D R

    2007-07-01

    PDE1B is a calcium-dependent cyclic nucleotide phosphodiesterase that is highly expressed in the striatum. In order to investigate the physiological role of PDE1B in the central nervous system, PDE1B knockout mice (C57BL/6N background) were assessed in behavioral tests and their brains were assayed for monoamine content. In a variety of well-characterized behavioral tasks, including the elevated plus maze (anxiety-like behavior), forced swim test (depression-like behavior), hot plate (nociception) and two cognition models (passive avoidance and acquisition of conditioned avoidance responding), PDE1B knockout mice performed similarly to wild-type mice. PDE1B knockout mice showed increased baseline exploratory activity when compared to wild-type mice. When challenged with amphetamine (AMPH) and methamphetamine (METH), male and female PDE1B knockout mice showed an exaggerated locomotor response. Male PDE1B knockout mice also showed increased locomotor responses to higher doses of phencyclidine (PCP) and MK-801; however, this effect was not consistently observed in female knockout mice. In the striatum, increased dopamine turnover (DOPAC/DA and HVA/DA ratios) was found in both male and female PDE1B knockout mice. Striatal serotonin (5-HT) levels were also decreased in PDE1B knockout mice, although levels of the metabolite, 5HIAA, were unchanged. The present studies demonstrate increased striatal dopamine turnover in PDE1B knockout mice associated with increased baseline motor activity and an exaggerated locomotor response to dopaminergic stimulants such as methamphetamine and amphetamine. These data further support a role for PDE1B in striatal function.

  6. GPU-based Space Situational Awareness Simulation utilising Parallelism for Enhanced Multi-sensor Management

    NASA Astrophysics Data System (ADS)

    Hobson, T.; Clarkson, V.

    2012-09-01

    As a result of continual space activity since the 1950s, there are now a large number of man-made Resident Space Objects (RSOs) orbiting the Earth. Because of the large number of items and their relative speeds, the possibility of destructive collisions involving important space assets is now of significant concern to users and operators of space-borne technologies. As a result, a growing number of international agencies are researching methods for improving techniques to maintain Space Situational Awareness (SSA). Computer simulation is a method commonly used by many countries to validate competing methodologies prior to full scale adoption. The use of supercomputing and/or reduced scale testing is often necessary to effectively simulate such a complex problem on todays computers. Recently the authors presented a simulation aimed at reducing the computational burden by selecting the minimum level of fidelity necessary for contrasting methodologies and by utilising multi-core CPU parallelism for increased computational efficiency. The resulting simulation runs on a single PC while maintaining the ability to effectively evaluate competing methodologies. Nonetheless, the ability to control the scale and expand upon the computational demands of the sensor management system is limited. In this paper, we examine the advantages of increasing the parallelism of the simulation by means of General Purpose computing on Graphics Processing Units (GPGPU). As many sub-processes pertaining to SSA management are independent, we demonstrate how parallelisation via GPGPU has the potential to significantly enhance not only research into techniques for maintaining SSA, but also to enhance the level of sophistication of existing space surveillance sensors and sensor management systems. Nonetheless, the use of GPGPU imposes certain limitations and adds to the implementation complexity, both of which require consideration to achieve an effective system. We discuss these challenges and

  7. PDE5 inhibitors blunt inflammation in human BPH: a potential mechanism of action for PDE5 inhibitors in LUTS.

    PubMed

    Vignozzi, Linda; Gacci, Mauro; Cellai, Ilaria; Morelli, Annamaria; Maneschi, Elena; Comeglio, Paolo; Santi, Raffaella; Filippi, Sandra; Sebastianelli, Arcangelo; Nesi, Gabriella; Serni, Sergio; Carini, Marco; Maggi, Mario

    2013-09-01

    Metabolic syndrome (MetS) and benign prostate hyperplasia (BPH)/low urinary tract symptoms (LUTS) are often comorbid. Chronic inflammation is one of the putative links between these diseases. Phosphodiesterase type 5 inhibitors (PDE5i) are recognized as an effective treatment of BPH-related LUTS. One proposed mechanism of action of PDE5 is the inhibition of intraprostatic inflammation. In this study we investigate whether PDE5i could blunt inflammation in the human prostate. Evaluation of the effect of tadalafil and vardenafil on secretion of interleukin 8 (IL-8, a surrogate marker of prostate inflammation) by human myofibroblast prostatic cells (hBPH) exposed to different inflammatory stimuli. We preliminary evaluate histological features of prostatic inflammatory infiltrates in BPH patients enrolled in a randomized, double bind, placebo controlled study aimed at investigating the efficacy of vardenafil (10 mg/day, for 12 weeks) on BPH/LUTS. In vitro treatment with tadalafil or vardenafil on hBPH reduced IL-8 secretion induced by either TNFα or metabolic factors, including oxidized low-density lipoprotein, oxLDL, to the same extent as a PDE5-insensitive PKG agonist Sp-8-Br-PET-cGMP. These effects were reverted by the PKG inhibitor KT5823, suggesting a cGMP/PKG-dependency. Treatment with tadalafil or vardenafil significantly suppressed oxLDL receptor (LOX-1) expression. Histological evaluation of anti-CD45 staining (CD45 score) in prostatectomy specimens of BPH patients showed a positive association with MetS severity. Reduced HDL-cholesterol and elevated triglycerides were the only MetS factors significantly associated with CD45 score. In the MetS cohort there was a significant lower CD45 score in the vardenafil-arm versus the placebo-one. © 2013 Wiley Periodicals, Inc.

  8. Smoldyn on graphics processing units: massively parallel Brownian dynamics simulations.

    PubMed

    Dematté, Lorenzo

    2012-01-01

    Space is a very important aspect in the simulation of biochemical systems; recently, the need for simulation algorithms able to cope with space is becoming more and more compelling. Complex and detailed models of biochemical systems need to deal with the movement of single molecules and particles, taking into consideration localized fluctuations, transportation phenomena, and diffusion. A common drawback of spatial models lies in their complexity: models can become very large, and their simulation could be time consuming, especially if we want to capture the systems behavior in a reliable way using stochastic methods in conjunction with a high spatial resolution. In order to deliver the promise done by systems biology to be able to understand a system as whole, we need to scale up the size of models we are able to simulate, moving from sequential to parallel simulation algorithms. In this paper, we analyze Smoldyn, a widely diffused algorithm for stochastic simulation of chemical reactions with spatial resolution and single molecule detail, and we propose an alternative, innovative implementation that exploits the parallelism of Graphics Processing Units (GPUs). The implementation executes the most computational demanding steps (computation of diffusion, unimolecular, and bimolecular reaction, as well as the most common cases of molecule-surface interaction) on the GPU, computing them in parallel on each molecule of the system. The implementation offers good speed-ups and real time, high quality graphics output

  9. PDE 5 inhibitor improves insulin sensitivity by enhancing mitochondrial function in adipocytes.

    PubMed

    Yu, Hea Min; Chung, Hyo Kyun; Kim, Koon Soon; Lee, Jae Min; Hong, Jun Hwa; Park, Kang Seo

    2017-11-04

    Adipocytes are involved in many metabolic disorders. It was recently reported that phosphodiesterase type 5 (PDE5) is expressed in human adipose tissue. In addition, PDE5 inhibitors have been shown to improve insulin sensitivity in humans. However, the mechanism underlying the role of PDE5 inhibitors as an insulin sensitizer remains largely unknown. The present study was undertaken to investigate the role of the PDE5 inhibitor udenafil in insulin signaling in adipocytes and whether this is mediated through the regulation of mitochondrial function. To study the mechanism underlying the insulin sensitizing action of PDE5 inhibitors, we evaluated quantitative changes in protein or mRNA levels of mitochondrial oxidative phosphorylation (OxPhos) complex, oxygen consumption rate (OCR), and fatty acid oxidation with varying udenafil concentrations in 3T3-L1 cells. Our cell study suggested that udenafil enhanced the insulin signaling pathway in 3T3-L1 cells. Following udenafil treatment, basal mitochondrial OCR, maximal OxPhos capacity, and OxPhos gene expression significantly increased. Finally, we examined whether udenafil can affect the fatty acid oxidation process. Treatment of 3T3-L1 cells with udenafil (10 and 20 μM) significantly increased fatty acid oxidation rate in a dose-dependent manner. In addition, the expression of peroxisome proliferator-activated receptor gamma coactivator 1-alpha (PGC-1α) significantly increased. We demonstrated that the PDE5 inhibitor udenafil enhances insulin sensitivity by improving mitochondrial function in 3T3-L1 cells. This might be the mechanism underlying the PDE5 inhibitor-enhanced insulin signaling in adipocytes. This also suggests that udenafil may provide benefit in the treatment of type 2 diabetes and other related cardiovascular diseases. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Enabling parallel simulation of large-scale HPC network systems

    DOE PAGES

    Mubarak, Misbah; Carothers, Christopher D.; Ross, Robert B.; ...

    2016-04-07

    Here, with the increasing complexity of today’s high-performance computing (HPC) architectures, simulation has become an indispensable tool for exploring the design space of HPC systems—in particular, networks. In order to make effective design decisions, simulations of these systems must possess the following properties: (1) have high accuracy and fidelity, (2) produce results in a timely manner, and (3) be able to analyze a broad range of network workloads. Most state-of-the-art HPC network simulation frameworks, however, are constrained in one or more of these areas. In this work, we present a simulation framework for modeling two important classes of networks usedmore » in today’s IBM and Cray supercomputers: torus and dragonfly networks. We use the Co-Design of Multi-layer Exascale Storage Architecture (CODES) simulation framework to simulate these network topologies at a flit-level detail using the Rensselaer Optimistic Simulation System (ROSS) for parallel discrete-event simulation. Our simulation framework meets all the requirements of a practical network simulation and can assist network designers in design space exploration. First, it uses validated and detailed flit-level network models to provide an accurate and high-fidelity network simulation. Second, instead of relying on serial time-stepped or traditional conservative discrete-event simulations that limit simulation scalability and efficiency, we use the optimistic event-scheduling capability of ROSS to achieve efficient and scalable HPC network simulations on today’s high-performance cluster systems. Third, our models give network designers a choice in simulating a broad range of network workloads, including HPC application workloads using detailed network traces, an ability that is rarely offered in parallel with high-fidelity network simulations« less

  11. Enabling parallel simulation of large-scale HPC network systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mubarak, Misbah; Carothers, Christopher D.; Ross, Robert B.

    Here, with the increasing complexity of today’s high-performance computing (HPC) architectures, simulation has become an indispensable tool for exploring the design space of HPC systems—in particular, networks. In order to make effective design decisions, simulations of these systems must possess the following properties: (1) have high accuracy and fidelity, (2) produce results in a timely manner, and (3) be able to analyze a broad range of network workloads. Most state-of-the-art HPC network simulation frameworks, however, are constrained in one or more of these areas. In this work, we present a simulation framework for modeling two important classes of networks usedmore » in today’s IBM and Cray supercomputers: torus and dragonfly networks. We use the Co-Design of Multi-layer Exascale Storage Architecture (CODES) simulation framework to simulate these network topologies at a flit-level detail using the Rensselaer Optimistic Simulation System (ROSS) for parallel discrete-event simulation. Our simulation framework meets all the requirements of a practical network simulation and can assist network designers in design space exploration. First, it uses validated and detailed flit-level network models to provide an accurate and high-fidelity network simulation. Second, instead of relying on serial time-stepped or traditional conservative discrete-event simulations that limit simulation scalability and efficiency, we use the optimistic event-scheduling capability of ROSS to achieve efficient and scalable HPC network simulations on today’s high-performance cluster systems. Third, our models give network designers a choice in simulating a broad range of network workloads, including HPC application workloads using detailed network traces, an ability that is rarely offered in parallel with high-fidelity network simulations« less

  12. Scalability study of parallel spatial direct numerical simulation code on IBM SP1 parallel supercomputer

    NASA Technical Reports Server (NTRS)

    Hanebutte, Ulf R.; Joslin, Ronald D.; Zubair, Mohammad

    1994-01-01

    The implementation and the performance of a parallel spatial direct numerical simulation (PSDNS) code are reported for the IBM SP1 supercomputer. The spatially evolving disturbances that are associated with laminar-to-turbulent in three-dimensional boundary-layer flows are computed with the PS-DNS code. By remapping the distributed data structure during the course of the calculation, optimized serial library routines can be utilized that substantially increase the computational performance. Although the remapping incurs a high communication penalty, the parallel efficiency of the code remains above 40% for all performed calculations. By using appropriate compile options and optimized library routines, the serial code achieves 52-56 Mflops on a single node of the SP1 (45% of theoretical peak performance). The actual performance of the PSDNS code on the SP1 is evaluated with a 'real world' simulation that consists of 1.7 million grid points. One time step of this simulation is calculated on eight nodes of the SP1 in the same time as required by a Cray Y/MP for the same simulation. The scalability information provides estimated computational costs that match the actual costs relative to changes in the number of grid points.

  13. Inactivation of Pde8b enhances memory, motor performance, and protects against age-induced motor coordination decay

    PubMed Central

    Tsai, Li-Chun Lisa; Chan, Guy Chiu-Kai; Nangle, Shannon N.; Shimizu-Albergine, Masami; Jones, Graham; Storm, Daniel R.; Beavo, Joseph A.; Zweifel, Larry S.

    2012-01-01

    Phosphodiesterases (PDEs) are critical regulatory enzymes in cyclic nucleotide signaling. PDEs have diverse expression patterns within the central nervous system (CNS), show differing affinities for cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP), and regulate a vast array of behaviors. Here, we investigated the expression profile of the PDE8 gene family members Pde8a and Pde8b in the mouse brain. We find that Pde8a expression is largely absent in the CNS; by contrast, Pde8b is expressed in select regions of the hippocampus, ventral striatum, and cerebellum. Behavioral analysis of mice with Pde8b gene inactivation (PDE8B KO) demonstrate an enhancement in contextual fear, spatial memory, performance in an appetitive instrumental conditioning task, motor-coordination, and have an attenuation of age-induced motor coordination decline. In addition to improvements observed in select behaviors, we find basal anxiety levels to be increased in PDE8B KO mice. These findings indicate that selective antagonism of PDE8B may be an attractive target for enhancement of cognitive and motor functions; however, possible alterations in affective state will need to be weighed against potential therapeutic value. PMID:22925203

  14. Regulation of ecto-apyrase CD39 (ENTPD1) expression by phosphodiesterase III (PDE3)

    PubMed Central

    Baek, Amy E.; Kanthi, Yogendra; Sutton, Nadia R.; Liao, Hui; Pinsky, David J.

    2013-01-01

    The ectoenzyme CD39 suppresses thrombosis and inflammation by suppressing ATP and ADP to AMP. However, mechanisms of CD39 transcriptional and post-translational regulation are not well known. Here we show that CD39 levels are modulated by inhibition of phosphodiesterase 3 (PDE3). RAW macrophages and human umbilical vein endothelial cells (HUVECs) were treated with the PDE3 inhibitors cilostazol and milrinone, then analyzed using qRT-PCR, immunoprecipitation/Western blot, immunofluorescent staining, radio-thin-layer chromatography, a malachite green assay, and ELISA. HUVECs expressed elevated CD39 protein (2-fold [P<0.05] for cilostazol and 2.5-fold [P<0.01] for milrinone), while macrophage CD39 mRNA and protein were both elevated after PDE3 inhibition. HUVEC ATPase activity increased by 25% with cilostazol and milrinone treatment (P<0.05 and P<0.01, respectively), as did ADPase activity (47% and 61%, P<0.001). There was also a dose-dependent elevation of soluble CD39 after treatment with 8-Br-cAMP, with maximal elevation of 60% more CD39 present compared to controls (1 mM, P<0.001). Protein harvested after 8-Br-cAMP treatment showed that ubiquitination of CD39 was decreased by 43% compared to controls. A DMSO or PBS vehicle control was included for each experiment based on solubility of cilostazol, milrinone, and 8-Br-cAMP. These results indicate that PDE3 inhibition regulates endothelial CD39 at a post-translational level.—Baek, A. E., Kanthi, Y., Sutton, N. R., Liao, H., Pinsky, D. J. Regulation of ecto-apyrase CD39 (ENTPD1) expression by phosphodiesterase III (PDE3). PMID:23901069

  15. Parallel Simulation of Three-Dimensional Free Surface Fluid Flow Problems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    BAER,THOMAS A.; SACKINGER,PHILIP A.; SUBIA,SAMUEL R.

    1999-10-14

    Simulation of viscous three-dimensional fluid flow typically involves a large number of unknowns. When free surfaces are included, the number of unknowns increases dramatically. Consequently, this class of problem is an obvious application of parallel high performance computing. We describe parallel computation of viscous, incompressible, free surface, Newtonian fluid flow problems that include dynamic contact fines. The Galerkin finite element method was used to discretize the fully-coupled governing conservation equations and a ''pseudo-solid'' mesh mapping approach was used to determine the shape of the free surface. In this approach, the finite element mesh is allowed to deform to satisfy quasi-staticmore » solid mechanics equations subject to geometric or kinematic constraints on the boundaries. As a result, nodal displacements must be included in the set of unknowns. Other issues discussed are the proper constraints appearing along the dynamic contact line in three dimensions. Issues affecting efficient parallel simulations include problem decomposition to equally distribute computational work among a SPMD computer and determination of robust, scalable preconditioners for the distributed matrix systems that must be solved. Solution continuation strategies important for serial simulations have an enhanced relevance in a parallel coquting environment due to the difficulty of solving large scale systems. Parallel computations will be demonstrated on an example taken from the coating flow industry: flow in the vicinity of a slot coater edge. This is a three dimensional free surface problem possessing a contact line that advances at the web speed in one region but transitions to static behavior in another region. As such, a significant fraction of the computational time is devoted to processing boundary data. Discussion focuses on parallel speed ups for fixed problem size, a class of problems of immediate practical importance.« less

  16. Simulating electron wave dynamics in graphene superlattices exploiting parallel processing advantages

    NASA Astrophysics Data System (ADS)

    Rodrigues, Manuel J.; Fernandes, David E.; Silveirinha, Mário G.; Falcão, Gabriel

    2018-01-01

    This work introduces a parallel computing framework to characterize the propagation of electron waves in graphene-based nanostructures. The electron wave dynamics is modeled using both "microscopic" and effective medium formalisms and the numerical solution of the two-dimensional massless Dirac equation is determined using a Finite-Difference Time-Domain scheme. The propagation of electron waves in graphene superlattices with localized scattering centers is studied, and the role of the symmetry of the microscopic potential in the electron velocity is discussed. The computational methodologies target the parallel capabilities of heterogeneous multi-core CPU and multi-GPU environments and are built with the OpenCL parallel programming framework which provides a portable, vendor agnostic and high throughput-performance solution. The proposed heterogeneous multi-GPU implementation achieves speedup ratios up to 75x when compared to multi-thread and multi-core CPU execution, reducing simulation times from several hours to a couple of minutes.

  17. Parallelization of Rocket Engine Simulator Software (PRESS)

    NASA Technical Reports Server (NTRS)

    Cezzar, Ruknet

    1997-01-01

    Parallelization of Rocket Engine System Software (PRESS) project is part of a collaborative effort with Southern University at Baton Rouge (SUBR), University of West Florida (UWF), and Jackson State University (JSU). The second-year funding, which supports two graduate students enrolled in our new Master's program in Computer Science at Hampton University and the principal investigator, have been obtained for the period from October 19, 1996 through October 18, 1997. The key part of the interim report was new directions for the second year funding. This came about from discussions during Rocket Engine Numeric Simulator (RENS) project meeting in Pensacola on January 17-18, 1997. At that time, a software agreement between Hampton University and NASA Lewis Research Center had already been concluded. That agreement concerns off-NASA-site experimentation with PUMPDES/TURBDES software. Before this agreement, during the first year of the project, another large-scale FORTRAN-based software, Two-Dimensional Kinetics (TDK), was being used for translation to an object-oriented language and parallelization experiments. However, that package proved to be too complex and lacking sufficient documentation for effective translation effort to the object-oriented C + + source code. The focus, this time with better documented and more manageable PUMPDES/TURBDES package, was still on translation to C + + with design improvements. At the RENS Meeting, however, the new impetus for the RENS projects in general, and PRESS in particular, has shifted in two important ways. One was closer alignment with the work on Numerical Propulsion System Simulator (NPSS) through cooperation and collaboration with LERC ACLU organization. The other was to see whether and how NASA's various rocket design software can be run over local and intra nets without any radical efforts for redesign and translation into object-oriented source code. There were also suggestions that the Fortran based code be

  18. Particle simulation of plasmas on the massively parallel processor

    NASA Technical Reports Server (NTRS)

    Gledhill, I. M. A.; Storey, L. R. O.

    1987-01-01

    Particle simulations, in which collective phenomena in plasmas are studied by following the self consistent motions of many discrete particles, involve several highly repetitive sets of calculations that are readily adaptable to SIMD parallel processing. A fully electromagnetic, relativistic plasma simulation for the massively parallel processor is described. The particle motions are followed in 2 1/2 dimensions on a 128 x 128 grid, with periodic boundary conditions. The two dimensional simulation space is mapped directly onto the processor network; a Fast Fourier Transform is used to solve the field equations. Particle data are stored according to an Eulerian scheme, i.e., the information associated with each particle is moved from one local memory to another as the particle moves across the spatial grid. The method is applied to the study of the nonlinear development of the whistler instability in a magnetospheric plasma model, with an anisotropic electron temperature. The wave distribution function is included as a new diagnostic to allow simulation results to be compared with satellite observations.

  19. Novel PDE4 Inhibitors Derived from Chinese Medicine Forsythia

    PubMed Central

    Coon, Tiffany A.; McKelvey, Alison C.; Weathington, Nate M.; Birru, Rahel L.; Lear, Travis; Leikauf, George D.; Chen, Bill B.

    2014-01-01

    Cyclic adenosine monophosphate (cAMP) is a crucial intracellular second messenger molecule that converts extracellular molecules to intracellular signal transduction pathways generating cell- and stimulus-specific effects. Importantly, specific phosphodiesterase (PDE) subtypes control the amplitude and duration of cAMP-induced physiological processes and are therefore a prominent pharmacological target currently used in a variety of fields. Here we tested the extracts from traditional Chinese medicine, Forsythia suspense seeds, which have been used for more than 2000 years to relieve respiratory symptoms. Using structural-functional analysis we found its major lignin, Forsynthin, acted as an immunosuppressant by inhibiting PDE4 in inflammatory and immune cell. Moreover, several novel, selective small molecule derivatives of Forsythin were tested in vitro and in murine models of viral and bacterial pneumonia, sepsis and cytokine-driven systemic inflammation. Thus, pharmacological targeting of PDE4 may be a promising strategy for immune-related disorders characterized by amplified host inflammatory response. PMID:25549252

  20. Parallel Performance of a Combustion Chemistry Simulation

    DOE PAGES

    Skinner, Gregg; Eigenmann, Rudolf

    1995-01-01

    We used a description of a combustion simulation's mathematical and computational methods to develop a version for parallel execution. The result was a reasonable performance improvement on small numbers of processors. We applied several important programming techniques, which we describe, in optimizing the application. This work has implications for programming languages, compiler design, and software engineering.

  1. Parallel Numerical Simulations of Water Reservoirs

    NASA Astrophysics Data System (ADS)

    Torres, Pedro; Mangiavacchi, Norberto

    2010-11-01

    The study of the water flow and scalar transport in water reservoirs is important for the determination of the water quality during the initial stages of the reservoir filling and during the life of the reservoir. For this scope, a parallel 2D finite element code for solving the incompressible Navier-Stokes equations coupled with scalar transport was implemented using the message-passing programming model, in order to perform simulations of hidropower water reservoirs in a computer cluster environment. The spatial discretization is based on the MINI element that satisfies the Babuska-Brezzi (BB) condition, which provides sufficient conditions for a stable mixed formulation. All the distributed data structures needed in the different stages of the code, such as preprocessing, solving and post processing, were implemented using the PETSc library. The resulting linear systems for the velocity and the pressure fields were solved using the projection method, implemented by an approximate block LU factorization. In order to increase the parallel performance in the solution of the linear systems, we employ the static condensation method for solving the intermediate velocity at vertex and centroid nodes separately. We compare performance results of the static condensation method with the approach of solving the complete system. In our tests the static condensation method shows better performance for large problems, at the cost of an increased memory usage. Performance results for other intensive parts of the code in a computer cluster are also presented.

  2. Synthesis and biological evaluation of 5-carbamoyl-2-phenylpyrimidine derivatives as novel and potent PDE4 inhibitors.

    PubMed

    Goto, Taiji; Shiina, Akiko; Yoshino, Toshiharu; Mizukami, Kiyoshi; Hirahara, Kazuki; Suzuki, Osamu; Sogawa, Yoshitaka; Takahashi, Tomoko; Mikkaichi, Tsuyoshi; Nakao, Naoki; Takahashi, Mizuki; Hasegawa, Masashi; Sasaki, Shigeki

    2013-11-15

    5-Carbamoyl-2-phenylpyrimidine derivative 2 has been identified as a phosphodiesterase 4 (PDE4) inhibitor with moderate PDE4B inhibitory activity (IC50=200 nM). Modification of the carboxylic acid moiety of 2 gave N-neopentylacetamide derivative 10f, which had high in vitro PDE4B inhibitory activity (IC50=8.3 nM) and in vivo efficacy against lipopolysaccharide (LPS)-induced pulmonary neutrophilia in mice (ID50=16 mg/kg, ip). Furthermore, based on the X-ray crystallography of 10f bound to the human PDE4B catalytic domain, we designed 7,8-dihydro-6H-pyrido[4,3-d]pyrimidin-5-one derivative 39 which has a fused bicyclic lactam scaffold. Compound 39 exhibited excellent inhibitory activity against LPS-induced tumor necrosis factor alpha (TNF-α) production in mouse splenocytes (IC50=0.21 nM) and in vivo anti-inflammatory activity against LPS-induced pulmonary neutrophilia in mice (41% inhibition at a dose of 1.0 mg/kg, i.t.). Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. Mutation in rod PDE6 linked to congenital stationary night blindness impairs the enzyme inhibition by its gamma-subunit.

    PubMed

    Muradov, Khakim G; Granovsky, Alexey E; Artemyev, Nikolai O

    2003-03-25

    Photoreceptor cGMP phosphodiesterase (PDE6) is the effector enzyme in the vertebrate visual transduction cascade. The activity of rod PDE6 catalytic alpha- and beta-subunits is blocked in the dark by two inhibitory Pgamma-subunits. The inhibition is released upon light-stimulation of photoreceptor cells. Mutation H258N in PDE6beta has been linked to congenital stationary night blindness (CSNB) in a large Danish family (Rambusch pedigree) (Gal, A., Orth, U., Baehr, W., Schwinger, E., and Rosenberg, T. (1994) Nat. Genet. 7, 64-67.) We have analyzed the consequences of this mutation for PDE6 function using a Pgamma-sensitive PDE6alpha'/PDE5 chimera, Chi16. Biochemical analysis of the H257N mutant, an equivalent of PDE6betaH258N, demonstrates that this substitution does not alter the ability of chimeric PDE to dimerize or the enzyme's catalytic properties. The sensitivity of H257N to a competitive inhibitor zaprinast was also unaffected. However, the mutant displayed a significant impairment in the inhibitory interaction with Pgamma, which was apparent from a approximately 20-fold increase in the K(i) value (46 nM) and incomplete maximal inhibition. The inhibitory defect of H257N is not due to perturbation of noncatalytic cGMP binding to the PDE6alpha' GAF domains. The noncatalytic cGMP-binding characteristics of the H257N mutant were similar to those of the parent PDE6alpha'/PDE5 chimera. Since rod PDE6 in the Rambusch CSNB is a catalytic heterodimer of the wild-type PDE6alpha and mutant PDE6beta, Chi16 and H257N were coexpressed, and a heterodimeric PDE, Chi16/H257N, was isolated. It displayed two Pgamma inhibitory sites with the K(i) values of 5 and 57 nM. Our results support the hypothesis that mutation H258N in PDE6beta causes CSNB through incomplete inhibition of PDE6 activity by Pgamma, which leads to desensitization of rod photoreceptors.

  4. [Update of PDE5 inhibitors as treatment of ED].

    PubMed

    Lu, Yong-ning; Chen, Bin

    2005-07-01

    Erectile dysfunction is a common ailment in middle-aged and old men. The management of ED has entered a new stage since sildenafil was used to treat ED in 1998. Sildenafil became the first-line treatment for its efficacy and safety. In recent years, new PDE5 inhibitors--vardenafil and tadalafil came into market in succession, providing more options available for oral therapy. This review is about the development of preclinical and clinical medicine research on the three PDE5 inhibitors, and provide information for clinical choices.

  5. MaMiCo: Software design for parallel molecular-continuum flow simulations

    NASA Astrophysics Data System (ADS)

    Neumann, Philipp; Flohr, Hanno; Arora, Rahul; Jarmatz, Piet; Tchipev, Nikola; Bungartz, Hans-Joachim

    2016-03-01

    The macro-micro-coupling tool (MaMiCo) was developed to ease the development of and modularize molecular-continuum simulations, retaining sequential and parallel performance. We demonstrate the functionality and performance of MaMiCo by coupling the spatially adaptive Lattice Boltzmann framework waLBerla with four molecular dynamics (MD) codes: the light-weight Lennard-Jones-based implementation SimpleMD, the node-level optimized software ls1 mardyn, and the community codes ESPResSo and LAMMPS. We detail interface implementations to connect each solver with MaMiCo. The coupling for each waLBerla-MD setup is validated in three-dimensional channel flow simulations which are solved by means of a state-based coupling method. We provide sequential and strong scaling measurements for the four molecular-continuum simulations. The overhead of MaMiCo is found to come at 10%-20% of the total (MD) runtime. The measurements further show that scalability of the hybrid simulations is reached on up to 500 Intel SandyBridge, and more than 1000 AMD Bulldozer compute cores.

  6. Genome-wide Association Analysis Identifies PDE4D as an Asthma-Susceptibility Gene

    PubMed Central

    Himes, Blanca E.; Hunninghake, Gary M.; Baurley, James W.; Rafaels, Nicholas M.; Sleiman, Patrick; Strachan, David P.; Wilk, Jemma B.; Willis-Owen, Saffron A.G.; Klanderman, Barbara; Lasky-Su, Jessica; Lazarus, Ross; Murphy, Amy J.; Soto-Quiros, Manuel E.; Avila, Lydiana; Beaty, Terri; Mathias, Rasika A.; Ruczinski, Ingo; Barnes, Kathleen C.; Celedón, Juan C.; Cookson, William O.C.; Gauderman, W. James; Gilliland, Frank D.; Hakonarson, Hakon; Lange, Christoph; Moffatt, Miriam F.; O'Connor, George T.; Raby, Benjamin A.; Silverman, Edwin K.; Weiss, Scott T.

    2009-01-01

    Asthma, a chronic airway disease with known heritability, affects more than 300 million people around the world. A genome-wide association (GWA) study of asthma with 359 cases from the Childhood Asthma Management Program (CAMP) and 846 genetically matched controls from the Illumina ICONdb public resource was performed. The strongest region of association seen was on chromosome 5q12 in PDE4D. The phosphodiesterase 4D, cAMP-specific (phosphodiesterase E3 dunce homolog, Drosophila) gene (PDE4D) is a regulator of airway smooth-muscle contractility, and PDE4 inhibitors have been developed as medications for asthma. Allelic p values for top SNPs in this region were 4.3 × 10−07 for rs1588265 and 9.7 × 10−07 for rs1544791. Replications were investigated in ten independent populations with different ethnicities, study designs, and definitions of asthma. In seven white and Hispanic replication populations, two PDE4D SNPs had significant results with p values less than 0.05, and five had results in the same direction as the original population but had p values greater than 0.05. Combined p values for 18,891 white and Hispanic individuals (4,342 cases) in our replication populations were 4.1 × 10−04 for rs1588265 and 9.2 × 10−04 for rs1544791. In three black replication populations, which had different linkage disequilibrium patterns than the other populations, original findings were not replicated. Further study of PDE4D variants might lead to improved understanding of the role of PDE4D in asthma pathophysiology and the efficacy of PDE4 inhibitor medications. PMID:19426955

  7. Effect of Operating Frequency on PDE Driven Ejector Thrust Performance

    NASA Technical Reports Server (NTRS)

    Santoro, Robert J.; Pal, Sibtosh; Landry, K.; Shehadeh, R.; Bouvet, N.; Lee, S.-Y.

    2005-01-01

    Results of an on-going study of pulse detonation engine driven ejectors are presented and discussed. The experiments were conducted using a pulse detonation engine (PDE) designed to operate at frequencies up to 50 Hz. The PDE used in these experiments utilizes an equi-molar mixture of oxygen and nitrogen as the oxidizer, and ethylene (C2H4) as the fuel, with the propellant mixture having an equivalence ratio of one. A line of sight laser absorption technique was used to determine the time needed for proper filling of the tube. Thrust measurements were made using an integrated spring damper system coupled with a linear variable displacement transducer. The baseline thrust of the PDE was first measured at each desired frequency and agrees with experimental and modeling results found in the literature. Thrust augmentation measurements were then made for constant diameter ejectors. The ejectors had varying lengths, and two different inlet geometries were tested for each ejector configuration. The parameter space for the study included PDE operation frequency, ejector length, overlap distance and the radius of curvature for the ejector inlets. For the studied experimental matrix, the results showed a maximum thrust augmentation of 106% at an operational frequency of 30 Hz.

  8. Parallel Grand Canonical Monte Carlo (ParaGrandMC) Simulation Code

    NASA Technical Reports Server (NTRS)

    Yamakov, Vesselin I.

    2016-01-01

    This report provides an overview of the Parallel Grand Canonical Monte Carlo (ParaGrandMC) simulation code. This is a highly scalable parallel FORTRAN code for simulating the thermodynamic evolution of metal alloy systems at the atomic level, and predicting the thermodynamic state, phase diagram, chemical composition and mechanical properties. The code is designed to simulate multi-component alloy systems, predict solid-state phase transformations such as austenite-martensite transformations, precipitate formation, recrystallization, capillary effects at interfaces, surface absorption, etc., which can aid the design of novel metallic alloys. While the software is mainly tailored for modeling metal alloys, it can also be used for other types of solid-state systems, and to some degree for liquid or gaseous systems, including multiphase systems forming solid-liquid-gas interfaces.

  9. Effect of Operating Frequency and Fill Time on PDE-Ejector Thrust Performance

    NASA Technical Reports Server (NTRS)

    Landry, K.; Santoro, Robert J.; Pal, Sibtosh; Shehadeh, R.; Bouvet, N.; Lee, S.-Y.

    2005-01-01

    Thrust measurements for a pulse detonation engine (PDE)-ejector system were determined for a range of operating frequencies. Various length tubular ejectors were utilized. The results were compared to the measurements of the thrust output of the PDE alone to determine the enhancement provided by each ejector configuration at the specified frequencies. Ethylene was chosen as the fuel, with an equi-molar mixture of nitrogen and oxygen acting as the oxidizer. The propellant was kept at an equivalence ratio of one during all the experiments. The system was operated for frequencies between 20 and 50 Hz. The parameter space of the study included PDE operation frequency, ejector length, overlap percentage, the radius of curvature for the ejector inlets, and duration of the time allowed between cycles. The results of the experiments showed a maximum thrust augmentation of 120% for a PDE-ejector configuration at a frequency of 40Hz with a fill time of 10 ms.

  10. Traffic Simulations on Parallel Computers Using Domain Decomposition Techniques

    DOT National Transportation Integrated Search

    1995-01-01

    Large scale simulations of Intelligent Transportation Systems (ITS) can only be acheived by using the computing resources offered by parallel computing architectures. Domain decomposition techniques are proposed which allow the performance of traffic...

  11. Selective Effects of PDE10A Inhibitors on Striatopallidal Neurons Require Phosphatase Inhibition by DARPP-321,2,3

    PubMed Central

    Polito, Marina; Guiot, Elvire; Gangarossa, Giuseppe; Longueville, Sophie; Doulazmi, Mohamed; Valjent, Emmanuel; Hervé, Denis; Girault, Jean-Antoine

    2015-01-01

    Abstract Type 10A phosphodiesterase (PDE10A) is highly expressed in the striatum, in striatonigral and striatopallidal medium-sized spiny neurons (MSNs), which express D1 and D2 dopamine receptors, respectively. PDE10A inhibitors have pharmacological and behavioral effects suggesting an antipsychotic profile, but the cellular bases of these effects are unclear. We analyzed the effects of PDE10A inhibition in vivo by immunohistochemistry, and imaged cAMP, cAMP-dependent protein kinase A (PKA), and cGMP signals with biosensors in mouse brain slices. PDE10A inhibition in mouse striatal slices produced a steady-state increase in intracellular cAMP concentration in D1 and D2 MSNs, demonstrating that PDE10A regulates basal cAMP levels. Surprisingly, the PKA-dependent AKAR3 phosphorylation signal was strong in D2 MSNs, whereas D1 MSNs remained unresponsive. This effect was also observed in adult mice in vivo since PDE10A inhibition increased phospho-histone H3 immunoreactivity selectively in D2 MSNs in the dorsomedial striatum. The PKA-dependent effects in D2 MSNs were prevented in brain slices and in vivo by mutation of the PKA-regulated phosphorylation site of 32 kDa dopamine- and cAMP-regulated phosphoprotein (DARPP-32), which is required for protein phosphatase-1 inhibition. These data highlight differences in the integration of the cAMP signal in D1 and D2 MSNs, resulting from stronger inhibition of protein phosphatase-1 by DARPP-32 in D2 MSNs than in D1 MSNs. This study shows that PDE10A inhibitors share with antipsychotic medications the property of activating preferentially PKA-dependent signaling in D2 MSNs. PMID:26465004

  12. Xyce Parallel Electronic Simulator Users' Guide Version 6.6.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Aadithya, Karthik Venkatraman; Mei, Ting

    This manual describes the use of the Xyce Parallel Electronic Simulator. Xyce has been de- signed as a SPICE-compatible, high-performance analog circuit simulator, and has been written to support the simulation needs of the Sandia National Laboratories electrical designers. This development has focused on improving capability over the current state-of-the-art in the following areas: Capability to solve extremely large circuit problems by supporting large-scale parallel com- puting platforms (up to thousands of processors). This includes support for most popular parallel and serial computers. A differential-algebraic-equation (DAE) formulation, which better isolates the device model package from solver algorithms. This allows onemore » to develop new types of analysis without requiring the implementation of analysis-specific device models. Device models that are specifically tailored to meet Sandia's needs, including some radiation- aware devices (for Sandia users only). Object-oriented code design and implementation using modern coding practices. Xyce is a parallel code in the most general sense of the phrase -- a message passing parallel implementation -- which allows it to run efficiently a wide range of computing platforms. These include serial, shared-memory and distributed-memory parallel platforms. Attention has been paid to the specific nature of circuit-simulation problems to ensure that optimal parallel efficiency is achieved as the number of processors grows. The information herein is subject to change without notice. Copyright c 2002-2016 Sandia Corporation. All rights reserved. Acknowledgements The BSIM Group at the University of California, Berkeley developed the BSIM3, BSIM4, BSIM6, BSIM-CMG and BSIM-SOI models. The BSIM3 is Copyright c 1999, Regents of the University of California. The BSIM4 is Copyright c 2006, Regents of the University of California. The BSIM6 is Copyright c 2015, Regents of the University of California. The BSIM-CMG is Copyright

  13. A new method for extracting near-surface mass-density anomalies from land-based gravity data, based on a special case of Poisson's PDE at the Earth's surface: A case study of salt diapirs in the south of Iran

    NASA Astrophysics Data System (ADS)

    AllahTavakoli, Y.; Safari, A.; Ardalan, A.; Bahroudi, A.

    2015-12-01

    The current research provides a method for tracking near-surface mass-density anomalies via using only land-based gravity data, which is based on a special version of Poisson's Partial Differential Equation (PDE) of the gravitational field at Earth's surface. The research demonstrates how the Poisson's PDE can provide us with a capability to extract the near-surface mass-density anomalies from land-based gravity data. Herein, this version of the Poisson's PDE is mathematically introduced to the Earth's surface and then it is used to develop the new method for approximating the mass-density via derivatives of the Earth's gravitational field (i.e. via the gradient tensor). Herein, the author believes that the PDE can give us new knowledge about the behavior of the Earth's gravitational field at the Earth's surface which can be so useful for developing new methods of Earth's mass-density determination. In a case study, the proposed method is applied to a set of gravity stations located in the south of Iran. The results were numerically validated via certain knowledge about the geological structures in the area of the case study. Also, the method was compared with two standard methods of mass-density determination. All the numerical experiments show that the proposed approach is well-suited for tracking near-surface mass-density anomalies via using only the gravity data. Finally, the approach is also applied to some petroleum exploration studies of salt diapirs in the south of Iran.

  14. Advantages of multigrid methods for certifying the accuracy of PDE modeling

    NASA Technical Reports Server (NTRS)

    Forester, C. K.

    1981-01-01

    Numerical techniques for assessing and certifying the accuracy of the modeling of partial differential equations (PDE) to the user's specifications are analyzed. Examples of the certification process with conventional techniques are summarized for the three dimensional steady state full potential and the two dimensional steady Navier-Stokes equations using fixed grid methods (FG). The advantages of the Full Approximation Storage (FAS) scheme of the multigrid technique of A. Brandt compared with the conventional certification process of modeling PDE are illustrated in one dimension with the transformed potential equation. Inferences are drawn for how MG will improve the certification process of the numerical modeling of two and three dimensional PDE systems. Elements of the error assessment process that are common to FG and MG are analyzed.

  15. Efficient parallelization of analytic bond-order potentials for large-scale atomistic simulations

    NASA Astrophysics Data System (ADS)

    Teijeiro, C.; Hammerschmidt, T.; Drautz, R.; Sutmann, G.

    2016-07-01

    Analytic bond-order potentials (BOPs) provide a way to compute atomistic properties with controllable accuracy. For large-scale computations of heterogeneous compounds at the atomistic level, both the computational efficiency and memory demand of BOP implementations have to be optimized. Since the evaluation of BOPs is a local operation within a finite environment, the parallelization concepts known from short-range interacting particle simulations can be applied to improve the performance of these simulations. In this work, several efficient parallelization methods for BOPs that use three-dimensional domain decomposition schemes are described. The schemes are implemented into the bond-order potential code BOPfox, and their performance is measured in a series of benchmarks. Systems of up to several millions of atoms are simulated on a high performance computing system, and parallel scaling is demonstrated for up to thousands of processors.

  16. Alternating Direction Implicit (ADI) schemes for a PDE-based image osmosis model

    NASA Astrophysics Data System (ADS)

    Calatroni, L.; Estatico, C.; Garibaldi, N.; Parisotto, S.

    2017-10-01

    We consider Alternating Direction Implicit (ADI) splitting schemes to compute efficiently the numerical solution of the PDE osmosis model considered by Weickert et al. in [10] for several imaging applications. The discretised scheme is shown to preserve analogous properties to the continuous model. The dimensional splitting strategy traduces numerically into the solution of simple tridiagonal systems for which standard matrix factorisation techniques can be used to improve upon the performance of classical implicit methods, even for large time steps. Applications to the shadow removal problem are presented.

  17. Massively Parallel Simulations of Diffusion in Dense Polymeric Structures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Faulon, Jean-Loup, Wilcox, R.T.

    1997-11-01

    An original computational technique to generate close-to-equilibrium dense polymeric structures is proposed. Diffusion of small gases are studied on the equilibrated structures using massively parallel molecular dynamics simulations running on the Intel Teraflops (9216 Pentium Pro processors) and Intel Paragon(1840 processors). Compared to the current state-of-the-art equilibration methods this new technique appears to be faster by some orders of magnitude.The main advantage of the technique is that one can circumvent the bottlenecks in configuration space that inhibit relaxation in molecular dynamics simulations. The technique is based on the fact that tetravalent atoms (such as carbon and silicon) fit in themore » center of a regular tetrahedron and that regular tetrahedrons can be used to mesh the three-dimensional space. Thus, the problem of polymer equilibration described by continuous equations in molecular dynamics is reduced to a discrete problem where solutions are approximated by simple algorithms. Practical modeling applications include the constructing of butyl rubber and ethylene-propylene-dimer-monomer (EPDM) models for oxygen and water diffusion calculations. Butyl and EPDM are used in O-ring systems and serve as sealing joints in many manufactured objects. Diffusion coefficients of small gases have been measured experimentally on both polymeric systems, and in general the diffusion coefficients in EPDM are an order of magnitude larger than in butyl. In order to better understand the diffusion phenomena, 10, 000 atoms models were generated and equilibrated for butyl and EPDM. The models were submitted to a massively parallel molecular dynamics simulation to monitor the trajectories of the diffusing species.« less

  18. A parallel implementation of an off-lattice individual-based model of multicellular populations

    NASA Astrophysics Data System (ADS)

    Harvey, Daniel G.; Fletcher, Alexander G.; Osborne, James M.; Pitt-Francis, Joe

    2015-07-01

    As computational models of multicellular populations include ever more detailed descriptions of biophysical and biochemical processes, the computational cost of simulating such models limits their ability to generate novel scientific hypotheses and testable predictions. While developments in microchip technology continue to increase the power of individual processors, parallel computing offers an immediate increase in available processing power. To make full use of parallel computing technology, it is necessary to develop specialised algorithms. To this end, we present a parallel algorithm for a class of off-lattice individual-based models of multicellular populations. The algorithm divides the spatial domain between computing processes and comprises communication routines that ensure the model is correctly simulated on multiple processors. The parallel algorithm is shown to accurately reproduce the results of a deterministic simulation performed using a pre-existing serial implementation. We test the scaling of computation time, memory use and load balancing as more processes are used to simulate a cell population of fixed size. We find approximate linear scaling of both speed-up and memory consumption on up to 32 processor cores. Dynamic load balancing is shown to provide speed-up for non-regular spatial distributions of cells in the case of a growing population.

  19. In Silico Investigations of Chemical Constituents of Clerodendrum colebrookianum in the Anti-Hypertensive Drug Targets: ROCK, ACE, and PDE5.

    PubMed

    Arya, Hemant; Syed, Safiulla Basha; Singh, Sorokhaibam Sureshkumar; Ampasala, Dinakar R; Coumar, Mohane Selvaraj

    2017-06-16

    Understanding the molecular mode of action of natural product is a key step for developing drugs from them. In this regard, this study is aimed to understand the molecular-level interactions of chemical constituents of Clerodendrum colebrookianum Walp., with anti-hypertensive drug targets using computational approaches. The plant has ethno-medicinal importance for the treatment of hypertension and reported to show activity against anti-hypertensive drug targets-Rho-associated coiled-coil protein kinase (ROCK), angiotensin-converting enzyme, and phosphodiesterase 5 (PDE5). Docking studies showed that three chemical constituents (acteoside, martinoside, and osmanthuside β6) out of 21 reported from the plant to interact with the anti-hypertensive drug targets with good glide score. In addition, they formed H-bond interactions with the key residues Met156/Met157 of ROCK I/ROCK II and Gln817 of PDE5. Further, molecular dynamics (MD) simulation of protein-ligand complexes suggest that H-bond interactions between acteoside/osmanthuside β6 and Met156/Met157 (ROCK I/ROCK II), acteoside and Gln817 (PDE5) were stable. The present investigation suggests that the anti-hypertensive activity of the plant is due to the interaction of acteoside and osmanthuside β6 with ROCK and PDE5 drug targets. The identified molecular mode of binding of the plant constituents could help to design new drugs to treat hypertension.

  20. A parallel finite element procedure for contact-impact problems using edge-based smooth triangular element and GPU

    NASA Astrophysics Data System (ADS)

    Cai, Yong; Cui, Xiangyang; Li, Guangyao; Liu, Wenyang

    2018-04-01

    The edge-smooth finite element method (ES-FEM) can improve the computational accuracy of triangular shell elements and the mesh partition efficiency of complex models. In this paper, an approach is developed to perform explicit finite element simulations of contact-impact problems with a graphical processing unit (GPU) using a special edge-smooth triangular shell element based on ES-FEM. Of critical importance for this problem is achieving finer-grained parallelism to enable efficient data loading and to minimize communication between the device and host. Four kinds of parallel strategies are then developed to efficiently solve these ES-FEM based shell element formulas, and various optimization methods are adopted to ensure aligned memory access. Special focus is dedicated to developing an approach for the parallel construction of edge systems. A parallel hierarchy-territory contact-searching algorithm (HITA) and a parallel penalty function calculation method are embedded in this parallel explicit algorithm. Finally, the program flow is well designed, and a GPU-based simulation system is developed, using Nvidia's CUDA. Several numerical examples are presented to illustrate the high quality of the results obtained with the proposed methods. In addition, the GPU-based parallel computation is shown to significantly reduce the computing time.

  1. Parallelized Three-Dimensional Resistivity Inversion Using Finite Elements And Adjoint State Methods

    NASA Astrophysics Data System (ADS)

    Schaa, Ralf; Gross, Lutz; Du Plessis, Jaco

    2015-04-01

    The resistivity method is one of the oldest geophysical exploration methods, which employs one pair of electrodes to inject current into the ground and one or more pairs of electrodes to measure the electrical potential difference. The potential difference is a non-linear function of the subsurface resistivity distribution described by an elliptic partial differential equation (PDE) of the Poisson type. Inversion of measured potentials solves for the subsurface resistivity represented by PDE coefficients. With increasing advances in multichannel resistivity acquisition systems (systems with more than 60 channels and full waveform recording are now emerging), inversion software require efficient storage and solver algorithms. We developed the finite element solver Escript, which provides a user-friendly programming environment in Python to solve large-scale PDE-based problems (see https://launchpad.net/escript-finley). Using finite elements, highly irregular shaped geology and topography can readily be taken into account. For the 3D resistivity problem, we have implemented the secondary potential approach, where the PDE is decomposed into a primary potential caused by the source current and the secondary potential caused by changes in subsurface resistivity. The primary potential is calculated analytically, and the boundary value problem for the secondary potential is solved using nodal finite elements. This approach removes the singularity caused by the source currents and provides more accurate 3D resistivity models. To solve the inversion problem we apply a 'first optimize then discretize' approach using the quasi-Newton scheme in form of the limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) method (see Gross & Kemp 2013). The evaluation of the cost function requires the solution of the secondary potential PDE for each source current and the solution of the corresponding adjoint-state PDE for the cost function gradients with respect to the subsurface

  2. Synthesis of Fluorine-Containing Phosphodiesterase 10A (PDE10A) Inhibitors and the In Vivo Evaluation of F-18 Labeled PDE10A PET Tracers in Rodent and Nonhuman Primate

    PubMed Central

    Li, Junfeng; Zhang, Xiang; Jin, Hongjun; Fan, Jinda; Flores, Hubert; Perlmutter, Joel S.; Tu, Zhude

    2015-01-01

    A series of fluorine-containing PDE10A inhibitors were designed and synthesized to improve the metabolic stability of [11C]MP-10. Twenty of the 22 new analogues had high potency and selectivity for PDE10A: 18a–j, 19d–j, 20a–b, and 21b had IC50 values <5 nM for PDE10A. Seven F-18 labeled compounds [18F]18a–e, [18F]18g, and [18F]20a were radiosynthesized by 18F-introduction onto the quinoline rather than the pyrazole moiety of the MP-10 pharmacophore and performed in vivo evaluation. Biodistribution studies in rats showed ~2-fold higher activity in the PDE10A-enriched striatum than nontarget brain regions; this ratio increased from 5 to 30 min postinjection, particularly for [18F]18a–d and [18F]20a. Micro-PET studies of [18F]18d and [18F]20a in nonhuman primates provided clear visualization of striatum with suitable equilibrium kinetics and favorable metabolic stability. These results suggest this strategy may identify a 18F-labeled PET tracer for quantifying the levels of PDE10A in patients with CNS disorders including Huntington’s disease and schizophrenia. PMID:26430878

  3. PDE5 Inhibitors Enhance Celecoxib Killing in Multiple Tumor Types

    PubMed Central

    BOOTH, LAURENCE; ROBERTS, JANE L.; CRUICKSHANKS, NICHOLA; TAVALLAI, SEYEDMEHRAD; WEBB, TIMOTHY; SAMUEL, PETER; CONLEY, ADAM; BINION, BRITTANY; YOUNG, HAROLD F.; POKLEPOVIC, ANDREW; SPIEGEL, SARAH; DENT, PAUL

    2015-01-01

    The present studies determined whether clinically relevant phosphodiesterase 5 (PDE5) inhibitors interacted with a clinically relevant NSAID, celecoxib, to kill tumor cells. Celecoxib and PDE5 inhibitors interacted in a greater than additive fashion to kill multiple tumor cell types. Celecoxib and sildenafil killed ex vivo primary human glioma cells as well as their associated activated microglia. Knock down of PDE5 recapitulated the effects of PDE5 inhibitor treatment; the nitric oxide synthase inhibitor L-NAME suppressed drug combination toxicity. The effects of celecoxib were COX2 independent. Over-expression of c-FLIP-s or knock down of CD95/FADD significantly reduced killing by the drug combination. CD95 activation was dependent on nitric oxide and ceramide signaling. CD95 signaling activated the JNK pathway and inhibition of JNK suppressed cell killing. The drug combination inactivated mTOR and increased the levels of autophagy and knock down of Beclin1 or ATG5 strongly suppressed killing by the drug combination. The drug combination caused an ER stress response; knock down of IRE1α/XBP1 enhanced killing whereas knock down of eIF2α/ATF4/CHOP suppressed killing. Sildenafil and celecoxib treatment suppressed the growth of mammary tumors in vivo. Collectively our data demonstrate that clinically achievable concentrations of celecoxib and sildenafil have the potential to be a new therapeutic approach for cancer. PMID:25303541

  4. Parallel computing of physical maps--a comparative study in SIMD and MIMD parallelism.

    PubMed

    Bhandarkar, S M; Chirravuri, S; Arnold, J

    1996-01-01

    Ordering clones from a genomic library into physical maps of whole chromosomes presents a central computational problem in genetics. Chromosome reconstruction via clone ordering is usually isomorphic to the NP-complete Optimal Linear Arrangement problem. Parallel SIMD and MIMD algorithms for simulated annealing based on Markov chain distribution are proposed and applied to the problem of chromosome reconstruction via clone ordering. Perturbation methods and problem-specific annealing heuristics are proposed and described. The SIMD algorithms are implemented on a 2048 processor MasPar MP-2 system which is an SIMD 2-D toroidal mesh architecture whereas the MIMD algorithms are implemented on an 8 processor Intel iPSC/860 which is an MIMD hypercube architecture. A comparative analysis of the various SIMD and MIMD algorithms is presented in which the convergence, speedup, and scalability characteristics of the various algorithms are analyzed and discussed. On a fine-grained, massively parallel SIMD architecture with a low synchronization overhead such as the MasPar MP-2, a parallel simulated annealing algorithm based on multiple periodically interacting searches performs the best. For a coarse-grained MIMD architecture with high synchronization overhead such as the Intel iPSC/860, a parallel simulated annealing algorithm based on multiple independent searches yields the best results. In either case, distribution of clonal data across multiple processors is shown to exacerbate the tendency of the parallel simulated annealing algorithm to get trapped in a local optimum.

  5. Partitioning and packing mathematical simulation models for calculation on parallel computers

    NASA Technical Reports Server (NTRS)

    Arpasi, D. J.; Milner, E. J.

    1986-01-01

    The development of multiprocessor simulations from a serial set of ordinary differential equations describing a physical system is described. Degrees of parallelism (i.e., coupling between the equations) and their impact on parallel processing are discussed. The problem of identifying computational parallelism within sets of closely coupled equations that require the exchange of current values of variables is described. A technique is presented for identifying this parallelism and for partitioning the equations for parallel solution on a multiprocessor. An algorithm which packs the equations into a minimum number of processors is also described. The results of the packing algorithm when applied to a turbojet engine model are presented in terms of processor utilization.

  6. Parallel simulations of Grover's algorithm for closest match search in neutron monitor data

    NASA Astrophysics Data System (ADS)

    Kussainov, Arman; White, Yelena

    We are studying the parallel implementations of Grover's closest match search algorithm for neutron monitor data analysis. This includes data formatting, and matching quantum parameters to a conventional structure of a chosen programming language and selected experimental data type. We have employed several workload distribution models based on acquired data and search parameters. As a result of these simulations, we have an understanding of potential problems that may arise during configuration of real quantum computational devices and the way they could run tasks in parallel. The work was supported by the Science Committee of the Ministry of Science and Education of the Republic of Kazakhstan Grant #2532/GF3.

  7. A parallel finite element simulator for ion transport through three-dimensional ion channel systems.

    PubMed

    Tu, Bin; Chen, Minxin; Xie, Yan; Zhang, Linbo; Eisenberg, Bob; Lu, Benzhuo

    2013-09-15

    A parallel finite element simulator, ichannel, is developed for ion transport through three-dimensional ion channel systems that consist of protein and membrane. The coordinates of heavy atoms of the protein are taken from the Protein Data Bank and the membrane is represented as a slab. The simulator contains two components: a parallel adaptive finite element solver for a set of Poisson-Nernst-Planck (PNP) equations that describe the electrodiffusion process of ion transport, and a mesh generation tool chain for ion channel systems, which is an essential component for the finite element computations. The finite element method has advantages in modeling irregular geometries and complex boundary conditions. We have built a tool chain to get the surface and volume mesh for ion channel systems, which consists of a set of mesh generation tools. The adaptive finite element solver in our simulator is implemented using the parallel adaptive finite element package Parallel Hierarchical Grid (PHG) developed by one of the authors, which provides the capability of doing large scale parallel computations with high parallel efficiency and the flexibility of choosing high order elements to achieve high order accuracy. The simulator is applied to a real transmembrane protein, the gramicidin A (gA) channel protein, to calculate the electrostatic potential, ion concentrations and I - V curve, with which both primitive and transformed PNP equations are studied and their numerical performances are compared. To further validate the method, we also apply the simulator to two other ion channel systems, the voltage dependent anion channel (VDAC) and α-Hemolysin (α-HL). The simulation results agree well with Brownian dynamics (BD) simulation results and experimental results. Moreover, because ionic finite size effects can be included in PNP model now, we also perform simulations using a size-modified PNP (SMPNP) model on VDAC and α-HL. It is shown that the size effects in SMPNP can

  8. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit

    PubMed Central

    Pronk, Sander; Páll, Szilárd; Schulz, Roland; Larsson, Per; Bjelkmar, Pär; Apostolov, Rossen; Shirts, Michael R.; Smith, Jeremy C.; Kasson, Peter M.; van der Spoel, David; Hess, Berk; Lindahl, Erik

    2013-01-01

    Motivation: Molecular simulation has historically been a low-throughput technique, but faster computers and increasing amounts of genomic and structural data are changing this by enabling large-scale automated simulation of, for instance, many conformers or mutants of biomolecules with or without a range of ligands. At the same time, advances in performance and scaling now make it possible to model complex biomolecular interaction and function in a manner directly testable by experiment. These applications share a need for fast and efficient software that can be deployed on massive scale in clusters, web servers, distributed computing or cloud resources. Results: Here, we present a range of new simulation algorithms and features developed during the past 4 years, leading up to the GROMACS 4.5 software package. The software now automatically handles wide classes of biomolecules, such as proteins, nucleic acids and lipids, and comes with all commonly used force fields for these molecules built-in. GROMACS supports several implicit solvent models, as well as new free-energy algorithms, and the software now uses multithreading for efficient parallelization even on low-end systems, including windows-based workstations. Together with hand-tuned assembly kernels and state-of-the-art parallelization, this provides extremely high performance and cost efficiency for high-throughput as well as massively parallel simulations. Availability: GROMACS is an open source and free software available from http://www.gromacs.org. Contact: erik.lindahl@scilifelab.se Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23407358

  9. On efficiency of fire simulation realization: parallelization with greater number of computational meshes

    NASA Astrophysics Data System (ADS)

    Valasek, Lukas; Glasa, Jan

    2017-12-01

    Current fire simulation systems are capable to utilize advantages of high-performance computer (HPC) platforms available and to model fires efficiently in parallel. In this paper, efficiency of a corridor fire simulation on a HPC computer cluster is discussed. The parallel MPI version of Fire Dynamics Simulator is used for testing efficiency of selected strategies of allocation of computational resources of the cluster using a greater number of computational cores. Simulation results indicate that if the number of cores used is not equal to a multiple of the total number of cluster node cores there are allocation strategies which provide more efficient calculations.

  10. Human PDE4D isoform composition is deregulated in primary prostate cancer and indicative for disease progression and development of distant metastases

    PubMed Central

    Böttcher, René; Dulla, Kalyan; van Strijp, Dianne; Dits, Natasja; Verhoef, Esther I.; Baillie, George S.; van Leenders, Geert J.L.H.; Houslay, Miles D.; Jenster, Guido; Hoffmann, Ralf

    2016-01-01

    Phosphodiesterase 4D7 was recently shown to be specifically over-expressed in localized prostate cancer, raising the question as to which regulatory mechanisms are involved and whether other isoforms of this gene family (PDE4D) are affected under the same conditions. We investigated PDE4D isoform composition in prostatic tissues using a total of seven independent expression datasets and also included data on DNA methylation, copy number and AR and ERG binding in PDE4D promoters to gain insight into their effect on PDE4D transcription. We show that expression of PDE4D isoforms is consistently altered in primary human prostate cancer compared to benign tissue, with PDE4D7 being up-regulated while PDE4D5 and PDE4D9 are down-regulated. Disease progression is marked by an overall down-regulation of long PDE4D isoforms, while short isoforms (PDE4D1/2) appear to be relatively unaffected. While these alterations seem to be independent of copy number alterations in the PDE4D locus and driven by AR and ERG binding, we also observed increased DNA methylation in the promoter region of PDE4D5, indicating a long lasting alteration of the isoform composition in prostate cancer tissues. We propose two independent metrics that may serve as diagnostic and prognostic markers for prostate disease: (PDE4D7 - PDE4D5) provides an effective means for distinguishing PCa from normal adjacent prostate, whereas PDE4D1/2 - (PDE4D5 + PDE4D7 + PDE4D9) offers strong prognostic potential to detect aggressive forms of PCa and is associated with metastasis free survival. Overall, our findings highlight the relevance of PDE4D as prostate cancer biomarker and potential drug target. PMID:27683107

  11. Xyce Parallel Electronic Simulator : reference guide, version 2.0.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoekstra, Robert John; Waters, Lon J.; Rankin, Eric Lamont

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users' Guide. The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users' Guide.

  12. Xyce™ Parallel Electronic Simulator Reference Guide Version 6.8

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Aadithya, Karthik Venkatraman; Mei, Ting

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users' Guide. The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce . This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users' Guide.

  13. Facilitating arrhythmia simulation: the method of quantitative cellular automata modeling and parallel running

    PubMed Central

    Zhu, Hao; Sun, Yan; Rajagopal, Gunaretnam; Mondry, Adrian; Dhar, Pawan

    2004-01-01

    Background Many arrhythmias are triggered by abnormal electrical activity at the ionic channel and cell level, and then evolve spatio-temporally within the heart. To understand arrhythmias better and to diagnose them more precisely by their ECG waveforms, a whole-heart model is required to explore the association between the massively parallel activities at the channel/cell level and the integrative electrophysiological phenomena at organ level. Methods We have developed a method to build large-scale electrophysiological models by using extended cellular automata, and to run such models on a cluster of shared memory machines. We describe here the method, including the extension of a language-based cellular automaton to implement quantitative computing, the building of a whole-heart model with Visible Human Project data, the parallelization of the model on a cluster of shared memory computers with OpenMP and MPI hybrid programming, and a simulation algorithm that links cellular activity with the ECG. Results We demonstrate that electrical activities at channel, cell, and organ levels can be traced and captured conveniently in our extended cellular automaton system. Examples of some ECG waveforms simulated with a 2-D slice are given to support the ECG simulation algorithm. A performance evaluation of the 3-D model on a four-node cluster is also given. Conclusions Quantitative multicellular modeling with extended cellular automata is a highly efficient and widely applicable method to weave experimental data at different levels into computational models. This process can be used to investigate complex and collective biological activities that can be described neither by their governing differentiation equations nor by discrete parallel computation. Transparent cluster computing is a convenient and effective method to make time-consuming simulation feasible. Arrhythmias, as a typical case, can be effectively simulated with the methods described. PMID:15339335

  14. β2-Agonist Induced cAMP Is Decreased in Asthmatic Airway Smooth Muscle Due to Increased PDE4D

    PubMed Central

    Trian, Thomas; Burgess, Janette K.; Niimi, Kyoko; Moir, Lyn M.; Ge, Qi; Berger, Patrick; Liggett, Stephen B.; Black, Judith L.; Oliver, Brian G.

    2011-01-01

    Background and Objective Asthma is associated with airway narrowing in response to bronchoconstricting stimuli and increased airway smooth muscle (ASM) mass. In addition, some studies have suggested impaired β-agonist induced ASM relaxation in asthmatics, but the mechanism is not known. Objective To characterize the potential defect in β-agonist induced cAMP in ASM derived from asthmatic in comparison to non-asthmatic subjects and to investigate its mechanism. Methods We examined β2-adrenergic (β2AR) receptor expression and basal β-agonist and forskolin (direct activator of adenylyl cyclase) stimulated cAMP production in asthmatic cultured ASM (n = 15) and non-asthmatic ASM (n = 22). Based on these results, PDE activity, PDE4D expression and cell proliferation were determined. Results In the presence of IBMX, a pan PDE inhibitor, asthmatic ASM had ∼50% lower cAMP production in response to isoproterenol, albuterol, formoterol, and forskolin compared to non-asthmatic ASM. However when PDE4 was specifically inhibited, cAMP production by the agonists and forskolin was normalized in asthmatic ASM. We then measured the amount and activity of PDE4, and found ∼2-fold greater expression and activity in asthmatic ASM compared to non-asthmatic ASM. Furthermore, inhibition of PDE4 reduced asthmatic ASM proliferation but not that of non-asthmatic ASM. Conclusion Decreased β-agonist induced cAMP in ASM from asthmatics results from enhanced degradation due to increased PDE4D expression. Clinical manifestations of this dysregulation would be suboptimal β-agonist-mediated bronchodilation and possibly reduced control over increasing ASM mass. These phenotypes appear to be “hard-wired” into ASM from asthmatics, as they do not require an inflammatory environment in culture to be observed. PMID:21611147

  15. Parallel 3D Multi-Stage Simulation of a Turbofan Engine

    NASA Technical Reports Server (NTRS)

    Turner, Mark G.; Topp, David A.

    1998-01-01

    A 3D multistage simulation of each component of a modern GE Turbofan engine has been made. An axisymmetric view of this engine is presented in the document. This includes a fan, booster rig, high pressure compressor rig, high pressure turbine rig and a low pressure turbine rig. In the near future, all components will be run in a single calculation for a solution of 49 blade rows. The simulation exploits the use of parallel computations by using two levels of parallelism. Each blade row is run in parallel and each blade row grid is decomposed into several domains and run in parallel. 20 processors are used for the 4 blade row analysis. The average passage approach developed by John Adamczyk at NASA Lewis Research Center has been further developed and parallelized. This is APNASA Version A. It is a Navier-Stokes solver using a 4-stage explicit Runge-Kutta time marching scheme with variable time steps and residual smoothing for convergence acceleration. It has an implicit K-E turbulence model which uses an ADI solver to factor the matrix. Between 50 and 100 explicit time steps are solved before a blade row body force is calculated and exchanged with the other blade rows. This outer iteration has been coined a "flip." Efforts have been made to make the solver linearly scaleable with the number of blade rows. Enough flips are run (between 50 and 200) so the solution in the entire machine is not changing. The K-E equations are generally solved every other explicit time step. One of the key requirements in the development of the parallel code was to make the parallel solution exactly (bit for bit) match the serial solution. This has helped isolate many small parallel bugs and guarantee the parallelization was done correctly. The domain decomposition is done only in the axial direction since the number of points axially is much larger than the other two directions. This code uses MPI for message passing. The parallel speed up of the solver portion (no 1/0 or body force

  16. Xyce

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thomquist, Heidi K.; Fixel, Deborah A.; Fett, David Brian

    The Xyce Parallel Electronic Simulator simulates electronic circuit behavior in DC, AC, HB, MPDE and transient mode using standard analog (DAE) and/or device (PDE) device models including several age and radiation aware devices. It supports a variety of computing platforms (both serial and parallel) computers. Lastly, it uses a variety of modern solution algorithms dynamic parallel load-balancing and iterative solvers.

  17. Parallel Stochastic discrete event simulation of calcium dynamics in neuron.

    PubMed

    Ishlam Patoary, Mohammad Nazrul; Tropper, Carl; McDougal, Robert A; Zhongwei, Lin; Lytton, William W

    2017-09-26

    The intra-cellular calcium signaling pathways of a neuron depends on both biochemical reactions and diffusions. Some quasi-isolated compartments (e.g. spines) are so small and calcium concentrations are so low that one extra molecule diffusing in by chance can make a nontrivial difference in its concentration (percentage-wise). These rare events can affect dynamics discretely in such way that they cannot be evaluated by a deterministic simulation. Stochastic models of such a system provide a more detailed understanding of these systems than existing deterministic models because they capture their behavior at a molecular level. Our research focuses on the development of a high performance parallel discrete event simulation environment, Neuron Time Warp (NTW), which is intended for use in the parallel simulation of stochastic reaction-diffusion systems such as intra-calcium signaling. NTW is integrated with NEURON, a simulator which is widely used within the neuroscience community. We simulate two models, a calcium buffer and a calcium wave model. The calcium buffer model is employed in order to verify the correctness and performance of NTW by comparing it to a serial deterministic simulation in NEURON. We also derived a discrete event calcium wave model from a deterministic model using the stochastic IP3R structure.

  18. Progress on the Multiphysics Capabilities of the Parallel Electromagnetic ACE3P Simulation Suite

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kononenko, Oleksiy

    2015-03-26

    ACE3P is a 3D parallel simulation suite that is being developed at SLAC National Accelerator Laboratory. Effectively utilizing supercomputer resources, ACE3P has become a key tool for the coupled electromagnetic, thermal and mechanical research and design of particle accelerators. Based on the existing finite-element infrastructure, a massively parallel eigensolver is developed for modal analysis of mechanical structures. It complements a set of the multiphysics tools in ACE3P and, in particular, can be used for the comprehensive study of microphonics in accelerating cavities ensuring the operational reliability of a particle accelerator.

  19. Implementation of Shifted Periodic Boundary Conditions in the Large-Scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) Software

    DTIC Science & Technology

    2015-08-01

    Atomic/Molecular Massively Parallel Simulator ( LAMMPS ) Software by N Scott Weingarten and James P Larentzos Approved for...Massively Parallel Simulator ( LAMMPS ) Software by N Scott Weingarten Weapons and Materials Research Directorate, ARL James P Larentzos Engility...Shifted Periodic Boundary Conditions in the Large-Scale Atomic/Molecular Massively Parallel Simulator ( LAMMPS ) Software 5a. CONTRACT NUMBER 5b

  20. A Role for Phosphodiesterase 11A (PDE11A) in the Formation of Social Memories and the Stabilization of Mood

    PubMed Central

    Kelly, Michy P.

    2017-01-01

    The most recently discovered 3′,5′-cyclic nucleotide phosphodiesterase family is the Phosphodiesterase 11 (PDE11) family, which is encoded by a single gene PDE11A. PDE11A is a dual-specific PDE, breaking down both cAMP and cGMP. There are four PDE11A splice variants (PDE11A1–4) with distinct tissue expression profiles and unique N-terminal regulatory regions, suggesting that each isoform could be individually targeted with a small molecule or biologic. PDE11A4 is the PDE11A isoform expressed in brain and is found in the hippocampal formation of humans and rodents. Studies in rodents show that PDE11A4 mRNA expression in brain is, in fact, restricted to the hippocampal formation (CA1, possibly CA2, subiculum, and the adjacently connected amygdalohippocampal area). Within the hippocampal formation of rodents, PDE11A4 protein is expressed in neurons but not astrocytes, with a distribution across nuclear, cytoplasmic, and membrane compartments. This subcellular localization of PDE11A4 is altered in response to social experience in mouse, and in vitro studies show the compartmentalization of PDE11A4 is controlled, at least in part, by homodimerization and N-terminal phosphorylation. PDE11A4 expression dramatically increases in the hippocampus with age in the rodent hippocampus, from early postnatal life to late aging, suggesting PDE11A4 function may evolve across the lifespan. Interestingly, PDE11A4 protein shows a 3–10-fold enrichment in the rodent ventral hippocampal formation (VHIPP; a.k.a. anterior in primates) versus dorsal hippocampal formation (DHIPP). Consistent with this enrichment in VHIPP, studies in knockout mice show that PDE11A regulates the formation of social memories and the stabilization of mood and is a critical mechanism by which social experience feeds back to modify the brain and subsequent social behaviors. PDE11A4 likely controls behavior by regulating hippocampal glutamatergic, oxytocin, and cytokine signaling, as well as protein

  1. Parallelized computation for computer simulation of electrocardiograms using personal computers with multi-core CPU and general-purpose GPU.

    PubMed

    Shen, Wenfeng; Wei, Daming; Xu, Weimin; Zhu, Xin; Yuan, Shizhong

    2010-10-01

    Biological computations like electrocardiological modelling and simulation usually require high-performance computing environments. This paper introduces an implementation of parallel computation for computer simulation of electrocardiograms (ECGs) in a personal computer environment with an Intel CPU of Core (TM) 2 Quad Q6600 and a GPU of Geforce 8800GT, with software support by OpenMP and CUDA. It was tested in three parallelization device setups: (a) a four-core CPU without a general-purpose GPU, (b) a general-purpose GPU plus 1 core of CPU, and (c) a four-core CPU plus a general-purpose GPU. To effectively take advantage of a multi-core CPU and a general-purpose GPU, an algorithm based on load-prediction dynamic scheduling was developed and applied to setting (c). In the simulation with 1600 time steps, the speedup of the parallel computation as compared to the serial computation was 3.9 in setting (a), 16.8 in setting (b), and 20.0 in setting (c). This study demonstrates that a current PC with a multi-core CPU and a general-purpose GPU provides a good environment for parallel computations in biological modelling and simulation studies. Copyright 2010 Elsevier Ireland Ltd. All rights reserved.

  2. PCTDSE: A parallel Cartesian-grid-based TDSE solver for modeling laser-atom interactions

    NASA Astrophysics Data System (ADS)

    Fu, Yongsheng; Zeng, Jiaolong; Yuan, Jianmin

    2017-01-01

    We present a parallel Cartesian-grid-based time-dependent Schrödinger equation (TDSE) solver for modeling laser-atom interactions. It can simulate the single-electron dynamics of atoms in arbitrary time-dependent vector potentials. We use a split-operator method combined with fast Fourier transforms (FFT), on a three-dimensional (3D) Cartesian grid. Parallelization is realized using a 2D decomposition strategy based on the Message Passing Interface (MPI) library, which results in a good parallel scaling on modern supercomputers. We give simple applications for the hydrogen atom using the benchmark problems coming from the references and obtain repeatable results. The extensions to other laser-atom systems are straightforward with minimal modifications of the source code.

  3. Xyce parallel electronic simulator reference guide, version 6.0.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R; Mei, Ting; Russo, Thomas V.

    2013-08-01

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users Guide [1] . The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users Guide [1] .

  4. A numerical investigation into the ability of the Poisson PDE to extract the mass-density from land-based gravity data: A case study of salt diapirs in the north coast of the Persian Gulf

    NASA Astrophysics Data System (ADS)

    AllahTavakoli, Yahya; Safari, Abdolreza

    2017-08-01

    This paper is counted as a numerical investigation into the capability of Poisson's Partial Differential Equation (PDE) at Earth's surface to extract the near-surface mass-density from land-based gravity data. For this purpose, first it focuses on approximating the gradient tensor of Earth's gravitational potential by means of land-based gravity data. Then, based on the concepts of both the gradient tensor and Poisson's PDE at the Earth's surface, certain formulae are proposed for the mass-density determination. Furthermore, this paper shows how the generalized Tikhonov regularization strategy can be used for enhancing the efficiency of the proposed approach. Finally, in a real case study, the formulae are applied to 6350 gravity stations located within a part of the north coast of the Persian Gulf. The case study numerically indicates that the proposed formulae, provided by Poisson's PDE, has the ability to convert land-based gravity data into the terrain mass-density which has been used for depicting areas of salt diapirs in the region of the case study.

  5. Simulated parallel annealing within a neighborhood for optimization of biomechanical systems.

    PubMed

    Higginson, J S; Neptune, R R; Anderson, F C

    2005-09-01

    Optimization problems for biomechanical systems have become extremely complex. Simulated annealing (SA) algorithms have performed well in a variety of test problems and biomechanical applications; however, despite advances in computer speed, convergence to optimal solutions for systems of even moderate complexity has remained prohibitive. The objective of this study was to develop a portable parallel version of a SA algorithm for solving optimization problems in biomechanics. The algorithm for simulated parallel annealing within a neighborhood (SPAN) was designed to minimize interprocessor communication time and closely retain the heuristics of the serial SA algorithm. The computational speed of the SPAN algorithm scaled linearly with the number of processors on different computer platforms for a simple quadratic test problem and for a more complex forward dynamic simulation of human pedaling.

  6. Long-range interactions and parallel scalability in molecular simulations

    NASA Astrophysics Data System (ADS)

    Patra, Michael; Hyvönen, Marja T.; Falck, Emma; Sabouri-Ghomi, Mohsen; Vattulainen, Ilpo; Karttunen, Mikko

    2007-01-01

    Typical biomolecular systems such as cellular membranes, DNA, and protein complexes are highly charged. Thus, efficient and accurate treatment of electrostatic interactions is of great importance in computational modeling of such systems. We have employed the GROMACS simulation package to perform extensive benchmarking of different commonly used electrostatic schemes on a range of computer architectures (Pentium-4, IBM Power 4, and Apple/IBM G5) for single processor and parallel performance up to 8 nodes—we have also tested the scalability on four different networks, namely Infiniband, GigaBit Ethernet, Fast Ethernet, and nearly uniform memory architecture, i.e. communication between CPUs is possible by directly reading from or writing to other CPUs' local memory. It turns out that the particle-mesh Ewald method (PME) performs surprisingly well and offers competitive performance unless parallel runs on PC hardware with older network infrastructure are needed. Lipid bilayers of sizes 128, 512 and 2048 lipid molecules were used as the test systems representing typical cases encountered in biomolecular simulations. Our results enable an accurate prediction of computational speed on most current computing systems, both for serial and parallel runs. These results should be helpful in, for example, choosing the most suitable configuration for a small departmental computer cluster.

  7. Kaempferia parviflora, a plant used in traditional medicine to enhance sexual performance contains large amounts of low affinity PDE5 inhibitors

    PubMed Central

    Temkitthawon, Prapapan; Hinds, Thomas R.; Beavo, Joseph A.; Viyoch, Jarupa; Suwanborirux, Khanit; Pongamornkul, Wittaya; Sawasdee, Pattara; Ingkaninan, Kornkanok

    2014-01-01

    Aim of the study A number of medicinal plants are used in traditional medicine to treat erectile dysfunction. Since cyclic nucleotide PDEs inhibitors underlie several current treatments for this condition, we sought to show whether these plants might contain substantial amounts of PDE5 inhibitors. Materials and methods Forty one plant extracts and eight 7-methoxyflavones from Kaempferia parviflora Wall. ex Baker were screened for PDE5 and PDE6 inhibitory activities using the two-step radioactive assay. The PDE5 and PDE6 were prepared from mice lung and chicken retinas, respectively. All plant extracts were tested at 50 μg/ml whereas the pure compounds were tested at 10 μM. Results From forty one plant extracts tested, four showed the PDE5 inhibitory effect. The chemical constituents isolated from rhizomes of Kaempferia parviflora were further investigated on inhibitory activity against PDE5 and PDE6. The results showed that 7-methoxyflavones from this plant showed inhibition toward both enzymes. The most potent PDE5 inhibitor was 5,7-dimethoxyflavone (IC50 = 10.64 ± 2.09 μM, selectivity on PDE5 over PDE6 = 3.71). Structure activity relationship showed that the methoxyl group at C-5 position of 7-methoxyflavones was necessary for PDE5 inhibition. Conclusions Kaempferia parviflora rhizome extract and its 7-methoxyflavone constituents had moderate inhibitory activity against PDE5. This finding provides an explanation for enhancing sexual performance in the traditional use of Kaempferia parviflora. Moreover, 5,7-dimethoxyflavones should make a useful lead compound to further develop clinically efficacious PDE5 inhibitors. PMID:21884777

  8. A PDE Sensitivity Equation Method for Optimal Aerodynamic Design

    NASA Technical Reports Server (NTRS)

    Borggaard, Jeff; Burns, John

    1996-01-01

    The use of gradient based optimization algorithms in inverse design is well established as a practical approach to aerodynamic design. A typical procedure uses a simulation scheme to evaluate the objective function (from the approximate states) and its gradient, then passes this information to an optimization algorithm. Once the simulation scheme (CFD flow solver) has been selected and used to provide approximate function evaluations, there are several possible approaches to the problem of computing gradients. One popular method is to differentiate the simulation scheme and compute design sensitivities that are then used to obtain gradients. Although this black-box approach has many advantages in shape optimization problems, one must compute mesh sensitivities in order to compute the design sensitivity. In this paper, we present an alternative approach using the PDE sensitivity equation to develop algorithms for computing gradients. This approach has the advantage that mesh sensitivities need not be computed. Moreover, when it is possible to use the CFD scheme for both the forward problem and the sensitivity equation, then there are computational advantages. An apparent disadvantage of this approach is that it does not always produce consistent derivatives. However, for a proper combination of discretization schemes, one can show asymptotic consistency under mesh refinement, which is often sufficient to guarantee convergence of the optimal design algorithm. In particular, we show that when asymptotically consistent schemes are combined with a trust-region optimization algorithm, the resulting optimal design method converges. We denote this approach as the sensitivity equation method. The sensitivity equation method is presented, convergence results are given and the approach is illustrated on two optimal design problems involving shocks.

  9. Use of Parallel Micro-Platform for the Simulation the Space Exploration

    NASA Astrophysics Data System (ADS)

    Velasco Herrera, Victor Manuel; Velasco Herrera, Graciela; Rosano, Felipe Lara; Rodriguez Lozano, Salvador; Lucero Roldan Serrato, Karen

    The purpose of this work is to create a parallel micro-platform, that simulates the virtual movements of a space exploration in 3D. One of the innovations presented in this design consists of the application of a lever mechanism for the transmission of the movement. The development of such a robot is a challenging task very different of the industrial manipulators due to a totally different target system of requirements. This work presents the study and simulation, aided by computer, of the movement of this parallel manipulator. The development of this model has been developed using the platform of computer aided design Unigraphics, in which it was done the geometric modeled of each one of the components and end assembly (CAD), the generation of files for the computer aided manufacture (CAM) of each one of the pieces and the kinematics simulation of the system evaluating different driving schemes. We used the toolbox (MATLAB) of aerospace and create an adaptive control module to simulate the system.

  10. Xyce parallel electronic simulator reference guide, version 6.1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R; Mei, Ting; Russo, Thomas V.

    2014-03-01

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users Guide [1] . The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users Guide [1] .

  11. Targeted Ablation of the Pde6h Gene in Mice Reveals Cross-species Differences in Cone and Rod Phototransduction Protein Isoform Inventory*

    PubMed Central

    Brennenstuhl, Christina; Tanimoto, Naoyuki; Burkard, Markus; Wagner, Rebecca; Bolz, Sylvia; Trifunovic, Dragana; Kabagema-Bilan, Clement; Paquet-Durand, Francois; Beck, Susanne C.; Huber, Gesine; Seeliger, Mathias W.; Ruth, Peter; Wissinger, Bernd; Lukowski, Robert

    2015-01-01

    Phosphodiesterase-6 (PDE6) is a multisubunit enzyme that plays a key role in the visual transduction cascade in rod and cone photoreceptors. Each type of photoreceptor utilizes discrete catalytic and inhibitory PDE6 subunits to fulfill its physiological tasks, i.e. the degradation of cyclic guanosine-3′,5′-monophosphate at specifically tuned rates and kinetics. Recently, the human PDE6H gene was identified as a novel locus for autosomal recessive (incomplete) color blindness. However, the three different classes of cones were not affected to the same extent. Short wave cone function was more preserved than middle and long wave cone function indicating that some basic regulation of the PDE6 multisubunit enzyme was maintained albeit by a unknown mechanism. To study normal and disease-related functions of cone Pde6h in vivo, we generated Pde6h knock-out (Pde6h−/−) mice. Expression of PDE6H in murine eyes was restricted to both outer segments and synaptic terminals of short and long/middle cone photoreceptors, whereas Pde6h−/− retinae remained PDE6H-negative. Combined in vivo assessment of retinal morphology with histomorphological analyses revealed a normal overall integrity of the retinal organization and an unaltered distribution of the different cone photoreceptor subtypes upon Pde6h ablation. In contrast to human patients, our electroretinographic examinations of Pde6h−/− mice suggest no defects in cone/rod-driven retinal signaling and therefore preserved visual functions. To this end, we were able to demonstrate the presence of rod PDE6G in cones indicating functional substitution of PDE6. The disparities between human and murine phenotypes caused by mutant Pde6h/PDE6H suggest species-to-species differences in the vulnerability of biochemical and neurosensory pathways of the visual signal transduction system. PMID:25739440

  12. A fast parallel clustering algorithm for molecular simulation trajectories.

    PubMed

    Zhao, Yutong; Sheong, Fu Kit; Sun, Jian; Sander, Pedro; Huang, Xuhui

    2013-01-15

    We implemented a GPU-powered parallel k-centers algorithm to perform clustering on the conformations of molecular dynamics (MD) simulations. The algorithm is up to two orders of magnitude faster than the CPU implementation. We tested our algorithm on four protein MD simulation datasets ranging from the small Alanine Dipeptide to a 370-residue Maltose Binding Protein (MBP). It is capable of grouping 250,000 conformations of the MBP into 4000 clusters within 40 seconds. To achieve this, we effectively parallelized the code on the GPU and utilize the triangle inequality of metric spaces. Furthermore, the algorithm's running time is linear with respect to the number of cluster centers. In addition, we found the triangle inequality to be less effective in higher dimensions and provide a mathematical rationale. Finally, using Alanine Dipeptide as an example, we show a strong correlation between cluster populations resulting from the k-centers algorithm and the underlying density. © 2012 Wiley Periodicals, Inc. Copyright © 2012 Wiley Periodicals, Inc.

  13. Acceleration of Radiance for Lighting Simulation by Using Parallel Computing with OpenCL

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zuo, Wangda; McNeil, Andrew; Wetter, Michael

    2011-09-06

    We report on the acceleration of annual daylighting simulations for fenestration systems in the Radiance ray-tracing program. The algorithm was optimized to reduce both the redundant data input/output operations and the floating-point operations. To further accelerate the simulation speed, the calculation for matrix multiplications was implemented using parallel computing on a graphics processing unit. We used OpenCL, which is a cross-platform parallel programming language. Numerical experiments show that the combination of the above measures can speed up the annual daylighting simulations 101.7 times or 28.6 times when the sky vector has 146 or 2306 elements, respectively.

  14. Frequent phosphodiesterase 11A gene (PDE11A) defects in patients with Carney complex (CNC) caused by PRKAR1A mutations: PDE11A may contribute to adrenal and testicular tumors in CNC as a modifier of the phenotype.

    PubMed

    Libé, Rossella; Horvath, Anelia; Vezzosi, Delphine; Fratticci, Amato; Coste, Joel; Perlemoine, Karine; Ragazzon, Bruno; Guillaud-Bataille, Marine; Groussin, Lionel; Clauser, Eric; Raffin-Sanson, Marie-Laure; Siegel, Jennifer; Moran, Jason; Drori-Herishanu, Limor; Faucz, Fabio Rueda; Lodish, Maya; Nesterova, Maria; Bertagna, Xavier; Bertherat, Jerome; Stratakis, Constantine A

    2011-01-01

    Carney complex (CNC) is an autosomal dominant multiple neoplasia, caused mostly by inactivating mutations of the regulatory subunit 1A of the protein kinase A (PRKAR1A). Primary pigmented nodular adrenocortical disease (PPNAD) is the most frequent endocrine manifestation of CNC with a great inter-individual variability. Germline, protein-truncating mutations of phosphodiesterase type 11A (PDE11A) have been described to predispose to a variety of endocrine tumors, including adrenal and testicular tumors. Our objective was to investigate the role of PDE11A as a possible gene modifier of the phenotype in a series of 150 patients with CNC. A higher frequency of PDE11A variants in patients with CNC compared with healthy controls was found (25.3 vs. 6.8%, P < 0.0001). Among CNC patients, those with PPNAD were significantly more frequently carriers of PDE11A variants compared with patients without PPNAD (30.8 vs. 13%, P = 0.025). Furthermore, men with PPNAD were significantly more frequently carriers of PDE11A sequence variants (40.7%) than women with PPNAD (27.3%) (P < 0.001). A higher frequency of PDE11A sequence variants was also found in patients with large-cell calcifying Sertoli cell tumors (LCCSCT) compared with those without LCCSCT (50 vs. 10%, P = 0.0056). PDE11A variants were significantly associated with the copresence of PPNAD and LCCSCT in men: 81 vs. 20%, P < 0.004). The simultaneous inactivation of PRKAR1A and PDE11A by small inhibitory RNA led to an increase in cAMP-regulatory element-mediated transcriptional activity under basal conditions and after stimulation by forskolin. We demonstrate, in a large cohort of CNC patients, a high frequency of PDE11A variants, suggesting that PDE11A is a genetic modifying factor for the development of testicular and adrenal tumors in patients with germline PRKAR1A mutation.

  15. Parallel conjugate gradient algorithms for manipulator dynamic simulation

    NASA Technical Reports Server (NTRS)

    Fijany, Amir; Scheld, Robert E.

    1989-01-01

    Parallel conjugate gradient algorithms for the computation of multibody dynamics are developed for the specialized case of a robot manipulator. For an n-dimensional positive-definite linear system, the Classical Conjugate Gradient (CCG) algorithms are guaranteed to converge in n iterations, each with a computation cost of O(n); this leads to a total computational cost of O(n sq) on a serial processor. A conjugate gradient algorithms is presented that provide greater efficiency using a preconditioner, which reduces the number of iterations required, and by exploiting parallelism, which reduces the cost of each iteration. Two Preconditioned Conjugate Gradient (PCG) algorithms are proposed which respectively use a diagonal and a tridiagonal matrix, composed of the diagonal and tridiagonal elements of the mass matrix, as preconditioners. Parallel algorithms are developed to compute the preconditioners and their inversions in O(log sub 2 n) steps using n processors. A parallel algorithm is also presented which, on the same architecture, achieves the computational time of O(log sub 2 n) for each iteration. Simulation results for a seven degree-of-freedom manipulator are presented. Variants of the proposed algorithms are also developed which can be efficiently implemented on the Robot Mathematics Processor (RMP).

  16. Parallel processing for nonlinear dynamics simulations of structures including rotating bladed-disk assemblies

    NASA Technical Reports Server (NTRS)

    Hsieh, Shang-Hsien

    1993-01-01

    The principal objective of this research is to develop, test, and implement coarse-grained, parallel-processing strategies for nonlinear dynamic simulations of practical structural problems. There are contributions to four main areas: finite element modeling and analysis of rotational dynamics, numerical algorithms for parallel nonlinear solutions, automatic partitioning techniques to effect load-balancing among processors, and an integrated parallel analysis system.

  17. Parallel Adjective High-Order CFD Simulations Characterizing SOFIA Cavity Acoustics

    NASA Technical Reports Server (NTRS)

    Barad, Michael F.; Brehm, Christoph; Kiris, Cetin C.; Biswas, Rupak

    2016-01-01

    This paper presents large-scale MPI-parallel computational uid dynamics simulations for the Stratospheric Observatory for Infrared Astronomy (SOFIA). SOFIA is an airborne, 2.5-meter infrared telescope mounted in an open cavity in the aft fuselage of a Boeing 747SP. These simulations focus on how the unsteady ow eld inside and over the cavity interferes with the optical path and mounting structure of the telescope. A temporally fourth-order accurate Runge-Kutta, and spatially fth-order accurate WENO- 5Z scheme was used to perform implicit large eddy simulations. An immersed boundary method provides automated gridding for complex geometries and natural coupling to a block-structured Cartesian adaptive mesh re nement framework. Strong scaling studies using NASA's Pleiades supercomputer with up to 32k CPU cores and 4 billion compu- tational cells shows excellent scaling. Dynamic load balancing based on execution time on individual AMR blocks addresses irregular numerical cost associated with blocks con- taining boundaries. Limits to scaling beyond 32k cores are identi ed, and targeted code optimizations are discussed.

  18. Parallel Adaptive High-Order CFD Simulations Characterizing SOFIA Cavitiy Acoustics

    NASA Technical Reports Server (NTRS)

    Barad, Michael F.; Brehm, Christoph; Kiris, Cetin C.; Biswas, Rupak

    2015-01-01

    This paper presents large-scale MPI-parallel computational uid dynamics simulations for the Stratospheric Observatory for Infrared Astronomy (SOFIA). SOFIA is an airborne, 2.5-meter infrared telescope mounted in an open cavity in the aft fuselage of a Boeing 747SP. These simulations focus on how the unsteady ow eld inside and over the cavity interferes with the optical path and mounting structure of the telescope. A tempo- rally fourth-order accurate Runge-Kutta, and a spatially fth-order accurate WENO-5Z scheme were used to perform implicit large eddy simulations. An immersed boundary method provides automated gridding for complex geometries and natural coupling to a block-structured Cartesian adaptive mesh re nement framework. Strong scaling studies using NASA's Pleiades supercomputer with up to 32k CPU cores and 4 billion compu- tational cells shows excellent scaling. Dynamic load balancing based on execution time on individual AMR blocks addresses irregular numerical cost associated with blocks con- taining boundaries. Limits to scaling beyond 32k cores are identi ed, and targeted code optimizations are discussed.

  19. Extending molecular simulation time scales: Parallel in time integrations for high-level quantum chemistry and complex force representations

    NASA Astrophysics Data System (ADS)

    Bylaska, Eric J.; Weare, Jonathan Q.; Weare, John H.

    2013-08-01

    Parallel in time simulation algorithms are presented and applied to conventional molecular dynamics (MD) and ab initio molecular dynamics (AIMD) models of realistic complexity. Assuming that a forward time integrator, f (e.g., Verlet algorithm), is available to propagate the system from time ti (trajectory positions and velocities xi = (ri, vi)) to time ti + 1 (xi + 1) by xi + 1 = fi(xi), the dynamics problem spanning an interval from t0…tM can be transformed into a root finding problem, F(X) = [xi - f(x(i - 1)]i = 1, M = 0, for the trajectory variables. The root finding problem is solved using a variety of root finding techniques, including quasi-Newton and preconditioned quasi-Newton schemes that are all unconditionally convergent. The algorithms are parallelized by assigning a processor to each time-step entry in the columns of F(X). The relation of this approach to other recently proposed parallel in time methods is discussed, and the effectiveness of various approaches to solving the root finding problem is tested. We demonstrate that more efficient dynamical models based on simplified interactions or coarsening time-steps provide preconditioners for the root finding problem. However, for MD and AIMD simulations, such preconditioners are not required to obtain reasonable convergence and their cost must be considered in the performance of the algorithm. The parallel in time algorithms developed are tested by applying them to MD and AIMD simulations of size and complexity similar to those encountered in present day applications. These include a 1000 Si atom MD simulation using Stillinger-Weber potentials, and a HCl + 4H2O AIMD simulation at the MP2 level. The maximum speedup (serial execution time/parallel execution time) obtained by parallelizing the Stillinger-Weber MD simulation was nearly 3.0. For the AIMD MP2 simulations, the algorithms achieved speedups of up to 14.3. The parallel in time algorithms can be implemented in a distributed computing

  20. Parallel spatial direct numerical simulations on the Intel iPSC/860 hypercube

    NASA Technical Reports Server (NTRS)

    Joslin, Ronald D.; Zubair, Mohammad

    1993-01-01

    The implementation and performance of a parallel spatial direct numerical simulation (PSDNS) approach on the Intel iPSC/860 hypercube is documented. The direct numerical simulation approach is used to compute spatially evolving disturbances associated with the laminar-to-turbulent transition in boundary-layer flows. The feasibility of using the PSDNS on the hypercube to perform transition studies is examined. The results indicate that the direct numerical simulation approach can effectively be parallelized on a distributed-memory parallel machine. By increasing the number of processors nearly ideal linear speedups are achieved with nonoptimized routines; slower than linear speedups are achieved with optimized (machine dependent library) routines. This slower than linear speedup results because the Fast Fourier Transform (FFT) routine dominates the computational cost and because the routine indicates less than ideal speedups. However with the machine-dependent routines the total computational cost decreases by a factor of 4 to 5 compared with standard FORTRAN routines. The computational cost increases linearly with spanwise wall-normal and streamwise grid refinements. The hypercube with 32 processors was estimated to require approximately twice the amount of Cray supercomputer single processor time to complete a comparable simulation; however it is estimated that a subgrid-scale model which reduces the required number of grid points and becomes a large-eddy simulation (PSLES) would reduce the computational cost and memory requirements by a factor of 10 over the PSDNS. This PSLES implementation would enable transition simulations on the hypercube at a reasonable computational cost.

  1. Interaction between integrin α5 and PDE4D regulates endothelial inflammatory signalling

    PubMed Central

    Yun, Sanguk; Budatha, Madhusudhan; Dahlman, James E.; Coon, Brian G.; Cameron, Ryan T.; Langer, Robert; Anderson, Daniel G.; Baillie, George; Schwartz, Martin A.

    2016-01-01

    Atherosclerosis is primarily a disease of lipid metabolism and inflammation; however, it is also closely associated with endothelial extracellular matrix (ECM) remodelling, with fibronectin accumulating in the laminin–collagen basement membrane. To investigate how fibronectin modulates inflammation in arteries, we replaced the cytoplasmic tail of the fibronectin receptor integrin α5 with that of the collagen/laminin receptor integrin α2. This chimaera suppressed inflammatory signalling in endothelial cells on fibronectin and in knock-in mice. Fibronectin promoted inflammation by suppressing anti-inflammatory cAMP. cAMP was activated through endothelial prostacyclin secretion; however, this was ECM-independent. Instead, cells on fibronectin suppressed cAMP via enhanced phosphodiesterase (PDE) activity, through direct binding of integrin α5 to phosphodiesterase-4D5 (PDE4D5), which induced PP2A-dependent dephosphorylation of PDE4D5 on the inhibitory site Ser651. In vivo knockdown of PDE4D5 inhibited inflammation at athero-prone sites. These data elucidate a molecular mechanism linking ECM remodelling and inflammation, thereby identifying a new class of therapeutic targets. PMID:27595237

  2. Address tracing for parallel machines

    NASA Technical Reports Server (NTRS)

    Stunkel, Craig B.; Janssens, Bob; Fuchs, W. Kent

    1991-01-01

    Recently implemented parallel system address-tracing methods based on several metrics are surveyed. The issues specific to collection of traces for both shared and distributed memory parallel computers are highlighted. Five general categories of address-trace collection methods are examined: hardware-captured, interrupt-based, simulation-based, altered microcode-based, and instrumented program-based traces. The problems unique to shared memory and distributed memory multiprocessors are examined separately.

  3. A scalable PC-based parallel computer for lattice QCD

    NASA Astrophysics Data System (ADS)

    Fodor, Z.; Katz, S. D.; Pappa, G.

    2003-05-01

    A PC-based parallel computer for medium/large scale lattice QCD simulations is suggested. The Eo¨tvo¨s Univ., Inst. Theor. Phys. cluster consists of 137 Intel P4-1.7GHz nodes. Gigabit Ethernet cards are used for nearest neighbor communication in a two-dimensional mesh. The sustained performance for dynamical staggered (wilson) quarks on large lattices is around 70(110) GFlops. The exceptional price/performance ratio is below $1/Mflop.

  4. Rutin inhibits B[a]PDE-induced cyclooxygenase-2 expression by targeting EGFR kinase activity.

    PubMed

    Choi, Seunghwan; Lim, Tae-Gyu; Hwang, Mun Kyung; Kim, Yoon-A; Kim, Jiyoung; Kang, Nam Joo; Jang, Tae Su; Park, Jun-Seong; Yeom, Myeong Hun; Lee, Ki Won

    2013-11-15

    Rutin is a well-known flavonoid that exists in various natural sources. Accumulative studies have represented the biological effects of rutin, such as anti-oxidative and anti-inflammatory effects. However, the underlying mechanisms of rutin and its direct targets are not understood. We investigated whether rutin reduced B[a]PDE-induced-COX-2 expression. The transactivation of AP-1 and NF-κB were inhibited by rutin. Rutin also attenuated B[a]PDE-induced Raf/MEK/ERK and Akt activation, but had no effect on the phosphorylation of EGFR. An in vitro kinase assay revealed rutin suppressed EGFR kinase activity. We also confirmed direct binding between rutin and EGFR, and found that the binding was regressed by ATP. The EGFR inhibitor also inhibited the B[a]PDE-induced MEK/ERK and Akt signaling pathways and subsequently, suppressed COX-2 expression and promoter activity, in addition to suppressing the transactivation of AP-1 and NF-κB. In EGFR(-/-)mouse embryonic fibroblast cells, B[a]PDE-induced COX-2 expression was also diminished. Collectively, rutin inhibits B[a]PDE-induced COX-2 expression by suppressing the Raf/MEK/ERK and Akt signaling pathways. EGFR appeared to be the direct target of rutin. Copyright © 2013 Elsevier Inc. All rights reserved.

  5. Modified current follower-based immittance function simulators

    NASA Astrophysics Data System (ADS)

    Alpaslan, Halil; Yuce, Erkan

    2017-12-01

    In this paper, four immittance function simulators consisting of a single modified current follower with single Z- terminal and a minimum number of passive components are proposed. The first proposed circuit can provide +L parallel with +R and the second proposed one can realise -L parallel with -R. The third proposed structure can provide +L series with +R and the fourth proposed one can realise -L series with -R. However, all the proposed immittance function simulators need a single resistive matching constraint. Parasitic impedance effects on all the proposed immittance function simulators are investigated. A second-order current-mode (CM) high-pass filter derived from the first proposed immittance function simulator is given as an application example. Also, a second-order CM low-pass filter derived from the third proposed immittance function simulator is given as an application example. A number of simulation results based on SPICE programme and an experimental test result are given to verify the theory.

  6. Reasons and predictive factors for discontinuation of PDE-5 inhibitors despite successful intercourse in erectile dysfunction patients

    PubMed Central

    Kim, S-C; Lee, Y-S; Seo, K-K; Jung, G-W; Kim, T-H

    2014-01-01

    This study was aimed to identify characteristics of ED patients who discontinued PDE5i despite successful intercourse. Data were collected using a questionnaire from 34 urologic clinics regardless of the effect (success or failure) of PDE5i treatment by visiting the clinics (717), e-mail (64) or post (101) for 882 ED patients who had previously taken any kind of PDE5i on demand four or more times. Discontinuation of PDE5i was defined if the patient had never taken PDE5i for the previous 1 year despite successful intercourse. Of the 882 patients, 485 were included in the final analysis. Difference in the socio-demographic, ED- and partner-related data between the continuation and discontinuation group and factors influencing discontinuation of the PDE5i were analyzed. Among 485 respondents (mean age, 53.6), 116 (23.9%) had discontinued PDE5i use despite successful intercourse. Most common reasons for the discontinuation were ‘reluctant medication-dependent intercourse' (31.0%), ‘spontaneous recovery of erectile function without further treatment' (30.2%), and ‘high cost' (26.7%). In multiple logistic regression analysis, independent factors influencing discontinuation of the drug were cause of ED (psychogenic), short duration of ED, low education (⩽ middle school), and religion (Catholic). In partner-related compliance, only partner's religion (Catholic) was a significant factor. PMID:24305610

  7. Rational rates of uniform decay for strong solutions to a fluid-structure PDE system

    NASA Astrophysics Data System (ADS)

    Avalos, George; Bucci, Francesca

    2015-06-01

    In this work we investigate the uniform stability properties of solutions to a well-established partial differential equation (PDE) model for a fluid-structure interaction. The PDE system under consideration comprises a Stokes flow which evolves within a three-dimensional cavity; moreover, a Kirchhoff plate equation is invoked to describe the displacements along a (fixed) portion - say, Ω - of the cavity wall. Contact between the respective fluid and structure dynamics occurs on the boundary interface Ω. The main result in the paper is as follows: the solutions to the composite PDE system, corresponding to smooth initial data, decay at the rate of O (1 / t). Our method of proof hinges upon the appropriate invocation of a relatively recent resolvent criterion for polynomial decays of C0-semigroups. While the characterization provided by said criterion originates in the context of operator theory and functional analysis, the work entailed here is wholly within the realm of PDE.

  8. A Three-Dimensional Eulerian Code for Simulation of High-Speed Multimaterial Interactions

    DTIC Science & Technology

    2011-08-01

    PDE -based extension. The extension process is done on only the host cells on a particular processor. After extension the parallel communication is...condensation shocks, explosive debris transport, detonation in heterogeneous media and so on. In these flows complex interactions occur between the...A.22] and ijΩ is the spin tensor. The Jaumann derivative is used to ensure objectivity of the stress tensor with respect to rotation

  9. Spontaneous Hot Flow Anomalies at Quasi-Parallel Shocks: 2. Hybrid Simulations

    NASA Technical Reports Server (NTRS)

    Omidi, N.; Zhang, H.; Sibeck, D.; Turner, D.

    2013-01-01

    Motivated by recent THEMIS observations, this paper uses 2.5-D electromagnetic hybrid simulations to investigate the formation of Spontaneous Hot Flow Anomalies (SHFA) upstream of quasi-parallel bow shocks during steady solar wind conditions and in the absence of discontinuities. The results show the formation of a large number of structures along and upstream of the quasi-parallel bow shock. Their outer edges exhibit density and magnetic field enhancements, while their cores exhibit drops in density, magnetic field, solar wind velocity and enhancements in ion temperature. Using virtual spacecraft in the simulation, we show that the signatures of these structures in the time series data are very similar to those of SHFAs seen in THEMIS data and conclude that they correspond to SHFAs. Examination of the simulation data shows that SHFAs form as the result of foreshock cavitons interacting with the bow shock. Foreshock cavitons in turn form due to the nonlinear evolution of ULF waves generated by the interaction of the solar wind with the backstreaming ions. Because foreshock cavitons are an inherent part of the shock dissipation process, the formation of SHFAs is also an inherent part of the dissipation process leading to a highly non-uniform plasma in the quasi-parallel magnetosheath including large scale density and magnetic field cavities.

  10. Towards selective phosphodiesterase 2A (PDE2A) inhibitors: a patent review (2010 - present).

    PubMed

    Trabanco, Andrés A; Buijnsters, Peter; Rombouts, Frederik J R

    2016-08-01

    The cyclic nucleotides cAMP and cGMP are ubiquitous intracellular second messengers regulating a large variety of biological processes. The intracellular concentration of these biologically relevant molecules is modulated by the activity of phosphodiesterases (PDEs), a class of enzymes that is grouped in 11 families. The expression of PDEs is tissue- and cell-specific allowing spatiotemporal integration of multiple signaling cascades. PDE2A is a dual substrate enzyme and is expressed in both the periphery and in the central nervous system, however its expression is highest in the brain, where it is mainly localized in the cortex, hippocampus, and striatum. This suggests that this enzyme may regulate intraneuronal cGMP and cAMP levels in brain areas involved in emotion, perception, concentration, learning and memory. This review covers the patent applications published between January 2010 and February 2016 on phosphodiesterase 2A inhibitors. Recent publications in the literature and in filed patent applications demonstrate the interest of pharmaceutical companies for PDE2A. This has increased the insights of its possible therapeutic role but the few clinical trials were terminated. Based on the ongoing interest in the field it is likely that new clinical trials can be expected and will unravel the therapeutic potential of PDE2A inhibition.

  11. Validation of PDE9A Gene Identified in GWAS Showing Strong Association with Milk Production Traits in Chinese Holstein.

    PubMed

    Yang, Shao-Hua; Bi, Xiao-Jun; Xie, Yan; Li, Cong; Zhang, Sheng-Li; Zhang, Qin; Sun, Dong-Xiao

    2015-11-05

    Phosphodiesterase9A (PDE9A) is a cyclic guanosine monophosphate (cGMP)-specific enzyme widely expressed among the tissues, which is important in activating cGMP-dependent signaling pathways. In our previous genome-wide association study, a single nucleotide polymorphism (SNP) (BTA-55340-no-rs(b)) located in the intron 14 of PDE9A, was found to be significantly associated with protein yield. In addition, we found that PDE9A was highly expressed in mammary gland by analyzing its mRNA expression in different tissues. The objectives of this study were to identify genetic polymorphisms of PDE9A and to determine the effects of these variants on milk production traits in dairy cattle. DNA sequencing identified 11 single nucleotide polymorphisms (SNPs) and six SNPs in 5' regulatory region were genotyped to test for the subsequent association analyses. After Bonferroni correction for multiple testing, all these identified SNPs were statistically significant for one or more milk production traits (p < 0.0001~0.0077). Interestingly, haplotype-based association analysis revealed similar effects on milk production traits (p < 0.01). In follow-up RNA expression analyses, two SNPs (c.-1376 G>A, c.-724 A>G) were involved in the regulation of gene expression. Consequently, our findings provide confirmatory evidences for associations of PDE9A variants with milk production traits and these identified SNPs may serve as genetic markers to accelerate Chinese Holstein breeding program.

  12. Satisfiability Test with Synchronous Simulated Annealing on the Fujitsu AP1000 Massively-Parallel Multiprocessor

    NASA Technical Reports Server (NTRS)

    Sohn, Andrew; Biswas, Rupak

    1996-01-01

    Solving the hard Satisfiability Problem is time consuming even for modest-sized problem instances. Solving the Random L-SAT Problem is especially difficult due to the ratio of clauses to variables. This report presents a parallel synchronous simulated annealing method for solving the Random L-SAT Problem on a large-scale distributed-memory multiprocessor. In particular, we use a parallel synchronous simulated annealing procedure, called Generalized Speculative Computation, which guarantees the same decision sequence as sequential simulated annealing. To demonstrate the performance of the parallel method, we have selected problem instances varying in size from 100-variables/425-clauses to 5000-variables/21,250-clauses. Experimental results on the AP1000 multiprocessor indicate that our approach can satisfy 99.9 percent of the clauses while giving almost a 70-fold speedup on 500 processors.

  13. Development of a parallel FE simulator for modeling the whole trans-scale failure process of rock from meso- to engineering-scale

    NASA Astrophysics Data System (ADS)

    Li, Gen; Tang, Chun-An; Liang, Zheng-Zhao

    2017-01-01

    Multi-scale high-resolution modeling of rock failure process is a powerful means in modern rock mechanics studies to reveal the complex failure mechanism and to evaluate engineering risks. However, multi-scale continuous modeling of rock, from deformation, damage to failure, has raised high requirements on the design, implementation scheme and computation capacity of the numerical software system. This study is aimed at developing the parallel finite element procedure, a parallel rock failure process analysis (RFPA) simulator that is capable of modeling the whole trans-scale failure process of rock. Based on the statistical meso-damage mechanical method, the RFPA simulator is able to construct heterogeneous rock models with multiple mechanical properties, deal with and represent the trans-scale propagation of cracks, in which the stress and strain fields are solved for the damage evolution analysis of representative volume element by the parallel finite element method (FEM) solver. This paper describes the theoretical basis of the approach and provides the details of the parallel implementation on a Windows - Linux interactive platform. A numerical model is built to test the parallel performance of FEM solver. Numerical simulations are then carried out on a laboratory-scale uniaxial compression test, and field-scale net fracture spacing and engineering-scale rock slope examples, respectively. The simulation results indicate that relatively high speedup and computation efficiency can be achieved by the parallel FEM solver with a reasonable boot process. In laboratory-scale simulation, the well-known physical phenomena, such as the macroscopic fracture pattern and stress-strain responses, can be reproduced. In field-scale simulation, the formation process of net fracture spacing from initiation, propagation to saturation can be revealed completely. In engineering-scale simulation, the whole progressive failure process of the rock slope can be well modeled. It is

  14. An Empirical Development of Parallelization Guidelines for Time-Driven Simulation

    DTIC Science & Technology

    1989-12-01

    wives, who though not Cub fans, put on a good show during our trip, to waich some games . I would also like to recognize the help of my professors at...program parallelization. in this research effort a Ballistic Missile Defense (BMD) time driven simulation program, developed by DESE Research and...continuously, or continuously with discrete changes superimposed. The distinguishing feature of these simulations is the interaction between discretely

  15. Modeling of fatigue crack induced nonlinear ultrasonics using a highly parallelized explicit local interaction simulation approach

    NASA Astrophysics Data System (ADS)

    Shen, Yanfeng; Cesnik, Carlos E. S.

    2016-04-01

    This paper presents a parallelized modeling technique for the efficient simulation of nonlinear ultrasonics introduced by the wave interaction with fatigue cracks. The elastodynamic wave equations with contact effects are formulated using an explicit Local Interaction Simulation Approach (LISA). The LISA formulation is extended to capture the contact-impact phenomena during the wave damage interaction based on the penalty method. A Coulomb friction model is integrated into the computation procedure to capture the stick-slip contact shear motion. The LISA procedure is coded using the Compute Unified Device Architecture (CUDA), which enables the highly parallelized supercomputing on powerful graphic cards. Both the explicit contact formulation and the parallel feature facilitates LISA's superb computational efficiency over the conventional finite element method (FEM). The theoretical formulations based on the penalty method is introduced and a guideline for the proper choice of the contact stiffness is given. The convergence behavior of the solution under various contact stiffness values is examined. A numerical benchmark problem is used to investigate the new LISA formulation and results are compared with a conventional contact finite element solution. Various nonlinear ultrasonic phenomena are successfully captured using this contact LISA formulation, including the generation of nonlinear higher harmonic responses. Nonlinear mode conversion of guided waves at fatigue cracks is also studied.

  16. Anchored PDE4 regulates chloride conductance in wild-type and ΔF508-CFTR human airway epithelia

    PubMed Central

    Blanchard, Elise; Zlock, Lorna; Lao, Anna; Mika, Delphine; Namkung, Wan; Xie, Moses; Scheitrum, Colleen; Gruenert, Dieter C.; Verkman, Alan S.; Finkbeiner, Walter E.; Conti, Marco; Richter, Wito

    2014-01-01

    Cystic fibrosis (CF) is caused by mutations in the gene encoding the cystic fibrosis transmembrane conductance regulator (CFTR) that impair its expression and/or chloride channel function. Here, we provide evidence that type 4 cyclic nucleotide phosphodiesterases (PDE4s) are critical regulators of the cAMP/PKA-dependent activation of CFTR in primary human bronchial epithelial cells. In non-CF cells, PDE4 inhibition increased CFTR activity under basal conditions (ΔISC 7.1 μA/cm2) and after isoproterenol stimulation (increased ΔISC from 13.9 to 21.0 μA/cm2) and slowed the return of stimulated CFTR activity to basal levels by >3-fold. In cells homozygous for ΔF508-CFTR, the most common mutation found in CF, PDE4 inhibition alone produced minimal channel activation. However, PDE4 inhibition strongly amplified the effects of CFTR correctors, drugs that increase expression and membrane localization of CFTR, and/or CFTR potentiators, drugs that increase channel gating, to reach ∼25% of the chloride conductance observed in non-CF cells. Biochemical studies indicate that PDE4s are anchored to CFTR and mediate a local regulation of channel function. Taken together, our results implicate PDE4 as an important determinant of CFTR activity in airway epithelia, and support the use of PDE4 inhibitors to potentiate the therapeutic benefits of CFTR correctors and potentiators.—Blanchard, E., Zlock, L., Lao, A., Mika, D., Namkung, W., Xie, M., Scheitrum, C., Gruenert, D.C., Verkman, A.S., Finkbeiner, W.E., Conti, M., Richter, W. Anchored PDE4 regulates chloride conductance in wild type and ΔF508-CFTR human airway epithelia. PMID:24200884

  17. Mapping a battlefield simulation onto message-passing parallel architectures

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1987-01-01

    Perhaps the most critical problem in distributed simulation is that of mapping: without an effective mapping of workload to processors the speedup potential of parallel processing cannot be realized. Mapping a simulation onto a message-passing architecture is especially difficult when the computational workload dynamically changes as a function of time and space; this is exactly the situation faced by battlefield simulations. This paper studies an approach where the simulated battlefield domain is first partitioned into many regions of equal size; typically there are more regions than processors. The regions are then assigned to processors; a processor is responsible for performing all simulation activity associated with the regions. The assignment algorithm is quite simple and attempts to balance load by exploiting locality of workload intensity. The performance of this technique is studied on a simple battlefield simulation implemented on the Flex/32 multiprocessor. Measurements show that the proposed method achieves reasonable processor efficiencies. Furthermore, the method shows promise for use in dynamic remapping of the simulation.

  18. The Programming Language Python In Earth System Simulations

    NASA Astrophysics Data System (ADS)

    Gross, L.; Imranullah, A.; Mora, P.; Saez, E.; Smillie, J.; Wang, C.

    2004-12-01

    Mathematical models in earth sciences base on the solution of systems of coupled, non-linear, time-dependent partial differential equations (PDEs). The spatial and time-scale vary from a planetary scale and million years for convection problems to 100km and 10 years for fault systems simulations. Various techniques are in use to deal with the time dependency (e.g. Crank-Nicholson), with the non-linearity (e.g. Newton-Raphson) and weakly coupled equations (e.g. non-linear Gauss-Seidel). Besides these high-level solution algorithms discretization methods (e.g. finite element method (FEM), boundary element method (BEM)) are used to deal with spatial derivatives. Typically, large-scale, three dimensional meshes are required to resolve geometrical complexity (e.g. in the case of fault systems) or features in the solution (e.g. in mantel convection simulations). The modelling environment escript allows the rapid implementation of new physics as required for the development of simulation codes in earth sciences. Its main object is to provide a programming language, where the user can define new models and rapidly develop high-level solution algorithms. The current implementation is linked with the finite element package finley as a PDE solver. However, the design is open and other discretization technologies such as finite differences and boundary element methods could be included. escript is implemented as an extension of the interactive programming environment python (see www.python.org). Key concepts introduced are Data objects, which are holding values on nodes or elements of the finite element mesh, and linearPDE objects, which are defining linear partial differential equations to be solved by the underlying discretization technology. In this paper we will show the basic concepts of escript and will show how escript is used to implement a simulation code for interacting fault systems. We will show some results of large-scale, parallel simulations on an SGI Altix

  19. Extending molecular simulation time scales: Parallel in time integrations for high-level quantum chemistry and complex force representations.

    PubMed

    Bylaska, Eric J; Weare, Jonathan Q; Weare, John H

    2013-08-21

    Parallel in time simulation algorithms are presented and applied to conventional molecular dynamics (MD) and ab initio molecular dynamics (AIMD) models of realistic complexity. Assuming that a forward time integrator, f (e.g., Verlet algorithm), is available to propagate the system from time ti (trajectory positions and velocities xi = (ri, vi)) to time ti + 1 (xi + 1) by xi + 1 = fi(xi), the dynamics problem spanning an interval from t0[ellipsis (horizontal)]tM can be transformed into a root finding problem, F(X) = [xi - f(x(i - 1)]i = 1, M = 0, for the trajectory variables. The root finding problem is solved using a variety of root finding techniques, including quasi-Newton and preconditioned quasi-Newton schemes that are all unconditionally convergent. The algorithms are parallelized by assigning a processor to each time-step entry in the columns of F(X). The relation of this approach to other recently proposed parallel in time methods is discussed, and the effectiveness of various approaches to solving the root finding problem is tested. We demonstrate that more efficient dynamical models based on simplified interactions or coarsening time-steps provide preconditioners for the root finding problem. However, for MD and AIMD simulations, such preconditioners are not required to obtain reasonable convergence and their cost must be considered in the performance of the algorithm. The parallel in time algorithms developed are tested by applying them to MD and AIMD simulations of size and complexity similar to those encountered in present day applications. These include a 1000 Si atom MD simulation using Stillinger-Weber potentials, and a HCl + 4H2O AIMD simulation at the MP2 level. The maximum speedup (serial execution/timeparallel execution time) obtained by parallelizing the Stillinger-Weber MD simulation was nearly 3.0. For the AIMD MP2 simulations, the algorithms achieved speedups of up to 14.3. The parallel in time algorithms can be implemented in a

  20. Extending molecular simulation time scales: Parallel in time integrations for high-level quantum chemistry and complex force representations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bylaska, Eric J.; Weare, Jonathan Q.; Weare, John H.

    2013-08-21

    Parallel in time simulation algorithms are presented and applied to conventional molecular dynamics (MD) and ab initio molecular dynamics (AIMD) models of realistic complexity. Assuming that a forward time integrator, f , (e.g. Verlet algorithm) is available to propagate the system from time ti (trajectory positions and velocities xi = (ri; vi)) to time ti+1 (xi+1) by xi+1 = fi(xi), the dynamics problem spanning an interval from t0 : : : tM can be transformed into a root finding problem, F(X) = [xi - f (x(i-1)]i=1;M = 0, for the trajectory variables. The root finding problem is solved using amore » variety of optimization techniques, including quasi-Newton and preconditioned quasi-Newton optimization schemes that are all unconditionally convergent. The algorithms are parallelized by assigning a processor to each time-step entry in the columns of F(X). The relation of this approach to other recently proposed parallel in time methods is discussed and the effectiveness of various approaches to solving the root finding problem are tested. We demonstrate that more efficient dynamical models based on simplified interactions or coarsening time-steps provide preconditioners for the root finding problem. However, for MD and AIMD simulations such preconditioners are not required to obtain reasonable convergence and their cost must be considered in the performance of the algorithm. The parallel in time algorithms developed are tested by applying them to MD and AIMD simulations of size and complexity similar to those encountered in present day applications. These include a 1000 Si atom MD simulation using Stillinger-Weber potentials, and a HCl+4H2O AIMD simulation at the MP2 level. The maximum speedup obtained by parallelizing the Stillinger-Weber MD simulation was nearly 3.0. For the AIMD MP2 simulations the algorithms achieved speedups of up to 14.3. The parallel in time algorithms can be implemented in a distributed computing environment using very slow TCP/IP networks

  1. A PDE-based methodology for modeling, parameter estimation and feedback control in structural and structural acoustic systems

    NASA Technical Reports Server (NTRS)

    Banks, H. T.; Brown, D. E.; Metcalf, Vern L.; Silcox, R. J.; Smith, Ralph C.; Wang, Yun

    1994-01-01

    A problem of continued interest concerns the control of vibrations in a flexible structure and the related problem of reducing structure-borne noise in structural acoustic systems. In both cases, piezoceramic patches bonded to the structures have been successfully used as control actuators. Through the application of a controlling voltage, the patches can be used to reduce structural vibrations which in turn lead to methods for reducing structure-borne noise. A PDE-based methodology for modeling, estimating physical parameters, and implementing a feedback control scheme for problems of this type is discussed. While the illustrating example is a circular plate, the methodology is sufficiently general so as to be applicable in a variety of structural and structural acoustic systems.

  2. Extending molecular simulation time scales: Parallel in time integrations for high-level quantum chemistry and complex force representations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bylaska, Eric J., E-mail: Eric.Bylaska@pnnl.gov; Weare, Jonathan Q., E-mail: weare@uchicago.edu; Weare, John H., E-mail: jweare@ucsd.edu

    2013-08-21

    Parallel in time simulation algorithms are presented and applied to conventional molecular dynamics (MD) and ab initio molecular dynamics (AIMD) models of realistic complexity. Assuming that a forward time integrator, f (e.g., Verlet algorithm), is available to propagate the system from time t{sub i} (trajectory positions and velocities x{sub i} = (r{sub i}, v{sub i})) to time t{sub i+1} (x{sub i+1}) by x{sub i+1} = f{sub i}(x{sub i}), the dynamics problem spanning an interval from t{sub 0}…t{sub M} can be transformed into a root finding problem, F(X) = [x{sub i} − f(x{sub (i−1})]{sub i} {sub =1,M} = 0, for themore » trajectory variables. The root finding problem is solved using a variety of root finding techniques, including quasi-Newton and preconditioned quasi-Newton schemes that are all unconditionally convergent. The algorithms are parallelized by assigning a processor to each time-step entry in the columns of F(X). The relation of this approach to other recently proposed parallel in time methods is discussed, and the effectiveness of various approaches to solving the root finding problem is tested. We demonstrate that more efficient dynamical models based on simplified interactions or coarsening time-steps provide preconditioners for the root finding problem. However, for MD and AIMD simulations, such preconditioners are not required to obtain reasonable convergence and their cost must be considered in the performance of the algorithm. The parallel in time algorithms developed are tested by applying them to MD and AIMD simulations of size and complexity similar to those encountered in present day applications. These include a 1000 Si atom MD simulation using Stillinger-Weber potentials, and a HCl + 4H{sub 2}O AIMD simulation at the MP2 level. The maximum speedup ((serial execution time)/(parallel execution time) ) obtained by parallelizing the Stillinger-Weber MD simulation was nearly 3.0. For the AIMD MP2 simulations, the algorithms achieved speedups

  3. Concurrent simulation of a parallel jaw end effector

    NASA Technical Reports Server (NTRS)

    Bynum, Bill

    1985-01-01

    A system of programs developed to aid in the design and development of the command/response protocol between a parallel jaw end effector and the strategic planner program controlling it are presented. The system executes concurrently with the LISP controlling program to generate a graphical image of the end effector that moves in approximately real time in response to commands sent from the controlling program. Concurrent execution of the simulation program is useful for revealing flaws in the communication command structure arising from the asynchronous nature of the message traffic between the end effector and the strategic planner. Software simulation helps to minimize the number of hardware changes necessary to the microprocessor driving the end effector because of changes in the communication protocol. The simulation of other actuator devices can be easily incorporated into the system of programs by using the underlying support that was developed for the concurrent execution of the simulation process and the communication between it and the controlling program.

  4. Xyce™ Parallel Electronic Simulator Reference Guide, Version 6.5

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Aadithya, Karthik V.; Mei, Ting

    2016-06-01

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users’ Guide. The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users’ Guide. The information herein is subject to change without notice. Copyright © 2002-2016 Sandia Corporation. All rights reserved.

  5. Existence and Optimality Conditions for Risk-Averse PDE-Constrained Optimization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kouri, Drew Philip; Surowiec, Thomas M.

    Uncertainty is ubiquitous in virtually all engineering applications, and, for such problems, it is inadequate to simulate the underlying physics without quantifying the uncertainty in unknown or random inputs, boundary and initial conditions, and modeling assumptions. Here in this paper, we introduce a general framework for analyzing risk-averse optimization problems constrained by partial differential equations (PDEs). In particular, we postulate conditions on the random variable objective function as well as the PDE solution that guarantee existence of minimizers. Furthermore, we derive optimality conditions and apply our results to the control of an environmental contaminant. Lastly, we introduce a new riskmore » measure, called the conditional entropic risk, that fuses desirable properties from both the conditional value-at-risk and the entropic risk measures.« less

  6. Existence and Optimality Conditions for Risk-Averse PDE-Constrained Optimization

    DOE PAGES

    Kouri, Drew Philip; Surowiec, Thomas M.

    2018-06-05

    Uncertainty is ubiquitous in virtually all engineering applications, and, for such problems, it is inadequate to simulate the underlying physics without quantifying the uncertainty in unknown or random inputs, boundary and initial conditions, and modeling assumptions. Here in this paper, we introduce a general framework for analyzing risk-averse optimization problems constrained by partial differential equations (PDEs). In particular, we postulate conditions on the random variable objective function as well as the PDE solution that guarantee existence of minimizers. Furthermore, we derive optimality conditions and apply our results to the control of an environmental contaminant. Lastly, we introduce a new riskmore » measure, called the conditional entropic risk, that fuses desirable properties from both the conditional value-at-risk and the entropic risk measures.« less

  7. Modeling and Simulation of High Dimensional Stochastic Multiscale PDE Systems at the Exascale

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kevrekidis, Ioannis

    2017-03-22

    The thrust of the proposal was to exploit modern data-mining tools in a way that will create a systematic, computer-assisted approach to the representation of random media -- and also to the representation of the solutions of an array of important physicochemical processes that take place in/on such media. A parsimonious representation/parametrization of the random media links directly (via uncertainty quantification tools) to good sampling of the distribution of random media realizations. It also links directly to modern multiscale computational algorithms (like the equation-free approach that has been developed in our group) and plays a crucial role in accelerating themore » scientific computation of solutions of nonlinear PDE models (deterministic or stochastic) in such media – both solutions in particular realizations of the random media, and estimation of the statistics of the solutions over multiple realizations (e.g. expectations).« less

  8. On the Interface of Probabilistic and PDE Methods in a Multifactor Term Structure Theory

    ERIC Educational Resources Information Center

    Mamon, Rogemar S.

    2004-01-01

    Within the general framework of a multifactor term structure model, the fundamental partial differential equation (PDE) satisfied by a default-free zero-coupon bond price is derived via a martingale-oriented approach. Using this PDE, a result characterizing a model belonging to an exponential affine class is established using only a system of…

  9. Xyce parallel electronic simulator reference guide, Version 6.0.1.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R; Mei, Ting; Russo, Thomas V.

    2014-01-01

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users Guide [1] . The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce. This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users Guide [1] .

  10. Design of a real-time wind turbine simulator using a custom parallel architecture

    NASA Technical Reports Server (NTRS)

    Hoffman, John A.; Gluck, R.; Sridhar, S.

    1995-01-01

    The design of a new parallel-processing digital simulator is described. The new simulator has been developed specifically for analysis of wind energy systems in real time. The new processor has been named: the Wind Energy System Time-domain simulator, version 3 (WEST-3). Like previous WEST versions, WEST-3 performs many computations in parallel. The modules in WEST-3 are pure digital processors, however. These digital processors can be programmed individually and operated in concert to achieve real-time simulation of wind turbine systems. Because of this programmability, WEST-3 is very much more flexible and general than its two predecessors. The design features of WEST-3 are described to show how the system produces high-speed solutions of nonlinear time-domain equations. WEST-3 has two very fast Computational Units (CU's) that use minicomputer technology plus special architectural features that make them many times faster than a microcomputer. These CU's are needed to perform the complex computations associated with the wind turbine rotor system in real time. The parallel architecture of the CU causes several tasks to be done in each cycle, including an IO operation and the combination of a multiply, add, and store. The WEST-3 simulator can be expanded at any time for additional computational power. This is possible because the CU's interfaced to each other and to other portions of the simulation using special serial buses. These buses can be 'patched' together in essentially any configuration (in a manner very similar to the programming methods used in analog computation) to balance the input/ output requirements. CU's can be added in any number to share a given computational load. This flexible bus feature is very different from many other parallel processors which usually have a throughput limit because of rigid bus architecture.

  11. Effect of phoshpodiesterase 4 (PDE4) inhibibtors on eotaxin expression in humen bronchial epithelial cells.

    PubMed

    Paplinska, M; Chazan, R; Grubek-Jaworska, H

    2011-06-01

    The increasing number of eosinophils into bronchoaelvolar space is observed during noninfectious inflammatory lung diseases. Eotaxins (eotaxin-1/CCL11, eotaxin-2/CCL24, eotaxin-3/CCL26) are the strongest chemotactic agents for eosinophils. Inhibitors of phosphodiesterase 4 (PDE4), the enzyme decomposing cAMP, are anti-inflammatory agents which act through cAMP elevation and inhibit numerous steps of allergic inflammation. The effect of PDE4 inhibitors on eotaxin expression is not known in details. The aim of our study was to evaluate the influence of PDE4 inhibitors: rolipram and RO-20-1724 on expression of eotaxins in bronchial epithelial cell line BEAS-2B. Cells were preincubated with PDE4 inhibitors or dexamethasone for 1 hour and then stimulated with IL-4 or IL-13 alone or in combination with TNF-α. After 48 hours eotaxin protein level was measured by ELISA and mRNA level by real time PCR. PDE4 inhibitors decreased CCL11 and CCL26 expression only in cultures co-stimulated with TNF-α. In cultures stimulated with IL-4 and TNF-α rolipram and RO-20-1724 diminished CCL11 mRNA expression by 34 and 37%, respectively, and CCL26 by 43 and 47%. In cultures stimulated with IL-13 and TNF-α rolipram and RO-20-1724 decreased expression of both eotaxins by about 50%. These results were confirmed at the protein level. The effect of PDE4 inhibitors on eotaxin expression in BEAS-2B cells, in our experimental conditions, depends on TNF-α contribution.

  12. How Schools and Students Respond to School Improvement Programs: The Case of Brazil's PDE

    ERIC Educational Resources Information Center

    Carnoy, Martin; Gove, Amber K.; Loeb, Susanna; Marshall, Jeffrey H.; Socias, Miguel

    2008-01-01

    This study uses rich empirical data from Brazil to assess how a government program (PDE) that decentralizes school management decisions changes what goes on in schools and how these changes affect student outcomes. It appears that the PDE resulted in some improvements in management and learning materials, but little change in other areas including…

  13. Parallel distributed, reciprocal Monte Carlo radiation in coupled, large eddy combustion simulations

    NASA Astrophysics Data System (ADS)

    Hunsaker, Isaac L.

    Radiation is the dominant mode of heat transfer in high temperature combustion environments. Radiative heat transfer affects the gas and particle phases, including all the associated combustion chemistry. The radiative properties are in turn affected by the turbulent flow field. This bi-directional coupling of radiation turbulence interactions poses a major challenge in creating parallel-capable, high-fidelity combustion simulations. In this work, a new model was developed in which reciprocal monte carlo radiation was coupled with a turbulent, large-eddy simulation combustion model. A technique wherein domain patches are stitched together was implemented to allow for scalable parallelism. The combustion model runs in parallel on a decomposed domain. The radiation model runs in parallel on a recomposed domain. The recomposed domain is stored on each processor after information sharing of the decomposed domain is handled via the message passing interface. Verification and validation testing of the new radiation model were favorable. Strong scaling analyses were performed on the Ember cluster and the Titan cluster for the CPU-radiation model and GPU-radiation model, respectively. The model demonstrated strong scaling to over 1,700 and 16,000 processing cores on Ember and Titan, respectively.

  14. Automatic Generation of Directive-Based Parallel Programs for Shared Memory Parallel Systems

    NASA Technical Reports Server (NTRS)

    Jin, Hao-Qiang; Yan, Jerry; Frumkin, Michael

    2000-01-01

    The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. As great progress was made in hardware and software technologies, performance of parallel programs with compiler directives has demonstrated large improvement. The introduction of OpenMP directives, the industrial standard for shared-memory programming, has minimized the issue of portability. Due to its ease of programming and its good performance, the technique has become very popular. In this study, we have extended CAPTools, a computer-aided parallelization toolkit, to automatically generate directive-based, OpenMP, parallel programs. We outline techniques used in the implementation of the tool and present test results on the NAS parallel benchmarks and ARC3D, a CFD application. This work demonstrates the great potential of using computer-aided tools to quickly port parallel programs and also achieve good performance.

  15. Current drug therapy of patients with BPH-LUTS with the special emphasis on PDE5 inhibitors

    PubMed Central

    Govorov, Alexander; Kasyan, George; Priymak, Diana; Pushkar, Dmitry

    2016-01-01

    Introduction Benign prostatic hyperplasia (BPH) is the most common cause of lower urinary tract symptom (LUTS) development in men [1]. The intensity of the symptoms may vary from mild to severe, significantly affecting the quality of life. Erectile dysfunction (ED) is one of the most challenging issues in modern urology that significantly influences the quality of life in men worldwide. The objective of this literature review was to analyze the current drug therapies of patients with BPH-LUTS, with the special emphasis on PDE5 inhibitors. Material and methods The authors searched the literature for the period from 2000 until 2015 in MEDLINE and PubMed. Results Twenty-three articles were selected based on their reliability. A detailed analysis of the selected papers was performed. Primary attention was given to articles describing the use of PDE5. Works describing the use of different groups of drugs in patients with BPH-LUTS were also selected. Conclusions The current literature analysis suggests that the introduction of PDE5 inhibitors in clinical practice for the treatment of patients with BPH-LUTS will allow for significant expansion of the therapeutic options for the treatment of this disease. PMID:28127458

  16. Current drug therapy of patients with BPH-LUTS with the special emphasis on PDE5 inhibitors.

    PubMed

    Kolontarev, Konstantin; Govorov, Alexander; Kasyan, George; Priymak, Diana; Pushkar, Dmitry

    2016-01-01

    Benign prostatic hyperplasia (BPH) is the most common cause of lower urinary tract symptom (LUTS) development in men [1]. The intensity of the symptoms may vary from mild to severe, significantly affecting the quality of life. Erectile dysfunction (ED) is one of the most challenging issues in modern urology that significantly influences the quality of life in men worldwide. The objective of this literature review was to analyze the current drug therapies of patients with BPH-LUTS, with the special emphasis on PDE5 inhibitors. The authors searched the literature for the period from 2000 until 2015 in MEDLINE and PubMed. Twenty-three articles were selected based on their reliability. A detailed analysis of the selected papers was performed. Primary attention was given to articles describing the use of PDE5. Works describing the use of different groups of drugs in patients with BPH-LUTS were also selected. The current literature analysis suggests that the introduction of PDE5 inhibitors in clinical practice for the treatment of patients with BPH-LUTS will allow for significant expansion of the therapeutic options for the treatment of this disease.

  17. Cyanidin-3-glucoside suppresses B[a]PDE-induced cyclooxygenase-2 expression by directly inhibiting Fyn kinase activity.

    PubMed

    Lim, Tae-Gyu; Kwon, Jung Yeon; Kim, Jiyoung; Song, Nu Ry; Lee, Kyung Mi; Heo, Yong-Seok; Lee, Hyong Joo; Lee, Ki Won

    2011-07-15

    Benzo[a]pyrene-7,8-diol-9,10-epoxide (B[a]PDE) is a well-known carcinogen that is associated with skin cancer. Abnormal expression of cyclooxygenase-2 (COX-2) is an important mediator in inflammation and tumor promotion. We investigated the inhibitory effect of cyanidin-3-glucoside (C3G), an anthocyanin present in fruits, on B[a]PDE-induced COX-2 expression in mouse epidermal JB6 P+ cells. Pretreatment with C3G resulted in the reduction of B[a]PDE-induced expression of COX-2 and COX-2 promoter activity. The activation of activator protein-1 (AP-1) and nuclear factor-κB (NF-κB) induced by B[a]PDE was also attenuated by C3G. C3G attenuated the B[a]PDE-induced phosphorylation of MEK, MKK4, Akt, and mitogen-activated protein kinases (MAPKs), but no effect on the phosphorylation of the upstream MAPK regulator Fyn. However, kinase assays demonstrated that C3G suppressed Fyn kinase activity and C3G directly binds Fyn kinase noncompetitively with ATP. By using PP2, a pharmacological inhibitor for SFKs, we showed that Fyn kinase regulates B[a]PDE-induced COX-2 expression by activating MAPKs, AP-1 and NF-κB. These results suggest that C3G suppresses B[a]PDE-induced COX-2 expression mainly by blocking the activation of the Fyn signaling pathway, which may contribute to its chemopreventive potential. Copyright © 2011 Elsevier Inc. All rights reserved.

  18. Accelerating the Gillespie Exact Stochastic Simulation Algorithm using hybrid parallel execution on graphics processing units.

    PubMed

    Komarov, Ivan; D'Souza, Roshan M

    2012-01-01

    The Gillespie Stochastic Simulation Algorithm (GSSA) and its variants are cornerstone techniques to simulate reaction kinetics in situations where the concentration of the reactant is too low to allow deterministic techniques such as differential equations. The inherent limitations of the GSSA include the time required for executing a single run and the need for multiple runs for parameter sweep exercises due to the stochastic nature of the simulation. Even very efficient variants of GSSA are prohibitively expensive to compute and perform parameter sweeps. Here we present a novel variant of the exact GSSA that is amenable to acceleration by using graphics processing units (GPUs). We parallelize the execution of a single realization across threads in a warp (fine-grained parallelism). A warp is a collection of threads that are executed synchronously on a single multi-processor. Warps executing in parallel on different multi-processors (coarse-grained parallelism) simultaneously generate multiple trajectories. Novel data-structures and algorithms reduce memory traffic, which is the bottleneck in computing the GSSA. Our benchmarks show an 8×-120× performance gain over various state-of-the-art serial algorithms when simulating different types of models.

  19. A more secure parallel keyed hash function based on chaotic neural network

    NASA Astrophysics Data System (ADS)

    Huang, Zhongquan

    2011-08-01

    Although various hash functions based on chaos or chaotic neural network were proposed, most of them can not work efficiently in parallel computing environment. Recently, an algorithm for parallel keyed hash function construction based on chaotic neural network was proposed [13]. However, there is a strict limitation in this scheme that its secret keys must be nonce numbers. In other words, if the keys are used more than once in this scheme, there will be some potential security flaw. In this paper, we analyze the cause of vulnerability of the original one in detail, and then propose the corresponding enhancement measures, which can remove the limitation on the secret keys. Theoretical analysis and computer simulation indicate that the modified hash function is more secure and practical than the original one. At the same time, it can keep the parallel merit and satisfy the other performance requirements of hash function, such as good statistical properties, high message and key sensitivity, and strong collision resistance, etc.

  20. Performance evaluation of GPU parallelization, space-time adaptive algorithms, and their combination for simulating cardiac electrophysiology.

    PubMed

    Sachetto Oliveira, Rafael; Martins Rocha, Bernardo; Burgarelli, Denise; Meira, Wagner; Constantinides, Christakis; Weber Dos Santos, Rodrigo

    2018-02-01

    The use of computer models as a tool for the study and understanding of the complex phenomena of cardiac electrophysiology has attained increased importance nowadays. At the same time, the increased complexity of the biophysical processes translates into complex computational and mathematical models. To speed up cardiac simulations and to allow more precise and realistic uses, 2 different techniques have been traditionally exploited: parallel computing and sophisticated numerical methods. In this work, we combine a modern parallel computing technique based on multicore and graphics processing units (GPUs) and a sophisticated numerical method based on a new space-time adaptive algorithm. We evaluate each technique alone and in different combinations: multicore and GPU, multicore and GPU and space adaptivity, multicore and GPU and space adaptivity and time adaptivity. All the techniques and combinations were evaluated under different scenarios: 3D simulations on slabs, 3D simulations on a ventricular mouse mesh, ie, complex geometry, sinus-rhythm, and arrhythmic conditions. Our results suggest that multicore and GPU accelerate the simulations by an approximate factor of 33×, whereas the speedups attained by the space-time adaptive algorithms were approximately 48. Nevertheless, by combining all the techniques, we obtained speedups that ranged between 165 and 498. The tested methods were able to reduce the execution time of a simulation by more than 498× for a complex cellular model in a slab geometry and by 165× in a realistic heart geometry simulating spiral waves. The proposed methods will allow faster and more realistic simulations in a feasible time with no significant loss of accuracy. Copyright © 2017 John Wiley & Sons, Ltd.

  1. Efficient Parallel Algorithm For Direct Numerical Simulation of Turbulent Flows

    NASA Technical Reports Server (NTRS)

    Moitra, Stuti; Gatski, Thomas B.

    1997-01-01

    A distributed algorithm for a high-order-accurate finite-difference approach to the direct numerical simulation (DNS) of transition and turbulence in compressible flows is described. This work has two major objectives. The first objective is to demonstrate that parallel and distributed-memory machines can be successfully and efficiently used to solve computationally intensive and input/output intensive algorithms of the DNS class. The second objective is to show that the computational complexity involved in solving the tridiagonal systems inherent in the DNS algorithm can be reduced by algorithm innovations that obviate the need to use a parallelized tridiagonal solver.

  2. [Anti-B[a]PDE-DNA formation in lymphomonocytes of humans environmentally exposed to polycyclic aromatic hydrocarbons].

    PubMed

    Pavanello, S; Pulliero, A; Lai, A; Gaiardo, A; Mastrangelo, G; Clonfero, E

    2005-01-01

    [Anti-B[a]PDE-DNA formation in lymphomonocytes of humans environmentally exposed to polycyclic aromatic hydrocarbons] We are currently evaluating anti-benzo[a]pyrenediolepoxide-(B[a]PDE)-DNA adduct levels in lymphomonocytes of humans exposed to polycyclic aromatic hydrocarbons (PAHs) to validate this indicator of biologically effective dose in a surrogate tissue. The study protocol (October 2002-June 2005) implies: (a) a signed informed consent by each participant; (b) recruitment of 600 Padua municipal workers during visits at our outpatient clinic; (c) administration of a questionnaire regarding non occupational sources of PAH (B[a]P) exposure; (d) collection of blood (15 ml) and urine (200 ml) samples. Anti-B[a]PDE-DNA adduct levels in lymphomonocytes are detected by HPLC-fluorescence analysis. To date, 438 subjects have been examined (age range 20-62 years; 52% males). We found that: (i) anti-B[a]PDE-DNA adduct levels are significantly lower than those we previously found in coke-oven workers (N=95) occupationally exposed to high levels of PAHs (1.51 +/- 2.68 versus 4.07 +/- 3.78 anti-B[a]PDE-adduct/10(8) nucleotides, p < 0.001; 37% versus 97% positive subjects with > or =1 adduct/10(8) nucleotides; p < 0.001); (ii) smokers (23%) have significantly higher adduct levels than non smokers (p < 0.001); iii) non smokers who consume PAH-rich meals > or =52 times/year (142 subjects, 42%) have significantly increased adduct levels than those <52 times/year (p < 0.01). Dietary and smoking habits did not influence the occupationally-induced adduct levels in coke-oven workers. This is the first study that examines anti-B[a]PDE-DNA adduct levels in a large cohort showing that anti-B[a]PDE-DNA adducts can be detected in humans environmentally exposed to low doses of PAH (B[a]P and are modulated by smoke and dietary habits.

  3. Voxel based parallel post processor for void nucleation and growth analysis of atomistic simulations of material fracture.

    PubMed

    Hemani, H; Warrier, M; Sakthivel, N; Chaturvedi, S

    2014-05-01

    Molecular dynamics (MD) simulations are used in the study of void nucleation and growth in crystals that are subjected to tensile deformation. These simulations are run for typically several hundred thousand time steps depending on the problem. We output the atom positions at a required frequency for post processing to determine the void nucleation, growth and coalescence due to tensile deformation. The simulation volume is broken up into voxels of size equal to the unit cell size of crystal. In this paper, we present the algorithm to identify the empty unit cells (voids), their connections (void size) and dynamic changes (growth and coalescence of voids) for MD simulations of large atomic systems (multi-million atoms). We discuss the parallel algorithms that were implemented and discuss their relative applicability in terms of their speedup and scalability. We also present the results on scalability of our algorithm when it is incorporated into MD software LAMMPS. Copyright © 2014 Elsevier Inc. All rights reserved.

  4. Parallel collisionless shocks forming in simulations of the LAPD experiment

    NASA Astrophysics Data System (ADS)

    Weidl, Martin S.; Jenko, Frank; Niemann, Chris; Winske, Dan

    2016-10-01

    Research on parallel collisionless shocks, most prominently occurring in the Earth's bow shock region, has so far been limited to satellite measurements and simulations. However, the formation of collisionless shocks depends on a wide range of parameters and scales, which can be accessed more easily in a laboratory experiment. Using a kJ-class laser, an ongoing experimental campaign at the Large Plasma Device (LAPD) at UCLA is expected to produce the first laboratory measurements of the formation of a parallel collisionless shock. We present hybrid kinetic/MHD simulations that show how beam instabilities in the background plasma can be driven by ablating carbon ions from a target, causing non-linear density oscillations which develop into a propagating shock front. The free-streaming carbon ions can excite both the resonant right-hand instability and the non-resonant firehose mode. We analyze their respective roles and discuss optimizing their growth rates to speed up the process of shock formation.

  5. Template based parallel checkpointing in a massively parallel computer system

    DOEpatents

    Archer, Charles Jens [Rochester, MN; Inglett, Todd Alan [Rochester, MN

    2009-01-13

    A method and apparatus for a template based parallel checkpoint save for a massively parallel super computer system using a parallel variation of the rsync protocol, and network broadcast. In preferred embodiments, the checkpoint data for each node is compared to a template checkpoint file that resides in the storage and that was previously produced. Embodiments herein greatly decrease the amount of data that must be transmitted and stored for faster checkpointing and increased efficiency of the computer system. Embodiments are directed to a parallel computer system with nodes arranged in a cluster with a high speed interconnect that can perform broadcast communication. The checkpoint contains a set of actual small data blocks with their corresponding checksums from all nodes in the system. The data blocks may be compressed using conventional non-lossy data compression algorithms to further reduce the overall checkpoint size.

  6. Empirical study of parallel LRU simulation algorithms

    NASA Technical Reports Server (NTRS)

    Carr, Eric; Nicol, David M.

    1994-01-01

    This paper reports on the performance of five parallel algorithms for simulating a fully associative cache operating under the LRU (Least-Recently-Used) replacement policy. Three of the algorithms are SIMD, and are implemented on the MasPar MP-2 architecture. Two other algorithms are parallelizations of an efficient serial algorithm on the Intel Paragon. One SIMD algorithm is quite simple, but its cost is linear in the cache size. The two other SIMD algorithm are more complex, but have costs that are independent on the cache size. Both the second and third SIMD algorithms compute all stack distances; the second SIMD algorithm is completely general, whereas the third SIMD algorithm presumes and takes advantage of bounds on the range of reference tags. Both MIMD algorithm implemented on the Paragon are general and compute all stack distances; they differ in one step that may affect their respective scalability. We assess the strengths and weaknesses of these algorithms as a function of problem size and characteristics, and compare their performance on traces derived from execution of three SPEC benchmark programs.

  7. Repartitioning Strategies for Massively Parallel Simulation of Reacting Flow

    NASA Astrophysics Data System (ADS)

    Pisciuneri, Patrick; Zheng, Angen; Givi, Peyman; Labrinidis, Alexandros; Chrysanthis, Panos

    2015-11-01

    The majority of parallel CFD simulators partition the domain into equal regions and assign the calculations for a particular region to a unique processor. This type of domain decomposition is vital to the efficiency of the solver. However, as the simulation develops, the workload among the partitions often become uneven (e.g. by adaptive mesh refinement, or chemically reacting regions) and a new partition should be considered. The process of repartitioning adjusts the current partition to evenly distribute the load again. We compare two repartitioning tools: Zoltan, an architecture-agnostic graph repartitioner developed at the Sandia National Laboratories; and Paragon, an architecture-aware graph repartitioner developed at the University of Pittsburgh. The comparative assessment is conducted via simulation of the Taylor-Green vortex flow with chemical reaction.

  8. CHBPR: Decreased cGMP level contributes to increased contraction in arteries from hypertensive rats: role of PDE1

    PubMed Central

    Giachini, Fernanda R.; Lima, Victor V.; Carneiro, Fernando S.; Tostes, Rita C.; Webb, R. Clinton

    2011-01-01

    Recent evidence suggests that angiotensin II (Ang II) upregulates phosphodiesterase (PDE)-1A expression. We hypothesized that Ang II augmented PDE1 activation, decreasing the bioavailability of cyclic cyclic guanosine 3', 5'-monophosphate (cGMP), contributing to increased vascular contractility. Male Sprague-Dawley rats received mini-osmotic pumps with Ang II (60 ng.min−1) or saline for 14 days. PE-induced contractions were increased in aorta (Emax168±8 vs. 136±4%) and small-mesenteric arteries [(SMA), Emax170±6 vs. 143±3%] from Ang II infused rats compared to control. PDE1 inhibition with vinpocetine (10µM) reduced PE-induced contraction in aortas from Ang II rats (Emax94±12%) but not in control (154±7%). Vinpocetine decreased the sensitivity to PE in SMA from Ang II rats compared to vehicle (pD2 5.1±0.1 vs. 5.9±0.06), but not in control (6.0±0.03 vs. 6.1±0.04). Sildenafil (10µM), a PDE5 inhibitor reduced PE-induced maximal contraction similarly in Ang II and control rats. Arteries were contracted with PE (1µM) and concentration-dependent relaxation to vinpocetine and sildenafil was evaluated. Aortas from Ang II rats displayed increased relaxation to vinpocetine compared to control (Emax82±12 vs. 44±5%). SMA from Ang II rats showed greater sensitivity during vinpocetine-induced relaxation, compared to control (pD2 6.1±0.3 vs. 5.3±0.1). No differences in sildenafil-induced relaxation were observed. PDE1A and PDE1C expressions in aorta and PDE1A expression in SMA were increased in Ang II rats. cGMP production, which is decreased in arteries from Ang II rats, was restored after PDE1 blockade. We conclude that PDE1 activation reduces cGMP bioavailability in arteries from ANG II, contributing to increased contractile responsiveness. PMID:21282562

  9. A parallel decision tree-based method for user authentication based on keystroke patterns.

    PubMed

    Sheng, Yong; Phoha, Vir V; Rovnyak, Steven M

    2005-08-01

    We propose a Monte Carlo approach to attain sufficient training data, a splitting method to improve effectiveness, and a system composed of parallel decision trees (DTs) to authenticate users based on keystroke patterns. For each user, approximately 19 times as much simulated data was generated to complement the 387 vectors of raw data. The training set, including raw and simulated data, is split into four subsets. For each subset, wavelet transforms are performed to obtain a total of eight training subsets for each user. Eight DTs are thus trained using the eight subsets. A parallel DT is constructed for each user, which contains all eight DTs with a criterion for its output that it authenticates the user if at least three DTs do so; otherwise it rejects the user. Training and testing data were collected from 43 users who typed the exact same string of length 37 nine consecutive times to provide data for training purposes. The users typed the same string at various times over a period from November through December 2002 to provide test data. The average false reject rate was 9.62% and the average false accept rate was 0.88%.

  10. GRADSPMHD: A parallel MHD code based on the SPH formalism

    NASA Astrophysics Data System (ADS)

    Vanaverbeke, S.; Keppens, R.; Poedts, S.

    2014-03-01

    We present GRADSPMHD, a completely Lagrangian parallel magnetohydrodynamics code based on the SPH formalism. The implementation of the equations of SPMHD in the “GRAD-h” formalism assembles known results, including the derivation of the discretized MHD equations from a variational principle, the inclusion of time-dependent artificial viscosity, resistivity and conductivity terms, as well as the inclusion of a mixed hyperbolic/parabolic correction scheme for satisfying the ∇ṡB→ constraint on the magnetic field. The code uses a tree-based formalism for neighbor finding and can optionally use the tree code for computing the self-gravity of the plasma. The structure of the code closely follows the framework of our parallel GRADSPH FORTRAN 90 code which we added previously to the CPC program library. We demonstrate the capabilities of GRADSPMHD by running 1, 2, and 3 dimensional standard benchmark tests and we find good agreement with previous work done by other researchers. The code is also applied to the problem of simulating the magnetorotational instability in 2.5D shearing box tests as well as in global simulations of magnetized accretion disks. We find good agreement with available results on this subject in the literature. Finally, we discuss the performance of the code on a parallel supercomputer with distributed memory architecture. Catalogue identifier: AERP_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AERP_v1_0.html Program obtainable from: CPC Program Library, Queen’s University, Belfast, N. Ireland Licensing provisions: Standard CPC licence, http://cpc.cs.qub.ac.uk/licence/licence.html No. of lines in distributed program, including test data, etc.: 620503 No. of bytes in distributed program, including test data, etc.: 19837671 Distribution format: tar.gz Programming language: FORTRAN 90/MPI. Computer: HPC cluster. Operating system: Unix. Has the code been vectorized or parallelized?: Yes, parallelized using MPI. RAM: ˜30 MB for a

  11. Acceleration of the matrix multiplication of Radiance three phase daylighting simulations with parallel computing on heterogeneous hardware of personal computer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zuo, Wangda; McNeil, Andrew; Wetter, Michael

    2013-05-23

    Building designers are increasingly relying on complex fenestration systems to reduce energy consumed for lighting and HVAC in low energy buildings. Radiance, a lighting simulation program, has been used to conduct daylighting simulations for complex fenestration systems. Depending on the configurations, the simulation can take hours or even days using a personal computer. This paper describes how to accelerate the matrix multiplication portion of a Radiance three-phase daylight simulation by conducting parallel computing on heterogeneous hardware of a personal computer. The algorithm was optimized and the computational part was implemented in parallel using OpenCL. The speed of new approach wasmore » evaluated using various daylighting simulation cases on a multicore central processing unit and a graphics processing unit. Based on the measurements and analysis of the time usage for the Radiance daylighting simulation, further speedups can be achieved by using fast I/O devices and storing the data in a binary format.« less

  12. A PDE approach for quantifying and visualizing tumor progression and regression

    NASA Astrophysics Data System (ADS)

    Sintay, Benjamin J.; Bourland, J. Daniel

    2009-02-01

    Quantification of changes in tumor shape and size allows physicians the ability to determine the effectiveness of various treatment options, adapt treatment, predict outcome, and map potential problem sites. Conventional methods are often based on metrics such as volume, diameter, or maximum cross sectional area. This work seeks to improve the visualization and analysis of tumor changes by simultaneously analyzing changes in the entire tumor volume. This method utilizes an elliptic partial differential equation (PDE) to provide a roadmap of boundary displacement that does not suffer from the discontinuities associated with other measures such as Euclidean distance. Streamline pathways defined by Laplace's equation (a commonly used PDE) are used to track tumor progression and regression at the tumor boundary. Laplace's equation is particularly useful because it provides a smooth, continuous solution that can be evaluated with sub-pixel precision on variable grid sizes. Several metrics are demonstrated including maximum, average, and total regression and progression. This method provides many advantages over conventional means of quantifying change in tumor shape because it is observer independent, stable for highly unusual geometries, and provides an analysis of the entire three-dimensional tumor volume.

  13. Student's Lab Assignments in PDE Course with MAPLE.

    ERIC Educational Resources Information Center

    Ponidi, B. Alhadi

    Computer-aided software has been used intensively in many mathematics courses, especially in computational subjects, to solve initial value and boundary value problems in Partial Differential Equations (PDE). Many software packages were used in student lab assignments such as FORTRAN, PASCAL, MATLAB, MATHEMATICA, and MAPLE in order to accelerate…

  14. Parallel Multiscale Algorithms for Astrophysical Fluid Dynamics Simulations

    NASA Technical Reports Server (NTRS)

    Norman, Michael L.

    1997-01-01

    Our goal is to develop software libraries and applications for astrophysical fluid dynamics simulations in multidimensions that will enable us to resolve the large spatial and temporal variations that inevitably arise due to gravity, fronts and microphysical phenomena. The software must run efficiently on parallel computers and be general enough to allow the incorporation of a wide variety of physics. Cosmological structure formation with realistic gas physics is the primary application driver in this work. Accurate simulations of e.g. galaxy formation require a spatial dynamic range (i.e., ratio of system scale to smallest resolved feature) of 104 or more in three dimensions in arbitrary topologies. We take this as our technical requirement. We have achieved, and in fact, surpassed these goals.

  15. Design, optimization, and biological evaluation of novel keto-benzimidazoles as potent and selective inhibitors of phosphodiesterase 10A (PDE10A).

    PubMed

    Hu, Essa; Kunz, Roxanne K; Chen, Ning; Rumfelt, Shannon; Siegmund, Aaron; Andrews, Kristin; Chmait, Samer; Zhao, Sharon; Davis, Carl; Chen, Hang; Lester-Zeiner, Dianna; Ma, Ji; Biorn, Christopher; Shi, Jianxia; Porter, Amy; Treanor, James; Allen, Jennifer R

    2013-11-14

    Our development of PDE10A inhibitors began with an HTS screening hit (1) that exhibited both high p-glycoprotein (P-gp) efflux ratios in rat and human and poor metabolic stability. On the basis of cocrystal structure of 1 in human PDE10A enzyme, we designed a novel keto-benzimidazole 26 with comparable PDE10A potency devoid of efflux liabilities. On target in vivo coverage of PDE10A in rat brain was assessed using our previously reported LC-MS/MS receptor occupancy (RO) technology. Compound 26 achieved 55% RO of PDE10A at 30 mg/kg po and covered PDE10A receptors in rat brain in a dose-dependent manner. Cocrystal structure of 26 in PDE10A confirmed the binding mode of the novel scaffold. Further optimization resulted in the identification of keto-benzimidazole 34, which showed an increased in vivo efficacy of 57% RO in rats at 10 mg/kg po and an improved in vivo rat clearance and oral bioavailability.

  16. Performance Evaluation in Network-Based Parallel Computing

    NASA Technical Reports Server (NTRS)

    Dezhgosha, Kamyar

    1996-01-01

    Network-based parallel computing is emerging as a cost-effective alternative for solving many problems which require use of supercomputers or massively parallel computers. The primary objective of this project has been to conduct experimental research on performance evaluation for clustered parallel computing. First, a testbed was established by augmenting our existing SUNSPARCs' network with PVM (Parallel Virtual Machine) which is a software system for linking clusters of machines. Second, a set of three basic applications were selected. The applications consist of a parallel search, a parallel sort, a parallel matrix multiplication. These application programs were implemented in C programming language under PVM. Third, we conducted performance evaluation under various configurations and problem sizes. Alternative parallel computing models and workload allocations for application programs were explored. The performance metric was limited to elapsed time or response time which in the context of parallel computing can be expressed in terms of speedup. The results reveal that the overhead of communication latency between processes in many cases is the restricting factor to performance. That is, coarse-grain parallelism which requires less frequent communication between processes will result in higher performance in network-based computing. Finally, we are in the final stages of installing an Asynchronous Transfer Mode (ATM) switch and four ATM interfaces (each 155 Mbps) which will allow us to extend our study to newer applications, performance metrics, and configurations.

  17. Molecular-dynamics simulations of self-assembled monolayers (SAM) on parallel computers

    NASA Astrophysics Data System (ADS)

    Vemparala, Satyavani

    The purpose of this dissertation is to investigate the properties of self-assembled monolayers, particularly alkanethiols and Poly (ethylene glycol) terminated alkanethiols. These simulations are based on realistic interatomic potentials and require scalable and portable multiresolution algorithms implemented on parallel computers. Large-scale molecular dynamics simulations of self-assembled alkanethiol monolayer systems have been carried out using an all-atom model involving a million atoms to investigate their structural properties as a function of temperature, lattice spacing and molecular chain-length. Results show that the alkanethiol chains tilt from the surface normal by a collective angle of 25° along next-nearest neighbor direction at 300K. At 350K the system transforms to a disordered phase characterized by small tilt angle, flexible tilt direction, and random distribution of backbone planes. With increasing lattice spacing, a, the tilt angle increases rapidly from a nearly zero value at a = 4.7A to as high as 34° at a = 5.3A at 300K. We also studied the effect of end groups on the tilt structure of SAM films. We characterized the system with respect to temperature, the alkane chain length, lattice spacing, and the length of the end group. We found that the gauche defects were predominant only in the tails, and the gauche defects increased with the temperature and number of EG units. Effect of electric field on the structure of poly (ethylene glycol) (PEG) terminated alkanethiol self assembled monolayer (SAM) on gold has been studied using parallel molecular dynamics method. An applied electric field triggers a conformational transition from all-trans to a mostly gauche conformation. The polarity of the electric field has a significant effect on the surface structure of PEG leading to a profound effect on the hydrophilicity of the surface. The electric field applied anti-parallel to the surface normal causes a reversible transition to an ordered state

  18. cuTauLeaping: A GPU-Powered Tau-Leaping Stochastic Simulator for Massive Parallel Analyses of Biological Systems

    PubMed Central

    Besozzi, Daniela; Pescini, Dario; Mauri, Giancarlo

    2014-01-01

    Tau-leaping is a stochastic simulation algorithm that efficiently reconstructs the temporal evolution of biological systems, modeled according to the stochastic formulation of chemical kinetics. The analysis of dynamical properties of these systems in physiological and perturbed conditions usually requires the execution of a large number of simulations, leading to high computational costs. Since each simulation can be executed independently from the others, a massive parallelization of tau-leaping can bring to relevant reductions of the overall running time. The emerging field of General Purpose Graphic Processing Units (GPGPU) provides power-efficient high-performance computing at a relatively low cost. In this work we introduce cuTauLeaping, a stochastic simulator of biological systems that makes use of GPGPU computing to execute multiple parallel tau-leaping simulations, by fully exploiting the Nvidia's Fermi GPU architecture. We show how a considerable computational speedup is achieved on GPU by partitioning the execution of tau-leaping into multiple separated phases, and we describe how to avoid some implementation pitfalls related to the scarcity of memory resources on the GPU streaming multiprocessors. Our results show that cuTauLeaping largely outperforms the CPU-based tau-leaping implementation when the number of parallel simulations increases, with a break-even directly depending on the size of the biological system and on the complexity of its emergent dynamics. In particular, cuTauLeaping is exploited to investigate the probability distribution of bistable states in the Schlögl model, and to carry out a bidimensional parameter sweep analysis to study the oscillatory regimes in the Ras/cAMP/PKA pathway in S. cerevisiae. PMID:24663957

  19. LightForce Photon-Pressure Collision Avoidance: Updated Efficiency Analysis Utilizing a Highly Parallel Simulation Approach

    NASA Technical Reports Server (NTRS)

    Stupl, Jan; Faber, Nicolas; Foster, Cyrus; Yang, Fan Yang; Nelson, Bron; Aziz, Jonathan; Nuttall, Andrew; Henze, Chris; Levit, Creon

    2014-01-01

    This paper provides an updated efficiency analysis of the LightForce space debris collision avoidance scheme. LightForce aims to prevent collisions on warning by utilizing photon pressure from ground based, commercial off the shelf lasers. Past research has shown that a few ground-based systems consisting of 10 kilowatt class lasers directed by 1.5 meter telescopes with adaptive optics could lower the expected number of collisions in Low Earth Orbit (LEO) by an order of magnitude. Our simulation approach utilizes the entire Two Line Element (TLE) catalogue in LEO for a given day as initial input. Least-squares fitting of a TLE time series is used for an improved orbit estimate. We then calculate the probability of collision for all LEO objects in the catalogue for a time step of the simulation. The conjunctions that exceed a threshold probability of collision are then engaged by a simulated network of laser ground stations. After those engagements, the perturbed orbits are used to re-assess the probability of collision and evaluate the efficiency of the system. This paper describes new simulations with three updated aspects: 1) By utilizing a highly parallel simulation approach employing hundreds of processors, we have extended our analysis to a much broader dataset. The simulation time is extended to one year. 2) We analyze not only the efficiency of LightForce on conjunctions that naturally occur, but also take into account conjunctions caused by orbit perturbations due to LightForce engagements. 3) We use a new simulation approach that is regularly updating the LightForce engagement strategy, as it would be during actual operations. In this paper we present our simulation approach to parallelize the efficiency analysis, its computational performance and the resulting expected efficiency of the LightForce collision avoidance system. Results indicate that utilizing a network of four LightForce stations with 20 kilowatt lasers, 85% of all conjunctions with a

  20. Inversion of potential field data using the finite element method on parallel computers

    NASA Astrophysics Data System (ADS)

    Gross, L.; Altinay, C.; Shaw, S.

    2015-11-01

    In this paper we present a formulation of the joint inversion of potential field anomaly data as an optimization problem with partial differential equation (PDE) constraints. The problem is solved using the iterative Broyden-Fletcher-Goldfarb-Shanno (BFGS) method with the Hessian operator of the regularization and cross-gradient component of the cost function as preconditioner. We will show that each iterative step requires the solution of several PDEs namely for the potential fields, for the adjoint defects and for the application of the preconditioner. In extension to the traditional discrete formulation the BFGS method is applied to continuous descriptions of the unknown physical properties in combination with an appropriate integral form of the dot product. The PDEs can easily be solved using standard conforming finite element methods (FEMs) with potentially different resolutions. For two examples we demonstrate that the number of PDE solutions required to reach a given tolerance in the BFGS iteration is controlled by weighting regularization and cross-gradient but is independent of the resolution of PDE discretization and that as a consequence the method is weakly scalable with the number of cells on parallel computers. We also show a comparison with the UBC-GIF GRAV3D code.

  1. UCR1C is a novel activator of phosphodiesterase 4 (PDE4) long isoforms and attenuates cardiomyocyte hypertrophy.

    PubMed

    Wang, Li; Burmeister, Brian T; Johnson, Keven R; Baillie, George S; Karginov, Andrei V; Skidgel, Randal A; O'Bryan, John P; Carnegie, Graeme K

    2015-05-01

    Hypertrophy increases the risk of heart failure and arrhythmia. Prevention or reversal of the maladaptive hypertrophic phenotype has thus been proposed to treat heart failure. Chronic β-adrenergic receptor (β-AR) stimulation induces cardiomyocyte hypertrophy by elevating 3',5'-cyclic adenosine monophosphate (cAMP) levels and activating downstream effectors such protein kinase A (PKA). Conversely, hydrolysis of cAMP by phosphodiesterases (PDEs) spatiotemporally restricts cAMP signaling. Here, we demonstrate that PDE4, but not PDE3, is critical in regulating cardiomyocyte hypertrophy, and may represent a potential target for preventing maladaptive hypertrophy. We identify a sequence within the upstream conserved region 1 of PDE4D, termed UCR1C, as a novel activator of PDE4 long isoforms. UCR1C activates PDE4 in complex with A-kinase anchoring protein (AKAP)-Lbc resulting in decreased PKA signaling facilitated by AKAP-Lbc. Expression of UCR1C in cardiomyocytes inhibits hypertrophy in response to chronic β-AR stimulation. This effect is partially due to inhibition of nuclear PKA activity, which decreases phosphorylation of the transcription factor cAMP response element-binding protein (CREB). In conclusion, PDE4 activation by UCR1C attenuates cardiomyocyte hypertrophy by specifically inhibiting nuclear PKA activity. Published by Elsevier Inc.

  2. Conservative parallel simulation of priority class queueing networks

    NASA Technical Reports Server (NTRS)

    Nicol, David

    1992-01-01

    A conservative synchronization protocol is described for the parallel simulation of queueing networks having C job priority classes, where a job's class is fixed. This problem has long vexed designers of conservative synchronization protocols because of its seemingly poor ability to compute lookahead: the time of the next departure. For, a job in service having low priority can be preempted at any time by an arrival having higher priority and an arbitrarily small service time. The solution is to skew the event generation activity so that the events for higher priority jobs are generated farther ahead in simulated time than lower priority jobs. Thus, when a lower priority job enters service for the first time, all the higher priority jobs that may preempt it are already known and the job's departure time can be exactly predicted. Finally, the protocol was analyzed and it was demonstrated that good performance can be expected on the simulation of large queueing networks.

  3. Conservative parallel simulation of priority class queueing networks

    NASA Technical Reports Server (NTRS)

    Nicol, David M.

    1990-01-01

    A conservative synchronization protocol is described for the parallel simulation of queueing networks having C job priority classes, where a job's class is fixed. This problem has long vexed designers of conservative synchronization protocols because of its seemingly poor ability to compute lookahead: the time of the next departure. For, a job in service having low priority can be preempted at any time by an arrival having higher priority and an arbitrarily small service time. The solution is to skew the event generation activity so that the events for higher priority jobs are generated farther ahead in simulated time than lower priority jobs. Thus, when a lower priority job enters service for the first time, all the higher priority jobs that may preempt it are already known and the job's departure time can be exactly predicted. Finally, the protocol was analyzed and it was demonstrated that good performance can be expected on the simulation of large queueing networks.

  4. Identification of cytosolic phosphodiesterases in the erythrocyte: A possible role for PDE5

    PubMed Central

    Adderley, Shaquria P.; Thuet, Kelly M.; Sridharan, Meera; Bowles, Elizabeth A.; Stephenson, Alan H.; Ellsworth, Mary L.; Sprague, Randy S.

    2011-01-01

    Summary Background Within erythrocytes (RBCs), cAMP levels are regulated by phosphodiesterases (PDEs). Increases in cAMP and ATP release associated with activation of β-adrenergic receptors (βARs) and prostacyclin receptors (IPRs) are regulated by PDEs 2, 4 and PDE 3, respectively. Here we establish the presence of cytosolic PDEs in RBCs and determine a role for PDE5 in regulating levels of cGMP. Material/Methods Purified cytosolic proteins were obtained from isolated human RBCs and western analysis was performed using antibodies against PDEs 3A, 4 and 5. Rabbit RBCs were incubated with dbcGMP, a cGMP analog, to determine the effect of cGMP on cAMP levels. To determine if cGMP affects receptor-mediated increases in cAMP, rabbit RBCs were incubated with dbcGMP prior to addition of isoproterenol (ISO), a βAR receptor agonist. To demonstrate that endogenous cGMP produces the same effect, rabbit and human RBCs were incubated with SpNONOate (SpNO), a nitric oxide donor, and YC1, a direct activator of soluble guanylyl cyclase (sGC), in the absence and presence of a selective PDE5 inhibitor, zaprinast (ZAP). Results Western analysis identified PDEs 3A, 4D and 5A. dbcGMP produced a concentration dependent increase in cAMP and ISO-induced increases in cAMP were potentiated by dbcGMP. In addition, incubation with YC1 and SpNO in the presence of ZAP potentiated βAR-induced increases in cAMP. Conclusions PDEs 2, 3A and 5 are present in the cytosol of human RBCs. PDE5 activity in RBCs regulates cGMP levels. Increases in intracellular cGMP augment cAMP levels. These studies suggest a novel role for PDE5 in erythrocytes. PMID:21525805

  5. MMS Observations and Hybrid Simulations of Surface Ripples at a Marginally Quasi-Parallel Shock

    NASA Astrophysics Data System (ADS)

    Gingell, Imogen; Schwartz, Steven J.; Burgess, David; Johlander, Andreas; Russell, Christopher T.; Burch, James L.; Ergun, Robert E.; Fuselier, Stephen; Gershman, Daniel J.; Giles, Barbara L.; Goodrich, Katherine A.; Khotyaintsev, Yuri V.; Lavraud, Benoit; Lindqvist, Per-Arne; Strangeway, Robert J.; Trattner, Karlheinz; Torbert, Roy B.; Wei, Hanying; Wilder, Frederick

    2017-11-01

    Simulations and observations of collisionless shocks have shown that deviations of the nominal local shock normal orientation, that is, surface waves or ripples, are expected to propagate in the ramp and overshoot of quasi-perpendicular shocks. Here we identify signatures of a surface ripple propagating during a crossing of Earth's marginally quasi-parallel (θBn˜45∘) or quasi-parallel bow shock on 27 November 2015 06:01:44 UTC by the Magnetospheric Multiscale (MMS) mission and determine the ripple's properties using multispacecraft methods. Using two-dimensional hybrid simulations, we confirm that surface ripples are a feature of marginally quasi-parallel and quasi-parallel shocks under the observed solar wind conditions. In addition, since these marginally quasi-parallel and quasi-parallel shocks are expected to undergo a cyclic reformation of the shock front, we discuss the impact of multiple sources of nonstationarity on shock structure. Importantly, ripples are shown to be transient phenomena, developing faster than an ion gyroperiod and only during the period of the reformation cycle when a newly developed shock ramp is unaffected by turbulence in the foot. We conclude that the change in properties of the ripple observed by MMS is consistent with the reformation of the shock front over a time scale of an ion gyroperiod.

  6. Massively parallel simulations of relativistic fluid dynamics on graphics processing units with CUDA

    NASA Astrophysics Data System (ADS)

    Bazow, Dennis; Heinz, Ulrich; Strickland, Michael

    2018-04-01

    Relativistic fluid dynamics is a major component in dynamical simulations of the quark-gluon plasma created in relativistic heavy-ion collisions. Simulations of the full three-dimensional dissipative dynamics of the quark-gluon plasma with fluctuating initial conditions are computationally expensive and typically require some degree of parallelization. In this paper, we present a GPU implementation of the Kurganov-Tadmor algorithm which solves the 3 + 1d relativistic viscous hydrodynamics equations including the effects of both bulk and shear viscosities. We demonstrate that the resulting CUDA-based GPU code is approximately two orders of magnitude faster than the corresponding serial implementation of the Kurganov-Tadmor algorithm. We validate the code using (semi-)analytic tests such as the relativistic shock-tube and Gubser flow.

  7. Superelement model based parallel algorithm for vehicle dynamics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Agrawal, O.P.; Danhof, K.J.; Kumar, R.

    1994-05-01

    This paper presents a superelement model based parallel algorithm for a planar vehicle dynamics. The vehicle model is made up of a chassis and two suspension systems each of which consists of an axle-wheel assembly and two trailing arms. In this model, the chassis is treated as a Cartesian element and each suspension system is treated as a superelement. The parameters associated with the superelements are computed using an inverse dynamics technique. Suspension shock absorbers and the tires are modeled by nonlinear springs and dampers. The Euler-Lagrange approach is used to develop the system equations of motion. This leads tomore » a system of differential and algebraic equations in which the constraints internal to superelements appear only explicitly. The above formulation is implemented on a multiprocessor machine. The numerical flow chart is divided into modules and the computation of several modules is performed in parallel to gain computational efficiency. In this implementation, the master (parent processor) creates a pool of slaves (child processors) at the beginning of the program. The slaves remain in the pool until they are needed to perform certain tasks. Upon completion of a particular task, a slave returns to the pool. This improves the overall response time of the algorithm. The formulation presented is general which makes it attractive for a general purpose code development. Speedups obtained in the different modules of the dynamic analysis computation are also presented. Results show that the superelement model based parallel algorithm can significantly reduce the vehicle dynamics simulation time. 52 refs.« less

  8. Numerical Simulation of Flow Field Within Parallel Plate Plastometer

    NASA Technical Reports Server (NTRS)

    Antar, Basil N.

    2002-01-01

    Parallel Plate Plastometer (PPP) is a device commonly used for measuring the viscosity of high polymers at low rates of shear in the range 10(exp 4) to 10(exp 9) poises. This device is being validated for use in measuring the viscosity of liquid glasses at high temperatures having similar ranges for the viscosity values. PPP instrument consists of two similar parallel plates, both in the range of 1 inch in diameter with the upper plate being movable while the lower one is kept stationary. Load is applied to the upper plate by means of a beam connected to shaft attached to the upper plate. The viscosity of the fluid is deduced from measuring the variation of the plate separation, h, as a function of time when a specified fixed load is applied on the beam. Operating plate speeds measured with the PPP is usually in the range of 10.3 cm/s or lower. The flow field within the PPP can be simulated using the equations of motion of fluid flow for this configuration. With flow speeds in the range quoted above the flow field between the two plates is certainly incompressible and laminar. Such flows can be easily simulated using numerical modeling with computational fluid dynamics (CFD) codes. We present below the mathematical model used to simulate this flow field and also the solutions obtained for the flow using a commercially available finite element CFD code.

  9. Discovery of Phosphodiesterase 10A (PDE10A) PET Tracer AMG 580 to Support Clinical Studies.

    PubMed

    Hu, Essa; Chen, Ning; Kunz, Roxanne K; Hwang, Dah-Ren; Michelsen, Klaus; Davis, Carl; Ma, Ji; Shi, Jianxia; Lester-Zeiner, Dianna; Hungate, Randall; Treanor, James; Chen, Hang; Allen, Jennifer R

    2016-07-14

    We report the discovery of PDE10A PET tracer AMG 580 developed to support proof of concept studies with PDE10A inhibitors in the clinic. To find a tracer with higher binding potential (BPND) in NHP than our previously reported tracer 1, we implemented a surface plasmon resonance assay to measure the binding off-rate to identify candidates with slower washout rate in vivo. Five candidates (2-6) from two structurally distinct scaffolds were identified that possessed both the in vitro characteristics that would favor central penetration and the structural features necessary for PET isotope radiolabeling. Two cinnolines (2, 3) and one keto-benzimidazole (5) exhibited PDE10A target specificity and brain uptake comparable to or better than 1 in the in vivo LC-MS/MS kinetics distribution study in SD rats. In NHP PET imaging study, [(18)F]-5 produced a significantly improved BPND of 3.1 and was nominated as PDE10A PET tracer clinical candidate for further studies.

  10. Implementation of unsteady sampling procedures for the parallel direct simulation Monte Carlo method

    NASA Astrophysics Data System (ADS)

    Cave, H. M.; Tseng, K.-C.; Wu, J.-S.; Jermy, M. C.; Huang, J.-C.; Krumdieck, S. P.

    2008-06-01

    An unsteady sampling routine for a general parallel direct simulation Monte Carlo method called PDSC is introduced, allowing the simulation of time-dependent flow problems in the near continuum range. A post-processing procedure called DSMC rapid ensemble averaging method (DREAM) is developed to improve the statistical scatter in the results while minimising both memory and simulation time. This method builds an ensemble average of repeated runs over small number of sampling intervals prior to the sampling point of interest by restarting the flow using either a Maxwellian distribution based on macroscopic properties for near equilibrium flows (DREAM-I) or output instantaneous particle data obtained by the original unsteady sampling of PDSC for strongly non-equilibrium flows (DREAM-II). The method is validated by simulating shock tube flow and the development of simple Couette flow. Unsteady PDSC is found to accurately predict the flow field in both cases with significantly reduced run-times over single processor code and DREAM greatly reduces the statistical scatter in the results while maintaining accurate particle velocity distributions. Simulations are then conducted of two applications involving the interaction of shocks over wedges. The results of these simulations are compared to experimental data and simulations from the literature where there these are available. In general, it was found that 10 ensembled runs of DREAM processing could reduce the statistical uncertainty in the raw PDSC data by 2.5-3.3 times, based on the limited number of cases in the present study.

  11. Heterologous desensitization of cardiac β-adrenergic signal via hormone-induced βAR/arrestin/PDE4 complexes

    PubMed Central

    Shi, Qian; Li, Minghui; Mika, Delphine; Fu, Qin; Kim, Sungjin; Phan, Jason; Shen, Ao; Vandecasteele, Gregoire; Xiang, Yang K.

    2017-01-01

    Aims Cardiac β-adrenergic receptor (βAR) signalling is susceptible to heterologous desensitization by different neurohormonal stimuli in clinical conditions associated with heart failure. We aim to examine the underlying mechanism of cross talk between βARs and a set of G-protein coupled receptors (GPCRs) activated by hormones/agonists. Methods and results Rat ventricular cardiomyocytes were used to determine heterologous phosphorylation of βARs under a series of GPCR agonists. Activation of Gs-coupled dopamine receptor, adenosine receptor, relaxin receptor and prostaglandin E2 receptor, and Gq-coupled α1 adrenergic receptor and angiotensin II type 1 receptor promotes phosphorylation of β1AR and β2AR at putative protein kinase A (PKA) phosphorylation sites; but activation of Gi-coupled α2 adrenergic receptor and activation of protease-activated receptor does not. The GPCR agonists that promote β2AR phosphorylation effectively inhibit βAR agonist isoproterenol-induced PKA phosphorylation of phospholamban and contractile function in ventricular cardiomyocytes. Heterologous GPCR stimuli have minimal to small effect on isoproterenol-induced β2AR activation and G-protein coupling for cyclic adenosine monophosphate (cAMP) production. However, these GPCR stimuli significantly promote phosphorylation of phosphodiesterase 4D (PDE4D), and recruit PDE4D to the phosphorylated β2AR in a β-arrestin 2 dependent manner without promoting β2AR endocytosis. The increased binding between β2AR and PDE4D effectively hydrolyzes cAMP signal generated by subsequent stimulation with isoproterenol. Mutation of PKA phosphorylation sites in β2AR, inhibition of PDE4, or genetic ablation of PDE4D or β-arrestin 2 abolishes this heterologous inhibitory effect. Ablation of β-arrestin 2 or PDE4D gene also rescues β-adrenergic stimuli-induced myocyte contractile function. Conclusions These data reveal essential roles of β-arrestin 2 and PDE4D in a common mechanism for heterologous

  12. Scalable and massively parallel Monte Carlo photon transport simulations for heterogeneous computing platforms

    NASA Astrophysics Data System (ADS)

    Yu, Leiming; Nina-Paravecino, Fanny; Kaeli, David; Fang, Qianqian

    2018-01-01

    We present a highly scalable Monte Carlo (MC) three-dimensional photon transport simulation platform designed for heterogeneous computing systems. Through the development of a massively parallel MC algorithm using the Open Computing Language framework, this research extends our existing graphics processing unit (GPU)-accelerated MC technique to a highly scalable vendor-independent heterogeneous computing environment, achieving significantly improved performance and software portability. A number of parallel computing techniques are investigated to achieve portable performance over a wide range of computing hardware. Furthermore, multiple thread-level and device-level load-balancing strategies are developed to obtain efficient simulations using multiple central processing units and GPUs.

  13. Accelerating Dust Storm Simulation by Balancing Task Allocation in Parallel Computing Environment

    NASA Astrophysics Data System (ADS)

    Gui, Z.; Yang, C.; XIA, J.; Huang, Q.; YU, M.

    2013-12-01

    Dust storm has serious negative impacts on environment, human health, and assets. The continuing global climate change has increased the frequency and intensity of dust storm in the past decades. To better understand and predict the distribution, intensity and structure of dust storm, a series of dust storm models have been developed, such as Dust Regional Atmospheric Model (DREAM), the NMM meteorological module (NMM-dust) and Chinese Unified Atmospheric Chemistry Environment for Dust (CUACE/Dust). The developments and applications of these models have contributed significantly to both scientific research and our daily life. However, dust storm simulation is a data and computing intensive process. Normally, a simulation for a single dust storm event may take several days or hours to run. It seriously impacts the timeliness of prediction and potential applications. To speed up the process, high performance computing is widely adopted. By partitioning a large study area into small subdomains according to their geographic location and executing them on different computing nodes in a parallel fashion, the computing performance can be significantly improved. Since spatiotemporal correlations exist in the geophysical process of dust storm simulation, each subdomain allocated to a node need to communicate with other geographically adjacent subdomains to exchange data. Inappropriate allocations may introduce imbalance task loads and unnecessary communications among computing nodes. Therefore, task allocation method is the key factor, which may impact the feasibility of the paralleling. The allocation algorithm needs to carefully leverage the computing cost and communication cost for each computing node to minimize total execution time and reduce overall communication cost for the entire system. This presentation introduces two algorithms for such allocation and compares them with evenly distributed allocation method. Specifically, 1) In order to get optimized solutions, a

  14. Building Blocks for Reliable Complex Nonlinear Numerical Simulations

    NASA Technical Reports Server (NTRS)

    Yee, H. C.

    2005-01-01

    This chapter describes some of the building blocks to ensure a higher level of confidence in the predictability and reliability (PAR) of numerical simulation of multiscale complex nonlinear problems. The focus is on relating PAR of numerical simulations with complex nonlinear phenomena of numerics. To isolate sources of numerical uncertainties, the possible discrepancy between the chosen partial differential equation (PDE) model and the real physics and/or experimental data is set aside. The discussion is restricted to how well numerical schemes can mimic the solution behavior of the underlying PDE model for finite time steps and grid spacings. The situation is complicated by the fact that the available theory for the understanding of nonlinear behavior of numerics is not at a stage to fully analyze the nonlinear Euler and Navier-Stokes equations. The discussion is based on the knowledge gained for nonlinear model problems with known analytical solutions to identify and explain the possible sources and remedies of numerical uncertainties in practical computations.

  15. Mutations in the PDE6B gene in autosomal recessive retinitis pigmentosa

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Danciger, M.; Blaney, J.; Gao, Y.Q.

    1995-11-01

    We have studied 24 small families with presumed autosomal recessive inheritance of retinitis pigmentosa by a combination of haplotype analysis and exon screening. Initial analysis of the families was made with a dinucleotide repeat polymorphism adjacent to the gene for rod cGMP-phosphodiesterase (PDE6B). This was followed by denaturing gradient gel electrophoresis (DGGE) and single-strand conformation polymorphism electrophoresis (SSCPE) of the 22 exons and a portion of the 5{prime} untranslated region of the PDE6B gene in the probands of each family in which the PDE6B locus could not be ruled out from segregating with disease. Two probands were found with compoundmore » heterozygous mutations: Gly576Asp and His620(1-bp del) mutations were present in one proband, and a Lys706X null mutation and an AG to AT splice acceptor site mutation in intron 2 were present in the other. Only the affecteds of each of the two families carried both corresponding mutations. 29 refs., 3 figs., 1 tab.« less

  16. Solving large-scale PDE-constrained Bayesian inverse problems with Riemann manifold Hamiltonian Monte Carlo

    NASA Astrophysics Data System (ADS)

    Bui-Thanh, T.; Girolami, M.

    2014-11-01

    We consider the Riemann manifold Hamiltonian Monte Carlo (RMHMC) method for solving statistical inverse problems governed by partial differential equations (PDEs). The Bayesian framework is employed to cast the inverse problem into the task of statistical inference whose solution is the posterior distribution in infinite dimensional parameter space conditional upon observation data and Gaussian prior measure. We discretize both the likelihood and the prior using the H1-conforming finite element method together with a matrix transfer technique. The power of the RMHMC method is that it exploits the geometric structure induced by the PDE constraints of the underlying inverse problem. Consequently, each RMHMC posterior sample is almost uncorrelated/independent from the others providing statistically efficient Markov chain simulation. However this statistical efficiency comes at a computational cost. This motivates us to consider computationally more efficient strategies for RMHMC. At the heart of our construction is the fact that for Gaussian error structures the Fisher information matrix coincides with the Gauss-Newton Hessian. We exploit this fact in considering a computationally simplified RMHMC method combining state-of-the-art adjoint techniques and the superiority of the RMHMC method. Specifically, we first form the Gauss-Newton Hessian at the maximum a posteriori point and then use it as a fixed constant metric tensor throughout RMHMC simulation. This eliminates the need for the computationally costly differential geometric Christoffel symbols, which in turn greatly reduces computational effort at a corresponding loss of sampling efficiency. We further reduce the cost of forming the Fisher information matrix by using a low rank approximation via a randomized singular value decomposition technique. This is efficient since a small number of Hessian-vector products are required. The Hessian-vector product in turn requires only two extra PDE solves using the adjoint

  17. GPAW - massively parallel electronic structure calculations with Python-based software.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Enkovaara, J.; Romero, N.; Shende, S.

    2011-01-01

    Electronic structure calculations are a widely used tool in materials science and large consumer of supercomputing resources. Traditionally, the software packages for these kind of simulations have been implemented in compiled languages, where Fortran in its different versions has been the most popular choice. While dynamic, interpreted languages, such as Python, can increase the effciency of programmer, they cannot compete directly with the raw performance of compiled languages. However, by using an interpreted language together with a compiled language, it is possible to have most of the productivity enhancing features together with a good numerical performance. We have used thismore » approach in implementing an electronic structure simulation software GPAW using the combination of Python and C programming languages. While the chosen approach works well in standard workstations and Unix environments, massively parallel supercomputing systems can present some challenges in porting, debugging and profiling the software. In this paper we describe some details of the implementation and discuss the advantages and challenges of the combined Python/C approach. We show that despite the challenges it is possible to obtain good numerical performance and good parallel scalability with Python based software.« less

  18. Modelling and simulation of parallel triangular triple quantum dots (TTQD) by using SIMON 2.0

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fathany, Maulana Yusuf, E-mail: myfathany@gmail.com; Fuada, Syifaul, E-mail: fsyifaul@gmail.com; Lawu, Braham Lawas, E-mail: bram-labs@rocketmail.com

    2016-04-19

    This research presents analysis of modeling on Parallel Triple Quantum Dots (TQD) by using SIMON (SIMulation Of Nano-structures). Single Electron Transistor (SET) is used as the basic concept of modeling. We design the structure of Parallel TQD by metal material with triangular geometry model, it is called by Triangular Triple Quantum Dots (TTQD). We simulate it with several scenarios using different parameters; such as different value of capacitance, various gate voltage, and different thermal condition.

  19. Endpoint-based parallel data processing in a parallel active messaging interface of a parallel computer

    DOEpatents

    Archer, Charles J; Blocksome, Michael E; Ratterman, Joseph D; Smith, Brian E

    2014-02-11

    Endpoint-based parallel data processing in a parallel active messaging interface ('PAMI') of a parallel computer, the PAMI composed of data communications endpoints, each endpoint including a specification of data communications parameters for a thread of execution on a compute node, including specifications of a client, a context, and a task, the compute nodes coupled for data communications through the PAMI, including establishing a data communications geometry, the geometry specifying, for tasks representing processes of execution of the parallel application, a set of endpoints that are used in collective operations of the PAMI including a plurality of endpoints for one of the tasks; receiving in endpoints of the geometry an instruction for a collective operation; and executing the instruction for a collective opeartion through the endpoints in dependence upon the geometry, including dividing data communications operations among the plurality of endpoints for one of the tasks.

  20. Endpoint-based parallel data processing in a parallel active messaging interface of a parallel computer

    DOEpatents

    Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.; Smith, Brian E.

    2014-08-12

    Endpoint-based parallel data processing in a parallel active messaging interface (`PAMI`) of a parallel computer, the PAMI composed of data communications endpoints, each endpoint including a specification of data communications parameters for a thread of execution on a compute node, including specifications of a client, a context, and a task, the compute nodes coupled for data communications through the PAMI, including establishing a data communications geometry, the geometry specifying, for tasks representing processes of execution of the parallel application, a set of endpoints that are used in collective operations of the PAMI including a plurality of endpoints for one of the tasks; receiving in endpoints of the geometry an instruction for a collective operation; and executing the instruction for a collective operation through the endpoints in dependence upon the geometry, including dividing data communications operations among the plurality of endpoints for one of the tasks.

  1. 3D multiphysics modeling of superconducting cavities with a massively parallel simulation suite

    DOE PAGES

    Kononenko, Oleksiy; Adolphsen, Chris; Li, Zenghai; ...

    2017-10-10

    Radiofrequency cavities based on superconducting technology are widely used in particle accelerators for various applications. The cavities usually have high quality factors and hence narrow bandwidths, so the field stability is sensitive to detuning from the Lorentz force and external loads, including vibrations and helium pressure variations. If not properly controlled, the detuning can result in a serious performance degradation of a superconducting accelerator, so an understanding of the underlying detuning mechanisms can be very helpful. Recent advances in the simulation suite ace3p have enabled realistic multiphysics characterization of such complex accelerator systems on supercomputers. In this paper, we presentmore » the new capabilities in ace3p for large-scale 3D multiphysics modeling of superconducting cavities, in particular, a parallel eigensolver for determining mechanical resonances, a parallel harmonic response solver to calculate the response of a cavity to external vibrations, and a numerical procedure to decompose mechanical loads, such as from the Lorentz force or piezoactuators, into the corresponding mechanical modes. These capabilities have been used to do an extensive rf-mechanical analysis of dressed TESLA-type superconducting cavities. Furthermore, the simulation results and their implications for the operational stability of the Linac Coherent Light Source-II are discussed.« less

  2. 3D multiphysics modeling of superconducting cavities with a massively parallel simulation suite

    NASA Astrophysics Data System (ADS)

    Kononenko, Oleksiy; Adolphsen, Chris; Li, Zenghai; Ng, Cho-Kuen; Rivetta, Claudio

    2017-10-01

    Radiofrequency cavities based on superconducting technology are widely used in particle accelerators for various applications. The cavities usually have high quality factors and hence narrow bandwidths, so the field stability is sensitive to detuning from the Lorentz force and external loads, including vibrations and helium pressure variations. If not properly controlled, the detuning can result in a serious performance degradation of a superconducting accelerator, so an understanding of the underlying detuning mechanisms can be very helpful. Recent advances in the simulation suite ace3p have enabled realistic multiphysics characterization of such complex accelerator systems on supercomputers. In this paper, we present the new capabilities in ace3p for large-scale 3D multiphysics modeling of superconducting cavities, in particular, a parallel eigensolver for determining mechanical resonances, a parallel harmonic response solver to calculate the response of a cavity to external vibrations, and a numerical procedure to decompose mechanical loads, such as from the Lorentz force or piezoactuators, into the corresponding mechanical modes. These capabilities have been used to do an extensive rf-mechanical analysis of dressed TESLA-type superconducting cavities. The simulation results and their implications for the operational stability of the Linac Coherent Light Source-II are discussed.

  3. 3D multiphysics modeling of superconducting cavities with a massively parallel simulation suite

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kononenko, Oleksiy; Adolphsen, Chris; Li, Zenghai

    Radiofrequency cavities based on superconducting technology are widely used in particle accelerators for various applications. The cavities usually have high quality factors and hence narrow bandwidths, so the field stability is sensitive to detuning from the Lorentz force and external loads, including vibrations and helium pressure variations. If not properly controlled, the detuning can result in a serious performance degradation of a superconducting accelerator, so an understanding of the underlying detuning mechanisms can be very helpful. Recent advances in the simulation suite ace3p have enabled realistic multiphysics characterization of such complex accelerator systems on supercomputers. In this paper, we presentmore » the new capabilities in ace3p for large-scale 3D multiphysics modeling of superconducting cavities, in particular, a parallel eigensolver for determining mechanical resonances, a parallel harmonic response solver to calculate the response of a cavity to external vibrations, and a numerical procedure to decompose mechanical loads, such as from the Lorentz force or piezoactuators, into the corresponding mechanical modes. These capabilities have been used to do an extensive rf-mechanical analysis of dressed TESLA-type superconducting cavities. Furthermore, the simulation results and their implications for the operational stability of the Linac Coherent Light Source-II are discussed.« less

  4. Parallel processing of real-time dynamic systems simulation on OSCAR (Optimally SCheduled Advanced multiprocessoR)

    NASA Technical Reports Server (NTRS)

    Kasahara, Hironori; Honda, Hiroki; Narita, Seinosuke

    1989-01-01

    Parallel processing of real-time dynamic systems simulation on a multiprocessor system named OSCAR is presented. In the simulation of dynamic systems, generally, the same calculation are repeated every time step. However, we cannot apply to Do-all or the Do-across techniques for parallel processing of the simulation since there exist data dependencies from the end of an iteration to the beginning of the next iteration and furthermore data-input and data-output are required every sampling time period. Therefore, parallelism inside the calculation required for a single time step, or a large basic block which consists of arithmetic assignment statements, must be used. In the proposed method, near fine grain tasks, each of which consists of one or more floating point operations, are generated to extract the parallelism from the calculation and assigned to processors by using optimal static scheduling at compile time in order to reduce large run time overhead caused by the use of near fine grain tasks. The practicality of the scheme is demonstrated on OSCAR (Optimally SCheduled Advanced multiprocessoR) which has been developed to extract advantageous features of static scheduling algorithms to the maximum extent.

  5. Parallel-plate transmission line type of EMP simulators: Systematic review and recommendations

    NASA Astrophysics Data System (ADS)

    Giri, D. V.; Liu, T. K.; Tesche, F. M.; King, R. W. P.

    1980-05-01

    This report presents various aspects of the two-parallel-plate transmission line type of EMP simulator. Much of the work is the result of research efforts conducted during the last two decades at the Air Force Weapons Laboratory, and in industries/universities as well. The principal features of individual simulator components are discussed. The report also emphasizes that it is imperative to hybridize our understanding of individual components so that we can draw meaningful conclusions of simulator performance as a whole.

  6. Bayer image parallel decoding based on GPU

    NASA Astrophysics Data System (ADS)

    Hu, Rihui; Xu, Zhiyong; Wei, Yuxing; Sun, Shaohua

    2012-11-01

    In the photoelectrical tracking system, Bayer image is decompressed in traditional method, which is CPU-based. However, it is too slow when the images become large, for example, 2K×2K×16bit. In order to accelerate the Bayer image decoding, this paper introduces a parallel speedup method for NVIDA's Graphics Processor Unit (GPU) which supports CUDA architecture. The decoding procedure can be divided into three parts: the first is serial part, the second is task-parallelism part, and the last is data-parallelism part including inverse quantization, inverse discrete wavelet transform (IDWT) as well as image post-processing part. For reducing the execution time, the task-parallelism part is optimized by OpenMP techniques. The data-parallelism part could advance its efficiency through executing on the GPU as CUDA parallel program. The optimization techniques include instruction optimization, shared memory access optimization, the access memory coalesced optimization and texture memory optimization. In particular, it can significantly speed up the IDWT by rewriting the 2D (Tow-dimensional) serial IDWT into 1D parallel IDWT. Through experimenting with 1K×1K×16bit Bayer image, data-parallelism part is 10 more times faster than CPU-based implementation. Finally, a CPU+GPU heterogeneous decompression system was designed. The experimental result shows that it could achieve 3 to 5 times speed increase compared to the CPU serial method.

  7. Large-Scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) Simulations of the Molecular Crystal alphaRDX

    DTIC Science & Technology

    2013-08-01

    potential for HMX / RDX (3, 9). ...................................................................................8 1 1. Purpose This work...6 dispersion and electrostatic interactions. Constants for the SB potential are given in table 1. 8 Table 1. SB potential for HMX / RDX (3, 9...modeling dislocations in the energetic molecular crystal RDX using the Large-Scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) molecular

  8. Parallel grid library for rapid and flexible simulation development

    NASA Astrophysics Data System (ADS)

    Honkonen, I.; von Alfthan, S.; Sandroos, A.; Janhunen, P.; Palmroth, M.

    2013-04-01

    We present an easy to use and flexible grid library for developing highly scalable parallel simulations. The distributed cartesian cell-refinable grid (dccrg) supports adaptive mesh refinement and allows an arbitrary C++ class to be used as cell data. The amount of data in grid cells can vary both in space and time allowing dccrg to be used in very different types of simulations, for example in fluid and particle codes. Dccrg transfers the data between neighboring cells on different processes transparently and asynchronously allowing one to overlap computation and communication. This enables excellent scalability at least up to 32 k cores in magnetohydrodynamic tests depending on the problem and hardware. In the version of dccrg presented here part of the mesh metadata is replicated between MPI processes reducing the scalability of adaptive mesh refinement (AMR) to between 200 and 600 processes. Dccrg is free software that anyone can use, study and modify and is available at https://gitorious.org/dccrg. Users are also kindly requested to cite this work when publishing results obtained with dccrg. Catalogue identifier: AEOM_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEOM_v1_0.html Program obtainable from: CPC Program Library, Queen’s University, Belfast, N. Ireland Licensing provisions: GNU Lesser General Public License version 3 No. of lines in distributed program, including test data, etc.: 54975 No. of bytes in distributed program, including test data, etc.: 974015 Distribution format: tar.gz Programming language: C++. Computer: PC, cluster, supercomputer. Operating system: POSIX. The code has been parallelized using MPI and tested with 1-32768 processes RAM: 10 MB-10 GB per process Classification: 4.12, 4.14, 6.5, 19.3, 19.10, 20. External routines: MPI-2 [1], boost [2], Zoltan [3], sfc++ [4] Nature of problem: Grid library supporting arbitrary data in grid cells, parallel adaptive mesh refinement, transparent remote neighbor data updates and

  9. Revisiting Parallel Cyclic Reduction and Parallel Prefix-Based Algorithms for Block Tridiagonal System of Equations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Seal, Sudip K; Perumalla, Kalyan S; Hirshman, Steven Paul

    2013-01-01

    Simulations that require solutions of block tridiagonal systems of equations rely on fast parallel solvers for runtime efficiency. Leading parallel solvers that are highly effective for general systems of equations, dense or sparse, are limited in scalability when applied to block tridiagonal systems. This paper presents scalability results as well as detailed analyses of two parallel solvers that exploit the special structure of block tridiagonal matrices to deliver superior performance, often by orders of magnitude. A rigorous analysis of their relative parallel runtimes is shown to reveal the existence of a critical block size that separates the parameter space spannedmore » by the number of block rows, the block size and the processor count, into distinct regions that favor one or the other of the two solvers. Dependence of this critical block size on the above parameters as well as on machine-specific constants is established. These formal insights are supported by empirical results on up to 2,048 cores of a Cray XT4 system. To the best of our knowledge, this is the highest reported scalability for parallel block tridiagonal solvers to date.« less

  10. Fragment-assisted hit investigation involving integrated HTS and fragment screening: Application to the identification of phosphodiesterase 10A (PDE10A) inhibitors.

    PubMed

    Varnes, Jeffrey G; Geschwindner, Stefan; Holmquist, Christopher R; Forst, Janet; Wang, Xia; Dekker, Niek; Scott, Clay W; Tian, Gaochao; Wood, Michael W; Albert, Jeffrey S

    2016-01-01

    Fragment-based drug design (FBDD) relies on direct elaboration of fragment hits and typically requires high resolution structural information to guide optimization. In fragment-assisted drug discovery (FADD), fragments provide information to guide selection and design but do not serve as starting points for elaboration. We describe FADD and high-throughput screening (HTS) campaign strategies conducted in parallel against PDE10A where fragment hit co-crystallography was not available. The fragment screen led to prioritized fragment hits (IC50's ∼500μM), which were used to generate a hypothetical core scaffold. Application of this scaffold as a filter to HTS output afforded a 4μM hit, which, after preparation of a small number of analogs, was elaborated into a 16nM lead. This approach highlights the strength of FADD, as fragment methods were applied despite the absence of co-crystallographical information to efficiently identify a lead compound for further optimization. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. 3D Parallel Multigrid Methods for Real-Time Fluid Simulation

    NASA Astrophysics Data System (ADS)

    Wan, Feifei; Yin, Yong; Zhang, Suiyu

    2018-03-01

    The multigrid method is widely used in fluid simulation because of its strong convergence. In addition to operating accuracy, operational efficiency is also an important factor to consider in order to enable real-time fluid simulation in computer graphics. For this problem, we compared the performance of the Algebraic Multigrid and the Geometric Multigrid in the V-Cycle and Full-Cycle schemes respectively, and analyze the convergence and speed of different methods. All the calculations are done on the parallel computing of GPU in this paper. Finally, we experiment with the 3D-grid for each scale, and give the exact experimental results.

  12. Wake Encounter Analysis for a Closely Spaced Parallel Runway Paired Approach Simulation

    NASA Technical Reports Server (NTRS)

    Mckissick,Burnell T.; Rico-Cusi, Fernando J.; Murdoch, Jennifer; Oseguera-Lohr, Rosa M.; Stough, Harry P, III; O'Connor, Cornelius J.; Syed, Hazari I.

    2009-01-01

    A Monte Carlo simulation of simultaneous approaches performed by two transport category aircraft from the final approach fix to a pair of closely spaced parallel runways was conducted to explore the aft boundary of the safe zone in which separation assurance and wake avoidance are provided. The simulation included variations in runway centerline separation, initial longitudinal spacing of the aircraft, crosswind speed, and aircraft speed during the approach. The data from the simulation showed that the majority of the wake encounters occurred near or over the runway and the aft boundaries of the safe zones were identified for all simulation conditions.

  13. Evolution Nonlinear Diffusion-Convection PDE Models for Spectrogram Enhancement

    NASA Astrophysics Data System (ADS)

    Dugnol, B.; Fernández, C.; Galiano, G.; Velasco, J.

    2008-09-01

    In previous works we studied the application of PDE-based image processing techniques applied to the spectrogram of audio signals in order to improve the readability of the signal. In particular we considered the implementation of the nonlinear diffusive model proposed by Álvarez, Lions and Morel [1](ALM) combined with a convective term inspired by the differential reassignment proposed by Chassandre-Mottin, Daubechies, Auger and Flandrin [2]-[3]. In this work we consider the possibility of replacing the diffusive model of ALM by diffusive terms in divergence form. In particular we implement finite element approximations of nonlinear diffusive terms studied by Chen, Levine, Rao [4] and Antontsev, Shmarev [5]-[8] with a convective term.

  14. A cable-driven parallel robots application: modelling and simulation of a dynamic cable model in Dymola

    NASA Astrophysics Data System (ADS)

    Othman, M. F.; Kurniawan, R.; Schramm, D.; Ariffin, A. K.

    2018-05-01

    Modeling a cable model in multibody dynamics simulation tool which dynamically varies in length, mass and stiffness is a challenging task. Simulation of cable-driven parallel robots (CDPR) for instance requires a cable model that can dynamically change in length for every desired pose of the platform. Thus, in this paper, a detailed procedure for modeling and simulation of a dynamic cable model in Dymola is proposed. The approach is also applicable for other types of Modelica simulation environments. The cable is modeled using standard mechanical elements like mass, spring, damper and joint. The parameters of the cable model are based on the factsheet of the manufacturer and experimental results. Its dynamic ability is tested by applying it on a complete planar CDPR model in which the parameters are based on a prototype named CABLAR, which is developed in Chair of Mechatronics, University of Duisburg-Essen. The prototype has been developed to demonstrate an application of CDPR as a goods storage and retrieval machine. The performance of the cable model during the simulation is analyzed and discussed.

  15. Parallel Implementation of the Discontinuous Galerkin Method

    NASA Technical Reports Server (NTRS)

    Baggag, Abdalkader; Atkins, Harold; Keyes, David

    1999-01-01

    This paper describes a parallel implementation of the discontinuous Galerkin method. Discontinuous Galerkin is a spatially compact method that retains its accuracy and robustness on non-smooth unstructured grids and is well suited for time dependent simulations. Several parallelization approaches are studied and evaluated. The most natural and symmetric of the approaches has been implemented in all object-oriented code used to simulate aeroacoustic scattering. The parallel implementation is MPI-based and has been tested on various parallel platforms such as the SGI Origin, IBM SP2, and clusters of SGI and Sun workstations. The scalability results presented for the SGI Origin show slightly superlinear speedup on a fixed-size problem due to cache effects.

  16. Identification, characterization and subcellular localization of TcPDE1, a novel cAMP-specific phosphodiesterase from Trypanosoma cruzi.

    PubMed Central

    D'Angelo, Maximiliano A; Sanguineti, Santiago; Reece, Jeffrey M; Birnbaumer, Lutz; Torres, Héctor N; Flawiá, Mirtha M

    2004-01-01

    Compartmentalization of cAMP phosphodiesterases plays a key role in the regulation of cAMP signalling in mammals. In the present paper, we report the characterization and subcellular localization of TcPDE1, the first cAMP-specific phosphodiesterase to be identified from Trypanosoma cruzi. TcPDE1 is part of a small gene family and encodes a 929-amino-acid protein that can complement a heat-shock-sensitive yeast mutant deficient in phospho-diesterase genes. Recombinant TcPDE1 strongly associates with membranes and cannot be released with NaCl or sodium cholate, suggesting that it is an integral membrane protein. This enzyme is specific for cAMP and its activity is not affected by cGMP, Ca2+, calmodulin or fenotiazinic inhibitors. TcPDE1 is sensitive to the phosphodiesterase inhibitor dipyridamole but is resistant to 3-isobutyl-1-methylxanthine, theophylline, rolipram and zaprinast. Papaverine, erythro-9-(2-hydroxy-3-nonyl)-adenine hydrochloride, and vinpocetine are poor inhibitors of this enzyme. Confocal laser scanning of T. cruzi epimastigotes showed that TcPDE1 is associated with the plasma membrane and concentrated in the flagellum of the parasite. The association of TcPDE1 with this organelle was confirmed by subcellular fractionation and cell-disruption treatments. The localization of this enzyme is a unique feature that distinguishes it from all the trypanosomatid phosphodiesterases described so far and indicates that compartmentalization of cAMP phosphodiesterases could also be important in these parasites. PMID:14556647

  17. Scalable Parallel Density-based Clustering and Applications

    NASA Astrophysics Data System (ADS)

    Patwary, Mostofa Ali

    2014-04-01

    Recently, density-based clustering algorithms (DBSCAN and OPTICS) have gotten significant attention of the scientific community due to their unique capability of discovering arbitrary shaped clusters and eliminating noise data. These algorithms have several applications, which require high performance computing, including finding halos and subhalos (clusters) from massive cosmology data in astrophysics, analyzing satellite images, X-ray crystallography, and anomaly detection. However, parallelization of these algorithms are extremely challenging as they exhibit inherent sequential data access order, unbalanced workload resulting in low parallel efficiency. To break the data access sequentiality and to achieve high parallelism, we develop new parallel algorithms, both for DBSCAN and OPTICS, designed using graph algorithmic techniques. For example, our parallel DBSCAN algorithm exploits the similarities between DBSCAN and computing connected components. Using datasets containing up to a billion floating point numbers, we show that our parallel density-based clustering algorithms significantly outperform the existing algorithms, achieving speedups up to 27.5 on 40 cores on shared memory architecture and speedups up to 5,765 using 8,192 cores on distributed memory architecture. In our experiments, we found that while achieving the scalability, our algorithms produce clustering results with comparable quality to the classical algorithms.

  18. Heterologous desensitization of cardiac β-adrenergic signal via hormone-induced βAR/arrestin/PDE4 complexes.

    PubMed

    Shi, Qian; Li, Minghui; Mika, Delphine; Fu, Qin; Kim, Sungjin; Phan, Jason; Shen, Ao; Vandecasteele, Gregoire; Xiang, Yang K

    2017-05-01

    Cardiac β-adrenergic receptor (βAR) signalling is susceptible to heterologous desensitization by different neurohormonal stimuli in clinical conditions associated with heart failure. We aim to examine the underlying mechanism of cross talk between βARs and a set of G-protein coupled receptors (GPCRs) activated by hormones/agonists. Rat ventricular cardiomyocytes were used to determine heterologous phosphorylation of βARs under a series of GPCR agonists. Activation of Gs-coupled dopamine receptor, adenosine receptor, relaxin receptor and prostaglandin E2 receptor, and Gq-coupled α1 adrenergic receptor and angiotensin II type 1 receptor promotes phosphorylation of β1AR and β2AR at putative protein kinase A (PKA) phosphorylation sites; but activation of Gi-coupled α2 adrenergic receptor and activation of protease-activated receptor does not. The GPCR agonists that promote β2AR phosphorylation effectively inhibit βAR agonist isoproterenol-induced PKA phosphorylation of phospholamban and contractile function in ventricular cardiomyocytes. Heterologous GPCR stimuli have minimal to small effect on isoproterenol-induced β2AR activation and G-protein coupling for cyclic adenosine monophosphate (cAMP) production. However, these GPCR stimuli significantly promote phosphorylation of phosphodiesterase 4D (PDE4D), and recruit PDE4D to the phosphorylated β2AR in a β-arrestin 2 dependent manner without promoting β2AR endocytosis. The increased binding between β2AR and PDE4D effectively hydrolyzes cAMP signal generated by subsequent stimulation with isoproterenol. Mutation of PKA phosphorylation sites in β2AR, inhibition of PDE4, or genetic ablation of PDE4D or β-arrestin 2 abolishes this heterologous inhibitory effect. Ablation of β-arrestin 2 or PDE4D gene also rescues β-adrenergic stimuli-induced myocyte contractile function. These data reveal essential roles of β-arrestin 2 and PDE4D in a common mechanism for heterologous desensitization of cardiac

  19. Behavioral characterization of mice deficient in the phosphodiesterase-10A (PDE10A) enzyme on a C57/Bl6N congenic background.

    PubMed

    Siuciak, Judith A; McCarthy, Sheryl A; Chapin, Douglas S; Martin, Ashley N; Harms, John F; Schmidt, Christopher J

    2008-02-01

    The phenotype of genetically modified animals is strongly influenced by both the genetic background of the animal as well as environmental factors. We have previously reported the behavioral and neurochemical characterization of PDE10A knockout mice maintained on a DBA1LacJ (PDE10A(DBA)) genetic background. The aim of the present studies was to assess the behavioral and neurochemical phenotype of PDE10A knockout mice on an alternative congenic C57BL/6N (PDE10A(C57)) genetic background. Consistent with our previous results, PDE10A(C57) knockout mice showed a decrease in exploratory locomotor activity and a delay in the acquisition of conditioned avoidance responding. Also consistent with previous studies, the elimination of PDE10A did not alter basal levels of striatal cGMP or cAMP or affect behavior in several other well-characterized behavioral assays. PDE10A(C57) knockout mice showed a blunted response to MK-801, although to a lesser degree than previously observed in the PDE10A(DBA) knockout mice, and no differences were observed following a PCP challenge. PDE10A(C57) knockout mice showed a significant change in striatal dopamine turnover, which was accompanied by an enhanced locomotor response to AMPH, These studies demonstrate that while many of the behavioral effects of the PDE10A gene deletion appear to be independent of genetic background, the impact of the deletion on behavior can vary in magnitude. Furthermore, the effects on the dopaminergic system appear to be background-dependent, with significant effects observed only in knockout mice on the C57BL6N genetic background.

  20. A polymorphic reconfigurable emulator for parallel simulation

    NASA Technical Reports Server (NTRS)

    Parrish, E. A., Jr.; Mcvey, E. S.; Cook, G.

    1980-01-01

    Microprocessor and arithmetic support chip technology was applied to the design of a reconfigurable emulator for real time flight simulation. The system developed consists of master control system to perform all man machine interactions and to configure the hardware to emulate a given aircraft, and numerous slave compute modules (SCM) which comprise the parallel computational units. It is shown that all parts of the state equations can be worked on simultaneously but that the algebraic equations cannot (unless they are slowly varying). Attempts to obtain algorithms that will allow parellel updates are reported. The word length and step size to be used in the SCM's is determined and the architecture of the hardware and software is described.

  1. Wakefield Simulation of CLIC PETS Structure Using Parallel 3D Finite Element Time-Domain Solver T3P

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Candel, A.; Kabel, A.; Lee, L.

    In recent years, SLAC's Advanced Computations Department (ACD) has developed the parallel 3D Finite Element electromagnetic time-domain code T3P. Higher-order Finite Element methods on conformal unstructured meshes and massively parallel processing allow unprecedented simulation accuracy for wakefield computations and simulations of transient effects in realistic accelerator structures. Applications include simulation of wakefield damping in the Compact Linear Collider (CLIC) power extraction and transfer structure (PETS).

  2. Scalable parallel communications

    NASA Technical Reports Server (NTRS)

    Maly, K.; Khanna, S.; Overstreet, C. M.; Mukkamala, R.; Zubair, M.; Sekhar, Y. S.; Foudriat, E. C.

    1992-01-01

    Coarse-grain parallelism in networking (that is, the use of multiple protocol processors running replicated software sending over several physical channels) can be used to provide gigabit communications for a single application. Since parallel network performance is highly dependent on real issues such as hardware properties (e.g., memory speeds and cache hit rates), operating system overhead (e.g., interrupt handling), and protocol performance (e.g., effect of timeouts), we have performed detailed simulations studies of both a bus-based multiprocessor workstation node (based on the Sun Galaxy MP multiprocessor) and a distributed-memory parallel computer node (based on the Touchstone DELTA) to evaluate the behavior of coarse-grain parallelism. Our results indicate: (1) coarse-grain parallelism can deliver multiple 100 Mbps with currently available hardware platforms and existing networking protocols (such as Transmission Control Protocol/Internet Protocol (TCP/IP) and parallel Fiber Distributed Data Interface (FDDI) rings); (2) scale-up is near linear in n, the number of protocol processors, and channels (for small n and up to a few hundred Mbps); and (3) since these results are based on existing hardware without specialized devices (except perhaps for some simple modifications of the FDDI boards), this is a low cost solution to providing multiple 100 Mbps on current machines. In addition, from both the performance analysis and the properties of these architectures, we conclude: (1) multiple processors providing identical services and the use of space division multiplexing for the physical channels can provide better reliability than monolithic approaches (it also provides graceful degradation and low-cost load balancing); (2) coarse-grain parallelism supports running several transport protocols in parallel to provide different types of service (for example, one TCP handles small messages for many users, other TCP's running in parallel provide high bandwidth

  3. Special purpose parallel computer architecture for real-time control and simulation in robotic applications

    NASA Technical Reports Server (NTRS)

    Fijany, Amir (Inventor); Bejczy, Antal K. (Inventor)

    1993-01-01

    This is a real-time robotic controller and simulator which is a MIMD-SIMD parallel architecture for interfacing with an external host computer and providing a high degree of parallelism in computations for robotic control and simulation. It includes a host processor for receiving instructions from the external host computer and for transmitting answers to the external host computer. There are a plurality of SIMD microprocessors, each SIMD processor being a SIMD parallel processor capable of exploiting fine grain parallelism and further being able to operate asynchronously to form a MIMD architecture. Each SIMD processor comprises a SIMD architecture capable of performing two matrix-vector operations in parallel while fully exploiting parallelism in each operation. There is a system bus connecting the host processor to the plurality of SIMD microprocessors and a common clock providing a continuous sequence of clock pulses. There is also a ring structure interconnecting the plurality of SIMD microprocessors and connected to the clock for providing the clock pulses to the SIMD microprocessors and for providing a path for the flow of data and instructions between the SIMD microprocessors. The host processor includes logic for controlling the RRCS by interpreting instructions sent by the external host computer, decomposing the instructions into a series of computations to be performed by the SIMD microprocessors, using the system bus to distribute associated data among the SIMD microprocessors, and initiating activity of the SIMD microprocessors to perform the computations on the data by procedure call.

  4. Accelerating population balance-Monte Carlo simulation for coagulation dynamics from the Markov jump model, stochastic algorithm and GPU parallel computing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xu, Zuwei; Zhao, Haibo, E-mail: klinsmannzhb@163.com; Zheng, Chuguang

    2015-01-15

    This paper proposes a comprehensive framework for accelerating population balance-Monte Carlo (PBMC) simulation of particle coagulation dynamics. By combining Markov jump model, weighted majorant kernel and GPU (graphics processing unit) parallel computing, a significant gain in computational efficiency is achieved. The Markov jump model constructs a coagulation-rule matrix of differentially-weighted simulation particles, so as to capture the time evolution of particle size distribution with low statistical noise over the full size range and as far as possible to reduce the number of time loopings. Here three coagulation rules are highlighted and it is found that constructing appropriate coagulation rule providesmore » a route to attain the compromise between accuracy and cost of PBMC methods. Further, in order to avoid double looping over all simulation particles when considering the two-particle events (typically, particle coagulation), the weighted majorant kernel is introduced to estimate the maximum coagulation rates being used for acceptance–rejection processes by single-looping over all particles, and meanwhile the mean time-step of coagulation event is estimated by summing the coagulation kernels of rejected and accepted particle pairs. The computational load of these fast differentially-weighted PBMC simulations (based on the Markov jump model) is reduced greatly to be proportional to the number of simulation particles in a zero-dimensional system (single cell). Finally, for a spatially inhomogeneous multi-dimensional (multi-cell) simulation, the proposed fast PBMC is performed in each cell, and multiple cells are parallel processed by multi-cores on a GPU that can implement the massively threaded data-parallel tasks to obtain remarkable speedup ratio (comparing with CPU computation, the speedup ratio of GPU parallel computing is as high as 200 in a case of 100 cells with 10 000 simulation particles per cell). These accelerating approaches of PBMC

  5. A GaAs vector processor based on parallel RISC microprocessors

    NASA Astrophysics Data System (ADS)

    Misko, Tim A.; Rasset, Terry L.

    A vector processor architecture based on the development of a 32-bit microprocessor using gallium arsenide (GaAs) technology has been developed. The McDonnell Douglas vector processor (MVP) will be fabricated completely from GaAs digital integrated circuits. The MVP architecture includes a vector memory of 1 megabyte, a parallel bus architecture with eight processing elements connected in parallel, and a control processor. The processing elements consist of a reduced instruction set CPU (RISC) with four floating-point coprocessor units and necessary memory interface functions. This architecture has been simulated for several benchmark programs including complex fast Fourier transform (FFT), complex inner product, trigonometric functions, and sort-merge routine. The results of this study indicate that the MVP can process a 1024-point complex FFT at a speed of 112 microsec (389 megaflops) while consuming approximately 618 W of power in a volume of approximately 0.1 ft-cubed.

  6. Application of integration algorithms in a parallel processing environment for the simulation of jet engines

    NASA Technical Reports Server (NTRS)

    Krosel, S. M.; Milner, E. J.

    1982-01-01

    The application of Predictor corrector integration algorithms developed for the digital parallel processing environment are investigated. The algorithms are implemented and evaluated through the use of a software simulator which provides an approximate representation of the parallel processing hardware. Test cases which focus on the use of the algorithms are presented and a specific application using a linear model of a turbofan engine is considered. Results are presented showing the effects of integration step size and the number of processors on simulation accuracy. Real time performance, interprocessor communication, and algorithm startup are also discussed.

  7. Photoaffinity labelling of cyclic GMP-inhibited phosphodiesterase (PDE III) in human and rat platelets and rat tissues: effects of phosphodiesterase inhibitors.

    PubMed

    Tang, K M; Jang, E K; Haslam, R J

    1994-06-15

    Ultraviolet irradiation of human platelet cytosol in the presence of 32P-labelled cyclic GMP (cGMP) can specifically label 110, 80, 55, 49 and 38 kDa proteins; the 110 kDa species is the subunit of cGMP-inhibited phosphodiesterase (PDE III) and the 80 kDa species that of cGMP-dependent protein kinase (Tang et al., 1993, Biochem. J. 294, 329). We have now shown that although photolabelling of platelet PDE III was inhibited by unlabelled cGMP, 8-bromo-cGMP and cyclic AMP (cAMP), it was not affected by phosphorothioate analogues of these cyclic nucleotides. Specific concentration-dependent inhibitions of the photolabelling of PDE III were observed with the following PDE inhibitors: trequinsin (IC50 = 13 +/- 2 nM), lixazinone (IC50 = 22 +/- 4 nM), milrinone (IC50 = 56 +/- 12 nM), cilostamide (IC50 = 70 +/- 9 nM), siguazodan (IC50 = 117 +/- 29 nM) and 3-isobutyl 1-methylxanthine (IBMX) (IC50 = 3950 +/- 22 nM). Thus, measurements of the inhibitory effects of compounds on the photolabelling of platelet PDE III provide a simple quantitative means of investigating their actions at a molecular level that avoids the need to purify the enzyme. Photolabelling of rat platelet lysate or rat heart homogenate by [32P]cGMP showed that the 110 kDa PDE III present in human material was replaced by a 115 kDa protein, labelling of which was also blocked by PDE III inhibitors. Heart and other rat tissues contained much less of this putative 115 kDa PDE III than rat platelets. In contrast, the 80 kDa protein was labelled much less in platelets than in many other rat tissue homogenates (e.g., heart, aorta, uterus and lung). Thus, comparison of the relative amounts of specific photolabelled proteins in different cells may provide an indication of different patterns of cyclic nucleotide action. We compared the abilities of phosphodiesterase inhibitors to block the photolabelling of PDE III in human platelet cytosol and to increase the iloprost-stimulated accumulation of cAMP in intact

  8. Progress in the Simulation of Steady and Time-Dependent Flows with 3D Parallel Unstructured Cartesian Methods

    NASA Technical Reports Server (NTRS)

    Aftosmis, M. J.; Berger, M. J.; Murman, S. M.; Kwak, Dochan (Technical Monitor)

    2002-01-01

    The proposed paper will present recent extensions in the development of an efficient Euler solver for adaptively-refined Cartesian meshes with embedded boundaries. The paper will focus on extensions of the basic method to include solution adaptation, time-dependent flow simulation, and arbitrary rigid domain motion. The parallel multilevel method makes use of on-the-fly parallel domain decomposition to achieve extremely good scalability on large numbers of processors, and is coupled with an automatic coarse mesh generation algorithm for efficient processing by a multigrid smoother. Numerical results are presented demonstrating parallel speed-ups of up to 435 on 512 processors. Solution-based adaptation may be keyed off truncation error estimates using tau-extrapolation or a variety of feature detection based refinement parameters. The multigrid method is extended to for time-dependent flows through the use of a dual-time approach. The extension to rigid domain motion uses an Arbitrary Lagrangian-Eulerlarian (ALE) formulation, and results will be presented for a variety of two- and three-dimensional example problems with both simple and complex geometry.

  9. Schnek: A C++ library for the development of parallel simulation codes on regular grids

    NASA Astrophysics Data System (ADS)

    Schmitz, Holger

    2018-05-01

    A large number of algorithms across the field of computational physics are formulated on grids with a regular topology. We present Schnek, a library that enables fast development of parallel simulations on regular grids. Schnek contains a number of easy-to-use modules that greatly reduce the amount of administrative code for large-scale simulation codes. The library provides an interface for reading simulation setup files with a hierarchical structure. The structure of the setup file is translated into a hierarchy of simulation modules that the developer can specify. The reader parses and evaluates mathematical expressions and initialises variables or grid data. This enables developers to write modular and flexible simulation codes with minimal effort. Regular grids of arbitrary dimension are defined as well as mechanisms for defining physical domain sizes, grid staggering, and ghost cells on these grids. Ghost cells can be exchanged between neighbouring processes using MPI with a simple interface. The grid data can easily be written into HDF5 files using serial or parallel I/O.

  10. Parallel Unsteady Turbopump Simulations for Liquid Rocket Engines

    NASA Technical Reports Server (NTRS)

    Kiris, Cetin C.; Kwak, Dochan; Chan, William

    2000-01-01

    This paper reports the progress being made towards complete turbo-pump simulation capability for liquid rocket engines. Space Shuttle Main Engine (SSME) turbo-pump impeller is used as a test case for the performance evaluation of the MPI and hybrid MPI/Open-MP versions of the INS3D code. Then, a computational model of a turbo-pump has been developed for the shuttle upgrade program. Relative motion of the grid system for rotor-stator interaction was obtained by employing overset grid techniques. Time-accuracy of the scheme has been evaluated by using simple test cases. Unsteady computations for SSME turbo-pump, which contains 136 zones with 35 Million grid points, are currently underway on Origin 2000 systems at NASA Ames Research Center. Results from time-accurate simulations with moving boundary capability, and the performance of the parallel versions of the code will be presented in the final paper.

  11. Globalized Newton-Krylov-Schwarz Algorithms and Software for Parallel Implicit CFD

    NASA Technical Reports Server (NTRS)

    Gropp, W. D.; Keyes, D. E.; McInnes, L. C.; Tidriri, M. D.

    1998-01-01

    Implicit solution methods are important in applications modeled by PDEs with disparate temporal and spatial scales. Because such applications require high resolution with reasonable turnaround, "routine" parallelization is essential. The pseudo-transient matrix-free Newton-Krylov-Schwarz (Psi-NKS) algorithmic framework is presented as an answer. We show that, for the classical problem of three-dimensional transonic Euler flow about an M6 wing, Psi-NKS can simultaneously deliver: globalized, asymptotically rapid convergence through adaptive pseudo- transient continuation and Newton's method-, reasonable parallelizability for an implicit method through deferred synchronization and favorable communication-to-computation scaling in the Krylov linear solver; and high per- processor performance through attention to distributed memory and cache locality, especially through the Schwarz preconditioner. Two discouraging features of Psi-NKS methods are their sensitivity to the coding of the underlying PDE discretization and the large number of parameters that must be selected to govern convergence. We therefore distill several recommendations from our experience and from our reading of the literature on various algorithmic components of Psi-NKS, and we describe a freely available, MPI-based portable parallel software implementation of the solver employed here.

  12. The PDE4 inhibitor CHF-6001 and LAMAs inhibit bronchoconstriction-induced remodeling in lung slices.

    PubMed

    Kistemaker, Loes E M; Oenema, Tjitske A; Baarsma, Hoeke A; Bos, I Sophie T; Schmidt, Martina; Facchinetti, Fabrizio; Civelli, Maurizio; Villetti, Gino; Gosens, Reinoud

    2017-09-01

    Combination therapy of PDE4 inhibitors and anticholinergics induces bronchoprotection in COPD. Mechanical forces that arise during bronchoconstriction may contribute to airway remodeling. Therefore, we investigated the impact of PDE4 inhibitors and anticholinergics on bronchoconstriction-induced remodeling. Because of the different mechanism of action of PDE4 inhibitors and anticholinergics, we hypothesized functional interactions of these two drug classes. Guinea pig precision-cut lung slices were preincubated with the PDE4 inhibitors CHF-6001 or roflumilast and/or the anticholinergics tiotropium or glycopyorrolate, followed by stimulation with methacholine (10 μM) or TGF-β 1 (2 ng/ml) for 48 h. The inhibitory effects on airway smooth muscle remodeling, airway contraction, and TGF-β release were investigated. Methacholine-induced protein expression of smooth muscle-myosin was fully inhibited by CHF-6001 (0.3-100 nM), whereas roflumilast (1 µM) had smaller effects. Tiotropium and glycopyrrolate fully inhibited methacholine-induced airway remodeling (0.1-30 nM). The combination of CHF-6001 and tiotropium or glycopyrrolate, in concentrations partially effective by themselves, fully inhibited methacholine-induced remodeling in combination. CHF-6001 did not affect airway closure and had limited effects on TGF-β 1 -induced remodeling, but rather, it inhibited methacholine-induced TGF-β release. The PDE4 inhibitor CHF-6001, and to a lesser extent roflumilast, and the LAMAs tiotropium and glycopyrrolate inhibit bronchoconstriction-induced remodeling. The combination of CHF-6001 and anticholinergics was more effective than the individual compounds. This cooperativity might be explained by the distinct mechanisms of action inhibiting TGF-β release and bronchoconstriction. Copyright © 2017 the American Physiological Society.

  13. PDE1 Encodes a P-Type ATPase Involved in Appressorium-Mediated Plant Infection by the Rice Blast Fungus Magnaporthe grisea

    PubMed Central

    Balhadère, Pascale V.; Talbot, Nicholas J.

    2001-01-01

    Plant infection by the rice blast fungus Magnaporthe grisea is brought about by the action of specialized infection cells called appressoria. These infection cells generate enormous turgor pressure, which is translated into an invasive force that allows a narrow penetration hypha to breach the plant cuticle. The Magnaporthe pde1 mutant was identified previously by restriction enzyme–mediated DNA integration mutagenesis and is impaired in its ability to elaborate penetration hyphae. Here we report that the pde1 mutation is the result of an insertion into the promoter of a P-type ATPase-encoding gene. Targeted gene disruption confirmed the role of PDE1 in penetration hypha development and pathogenicity but highlighted potential differences in PDE1 regulation in different Magnaporthe strains. The predicted PDE1 gene product was most similar to members of the aminophospholipid translocase group of P-type ATPases and was shown to be a functional homolog of the yeast ATPase gene ATC8. Spatial expression studies showed that PDE1 is expressed in germinating conidia and developing appressoria. These findings implicate the action of aminophospholipid translocases in the development of penetration hyphae and the proliferation of the fungus beyond colonization of the first epidermal cell. PMID:11549759

  14. A parallel row-based algorithm with error control for standard-cell replacement on a hypercube multiprocessor

    NASA Technical Reports Server (NTRS)

    Sargent, Jeff Scott

    1988-01-01

    A new row-based parallel algorithm for standard-cell placement targeted for execution on a hypercube multiprocessor is presented. Key features of this implementation include a dynamic simulated-annealing schedule, row-partitioning of the VLSI chip image, and two novel new approaches to controlling error in parallel cell-placement algorithms; Heuristic Cell-Coloring and Adaptive (Parallel Move) Sequence Control. Heuristic Cell-Coloring identifies sets of noninteracting cells that can be moved repeatedly, and in parallel, with no buildup of error in the placement cost. Adaptive Sequence Control allows multiple parallel cell moves to take place between global cell-position updates. This feedback mechanism is based on an error bound derived analytically from the traditional annealing move-acceptance profile. Placement results are presented for real industry circuits and the performance is summarized of an implementation on the Intel iPSC/2 Hypercube. The runtime of this algorithm is 5 to 16 times faster than a previous program developed for the Hypercube, while producing equivalent quality placement. An integrated place and route program for the Intel iPSC/2 Hypercube is currently being developed.

  15. Parallel Computing for Brain Simulation.

    PubMed

    Pastur-Romay, L A; Porto-Pazos, A B; Cedron, F; Pazos, A

    2017-01-01

    The human brain is the most complex system in the known universe, it is therefore one of the greatest mysteries. It provides human beings with extraordinary abilities. However, until now it has not been understood yet how and why most of these abilities are produced. For decades, researchers have been trying to make computers reproduce these abilities, focusing on both understanding the nervous system and, on processing data in a more efficient way than before. Their aim is to make computers process information similarly to the brain. Important technological developments and vast multidisciplinary projects have allowed creating the first simulation with a number of neurons similar to that of a human brain. This paper presents an up-to-date review about the main research projects that are trying to simulate and/or emulate the human brain. They employ different types of computational models using parallel computing: digital models, analog models and hybrid models. This review includes the current applications of these works, as well as future trends. It is focused on various works that look for advanced progress in Neuroscience and still others which seek new discoveries in Computer Science (neuromorphic hardware, machine learning techniques). Their most outstanding characteristics are summarized and the latest advances and future plans are presented. In addition, this review points out the importance of considering not only neurons: Computational models of the brain should also include glial cells, given the proven importance of astrocytes in information processing. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  16. CHOLLA: A New Massively Parallel Hydrodynamics Code for Astrophysical Simulation

    NASA Astrophysics Data System (ADS)

    Schneider, Evan E.; Robertson, Brant E.

    2015-04-01

    We present Computational Hydrodynamics On ParaLLel Architectures (Cholla ), a new three-dimensional hydrodynamics code that harnesses the power of graphics processing units (GPUs) to accelerate astrophysical simulations. Cholla models the Euler equations on a static mesh using state-of-the-art techniques, including the unsplit Corner Transport Upwind algorithm, a variety of exact and approximate Riemann solvers, and multiple spatial reconstruction techniques including the piecewise parabolic method (PPM). Using GPUs, Cholla evolves the fluid properties of thousands of cells simultaneously and can update over 10 million cells per GPU-second while using an exact Riemann solver and PPM reconstruction. Owing to the massively parallel architecture of GPUs and the design of the Cholla code, astrophysical simulations with physically interesting grid resolutions (≳2563) can easily be computed on a single device. We use the Message Passing Interface library to extend calculations onto multiple devices and demonstrate nearly ideal scaling beyond 64 GPUs. A suite of test problems highlights the physical accuracy of our modeling and provides a useful comparison to other codes. We then use Cholla to simulate the interaction of a shock wave with a gas cloud in the interstellar medium, showing that the evolution of the cloud is highly dependent on its density structure. We reconcile the computed mixing time of a turbulent cloud with a realistic density distribution destroyed by a strong shock with the existing analytic theory for spherical cloud destruction by describing the system in terms of its median gas density.

  17. A study of the parallel algorithm for large-scale DC simulation of nonlinear systems

    NASA Astrophysics Data System (ADS)

    Cortés Udave, Diego Ernesto; Ogrodzki, Jan; Gutiérrez de Anda, Miguel Angel

    Newton-Raphson DC analysis of large-scale nonlinear circuits may be an extremely time consuming process even if sparse matrix techniques and bypassing of nonlinear models calculation are used. A slight decrease in the time required for this task may be enabled on multi-core, multithread computers if the calculation of the mathematical models for the nonlinear elements as well as the stamp management of the sparse matrix entries are managed through concurrent processes. This numerical complexity can be further reduced via the circuit decomposition and parallel solution of blocks taking as a departure point the BBD matrix structure. This block-parallel approach may give a considerable profit though it is strongly dependent on the system topology and, of course, on the processor type. This contribution presents the easy-parallelizable decomposition-based algorithm for DC simulation and provides a detailed study of its effectiveness.

  18. PDE7B is involved in nandrolone decanoate hydrolysis in liver cytosol and its transcription is up-regulated by androgens in HepG2.

    PubMed

    Strahm, Emmanuel; Rane, Anders; Ekström, Lena

    2014-01-01

    Most androgenic drugs are available as esters for a prolonged depot action. However, the enzymes involved in the hydrolysis of the esters have not been identified. There is one study indicating that PDE7B may be involved in the activation of testosterone enanthate. The aims are to identify the cellular compartments where the hydrolysis of testosterone enanthate and nandrolone decanoate occurs, and to investigate the involvement of PDE7B in the activation. We also determined if testosterone and nandrolone affect the expression of the PDE7B gene. The hydrolysis studies were performed in isolated human liver cytosolic and microsomal preparations with and without specific PDE7B inhibitor. The gene expression was studied in human hepatoma cells (HepG2) exposed to testosterone and nandrolone. We show that PDE7B serves as a catalyst of the hydrolysis of testosterone enanthate and nandrolone decanoate in liver cytosol. The gene expression of PDE7B was significantly induced 3- and 5- fold after 2 h exposure to 1 μM testosterone enanthate and nandrolone decanoate, respectively. These results show that PDE7B is involved in the activation of esterified nandrolone and testosterone and that the gene expression of PDE7B is induced by supra-physiological concentrations of androgenic drugs.

  19. N Termini of apPDE4 Isoforms Are Responsible for Targeting the Isoforms to Different Cellular Membranes

    ERIC Educational Resources Information Center

    Jang, Deok-Jin; Park, Soo-Won; Lee, Jin-A; Lee, Changhoon; Chae, Yeon-Su; Park, Hyungju; Kim, Min-Jeong; Choi, Sun-Lim; Lee, Nuribalhae; Kim, Hyoung; Kaang, Bong-Kiun

    2010-01-01

    Phosphodiesterases (PDEs) are known to play a key role in the compartmentalization of cAMP signaling; however, the molecular mechanisms underlying intracellular localization of different PDE isoforms are not understood. In this study, we have found that each of the supershort, short, and long forms of apPDE4 showed distinct localization in the…

  20. Topical otic drugs in a multi-purpose manufacturing facility: a guide on determination and application of permitted daily exposure (PDE).

    PubMed

    Wiesner, Lisa; Prause, Maarten; Lovsin Barle, Ester

    2018-03-01

    Due to newly introduced EU GMP (Good Manufacturing Practice) guideline for Medicinal Products for Human and Veterinary use, product specific permitted daily exposure (PDE) for toxicological evaluation in multi-purpose facilities are required within a documented process for risk assessment. European Medicines Agency (EMA) guidance on setting PDE limits so far focused on systemic administration routes such as intravenous (IV), oral or inhalation. This article provides guidance on setting PDE values for risk management purposes in multi-purpose facilities for active pharmaceutical ingredients (APIs) applied as topical otic drugs to the outer ear canal. The therewith determined PDE otic, is used for the calculation of maximum safe carry-over (MSC) in manufacturing scenarios where a topical otic product is manufactured followed by another topical otic product.

  1. Noninvasive aortic bloodflow by Pulsed Doppler Echocardiography (PDE) compared to cardiac output by the direct Fick procedure

    NASA Technical Reports Server (NTRS)

    1980-01-01

    Left ventricular stroke volume was estimated from the systolic velocity integral in the ascending aorta by pulsed Doppler Echocardiography (PDE) and the cross sectional area of the aorta estimated by M mode echocardiography on 15 patients with coronary disease undergoing right catheterization for diagnostic purposes. Cardiac output was calculated from stroke volume and heart volume using the PDE method as well as the Fick procedure for comparison. The mean value for the cardiac output via the PDE method (4.42 L/min) was only 6% lower than for the cardiac output obtained from the Fick procedure (4.69 L/min) and the correlation between the two methods was excellent (r=0.967, p less than .01). The good agreement between the two methods demonstrates that the PDE technique offers a reliable noninvasive alternative for estimating cardiac output, requiring no active cooperation by the subject. It was concluded that the Doppler method is superior to the Fick method in that it provides beat by beat information on cardiac performance.

  2. A task-based parallelism and vectorized approach to 3D Method of Characteristics (MOC) reactor simulation for high performance computing architectures

    NASA Astrophysics Data System (ADS)

    Tramm, John R.; Gunow, Geoffrey; He, Tim; Smith, Kord S.; Forget, Benoit; Siegel, Andrew R.

    2016-05-01

    In this study we present and analyze a formulation of the 3D Method of Characteristics (MOC) technique applied to the simulation of full core nuclear reactors. Key features of the algorithm include a task-based parallelism model that allows independent MOC tracks to be assigned to threads dynamically, ensuring load balancing, and a wide vectorizable inner loop that takes advantage of modern SIMD computer architectures. The algorithm is implemented in a set of highly optimized proxy applications in order to investigate its performance characteristics on CPU, GPU, and Intel Xeon Phi architectures. Speed, power, and hardware cost efficiencies are compared. Additionally, performance bottlenecks are identified for each architecture in order to determine the prospects for continued scalability of the algorithm on next generation HPC architectures.

  3. Robust High-Resolution Cloth Using Parallelism, History-Based Collisions and Accurate Friction

    PubMed Central

    Selle, Andrew; Su, Jonathan; Irving, Geoffrey; Fedkiw, Ronald

    2015-01-01

    In this paper we simulate high resolution cloth consisting of up to 2 million triangles which allows us to achieve highly detailed folds and wrinkles. Since the level of detail is also influenced by object collision and self collision, we propose a more accurate model for cloth-object friction. We also propose a robust history-based repulsion/collision framework where repulsions are treated accurately and efficiently on a per time step basis. Distributed memory parallelism is used for both time evolution and collisions and we specifically address Gauss-Seidel ordering of repulsion/collision response. This algorithm is demonstrated by several high-resolution and high-fidelity simulations. PMID:19147895

  4. Building Blocks for Reliable Complex Nonlinear Numerical Simulations

    NASA Technical Reports Server (NTRS)

    Yee, H. C.; Mansour, Nagi N. (Technical Monitor)

    2002-01-01

    This talk describes some of the building blocks to ensure a higher level of confidence in the predictability and reliability (PAR) of numerical simulation of multiscale complex nonlinear problems. The focus is on relating PAR of numerical simulations with complex nonlinear phenomena of numerics. To isolate sources of numerical uncertainties, the possible discrepancy between the chosen partial differential equation (PDE) model and the real physics and/or experimental data is set aside. The discussion is restricted to how well numerical schemes can mimic the solution behavior of the underlying PDE model for finite time steps and grid spacings. The situation is complicated by the fact that the available theory for the understanding of nonlinear behavior of numerics is not at a stage to fully analyze the nonlinear Euler and Navier-Stokes equations. The discussion is based on the knowledge gained for nonlinear model problems with known analytical solutions to identify and explain the possible sources and remedies of numerical uncertainties in practical computations. Examples relevant to turbulent flow computations are included.

  5. MMS observations and hybrid simulations of rippled and reforming quasi-parallel shocks

    NASA Astrophysics Data System (ADS)

    Gingell, I.; Schwartz, S. J.; Burgess, D.; Johlander, A.; Russell, C. T.; Burch, J. L.; Ergun, R.; Fuselier, S. A.; Gershman, D. J.; Giles, B. L.; Goodrich, K.; Khotyaintsev, Y. V.; Lavraud, B.; Lindqvist, P. A.; Strangeway, R. J.; Trattner, K. J.; Torbert, R. B.; Wilder, F. D.

    2017-12-01

    Surface ripples, i.e. deviations in the nominal local shock orientation, are expected to propagate in the ramp and overshoot of collisionless shocks. These ripples have typically been associated with observations and simulations of quasi-perpendicular shocks. We present observations of a crossing of Earth's marginally quasi-parallel (θBn ˜ 45°) bow shock by the MMS spacecraft on 2015-11-27 06:01:44 UTC, for which we identify signatures consistent with a propagating surface ripple. In order to demonstrate the differences between ripples at quasi-perpendicular and quasi-parallel shocks, we also present two-dimensional hybrid simulations over a range of shock normal angles θBn under the observed solar wind conditions. We show that in the quasi-parallel cases surface ripples are transient phenomena modulated by the cyclic reformation of the shock front. These ripples develop faster than an ion gyroperiod and only during the period of the reformation cycle when a newly developed shock ramp is unaffected by turbulence in the foot. We conclude that the change of properties of the surface ripple observed by MMS while crossing Earth's quasi-parallel bow shock are consistent with the influence of cyclic reformation on shock structure. Given that both surface ripples and cyclic reformation are expected to affect the acceleration of electrons within the shock, the interaction of these phenomena and any other sources of shock non-stationary are important for models of particle acceleration. We therefore discuss signatures of electron heating and acceleration in several rippled shocks observed by MMS.

  6. A discontinuous Galerkin method for two-dimensional PDE models of Asian options

    NASA Astrophysics Data System (ADS)

    Hozman, J.; Tichý, T.; Cvejnová, D.

    2016-06-01

    In our previous research we have focused on the problem of plain vanilla option valuation using discontinuous Galerkin method for numerical PDE solution. Here we extend a simple one-dimensional problem into two-dimensional one and design a scheme for valuation of Asian options, i.e. options with payoff depending on the average of prices collected over prespecified horizon. The algorithm is based on the approach combining the advantages of the finite element methods together with the piecewise polynomial generally discontinuous approximations. Finally, an illustrative example using DAX option market data is provided.

  7. Parallel computing in enterprise modeling.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Goldsby, Michael E.; Armstrong, Robert C.; Shneider, Max S.

    2008-08-01

    This report presents the results of our efforts to apply high-performance computing to entity-based simulations with a multi-use plugin for parallel computing. We use the term 'Entity-based simulation' to describe a class of simulation which includes both discrete event simulation and agent based simulation. What simulations of this class share, and what differs from more traditional models, is that the result sought is emergent from a large number of contributing entities. Logistic, economic and social simulations are members of this class where things or people are organized or self-organize to produce a solution. Entity-based problems never have an a priorimore » ergodic principle that will greatly simplify calculations. Because the results of entity-based simulations can only be realized at scale, scalable computing is de rigueur for large problems. Having said that, the absence of a spatial organizing principal makes the decomposition of the problem onto processors problematic. In addition, practitioners in this domain commonly use the Java programming language which presents its own problems in a high-performance setting. The plugin we have developed, called the Parallel Particle Data Model, overcomes both of these obstacles and is now being used by two Sandia frameworks: the Decision Analysis Center, and the Seldon social simulation facility. While the ability to engage U.S.-sized problems is now available to the Decision Analysis Center, this plugin is central to the success of Seldon. Because Seldon relies on computationally intensive cognitive sub-models, this work is necessary to achieve the scale necessary for realistic results. With the recent upheavals in the financial markets, and the inscrutability of terrorist activity, this simulation domain will likely need a capability with ever greater fidelity. High-performance computing will play an important part in enabling that greater fidelity.« less

  8. Parallel Adaptive High-Order CFD Simulations Characterizing Cavity Acoustics for the Complete SOFIA Aircraft

    NASA Technical Reports Server (NTRS)

    Barad, Michael F.; Brehm, Christoph; Kiris, Cetin C.; Biswas, Rupak

    2014-01-01

    This paper presents one-of-a-kind MPI-parallel computational fluid dynamics simulations for the Stratospheric Observatory for Infrared Astronomy (SOFIA). SOFIA is an airborne, 2.5-meter infrared telescope mounted in an open cavity in the aft of a Boeing 747SP. These simulations focus on how the unsteady flow field inside and over the cavity interferes with the optical path and mounting of the telescope. A temporally fourth-order Runge-Kutta, and spatially fifth-order WENO-5Z scheme was used to perform implicit large eddy simulations. An immersed boundary method provides automated gridding for complex geometries and natural coupling to a block-structured Cartesian adaptive mesh refinement framework. Strong scaling studies using NASA's Pleiades supercomputer with up to 32,000 cores and 4 billion cells shows excellent scaling. Dynamic load balancing based on execution time on individual AMR blocks addresses irregularities caused by the highly complex geometry. Limits to scaling beyond 32K cores are identified, and targeted code optimizations are discussed.

  9. Reconfigurable Analog PDE computation for Baseband and RFComputation

    DTIC Science & Technology

    2017-03-01

    waveguiding PDEs. One-dimensional ladder topologies enable linear delays, linear-phase analog filters , as well as analog beamforming, potentially at RF...performance. This discussion focuses on ODE / PDE analog computation available in SoC FPAA structures. One such computation is a ladder filter (Fig...Implementation of a one-dimensional ladder filter for computing inductor (L) and capacitor (C) lines. These components can be implemented in CABs or as

  10. Scalable and fast heterogeneous molecular simulation with predictive parallelization schemes

    NASA Astrophysics Data System (ADS)

    Guzman, Horacio V.; Junghans, Christoph; Kremer, Kurt; Stuehn, Torsten

    2017-11-01

    Multiscale and inhomogeneous molecular systems are challenging topics in the field of molecular simulation. In particular, modeling biological systems in the context of multiscale simulations and exploring material properties are driving a permanent development of new simulation methods and optimization algorithms. In computational terms, those methods require parallelization schemes that make a productive use of computational resources for each simulation and from its genesis. Here, we introduce the heterogeneous domain decomposition approach, which is a combination of an heterogeneity-sensitive spatial domain decomposition with an a priori rearrangement of subdomain walls. Within this approach, the theoretical modeling and scaling laws for the force computation time are proposed and studied as a function of the number of particles and the spatial resolution ratio. We also show the new approach capabilities, by comparing it to both static domain decomposition algorithms and dynamic load-balancing schemes. Specifically, two representative molecular systems have been simulated and compared to the heterogeneous domain decomposition proposed in this work. These two systems comprise an adaptive resolution simulation of a biomolecule solvated in water and a phase-separated binary Lennard-Jones fluid.

  11. Scalable and fast heterogeneous molecular simulation with predictive parallelization schemes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Guzman, Horacio V.; Junghans, Christoph; Kremer, Kurt

    Multiscale and inhomogeneous molecular systems are challenging topics in the field of molecular simulation. In particular, modeling biological systems in the context of multiscale simulations and exploring material properties are driving a permanent development of new simulation methods and optimization algorithms. In computational terms, those methods require parallelization schemes that make a productive use of computational resources for each simulation and from its genesis. Here, we introduce the heterogeneous domain decomposition approach, which is a combination of an heterogeneity-sensitive spatial domain decomposition with an a priori rearrangement of subdomain walls. Within this approach and paper, the theoretical modeling and scalingmore » laws for the force computation time are proposed and studied as a function of the number of particles and the spatial resolution ratio. We also show the new approach capabilities, by comparing it to both static domain decomposition algorithms and dynamic load-balancing schemes. Specifically, two representative molecular systems have been simulated and compared to the heterogeneous domain decomposition proposed in this work. Finally, these two systems comprise an adaptive resolution simulation of a biomolecule solvated in water and a phase-separated binary Lennard-Jones fluid.« less

  12. Scalable and fast heterogeneous molecular simulation with predictive parallelization schemes

    DOE PAGES

    Guzman, Horacio V.; Junghans, Christoph; Kremer, Kurt; ...

    2017-11-27

    Multiscale and inhomogeneous molecular systems are challenging topics in the field of molecular simulation. In particular, modeling biological systems in the context of multiscale simulations and exploring material properties are driving a permanent development of new simulation methods and optimization algorithms. In computational terms, those methods require parallelization schemes that make a productive use of computational resources for each simulation and from its genesis. Here, we introduce the heterogeneous domain decomposition approach, which is a combination of an heterogeneity-sensitive spatial domain decomposition with an a priori rearrangement of subdomain walls. Within this approach and paper, the theoretical modeling and scalingmore » laws for the force computation time are proposed and studied as a function of the number of particles and the spatial resolution ratio. We also show the new approach capabilities, by comparing it to both static domain decomposition algorithms and dynamic load-balancing schemes. Specifically, two representative molecular systems have been simulated and compared to the heterogeneous domain decomposition proposed in this work. Finally, these two systems comprise an adaptive resolution simulation of a biomolecule solvated in water and a phase-separated binary Lennard-Jones fluid.« less

  13. A dominant variant in the PDE1C gene is associated with nonsyndromic hearing loss.

    PubMed

    Wang, Li; Feng, Yong; Yan, Denise; Qin, Litao; Grati, M'hamed; Mittal, Rahul; Li, Tao; Sundhari, Abhiraami Kannan; Liu, Yalan; Chapagain, Prem; Blanton, Susan H; Liao, Shixiu; Liu, Xuezhong

    2018-06-02

    Identification of genes with variants causing non-syndromic hearing loss (NSHL) is challenging due to genetic heterogeneity. The difficulty is compounded by technical limitations that in the past prevented comprehensive gene identification. Recent advances in technology, using targeted capture and next-generation sequencing (NGS), is changing the face of gene identification and making it possible to rapidly and cost-effectively sequence the whole human exome. Here, we characterize a five-generation Chinese family with progressive, postlingual autosomal dominant nonsyndromic hearing loss (ADNSHL). By combining population-specific mutation arrays, targeted deafness genes panel, whole exome sequencing (WES), we identified PDE1C (Phosphodiesterase 1C) c.958G>T (p.A320S) as the disease-associated variant. Structural modeling insights into p.A320S strongly suggest that the sequence alteration will likely affect the substrate-binding pocket of PDE1C. By whole-mount immunofluorescence on postnatal day 3 mouse cochlea, we show its expression in outer (OHC) and inner (IHC) hair cells cytosol co-localizing with Lamp-1 in lysosomes. Furthermore, we provide evidence that the variant alters the PDE1C hydrolytic activity for both cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP). Collectively, our findings indicate that the c.958G>T variant in PDE1C may disrupt the cross talk between cGMP-signaling and cAMP pathways in Ca 2+ homeostasis.

  14. Parallel Simulation of Three-Dimensional Free-Surface Fluid Flow Problems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    BAER,THOMAS A.; SUBIA,SAMUEL R.; SACKINGER,PHILIP A.

    2000-01-18

    We describe parallel simulations of viscous, incompressible, free surface, Newtonian fluid flow problems that include dynamic contact lines. The Galerlin finite element method was used to discretize the fully-coupled governing conservation equations and a ''pseudo-solid'' mesh mapping approach was used to determine the shape of the free surface. In this approach, the finite element mesh is allowed to deform to satisfy quasi-static solid mechanics equations subject to geometric or kinematic constraints on the boundaries. As a result, nodal displacements must be included in the set of problem unknowns. Issues concerning the proper constraints along the solid-fluid dynamic contact line inmore » three dimensions are discussed. Parallel computations are carried out for an example taken from the coating flow industry, flow in the vicinity of a slot coater edge. This is a three-dimensional free-surface problem possessing a contact line that advances at the web speed in one region but transitions to static behavior in another part of the flow domain. Discussion focuses on parallel speedups for fixed problem size, a class of problems of immediate practical importance.« less

  15. A package of Linux scripts for the parallelization of Monte Carlo simulations

    NASA Astrophysics Data System (ADS)

    Badal, Andreu; Sempau, Josep

    2006-09-01

    sequential code. Program summary 1Title of program:clonEasy Catalogue identifier:ADYD_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/ADYD_v1_0 Program obtainable from:CPC Program Library, Queen's University of Belfast, Northern Ireland Computer for which the program is designed and others in which it is operable:Any computer with a Unix style shell (bash), support for the Secure Shell protocol and a FORTRAN compiler Operating systems under which the program has been tested:Linux (RedHat 8.0, SuSe 8.1, Debian Woody 3.1) Compilers:GNU FORTRAN g77 (Linux); g95 (Linux); Intel Fortran Compiler 7.1 (Linux) Programming language used:Linux shell (bash) script, FORTRAN 77 No. of bits in a word:32 No. of lines in distributed program, including test data, etc.:1916 No. of bytes in distributed program, including test data, etc.:18 202 Distribution format:tar.gz Nature of the physical problem:There are many situations where a Monte Carlo simulation involves a huge amount of CPU time. The parallelization of such calculations is a simple way of obtaining a relatively low statistical uncertainty using a reasonable amount of time. Method of solution:The presented collection of Linux scripts and auxiliary FORTRAN programs implement Secure Shell-based communication between a "master" computer and a set of "clones". The aim of this communication is to execute a code that performs a Monte Carlo simulation on all the clones simultaneously. The code is unique, but each clone is fed with a different set of random seeds. Hence, clonEasy effectively permits the parallelization of the calculation. Restrictions on the complexity of the program:clonEasy can only be used with programs that produce statistically independent results using the same code, but with a different sequence of random numbers. Users must choose the initialization values for the random number generator on each computer and combine the output from the different executions. A FORTRAN program to combine the final results is

  16. Parallel filtering in global gyrokinetic simulations

    NASA Astrophysics Data System (ADS)

    Jolliet, S.; McMillan, B. F.; Villard, L.; Vernay, T.; Angelino, P.; Tran, T. M.; Brunner, S.; Bottino, A.; Idomura, Y.

    2012-02-01

    In this work, a Fourier solver [B.F. McMillan, S. Jolliet, A. Bottino, P. Angelino, T.M. Tran, L. Villard, Comp. Phys. Commun. 181 (2010) 715] is implemented in the global Eulerian gyrokinetic code GT5D [Y. Idomura, H. Urano, N. Aiba, S. Tokuda, Nucl. Fusion 49 (2009) 065029] and in the global Particle-In-Cell code ORB5 [S. Jolliet, A. Bottino, P. Angelino, R. Hatzky, T.M. Tran, B.F. McMillan, O. Sauter, K. Appert, Y. Idomura, L. Villard, Comp. Phys. Commun. 177 (2007) 409] in order to reduce the memory of the matrix associated with the field equation. This scheme is verified with linear and nonlinear simulations of turbulence. It is demonstrated that the straight-field-line angle is the coordinate that optimizes the Fourier solver, that both linear and nonlinear turbulent states are unaffected by the parallel filtering, and that the k∥ spectrum is independent of plasma size at fixed normalized poloidal wave number.

  17. Parallel Proximity Detection for Computer Simulation

    NASA Technical Reports Server (NTRS)

    Steinman, Jeffrey S. (Inventor); Wieland, Frederick P. (Inventor)

    1997-01-01

    The present invention discloses a system for performing proximity detection in computer simulations on parallel processing architectures utilizing a distribution list which includes movers and sensor coverages which check in and out of grids. Each mover maintains a list of sensors that detect the mover's motion as the mover and sensor coverages check in and out of the grids. Fuzzy grids are includes by fuzzy resolution parameters to allow movers and sensor coverages to check in and out of grids without computing exact grid crossings. The movers check in and out of grids while moving sensors periodically inform the grids of their coverage. In addition, a lookahead function is also included for providing a generalized capability without making any limiting assumptions about the particular application to which it is applied. The lookahead function is initiated so that risk-free synchronization strategies never roll back grid events. The lookahead function adds fixed delays as events are scheduled for objects on other nodes.

  18. Parallel Proximity Detection for Computer Simulations

    NASA Technical Reports Server (NTRS)

    Steinman, Jeffrey S. (Inventor); Wieland, Frederick P. (Inventor)

    1998-01-01

    The present invention discloses a system for performing proximity detection in computer simulations on parallel processing architectures utilizing a distribution list which includes movers and sensor coverages which check in and out of grids. Each mover maintains a list of sensors that detect the mover's motion as the mover and sensor coverages check in and out of the grids. Fuzzy grids are included by fuzzy resolution parameters to allow movers and sensor coverages to check in and out of grids without computing exact grid crossings. The movers check in and out of grids while moving sensors periodically inform the grids of their coverage. In addition, a lookahead function is also included for providing a generalized capability without making any limiting assumptions about the particular application to which it is applied. The lookahead function is initiated so that risk-free synchronization strategies never roll back grid events. The lookahead function adds fixed delays as events are scheduled for objects on other nodes.

  19. Secure web-based invocation of large-scale plasma simulation codes

    NASA Astrophysics Data System (ADS)

    Dimitrov, D. A.; Busby, R.; Exby, J.; Bruhwiler, D. L.; Cary, J. R.

    2004-12-01

    We present our design and initial implementation of a web-based system for running, both in parallel and serial, Particle-In-Cell (PIC) codes for plasma simulations with automatic post processing and generation of visual diagnostics.

  20. The Phosphodiesterase 5-Inhibitors (PDE-5i) for ERECTILE DYSFUNCTION (ED): A Therapeutic Challenge For Psychiatrists.

    PubMed

    Koon, Chong Siew; Sidi, Hatta; Kumar, Jaya; Das, Srijit; Xi, Ong Wan; Hatta, Muhammad Hizri; Alfonso, Cesar

    2017-02-15

    Erectile function (EF) is a prerequisite for satisfactory sexual intercourse (SI) and central to male sexual functioning. Satisfactory SI eventually leads to orgasm - a biopsychophysiological state of euphoria - leading to a sense of bliss, enjoyment and positive mental well being. For a psychiatrist, treating ED is self-propelled to harmonize these pleasurable experiences alongside with encouragement of physical wellness and sensuality. Hence, the role of PDE-5i is pivotal in the context of treating ED constitutes a therapeutic challenge. PDE-5i work via the dopaminergic-oxytocin-nitric oxide pathway by increasing the availability of endothelial's guanosine monophosphate (GMP), immediately causing relaxation of the penile smooth muscle and an erection. The PDE-5i, like sildenafil, vardenafil and tadalafil, are effective in the treatment of ED with some benefits and disadvantages compared to other treatment modalities. Prescribed PDE-5i exclusively improve EF, fostering male's self-confidence and self-esteem. Treatment failures are associated with factors such as absent (or insufficient) sexual stimulation, psychosexual conflicts and the co-existence of medical disorders. Managing ED requires dealing with underlying medical diseases, addressing other co-morbid sexual dysfunctions like premature ejaculation (PE), and educating the patient on healthy life-styles beside being cautious with the potential side-effects and drug-drug interactions. Furthermore, by dealing with interpersonal dynamics within the couple and embracing adequate lifestyles (managing stress and revising one's sexual scripts), PDE-5i treatment benefits may be enhanced. In this review, we propose a holistic conceptual framework approach for psychiatric management of patients with ED. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  1. Rapid Parallel Calculation of shell Element Based On GPU

    NASA Astrophysics Data System (ADS)

    Wanga, Jian Hua; Lia, Guang Yao; Lib, Sheng; Li, Guang Yao

    2010-06-01

    Long computing time bottlenecked the application of finite element. In this paper, an effective method to speed up the FEM calculation by using the existing modern graphic processing unit and programmable colored rendering tool was put forward, which devised the representation of unit information in accordance with the features of GPU, converted all the unit calculation into film rendering process, solved the simulation work of all the unit calculation of the internal force, and overcame the shortcomings of lowly parallel level appeared ever before when it run in a single computer. Studies shown that this method could improve efficiency and shorten calculating hours greatly. The results of emulation calculation about the elasticity problem of large number cells in the sheet metal proved that using the GPU parallel simulation calculation was faster than using the CPU's. It is useful and efficient to solve the project problems in this way.

  2. Synchronous Parallel Emulation and Discrete Event Simulation System with Self-Contained Simulation Objects and Active Event Objects

    NASA Technical Reports Server (NTRS)

    Steinman, Jeffrey S. (Inventor)

    1998-01-01

    The present invention is embodied in a method of performing object-oriented simulation and a system having inter-connected processor nodes operating in parallel to simulate mutual interactions of a set of discrete simulation objects distributed among the nodes as a sequence of discrete events changing state variables of respective simulation objects so as to generate new event-defining messages addressed to respective ones of the nodes. The object-oriented simulation is performed at each one of the nodes by assigning passive self-contained simulation objects to each one of the nodes, responding to messages received at one node by generating corresponding active event objects having user-defined inherent capabilities and individual time stamps and corresponding to respective events affecting one of the passive self-contained simulation objects of the one node, restricting the respective passive self-contained simulation objects to only providing and receiving information from die respective active event objects, requesting information and changing variables within a passive self-contained simulation object by the active event object, and producing corresponding messages specifying events resulting therefrom by the active event objects.

  3. Computer-aided design of multi-target ligands at A1R, A2AR and PDE10A, key proteins in neurodegenerative diseases.

    PubMed

    Kalash, Leen; Val, Cristina; Azuaje, Jhonny; Loza, María I; Svensson, Fredrik; Zoufir, Azedine; Mervin, Lewis; Ladds, Graham; Brea, José; Glen, Robert; Sotelo, Eddy; Bender, Andreas

    2017-12-30

    Compounds designed to display polypharmacology may have utility in treating complex diseases, where activity at multiple targets is required to produce a clinical effect. In particular, suitable compounds may be useful in treating neurodegenerative diseases by promoting neuronal survival in a synergistic manner via their multi-target activity at the adenosine A 1 and A 2A receptors (A 1 R and A 2A R) and phosphodiesterase 10A (PDE10A), which modulate intracellular cAMP levels. Hence, in this work we describe a computational method for the design of synthetically feasible ligands that bind to A 1 and A 2A receptors and inhibit phosphodiesterase 10A (PDE10A), involving a retrosynthetic approach employing in silico target prediction and docking, which may be generally applicable to multi-target compound design at several target classes. This approach has identified 2-aminopyridine-3-carbonitriles as the first multi-target ligands at A 1 R, A 2A R and PDE10A, by showing agreement between the ligand and structure based predictions at these targets. The series were synthesized via an efficient one-pot scheme and validated pharmacologically as A 1 R/A 2A R-PDE10A ligands, with IC 50 values of 2.4-10.0 μM at PDE10A and K i values of 34-294 nM at A 1 R and/or A 2A R. Furthermore, selectivity profiling of the synthesized 2-amino-pyridin-3-carbonitriles against other subtypes of both protein families showed that the multi-target ligand 8 exhibited a minimum of twofold selectivity over all tested off-targets. In addition, both compounds 8 and 16 exhibited the desired multi-target profile, which could be considered for further functional efficacy assessment, analog modification for the improvement of selectivity towards A 1 R, A 2A R and PDE10A collectively, and evaluation of their potential synergy in modulating cAMP levels.

  4. A method for data handling numerical results in parallel OpenFOAM simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anton, Alin; Muntean, Sebastian

    Parallel computational fluid dynamics simulations produce vast amount of numerical result data. This paper introduces a method for reducing the size of the data by replaying the interprocessor traffic. The results are recovered only in certain regions of interest configured by the user. A known test case is used for several mesh partitioning scenarios using the OpenFOAM toolkit{sup ®}[1]. The space savings obtained with classic algorithms remain constant for more than 60 Gb of floating point data. Our method is most efficient on large simulation meshes and is much better suited for compressing large scale simulation results than the regular algorithms.

  5. Hypergraph partitioning implementation for parallelizing matrix-vector multiplication using CUDA GPU-based parallel computing

    NASA Astrophysics Data System (ADS)

    Murni, Bustamam, A.; Ernastuti, Handhika, T.; Kerami, D.

    2017-07-01

    Calculation of the matrix-vector multiplication in the real-world problems often involves large matrix with arbitrary size. Therefore, parallelization is needed to speed up the calculation process that usually takes a long time. Graph partitioning techniques that have been discussed in the previous studies cannot be used to complete the parallelized calculation of matrix-vector multiplication with arbitrary size. This is due to the assumption of graph partitioning techniques that can only solve the square and symmetric matrix. Hypergraph partitioning techniques will overcome the shortcomings of the graph partitioning technique. This paper addresses the efficient parallelization of matrix-vector multiplication through hypergraph partitioning techniques using CUDA GPU-based parallel computing. CUDA (compute unified device architecture) is a parallel computing platform and programming model that was created by NVIDIA and implemented by the GPU (graphics processing unit).

  6. Scalability of Parallel Spatial Direct Numerical Simulations on Intel Hypercube and IBM SP1 and SP2

    NASA Technical Reports Server (NTRS)

    Joslin, Ronald D.; Hanebutte, Ulf R.; Zubair, Mohammad

    1995-01-01

    The implementation and performance of a parallel spatial direct numerical simulation (PSDNS) approach on the Intel iPSC/860 hypercube and IBM SP1 and SP2 parallel computers is documented. Spatially evolving disturbances associated with the laminar-to-turbulent transition in boundary-layer flows are computed with the PSDNS code. The feasibility of using the PSDNS to perform transition studies on these computers is examined. The results indicate that PSDNS approach can effectively be parallelized on a distributed-memory parallel machine by remapping the distributed data structure during the course of the calculation. Scalability information is provided to estimate computational costs to match the actual costs relative to changes in the number of grid points. By increasing the number of processors, slower than linear speedups are achieved with optimized (machine-dependent library) routines. This slower than linear speedup results because the computational cost is dominated by FFT routine, which yields less than ideal speedups. By using appropriate compile options and optimized library routines on the SP1, the serial code achieves 52-56 M ops on a single node of the SP1 (45 percent of theoretical peak performance). The actual performance of the PSDNS code on the SP1 is evaluated with a "real world" simulation that consists of 1.7 million grid points. One time step of this simulation is calculated on eight nodes of the SP1 in the same time as required by a Cray Y/MP supercomputer. For the same simulation, 32-nodes of the SP1 and SP2 are required to reach the performance of a Cray C-90. A 32 node SP1 (SP2) configuration is 2.9 (4.6) times faster than a Cray Y/MP for this simulation, while the hypercube is roughly 2 times slower than the Y/MP for this application. KEY WORDS: Spatial direct numerical simulations; incompressible viscous flows; spectral methods; finite differences; parallel computing.

  7. The localization and concentration of the PDE2-encoded high-affinity cAMP phosphodiesterase is regulated by cAMP-dependent protein kinase A in the yeast Saccharomyces cerevisiae.

    PubMed

    Hu, Yun; Liu, Enkai; Bai, Xiaojia; Zhang, Aili

    2010-03-01

    The genome of the yeast Saccharomyces cerevisiae encodes two cyclic AMP (cAMP) phosphodiesterases, a low-affinity one, Pde1, and a high-affinity one, Pde2. Pde1 has been ascribed a function for downregulating agonist-induced cAMP accumulation in a protein kinase A (PKA)-governed negative feedback loop, whereas Pde2 controls the basal cAMP level in the cell. Here we show that PKA regulates the localization and protein concentration of Pde2. Pde2 is accumulated in the nucleus in wild-type cells growing on glucose, or in strains with hyperactive PKA. In contrast, in derepressed wild-type cells or cells with attenuated PKA activity, Pde2 is distributed over the nucleus and cytoplasm. We also show evidence indicating that the Pde2 protein level is positively correlated with PKA activity. The increase in the Pde2 protein level in high-PKA strains and in cells growing on glucose was due to its increased half-life. These results suggest that, like its low-affinity counterpart, the high-affinity phosphodiesterase may also play an important role in the PKA-controlled feedback inhibition of intracellular cAMP.

  8. Re-forming supercritical quasi-parallel shocks. I - One- and two-dimensional simulations

    NASA Technical Reports Server (NTRS)

    Thomas, V. A.; Winske, D.; Omidi, N.

    1990-01-01

    The process of reforming supercritical quasi-parallel shocks is investigated using one-dimensional and two-dimensional hybrid (particle ion, massless fluid electron) simulations both of shocks and of simpler two-stream interactions. It is found that the supercritical quasi-parallel shock is not steady. Instread of a well-defined shock ramp between upstream and downstream states that remains at a fixed position in the flow, the ramp periodically steepens, broadens, and then reforms upstream of its former position. It is concluded that the wave generation process is localized at the shock ramp and that the reformation process proceeds in the absence of upstream perturbations intersecting the shock.

  9. PDE based scheme for multi-modal medical image watermarking.

    PubMed

    Aherrahrou, N; Tairi, H

    2015-11-25

    This work deals with copyright protection of digital images, an issue that needs protection of intellectual property rights. It is an important issue with a large number of medical images interchanged on the Internet every day. So, it is a challenging task to ensure the integrity of received images as well as authenticity. Digital watermarking techniques have been proposed as valid solution for this problem. It is worth mentioning that the Region Of Interest (ROI)/Region Of Non Interest (RONI) selection can be seen as a significant limitation from which suffers most of ROI/RONI based watermarking schemes and that in turn affects and limit their applicability in an effective way. Generally, the ROI/RONI is defined by a radiologist or a computer-aided selection tool. And thus, this will not be efficient for an institute or health care system, where one has to process a large number of images. Therefore, developing an automatic ROI/RONI selection is a challenge task. The major aim of this work is to develop an automatic selection algorithm of embedding region based on the so called Partial Differential Equation (PDE) method. Thus avoiding ROI/RONI selection problems including: (1) computational overhead, (2) time consuming, and (3) modality dependent selection. The algorithm is evaluated in terms of imperceptibility, robustness, tamper localization and recovery using MRI, Ultrasound, CT and X-ray grey scale medical images. From experimental results that we have conducted on a database of 100 medical images of four modalities, it can be inferred that our method can achieve high imperceptibility, while showing good robustness against attacks. Furthermore, the experiment results confirm the effectiveness of the proposed algorithm in detecting and recovering the various types of tampering. The highest PSNR value reached over the 100 images is 94,746 dB, while the lowest PSNR value is 60,1272 dB, which demonstrates the higher imperceptibility nature of the proposed

  10. Parallel Dynamics Simulation Using a Krylov-Schwarz Linear Solution Scheme

    DOE PAGES

    Abhyankar, Shrirang; Constantinescu, Emil M.; Smith, Barry F.; ...

    2016-11-07

    Fast dynamics simulation of large-scale power systems is a computational challenge because of the need to solve a large set of stiff, nonlinear differential-algebraic equations at every time step. The main bottleneck in dynamic simulations is the solution of a linear system during each nonlinear iteration of Newton’s method. In this paper, we present a parallel Krylov- Schwarz linear solution scheme that uses the Krylov subspacebased iterative linear solver GMRES with an overlapping restricted additive Schwarz preconditioner. As a result, performance tests of the proposed Krylov-Schwarz scheme for several large test cases ranging from 2,000 to 20,000 buses, including amore » real utility network, show good scalability on different computing architectures.« less

  11. Parallel Dynamics Simulation Using a Krylov-Schwarz Linear Solution Scheme

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Abhyankar, Shrirang; Constantinescu, Emil M.; Smith, Barry F.

    Fast dynamics simulation of large-scale power systems is a computational challenge because of the need to solve a large set of stiff, nonlinear differential-algebraic equations at every time step. The main bottleneck in dynamic simulations is the solution of a linear system during each nonlinear iteration of Newton’s method. In this paper, we present a parallel Krylov- Schwarz linear solution scheme that uses the Krylov subspacebased iterative linear solver GMRES with an overlapping restricted additive Schwarz preconditioner. As a result, performance tests of the proposed Krylov-Schwarz scheme for several large test cases ranging from 2,000 to 20,000 buses, including amore » real utility network, show good scalability on different computing architectures.« less

  12. Ca2+ -activated K+ channel (KCa) stimulation improves relaxant capacity of PDE5 inhibitors in human penile arteries and recovers the reduced efficacy of PDE5 inhibition in diabetic erectile dysfunction.

    PubMed

    González-Corrochano, R; La Fuente, Jm; Cuevas, P; Fernández, A; Chen, Mx; Sáenz de Tejada, I; Angulo, J

    2013-05-01

    We have evaluated the influence of calcium-activated potassium channels (KCa ) activation on cGMP-mediated relaxation in human penile tissues from non-diabetic and diabetic patients, and on the effects of PDE5 inhibitors on erectile responses in control and diabetic rats. Cavernosal tissues were collected from organ donors and from patients with erectile dysfunction (ED). Relaxations of corpus cavernosum strips (HCC) and penile resistance arteries (HPRA) obtained from these specimens were evaluated. Intracavernosal pressure (ICP) increases to cavernosal nerve electrical stimulation were determined in anaesthetized diabetic and non-diabetic rats. Concentration-dependent vasodilation to the PDE5 inhibitor, sildenafil, in HPRA was sensitive to endothelium removal, NO/cGMP pathway inhibition and KCa blockade. Accordingly, activation of KCa with NS-8 (10 μM) significantly potentiated sildenafil-induced relaxations in HPRA (EC50 0.49 ± 0.22 vs. 5.21 ± 0.63 μM). In HCC, sildenafil-induced relaxation was unaffected by KCa blockade or activation. Potentiating effects in HPRA were reproduced with an alternative PDE5 inhibitor (tadalafil) and KCa activator (NS1619) and prevented by removing the endothelium. Large-conductance KCa (BK) and intermediate-conductance KCa (IK) contribute to NS-8-induced effects and were immunodetected in human and rat penile arteries. NS-8 potentiated sildenafil-induced enhancement of erectile responses in rats. Activation of KCa recovered the impaired relaxation to sildenafil in diabetic HPRA while sildenafil completely reversed diabetes-induced ED in rats only when combined with KCa activation. Activation of KCa improves vasodilatory capacity of PDE5 inhibitors in diabetic and non-diabetic HPRA, resulting in the recovery of erectile function in diabetic rats. These results suggest a therapeutic potential for KCa activation in diabetic ED. © 2013 The Authors. British Journal of Pharmacology © 2013 The British Pharmacological Society.

  13. Endpoint-based parallel data processing with non-blocking collective instructions in a parallel active messaging interface of a parallel computer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Archer, Charles J; Blocksome, Michael A; Cernohous, Bob R

    Methods, apparatuses, and computer program products for endpoint-based parallel data processing with non-blocking collective instructions in a parallel active messaging interface (`PAMI`) of a parallel computer are provided. Embodiments include establishing by a parallel application a data communications geometry, the geometry specifying a set of endpoints that are used in collective operations of the PAMI, including associating with the geometry a list of collective algorithms valid for use with the endpoints of the geometry. Embodiments also include registering in each endpoint in the geometry a dispatch callback function for a collective operation and executing without blocking, through a single onemore » of the endpoints in the geometry, an instruction for the collective operation.« less

  14. Accurate reaction-diffusion operator splitting on tetrahedral meshes for parallel stochastic molecular simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hepburn, I.; De Schutter, E., E-mail: erik@oist.jp; Theoretical Neurobiology & Neuroengineering, University of Antwerp, Antwerp 2610

    Spatial stochastic molecular simulations in biology are limited by the intense computation required to track molecules in space either in a discrete time or discrete space framework, which has led to the development of parallel methods that can take advantage of the power of modern supercomputers in recent years. We systematically test suggested components of stochastic reaction-diffusion operator splitting in the literature and discuss their effects on accuracy. We introduce an operator splitting implementation for irregular meshes that enhances accuracy with minimal performance cost. We test a range of models in small-scale MPI simulations from simple diffusion models to realisticmore » biological models and find that multi-dimensional geometry partitioning is an important consideration for optimum performance. We demonstrate performance gains of 1-3 orders of magnitude in the parallel implementation, with peak performance strongly dependent on model specification.« less

  15. GAPD: a GPU-accelerated atom-based polychromatic diffraction simulation code.

    PubMed

    E, J C; Wang, L; Chen, S; Zhang, Y Y; Luo, S N

    2018-03-01

    GAPD, a graphics-processing-unit (GPU)-accelerated atom-based polychromatic diffraction simulation code for direct, kinematics-based, simulations of X-ray/electron diffraction of large-scale atomic systems with mono-/polychromatic beams and arbitrary plane detector geometries, is presented. This code implements GPU parallel computation via both real- and reciprocal-space decompositions. With GAPD, direct simulations are performed of the reciprocal lattice node of ultralarge systems (∼5 billion atoms) and diffraction patterns of single-crystal and polycrystalline configurations with mono- and polychromatic X-ray beams (including synchrotron undulator sources), and validation, benchmark and application cases are presented.

  16. Advances in locally constrained k-space-based parallel MRI.

    PubMed

    Samsonov, Alexey A; Block, Walter F; Arunachalam, Arjun; Field, Aaron S

    2006-02-01

    In this article, several theoretical and methodological developments regarding k-space-based, locally constrained parallel MRI (pMRI) reconstruction are presented. A connection between Parallel MRI with Adaptive Radius in k-Space (PARS) and GRAPPA methods is demonstrated. The analysis provides a basis for unified treatment of both methods. Additionally, a weighted PARS reconstruction is proposed, which may absorb different weighting strategies for improved image reconstruction. Next, a fast and efficient method for pMRI reconstruction of data sampled on non-Cartesian trajectories is described. In the new technique, the computational burden associated with the numerous matrix inversions in the original PARS method is drastically reduced by limiting direct calculation of reconstruction coefficients to only a few reference points. The rest of the coefficients are found by interpolating between the reference sets, which is possible due to the similar configuration of points participating in reconstruction for highly symmetric trajectories, such as radial and spirals. As a result, the time requirements are drastically reduced, which makes it practical to use pMRI with non-Cartesian trajectories in many applications. The new technique was demonstrated with simulated and actual data sampled on radial trajectories. Copyright 2006 Wiley-Liss, Inc.

  17. Xyce Parallel Electronic Simulator Reference Guide Version 6.4

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Keiter, Eric R.; Mei, Ting; Russo, Thomas V.

    This document is a reference guide to the Xyce Parallel Electronic Simulator, and is a companion document to the Xyce Users' Guide [1] . The focus of this document is (to the extent possible) exhaustively list device parameters, solver options, parser options, and other usage details of Xyce . This document is not intended to be a tutorial. Users who are new to circuit simulation are better served by the Xyce Users' Guide [1] . Trademarks The information herein is subject to change without notice. Copyright c 2002-2015 Sandia Corporation. All rights reserved. Xyce TM Electronic Simulator and Xyce TMmore » are trademarks of Sandia Corporation. Portions of the Xyce TM code are: Copyright c 2002, The Regents of the University of California. Produced at the Lawrence Livermore National Laboratory. Written by Alan Hindmarsh, Allan Taylor, Radu Serban. UCRL-CODE-2002-59 All rights reserved. Orcad, Orcad Capture, PSpice and Probe are registered trademarks of Cadence Design Systems, Inc. Microsoft, Windows and Windows 7 are registered trademarks of Microsoft Corporation. Medici, DaVinci and Taurus are registered trademarks of Synopsys Corporation. Amtec and TecPlot are trademarks of Amtec Engineering, Inc. Xyce 's expression library is based on that inside Spice 3F5 developed by the EECS Department at the University of California. The EKV3 MOSFET model was developed by the EKV Team of the Electronics Laboratory-TUC of the Technical University of Crete. All other trademarks are property of their respective owners. Contacts Bug Reports (Sandia only) http://joseki.sandia.gov/bugzilla http://charleston.sandia.gov/bugzilla World Wide Web http://xyce.sandia.gov http://charleston.sandia.gov/xyce (Sandia only) Email xyce@sandia.gov (outside Sandia) xyce-sandia@sandia.gov (Sandia only)« less

  18. High Performance Radiation Transport Simulations on TITAN

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baker, Christopher G; Davidson, Gregory G; Evans, Thomas M

    2012-01-01

    In this paper we describe the Denovo code system. Denovo solves the six-dimensional, steady-state, linear Boltzmann transport equation, of central importance to nuclear technology applications such as reactor core analysis (neutronics), radiation shielding, nuclear forensics and radiation detection. The code features multiple spatial differencing schemes, state-of-the-art linear solvers, the Koch-Baker-Alcouffe (KBA) parallel-wavefront sweep algorithm for inverting the transport operator, a new multilevel energy decomposition method scaling to hundreds of thousands of processing cores, and a modern, novel code architecture that supports straightforward integration of new features. In this paper we discuss the performance of Denovo on the 10--20 petaflop ORNLmore » GPU-based system, Titan. We describe algorithms and techniques used to exploit the capabilities of Titan's heterogeneous compute node architecture and the challenges of obtaining good parallel performance for this sparse hyperbolic PDE solver containing inherently sequential computations. Numerical results demonstrating Denovo performance on early Titan hardware are presented.« less

  19. Use of the KlADH3 promoter for the quantitative production of the murine PDE5A isoforms in the yeast Kluyveromyces lactis.

    PubMed

    Cardarelli, Silvia; Giorgi, Mauro; Naro, Fabio; Malatesta, Francesco; Biagioni, Stefano; Saliola, Michele

    2017-09-22

    Phosphodiesterases (PDE) are a superfamily of enzymes that hydrolyse cyclic nucleotides (cAMP/cGMP), signal molecules in transduction pathways regulating crucial aspects of cell life. PDEs regulate the intensity and duration of the cyclic nucleotides signal modulating the downstream biological effect. Due to this critical role associated with the extensive distribution and multiplicity of isozymes, the 11 mammalian families (PDE1 to PDE11) constitute key therapeutic targets. PDE5, one of these cGMP-specific hydrolysing families, is the molecular target of several well known drugs used to treat erectile dysfunction and pulmonary hypertension. Kluyveromyces lactis, one of the few yeasts capable of utilizing lactose, is an attractive host alternative to Saccharomyces cerevisiae for heterologous protein production. Here we established K. lactis as a powerful host for the quantitative production of the murine PDE5 isoforms. Using the promoter of the highly expressed KlADH3 gene, multicopy plasmids were engineered to produce the native and recombinant Mus musculus PDE5 in K. lactis. Yeast cells produced large amounts of the purified A1, A2 and A3 isoforms displaying K m , V max and Sildenafil inhibition values similar to those of the native murine enzymes. PDE5 whose yield was nearly 1 mg/g wet weight biomass for all three isozymes (30 mg/L culture), is well tolerated by K. lactis cells without major growth deficiencies and interferences with the endogenous cAMP/cGMP signal transduction pathways. To our knowledge, this is the first time that the entire PDE5 isozymes family containing both regulatory and catalytic domains has been produced at high levels in a heterologous eukaryotic organism. K. lactis has been shown to be a very promising host platform for large scale production of mammalian PDEs for biochemical and structural studies and for the development of new specific PDE inhibitors for therapeutic applications in many pathologies.

  20. PDE-4 Inhibition Rescues Aberrant Synaptic Plasticity in Drosophila and Mouse Models of Fragile X Syndrome

    PubMed Central

    Choi, Catherine H.; Schoenfeld, Brian P.; Weisz, Eliana D.; Bell, Aaron J.; Chambers, Daniel B.; Hinchey, Joseph; Choi, Richard J.; Hinchey, Paul; Kollaros, Maria; Gertner, Michael J.; Ferrick, Neal J.; Terlizzi, Allison M.; Yohn, Nicole; Koenigsberg, Eric; Liebelt, David A.; Zukin, R. Suzanne; Woo, Newton H.; Tranfaglia, Michael R.; Louneva, Natalia; Arnold, Steven E.; Siegel, Steven J.

    2015-01-01

    Fragile X syndrome (FXS) is the leading cause of both intellectual disability and autism resulting from a single gene mutation. Previously, we characterized cognitive impairments and brain structural defects in a Drosophila model of FXS and demonstrated that these impairments were rescued by treatment with metabotropic glutamate receptor (mGluR) antagonists or lithium. A well-documented biochemical defect observed in fly and mouse FXS models and FXS patients is low cAMP levels. cAMP levels can be regulated by mGluR signaling. Herein, we demonstrate PDE-4 inhibition as a therapeutic strategy to ameliorate memory impairments and brain structural defects in the Drosophila model of fragile X. Furthermore, we examine the effects of PDE-4 inhibition by pharmacologic treatment in the fragile X mouse model. We demonstrate that acute inhibition of PDE-4 by pharmacologic treatment in hippocampal slices rescues the enhanced mGluR-dependent LTD phenotype observed in FXS mice. Additionally, we find that chronic treatment of FXS model mice, in adulthood, also restores the level of mGluR-dependent LTD to that observed in wild-type animals. Translating the findings of successful pharmacologic intervention from the Drosophila model into the mouse model of FXS is an important advance, in that this identifies and validates PDE-4 inhibition as potential therapeutic intervention for the treatment of individuals afflicted with FXS. PMID:25568131

  1. Implementation and performance of parallel Prolog interpreter

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wei, S.; Kale, L.V.; Balkrishna, R.

    1988-01-01

    In this paper, the authors discuss the implementation of a parallel Prolog interpreter on different parallel machines. The implementation is based on the REDUCE--OR process model which exploits both AND and OR parallelism in logic programs. It is machine independent as it runs on top of the chare-kernel--a machine-independent parallel programming system. The authors also give the performance of the interpreter running a diverse set of benchmark pargrams on parallel machines including shared memory systems: an Alliant FX/8, Sequent and a MultiMax, and a non-shared memory systems: Intel iPSC/32 hypercube, in addition to its performance on a multiprocessor simulation system.

  2. Virtual earthquake engineering laboratory with physics-based degrading materials on parallel computers

    NASA Astrophysics Data System (ADS)

    Cho, In Ho

    -scale reinforced concrete (RC) structures under cyclic loading are proposed. Quantitative comparison of state-of-the-art parallel strategies, in terms of factorization, had been carried out, leading to the problem-optimized solver, which is successfully embracing the penalty method and banded nature. Particularly, the penalty method employed imparts considerable smoothness to the global response, which yields a practical superiority of the parallel triangular system solver over other advanced solvers such as parallel preconditioned conjugate gradient method. Other salient issues on parallelization are also addressed. The parallel platform established offers unprecedented access to simulations of real-scale structures, giving new understanding about the physics-based mechanisms adopted and probabilistic randomness at the entire system level. Particularly, the platform enables bold simulations of real-scale RC structures exposed to cyclic loading---H-shaped wall system and 4-story T-shaped wall system. The simulations show the desired capability of accurate prediction of global force-displacement responses, postpeak softening behavior, and compressive buckling of longitudinal steel bars. It is fascinating to see that intrinsic randomness of the 3d interlocking model appears to cause "localized" damage of the real-scale structures, which is consistent with reported observations in different fields such as granular media. Equipped with accuracy, stability and scalability as demonstrated so far, the parallel platform is believed to serve as a fertile ground for the introducing of further physical mechanisms into various research fields as well as the earthquake engineering community. In the near future, it can be further expanded to run in concert with reliable FEA programs such as FRAME3d or OPENSEES. Following the central notion of "multiscale" analysis technique, actual infrastructures exposed to extreme natural hazard can be successfully tackled by this next generation analysis

  3. Building Blocks for Reliable Complex Nonlinear Numerical Simulations. Chapter 2

    NASA Technical Reports Server (NTRS)

    Yee, H. C.; Mansour, Nagi N. (Technical Monitor)

    2001-01-01

    This chapter describes some of the building blocks to ensure a higher level of confidence in the predictability and reliability (PAR) of numerical simulation of multiscale complex nonlinear problems. The focus is on relating PAR of numerical simulations with complex nonlinear phenomena of numerics. To isolate sources of numerical uncertainties, the possible discrepancy between the chosen partial differential equation (PDE) model and the real physics and/or experimental data is set aside. The discussion is restricted to how well numerical schemes can mimic the solution behavior of the underlying PDE model for finite time steps and grid spacings. The situation is complicated by the fact that the available theory for the understanding of nonlinear behavior of numerics is not at a stage to fully analyze the nonlinear Euler and Navier-Stokes equations. The discussion is based on the knowledge gained for nonlinear model problems with known analytical solutions to identify and explain the possible sources and remedies of numerical uncertainties in practical computations. Examples relevant to turbulent flow computations are included.

  4. Large-eddy simulations of compressible convection on massively parallel computers. [stellar physics

    NASA Technical Reports Server (NTRS)

    Xie, Xin; Toomre, Juri

    1993-01-01

    We report preliminary implementation of the large-eddy simulation (LES) technique in 2D simulations of compressible convection carried out on the CM-2 massively parallel computer. The convective flow fields in our simulations possess structures similar to those found in a number of direct simulations, with roll-like flows coherent across the entire depth of the layer that spans several density scale heights. Our detailed assessment of the effects of various subgrid scale (SGS) terms reveals that they may affect the gross character of convection. Yet, somewhat surprisingly, we find that our LES solutions, and another in which the SGS terms are turned off, only show modest differences. The resulting 2D flows realized here are rather laminar in character, and achieving substantial turbulence may require stronger forcing and less dissipation.

  5. BioFVM: an efficient, parallelized diffusive transport solver for 3-D biological simulations

    PubMed Central

    Ghaffarizadeh, Ahmadreza; Friedman, Samuel H.; Macklin, Paul

    2016-01-01

    Motivation: Computational models of multicellular systems require solving systems of PDEs for release, uptake, decay and diffusion of multiple substrates in 3D, particularly when incorporating the impact of drugs, growth substrates and signaling factors on cell receptors and subcellular systems biology. Results: We introduce BioFVM, a diffusive transport solver tailored to biological problems. BioFVM can simulate release and uptake of many substrates by cell and bulk sources, diffusion and decay in large 3D domains. It has been parallelized with OpenMP, allowing efficient simulations on desktop workstations or single supercomputer nodes. The code is stable even for large time steps, with linear computational cost scalings. Solutions are first-order accurate in time and second-order accurate in space. The code can be run by itself or as part of a larger simulator. Availability and implementation: BioFVM is written in C ++ with parallelization in OpenMP. It is maintained and available for download at http://BioFVM.MathCancer.org and http://BioFVM.sf.net under the Apache License (v2.0). Contact: paul.macklin@usc.edu. Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26656933

  6. Evaluating the performance of parallel subsurface simulators: An illustrative example with PFLOTRAN

    PubMed Central

    Hammond, G E; Lichtner, P C; Mills, R T

    2014-01-01

    [1] To better inform the subsurface scientist on the expected performance of parallel simulators, this work investigates performance of the reactive multiphase flow and multicomponent biogeochemical transport code PFLOTRAN as it is applied to several realistic modeling scenarios run on the Jaguar supercomputer. After a brief introduction to the code's parallel layout and code design, PFLOTRAN's parallel performance (measured through strong and weak scalability analyses) is evaluated in the context of conceptual model layout, software and algorithmic design, and known hardware limitations. PFLOTRAN scales well (with regard to strong scaling) for three realistic problem scenarios: (1) in situ leaching of copper from a mineral ore deposit within a 5-spot flow regime, (2) transient flow and solute transport within a regional doublet, and (3) a real-world problem involving uranium surface complexation within a heterogeneous and extremely dynamic variably saturated flow field. Weak scalability is discussed in detail for the regional doublet problem, and several difficulties with its interpretation are noted. PMID:25506097

  7. Evaluating the performance of parallel subsurface simulators: An illustrative example with PFLOTRAN.

    PubMed

    Hammond, G E; Lichtner, P C; Mills, R T

    2014-01-01

    [1] To better inform the subsurface scientist on the expected performance of parallel simulators, this work investigates performance of the reactive multiphase flow and multicomponent biogeochemical transport code PFLOTRAN as it is applied to several realistic modeling scenarios run on the Jaguar supercomputer. After a brief introduction to the code's parallel layout and code design, PFLOTRAN's parallel performance (measured through strong and weak scalability analyses) is evaluated in the context of conceptual model layout, software and algorithmic design, and known hardware limitations. PFLOTRAN scales well (with regard to strong scaling) for three realistic problem scenarios: (1) in situ leaching of copper from a mineral ore deposit within a 5-spot flow regime, (2) transient flow and solute transport within a regional doublet, and (3) a real-world problem involving uranium surface complexation within a heterogeneous and extremely dynamic variably saturated flow field. Weak scalability is discussed in detail for the regional doublet problem, and several difficulties with its interpretation are noted.

  8. Scalable High Performance Computing: Direct and Large-Eddy Turbulent Flow Simulations Using Massively Parallel Computers

    NASA Technical Reports Server (NTRS)

    Morgan, Philip E.

    2004-01-01

    This final report contains reports of research related to the tasks "Scalable High Performance Computing: Direct and Lark-Eddy Turbulent FLow Simulations Using Massively Parallel Computers" and "Devleop High-Performance Time-Domain Computational Electromagnetics Capability for RCS Prediction, Wave Propagation in Dispersive Media, and Dual-Use Applications. The discussion of Scalable High Performance Computing reports on three objectives: validate, access scalability, and apply two parallel flow solvers for three-dimensional Navier-Stokes flows; develop and validate a high-order parallel solver for Direct Numerical Simulations (DNS) and Large Eddy Simulation (LES) problems; and Investigate and develop a high-order Reynolds averaged Navier-Stokes turbulence model. The discussion of High-Performance Time-Domain Computational Electromagnetics reports on five objectives: enhancement of an electromagnetics code (CHARGE) to be able to effectively model antenna problems; utilize lessons learned in high-order/spectral solution of swirling 3D jets to apply to solving electromagnetics project; transition a high-order fluids code, FDL3DI, to be able to solve Maxwell's Equations using compact-differencing; develop and demonstrate improved radiation absorbing boundary conditions for high-order CEM; and extend high-order CEM solver to address variable material properties. The report also contains a review of work done by the systems engineer.

  9. A parallel program for numerical simulation of discrete fracture network and groundwater flow

    NASA Astrophysics Data System (ADS)

    Huang, Ting-Wei; Liou, Tai-Sheng; Kalatehjari, Roohollah

    2017-04-01

    The ability of modeling fluid flow in Discrete Fracture Network (DFN) is critical to various applications such as exploration of reserves in geothermal and petroleum reservoirs, geological sequestration of carbon dioxide and final disposal of spent nuclear fuels. Although several commerical or acdametic DFN flow simulators are already available (e.g., FracMan and DFNWORKS), challenges in terms of computational efficiency and three-dimensional visualization still remain, which therefore motivates this study for developing a new DFN and flow simulator. A new DFN and flow simulator, DFNbox, was written in C++ under a cross-platform software development framework provided by Qt. DFNBox integrates the following capabilities into a user-friendly drop-down menu interface: DFN simulation and clipping, 3D mesh generation, fracture data analysis, connectivity analysis, flow path analysis and steady-state grounwater flow simulation. All three-dimensional visualization graphics were developed using the free OpenGL API. Similar to other DFN simulators, fractures are conceptualized as random point process in space, with stochastic characteristics represented by orientation, size, transmissivity and aperture. Fracture meshing was implemented by Delaunay triangulation for visualization but not flow simulation purposes. Boundary element method was used for flow simulations such that only unknown head or flux along exterior and interection bounaries are needed for solving the flow field in the DFN. Parallel compuation concept was taken into account in developing DFNbox for calculations that such concept is possible. For example, the time-consuming seqential code for fracture clipping calculations has been completely replaced by a highly efficient parallel one. This can greatly enhance compuational efficiency especially on multi-thread platforms. Furthermore, DFNbox have been successfully tested in Windows and Linux systems with equally-well performance.

  10. Providing a parallel and distributed capability for JMASS using SPEEDES

    NASA Astrophysics Data System (ADS)

    Valinski, Maria; Driscoll, Jonathan; McGraw, Robert M.; Meyer, Bob

    2002-07-01

    The Joint Modeling And Simulation System (JMASS) is a Tri-Service simulation environment that supports engineering and engagement-level simulations. As JMASS is expanded to support other Tri-Service domains, the current set of modeling services must be expanded for High Performance Computing (HPC) applications by adding support for advanced time-management algorithms, parallel and distributed topologies, and high speed communications. By providing support for these services, JMASS can better address modeling domains requiring parallel computationally intense calculations such clutter, vulnerability and lethality calculations, and underwater-based scenarios. A risk reduction effort implementing some HPC services for JMASS using the SPEEDES (Synchronous Parallel Environment for Emulation and Discrete Event Simulation) Simulation Framework has recently concluded. As an artifact of the JMASS-SPEEDES integration, not only can HPC functionality be brought to the JMASS program through SPEEDES, but an additional HLA-based capability can be demonstrated that further addresses interoperability issues. The JMASS-SPEEDES integration provided a means of adding HLA capability to preexisting JMASS scenarios through an implementation of the standard JMASS port communication mechanism that allows players to communicate.

  11. Octree-based, GPU implementation of a continuous cellular automaton for the simulation of complex, evolving surfaces

    NASA Astrophysics Data System (ADS)

    Ferrando, N.; Gosálvez, M. A.; Cerdá, J.; Gadea, R.; Sato, K.

    2011-03-01

    Presently, dynamic surface-based models are required to contain increasingly larger numbers of points and to propagate them over longer time periods. For large numbers of surface points, the octree data structure can be used as a balance between low memory occupation and relatively rapid access to the stored data. For evolution rules that depend on neighborhood states, extended simulation periods can be obtained by using simplified atomistic propagation models, such as the Cellular Automata (CA). This method, however, has an intrinsic parallel updating nature and the corresponding simulations are highly inefficient when performed on classical Central Processing Units (CPUs), which are designed for the sequential execution of tasks. In this paper, a series of guidelines is presented for the efficient adaptation of octree-based, CA simulations of complex, evolving surfaces into massively parallel computing hardware. A Graphics Processing Unit (GPU) is used as a cost-efficient example of the parallel architectures. For the actual simulations, we consider the surface propagation during anisotropic wet chemical etching of silicon as a computationally challenging process with a wide-spread use in microengineering applications. A continuous CA model that is intrinsically parallel in nature is used for the time evolution. Our study strongly indicates that parallel computations of dynamically evolving surfaces simulated using CA methods are significantly benefited by the incorporation of octrees as support data structures, substantially decreasing the overall computational time and memory usage.

  12. Hybrid simulations of a parallel collisionless shock in the large plasma device

    DOE PAGES

    Weidl, Martin S.; Winske, Dan; Jenko, Frank; ...

    2016-12-01

    We present two-dimensional hybrid kinetic/magnetohydrodynamic simulations of planned laser-ablation experiments in the Large Plasma Device (LAPD). Our results, based on parameters which have been validated in previous experiments, show that a parallel collisionless shock can begin forming within the available space. Carbon-debris ions that stream along the magnetic- eld direction with a blow-o speed of four times the Alfv en velocity excite strong magnetic uctuations, eventually transfering part of their kinetic energy to the surrounding hydrogen ions. This acceleration and compression of the background plasma creates a shock front, which satis es the Rankine{Hugoniot conditions and can therefore propagate onmore » its own. Furthermore, we analyze the upstream turbulence and show that it is dominated by the right-hand resonant instability.« less

  13. Discovery of novel PDE9 inhibitors capable of inhibiting Aβ aggregation as potential candidates for the treatment of Alzheimer's disease.

    PubMed

    Su, Tao; Zhang, Tianhua; Xie, Shishun; Yan, Jun; Wu, Yinuo; Li, Xingshu; Huang, Ling; Luo, Hai-Bin

    2016-02-25

    Recently, phosphodiesterase-9 (PDE9) inhibitors and biometal-chelators have received much attention as potential therapeutics for the treatment of Alzheimer's disease (AD). Here, we designed, synthesized, and evaluated a novel series of PDE9 inhibitors with the ability to chelate metal ions. The bioassay results showed that most of these molecules strongly inhibited PDE9 activity. Compound 16 showed an IC50 of 34 nM against PDE9 and more than 55-fold selectivity against other PDEs. In addition, this compound displayed remarkable metal-chelating capacity and a considerable ability to halt copper redox cycling. Notably, in comparison to the reference compound clioquinol, it inhibited metal-induced Aβ(1-42) aggregation more effectively and promoted greater disassembly of the highly structured Aβ fibrils generated through Cu(2+)-induced Aβ aggregation. These activities of 16, together with its favorable blood-brain barrier permeability, suggest that 16 may be a promising compound for treatment of AD.

  14. GPU-based parallel algorithm for blind image restoration using midfrequency-based methods

    NASA Astrophysics Data System (ADS)

    Xie, Lang; Luo, Yi-han; Bao, Qi-liang

    2013-08-01

    GPU-based general-purpose computing is a new branch of modern parallel computing, so the study of parallel algorithms specially designed for GPU hardware architecture is of great significance. In order to solve the problem of high computational complexity and poor real-time performance in blind image restoration, the midfrequency-based algorithm for blind image restoration was analyzed and improved in this paper. Furthermore, a midfrequency-based filtering method is also used to restore the image hardly with any recursion or iteration. Combining the algorithm with data intensiveness, data parallel computing and GPU execution model of single instruction and multiple threads, a new parallel midfrequency-based algorithm for blind image restoration is proposed in this paper, which is suitable for stream computing of GPU. In this algorithm, the GPU is utilized to accelerate the estimation of class-G point spread functions and midfrequency-based filtering. Aiming at better management of the GPU threads, the threads in a grid are scheduled according to the decomposition of the filtering data in frequency domain after the optimization of data access and the communication between the host and the device. The kernel parallelism structure is determined by the decomposition of the filtering data to ensure the transmission rate to get around the memory bandwidth limitation. The results show that, with the new algorithm, the operational speed is significantly increased and the real-time performance of image restoration is effectively improved, especially for high-resolution images.

  15. Efficient Parallel Kernel Solvers for Computational Fluid Dynamics Applications

    NASA Technical Reports Server (NTRS)

    Sun, Xian-He

    1997-01-01

    Distributed-memory parallel computers dominate today's parallel computing arena. These machines, such as Intel Paragon, IBM SP2, and Cray Origin2OO, have successfully delivered high performance computing power for solving some of the so-called "grand-challenge" problems. Despite initial success, parallel machines have not been widely accepted in production engineering environments due to the complexity of parallel programming. On a parallel computing system, a task has to be partitioned and distributed appropriately among processors to reduce communication cost and to attain load balance. More importantly, even with careful partitioning and mapping, the performance of an algorithm may still be unsatisfactory, since conventional sequential algorithms may be serial in nature and may not be implemented efficiently on parallel machines. In many cases, new algorithms have to be introduced to increase parallel performance. In order to achieve optimal performance, in addition to partitioning and mapping, a careful performance study should be conducted for a given application to find a good algorithm-machine combination. This process, however, is usually painful and elusive. The goal of this project is to design and develop efficient parallel algorithms for highly accurate Computational Fluid Dynamics (CFD) simulations and other engineering applications. The work plan is 1) developing highly accurate parallel numerical algorithms, 2) conduct preliminary testing to verify the effectiveness and potential of these algorithms, 3) incorporate newly developed algorithms into actual simulation packages. The work plan has well achieved. Two highly accurate, efficient Poisson solvers have been developed and tested based on two different approaches: (1) Adopting a mathematical geometry which has a better capacity to describe the fluid, (2) Using compact scheme to gain high order accuracy in numerical discretization. The previously developed Parallel Diagonal Dominant (PDD) algorithm

  16. A piezoelectric six-DOF vibration energy harvester based on parallel mechanism: dynamic modeling, simulation, and experiment

    NASA Astrophysics Data System (ADS)

    Yuan, G.; Wang, D. H.

    2017-03-01

    Multi-directional and multi-degree-of-freedom (multi-DOF) vibration energy harvesting are attracting more and more research interest in recent years. In this paper, the principle of a piezoelectric six-DOF vibration energy harvester based on parallel mechanism is proposed to convert the energy of the six-DOF vibration to single-DOF vibrations of the limbs on the energy harvester and output voltages. The dynamic model of the piezoelectric six-DOF vibration energy harvester is established to estimate the vibrations of the limbs. On this basis, a Stewart-type piezoelectric six-DOF vibration energy harvester is developed and explored. In order to validate the established dynamic model and the analysis results, the simulation model of the Stewart-type piezoelectric six-DOF vibration energy harvester is built and tested with different vibration excitations by SimMechanics, and some preliminary experiments are carried out. The results show that the vibration of the limbs on the piezoelectric six-DOF vibration energy harvester can be estimated by the established dynamic model. The developed Stewart-type piezoelectric six-DOF vibration energy harvester can harvest the energy of multi-directional linear vibration and multi-axis rotating vibration with resonance frequencies of 17 Hz, 25 Hz, and 47 Hz. Moreover, the resonance frequencies of the developed piezoelectric six-DOF vibration energy harvester are not affected by the direction changing of the vibration excitation.

  17. Applications of New Surrogate Global Optimization Algorithms including Efficient Synchronous and Asynchronous Parallelism for Calibration of Expensive Nonlinear Geophysical Simulation Models.

    NASA Astrophysics Data System (ADS)

    Shoemaker, C. A.; Pang, M.; Akhtar, T.; Bindel, D.

    2016-12-01

    New parallel surrogate global optimization algorithms are developed and applied to objective functions that are expensive simulations (possibly with multiple local minima). The algorithms can be applied to most geophysical simulations, including those with nonlinear partial differential equations. The optimization does not require simulations be parallelized. Asynchronous (and synchronous) parallel execution is available in the optimization toolbox "pySOT". The parallel algorithms are modified from serial to eliminate fine grained parallelism. The optimization is computed with open source software pySOT, a Surrogate Global Optimization Toolbox that allows user to pick the type of surrogate (or ensembles), the search procedure on surrogate, and the type of parallelism (synchronous or asynchronous). pySOT also allows the user to develop new algorithms by modifying parts of the code. In the applications here, the objective function takes up to 30 minutes for one simulation, and serial optimization can take over 200 hours. Results from Yellowstone (NSF) and NCSS (Singapore) supercomputers are given for groundwater contaminant hydrology simulations with applications to model parameter estimation and decontamination management. All results are compared with alternatives. The first results are for optimization of pumping at many wells to reduce cost for decontamination of groundwater at a superfund site. The optimization runs with up to 128 processors. Superlinear speed up is obtained for up to 16 processors, and efficiency with 64 processors is over 80%. Each evaluation of the objective function requires the solution of nonlinear partial differential equations to describe the impact of spatially distributed pumping and model parameters on model predictions for the spatial and temporal distribution of groundwater contaminants. The second application uses an asynchronous parallel global optimization for groundwater quality model calibration. The time for a single objective

  18. Particle/Continuum Hybrid Simulation in a Parallel Computing Environment

    NASA Technical Reports Server (NTRS)

    Baganoff, Donald

    1996-01-01

    The objective of this study was to modify an existing parallel particle code based on the direct simulation Monte Carlo (DSMC) method to include a Navier-Stokes (NS) calculation so that a hybrid solution could be developed. In carrying out this work, it was determined that the following five issues had to be addressed before extensive program development of a three dimensional capability was pursued: (1) find a set of one-sided kinetic fluxes that are fully compatible with the DSMC method, (2) develop a finite volume scheme to make use of these one-sided kinetic fluxes, (3) make use of the one-sided kinetic fluxes together with DSMC type boundary conditions at a material surface so that velocity slip and temperature slip arise naturally for near-continuum conditions, (4) find a suitable sampling scheme so that the values of the one-sided fluxes predicted by the NS solution at an interface between the two domains can be converted into the correct distribution of particles to be introduced into the DSMC domain, (5) carry out a suitable number of tests to confirm that the developed concepts are valid, individually and in concert for a hybrid scheme.

  19. Simulated Wake Characteristics Data for Closely Spaced Parallel Runway Operations Analysis

    NASA Technical Reports Server (NTRS)

    Guerreiro, Nelson M.; Neitzke, Kurt W.

    2012-01-01

    A simulation experiment was performed to generate and compile wake characteristics data relevant to the evaluation and feasibility analysis of closely spaced parallel runway (CSPR) operational concepts. While the experiment in this work is not tailored to any particular operational concept, the generated data applies to the broader class of CSPR concepts, where a trailing aircraft on a CSPR approach is required to stay ahead of the wake vortices generated by a lead aircraft on an adjacent CSPR. Data for wake age, circulation strength, and wake altitude change, at various lateral offset distances from the wake-generating lead aircraft approach path were compiled for a set of nine aircraft spanning the full range of FAA and ICAO wake classifications. A total of 54 scenarios were simulated to generate data related to key parameters that determine wake behavior. Of particular interest are wake age characteristics that can be used to evaluate both time- and distance- based in-trail separation concepts for all aircraft wake-class combinations. A simple first-order difference model was developed to enable the computation of wake parameter estimates for aircraft models having weight, wingspan and speed characteristics similar to those of the nine aircraft modeled in this work.

  20. PDE-4 inhibition rescues aberrant synaptic plasticity in Drosophila and mouse models of fragile X syndrome.

    PubMed

    Choi, Catherine H; Schoenfeld, Brian P; Weisz, Eliana D; Bell, Aaron J; Chambers, Daniel B; Hinchey, Joseph; Choi, Richard J; Hinchey, Paul; Kollaros, Maria; Gertner, Michael J; Ferrick, Neal J; Terlizzi, Allison M; Yohn, Nicole; Koenigsberg, Eric; Liebelt, David A; Zukin, R Suzanne; Woo, Newton H; Tranfaglia, Michael R; Louneva, Natalia; Arnold, Steven E; Siegel, Steven J; Bolduc, Francois V; McDonald, Thomas V; Jongens, Thomas A; McBride, Sean M J

    2015-01-07

    Fragile X syndrome (FXS) is the leading cause of both intellectual disability and autism resulting from a single gene mutation. Previously, we characterized cognitive impairments and brain structural defects in a Drosophila model of FXS and demonstrated that these impairments were rescued by treatment with metabotropic glutamate receptor (mGluR) antagonists or lithium. A well-documented biochemical defect observed in fly and mouse FXS models and FXS patients is low cAMP levels. cAMP levels can be regulated by mGluR signaling. Herein, we demonstrate PDE-4 inhibition as a therapeutic strategy to ameliorate memory impairments and brain structural defects in the Drosophila model of fragile X. Furthermore, we examine the effects of PDE-4 inhibition by pharmacologic treatment in the fragile X mouse model. We demonstrate that acute inhibition of PDE-4 by pharmacologic treatment in hippocampal slices rescues the enhanced mGluR-dependent LTD phenotype observed in FXS mice. Additionally, we find that chronic treatment of FXS model mice, in adulthood, also restores the level of mGluR-dependent LTD to that observed in wild-type animals. Translating the findings of successful pharmacologic intervention from the Drosophila model into the mouse model of FXS is an important advance, in that this identifies and validates PDE-4 inhibition as potential therapeutic intervention for the treatment of individuals afflicted with FXS. Copyright © 2015 the authors 0270-6474/15/350396-13$15.00/0.

  1. Computational performance of a smoothed particle hydrodynamics simulation for shared-memory parallel computing

    NASA Astrophysics Data System (ADS)

    Nishiura, Daisuke; Furuichi, Mikito; Sakaguchi, Hide

    2015-09-01

    The computational performance of a smoothed particle hydrodynamics (SPH) simulation is investigated for three types of current shared-memory parallel computer devices: many integrated core (MIC) processors, graphics processing units (GPUs), and multi-core CPUs. We are especially interested in efficient shared-memory allocation methods for each chipset, because the efficient data access patterns differ between compute unified device architecture (CUDA) programming for GPUs and OpenMP programming for MIC processors and multi-core CPUs. We first introduce several parallel implementation techniques for the SPH code, and then examine these on our target computer architectures to determine the most effective algorithms for each processor unit. In addition, we evaluate the effective computing performance and power efficiency of the SPH simulation on each architecture, as these are critical metrics for overall performance in a multi-device environment. In our benchmark test, the GPU is found to produce the best arithmetic performance as a standalone device unit, and gives the most efficient power consumption. The multi-core CPU obtains the most effective computing performance. The computational speed of the MIC processor on Xeon Phi approached that of two Xeon CPUs. This indicates that using MICs is an attractive choice for existing SPH codes on multi-core CPUs parallelized by OpenMP, as it gains computational acceleration without the need for significant changes to the source code.

  2. PCSIM: A Parallel Simulation Environment for Neural Circuits Fully Integrated with Python

    PubMed Central

    Pecevski, Dejan; Natschläger, Thomas; Schuch, Klaus

    2008-01-01

    The Parallel Circuit SIMulator (PCSIM) is a software package for simulation of neural circuits. It is primarily designed for distributed simulation of large scale networks of spiking point neurons. Although its computational core is written in C++, PCSIM's primary interface is implemented in the Python programming language, which is a powerful programming environment and allows the user to easily integrate the neural circuit simulator with data analysis and visualization tools to manage the full neural modeling life cycle. The main focus of this paper is to describe PCSIM's full integration into Python and the benefits thereof. In particular we will investigate how the automatically generated bidirectional interface and PCSIM's object-oriented modular framework enable the user to adopt a hybrid modeling approach: using and extending PCSIM's functionality either employing pure Python or C++ and thus combining the advantages of both worlds. Furthermore, we describe several supplementary PCSIM packages written in pure Python and tailored towards setting up and analyzing neural simulations. PMID:19543450

  3. Vortex-induced vibration of two parallel risers: Experimental test and numerical simulation

    NASA Astrophysics Data System (ADS)

    Huang, Weiping; Zhou, Yang; Chen, Haiming

    2016-04-01

    The vortex-induced vibration of two identical rigidly mounted risers in a parallel arrangement was studied using Ansys- CFX and model tests. The vortex shedding and force were recorded to determine the effect of spacing on the two-degree-of-freedom oscillation of the risers. CFX was used to study the single riser and two parallel risers in 2-8 D spacing considering the coupling effect. Because of the limited width of water channel, only three different riser spacings, 2 D, 3 D, and 4 D, were tested to validate the characteristics of the two parallel risers by comparing to the numerical simulation. The results indicate that the lift force changes significantly with the increase in spacing, and in the case of 3 D spacing, the lift force of the two parallel risers reaches the maximum. The vortex shedding of the risers in 3 D spacing shows that a variable velocity field with the same frequency as the vortex shedding is generated in the overlapped area, thus equalizing the period of drag force to that of lift force. It can be concluded that the interaction between the two parallel risers is significant when the risers are brought to a small distance between them because the trajectory of riser changes from oval to curve 8 as the spacing is increased. The phase difference of lift force between the two risers is also different as the spacing changes.

  4. Turbomachinery CFD on parallel computers

    NASA Technical Reports Server (NTRS)

    Blech, Richard A.; Milner, Edward J.; Quealy, Angela; Townsend, Scott E.

    1992-01-01

    The role of multistage turbomachinery simulation in the development of propulsion system models is discussed. Particularly, the need for simulations with higher fidelity and faster turnaround time is highlighted. It is shown how such fast simulations can be used in engineering-oriented environments. The use of parallel processing to achieve the required turnaround times is discussed. Current work by several researchers in this area is summarized. Parallel turbomachinery CFD research at the NASA Lewis Research Center is then highlighted. These efforts are focused on implementing the average-passage turbomachinery model on MIMD, distributed memory parallel computers. Performance results are given for inviscid, single blade row and viscous, multistage applications on several parallel computers, including networked workstations.

  5. A CFD Heterogeneous Parallel Solver Based on Collaborating CPU and GPU

    NASA Astrophysics Data System (ADS)

    Lai, Jianqi; Tian, Zhengyu; Li, Hua; Pan, Sha

    2018-03-01

    Since Graphic Processing Unit (GPU) has a strong ability of floating-point computation and memory bandwidth for data parallelism, it has been widely used in the areas of common computing such as molecular dynamics (MD), computational fluid dynamics (CFD) and so on. The emergence of compute unified device architecture (CUDA), which reduces the complexity of compiling program, brings the great opportunities to CFD. There are three different modes for parallel solution of NS equations: parallel solver based on CPU, parallel solver based on GPU and heterogeneous parallel solver based on collaborating CPU and GPU. As we can see, GPUs are relatively rich in compute capacity but poor in memory capacity and the CPUs do the opposite. We need to make full use of the GPUs and CPUs, so a CFD heterogeneous parallel solver based on collaborating CPU and GPU has been established. Three cases are presented to analyse the solver’s computational accuracy and heterogeneous parallel efficiency. The numerical results agree well with experiment results, which demonstrate that the heterogeneous parallel solver has high computational precision. The speedup on a single GPU is more than 40 for laminar flow, it decreases for turbulent flow, but it still can reach more than 20. What’s more, the speedup increases as the grid size becomes larger.

  6. Research in parallel computing

    NASA Technical Reports Server (NTRS)

    Ortega, James M.; Henderson, Charles

    1994-01-01

    This report summarizes work on parallel computations for NASA Grant NAG-1-1529 for the period 1 Jan. - 30 June 1994. Short summaries on highly parallel preconditioners, target-specific parallel reductions, and simulation of delta-cache protocols are provided.

  7. Automatic Thread-Level Parallelization in the Chombo AMR Library

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Christen, Matthias; Keen, Noel; Ligocki, Terry

    2011-05-26

    The increasing on-chip parallelism has some substantial implications for HPC applications. Currently, hybrid programming models (typically MPI+OpenMP) are employed for mapping software to the hardware in order to leverage the hardware?s architectural features. In this paper, we present an approach that automatically introduces thread level parallelism into Chombo, a parallel adaptive mesh refinement framework for finite difference type PDE solvers. In Chombo, core algorithms are specified in the ChomboFortran, a macro language extension to F77 that is part of the Chombo framework. This domain-specific language forms an already used target language for an automatic migration of the large number ofmore » existing algorithms into a hybrid MPI+OpenMP implementation. It also provides access to the auto-tuning methodology that enables tuning certain aspects of an algorithm to hardware characteristics. Performance measurements are presented for a few of the most relevant kernels with respect to a specific application benchmark using this technique as well as benchmark results for the entire application. The kernel benchmarks show that, using auto-tuning, up to a factor of 11 in performance was gained with 4 threads with respect to the serial reference implementation.« less

  8. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers

    DOE PAGES

    Abraham, Mark James; Murtola, Teemu; Schulz, Roland; ...

    2015-07-15

    GROMACS is one of the most widely used open-source and free software codes in chemistry, used primarily for dynamical simulations of biomolecules. It provides a rich set of calculation types, preparation and analysis tools. Several advanced techniques for free-energy calculations are supported. In version 5, it reaches new performance heights, through several new and enhanced parallelization algorithms. This work on every level; SIMD registers inside cores, multithreading, heterogeneous CPU–GPU acceleration, state-of-the-art 3D domain decomposition, and ensemble-level parallelization through built-in replica exchange and the separate Copernicus framework. Finally, the latest best-in-class compressed trajectory storage format is supported.

  9. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Abraham, Mark James; Murtola, Teemu; Schulz, Roland

    GROMACS is one of the most widely used open-source and free software codes in chemistry, used primarily for dynamical simulations of biomolecules. It provides a rich set of calculation types, preparation and analysis tools. Several advanced techniques for free-energy calculations are supported. In version 5, it reaches new performance heights, through several new and enhanced parallelization algorithms. This work on every level; SIMD registers inside cores, multithreading, heterogeneous CPU–GPU acceleration, state-of-the-art 3D domain decomposition, and ensemble-level parallelization through built-in replica exchange and the separate Copernicus framework. Finally, the latest best-in-class compressed trajectory storage format is supported.

  10. Simulation of an array-based neural net model

    NASA Technical Reports Server (NTRS)

    Barnden, John A.

    1987-01-01

    Research in cognitive science suggests that much of cognition involves the rapid manipulation of complex data structures. However, it is very unclear how this could be realized in neural networks or connectionist systems. A core question is: how could the interconnectivity of items in an abstract-level data structure be neurally encoded? The answer appeals mainly to positional relationships between activity patterns within neural arrays, rather than directly to neural connections in the traditional way. The new method was initially devised to account for abstract symbolic data structures, but it also supports cognitively useful spatial analogue, image-like representations. As the neural model is based on massive, uniform, parallel computations over 2D arrays, the massively parallel processor is a convenient tool for simulation work, although there are complications in using the machine to the fullest advantage. An MPP Pascal simulation program for a small pilot version of the model is running.

  11. Neurite, a Finite Difference Large Scale Parallel Program for the Simulation of Electrical Signal Propagation in Neurites under Mechanical Loading

    PubMed Central

    García-Grajales, Julián A.; Rucabado, Gabriel; García-Dopico, Antonio; Peña, José-María; Jérusalem, Antoine

    2015-01-01

    With the growing body of research on traumatic brain injury and spinal cord injury, computational neuroscience has recently focused its modeling efforts on neuronal functional deficits following mechanical loading. However, in most of these efforts, cell damage is generally only characterized by purely mechanistic criteria, functions of quantities such as stress, strain or their corresponding rates. The modeling of functional deficits in neurites as a consequence of macroscopic mechanical insults has been rarely explored. In particular, a quantitative mechanically based model of electrophysiological impairment in neuronal cells, Neurite, has only very recently been proposed. In this paper, we present the implementation details of this model: a finite difference parallel program for simulating electrical signal propagation along neurites under mechanical loading. Following the application of a macroscopic strain at a given strain rate produced by a mechanical insult, Neurite is able to simulate the resulting neuronal electrical signal propagation, and thus the corresponding functional deficits. The simulation of the coupled mechanical and electrophysiological behaviors requires computational expensive calculations that increase in complexity as the network of the simulated cells grows. The solvers implemented in Neurite—explicit and implicit—were therefore parallelized using graphics processing units in order to reduce the burden of the simulation costs of large scale scenarios. Cable Theory and Hodgkin-Huxley models were implemented to account for the electrophysiological passive and active regions of a neurite, respectively, whereas a coupled mechanical model accounting for the neurite mechanical behavior within its surrounding medium was adopted as a link between electrophysiology and mechanics. This paper provides the details of the parallel implementation of Neurite, along with three different application examples: a long myelinated axon, a segmented

  12. Visual Data-Analytics of Large-Scale Parallel Discrete-Event Simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ross, Caitlin; Carothers, Christopher D.; Mubarak, Misbah

    Parallel discrete-event simulation (PDES) is an important tool in the codesign of extreme-scale systems because PDES provides a cost-effective way to evaluate designs of highperformance computing systems. Optimistic synchronization algorithms for PDES, such as Time Warp, allow events to be processed without global synchronization among the processing elements. A rollback mechanism is provided when events are processed out of timestamp order. Although optimistic synchronization protocols enable the scalability of large-scale PDES, the performance of the simulations must be tuned to reduce the number of rollbacks and provide an improved simulation runtime. To enable efficient large-scale optimistic simulations, one has tomore » gain insight into the factors that affect the rollback behavior and simulation performance. We developed a tool for ROSS model developers that gives them detailed metrics on the performance of their large-scale optimistic simulations at varying levels of simulation granularity. Model developers can use this information for parameter tuning of optimistic simulations in order to achieve better runtime and fewer rollbacks. In this work, we instrument the ROSS optimistic PDES framework to gather detailed statistics about the simulation engine. We have also developed an interactive visualization interface that uses the data collected by the ROSS instrumentation to understand the underlying behavior of the simulation engine. The interface connects real time to virtual time in the simulation and provides the ability to view simulation data at different granularities. We demonstrate the usefulness of our framework by performing a visual analysis of the dragonfly network topology model provided by the CODES simulation framework built on top of ROSS. The instrumentation needs to minimize overhead in order to accurately collect data about the simulation performance. To ensure that the instrumentation does not introduce unnecessary overhead, we perform

  13. Zonal methods for the parallel execution of range-limited N-body simulations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowers, Kevin J.; Dror, Ron O.; Shaw, David E.

    2007-01-20

    Particle simulations in fields ranging from biochemistry to astrophysics require the evaluation of interactions between all pairs of particles separated by less than some fixed interaction radius. The applicability of such simulations is often limited by the time required for calculation, but the use of massive parallelism to accelerate these computations is typically limited by inter-processor communication requirements. Recently, Snir [M. Snir, A note on N-body computations with cutoffs, Theor. Comput. Syst. 37 (2004) 295-318] and Shaw [D.E. Shaw, A fast, scalable method for the parallel evaluation of distance-limited pairwise particle interactions, J. Comput. Chem. 26 (2005) 1318-1328] independently introducedmore » two distinct methods that offer asymptotic reductions in the amount of data transferred between processors. In the present paper, we show that these schemes represent special cases of a more general class of methods, and introduce several new algorithms in this class that offer practical advantages over all previously described methods for a wide range of problem parameters. We also show that several of these algorithms approach an approximate lower bound on inter-processor data transfer.« less

  14. PFLOTRAN-E4D: A parallel open source PFLOTRAN module for simulating time-lapse electrical resistivity data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Johnson, Timothy C.; Hammond, Glenn E.; Chen, Xingyuan

    Time-lapse electrical resistivity tomography (ERT) is finding increased application for remotely monitoring processes occurring in the near subsurface in three-dimensions (i.e. 4D monitoring). However, there are few codes capable of simulating the evolution of subsurface resistivity and corresponding tomographic measurements arising from a particular process, particularly in parallel and with an open source license. Herein we describe and demonstrate an electrical resistivity tomography module for the PFLOTRAN subsurface flow and reactive transport simulation code, named PFLOTRAN-E4D. The PFLOTRAN-E4D module operates in parallel using a dedicated set of compute cores in a master-slave configuration. At each time step, the master processesmore » receives subsurface states from PFLOTRAN, converts those states to bulk electrical conductivity, and instructs the slave processes to simulate a tomographic data set. The resulting multi-physics simulation capability enables accurate feasibility studies for ERT imaging, the identification of the ERT signatures that are unique to a given process, and facilitates the joint inversion of ERT data with hydrogeological data for subsurface characterization. PFLOTRAN-E4D is demonstrated herein using a field study of stage-driven groundwater/river water interaction ERT monitoring along the Columbia River, Washington, USA. Results demonstrate the complex nature of subsurface electrical conductivity changes, in both the saturated and unsaturated zones, arising from river stage fluctuations and associated river water intrusion into the aquifer. Furthermore, the results also demonstrate the sensitivity of surface based ERT measurements to those changes over time.« less

  15. PFLOTRAN-E4D: A parallel open source PFLOTRAN module for simulating time-lapse electrical resistivity data

    DOE PAGES

    Johnson, Timothy C.; Hammond, Glenn E.; Chen, Xingyuan

    2016-09-22

    Time-lapse electrical resistivity tomography (ERT) is finding increased application for remotely monitoring processes occurring in the near subsurface in three-dimensions (i.e. 4D monitoring). However, there are few codes capable of simulating the evolution of subsurface resistivity and corresponding tomographic measurements arising from a particular process, particularly in parallel and with an open source license. Herein we describe and demonstrate an electrical resistivity tomography module for the PFLOTRAN subsurface flow and reactive transport simulation code, named PFLOTRAN-E4D. The PFLOTRAN-E4D module operates in parallel using a dedicated set of compute cores in a master-slave configuration. At each time step, the master processesmore » receives subsurface states from PFLOTRAN, converts those states to bulk electrical conductivity, and instructs the slave processes to simulate a tomographic data set. The resulting multi-physics simulation capability enables accurate feasibility studies for ERT imaging, the identification of the ERT signatures that are unique to a given process, and facilitates the joint inversion of ERT data with hydrogeological data for subsurface characterization. PFLOTRAN-E4D is demonstrated herein using a field study of stage-driven groundwater/river water interaction ERT monitoring along the Columbia River, Washington, USA. Results demonstrate the complex nature of subsurface electrical conductivity changes, in both the saturated and unsaturated zones, arising from river stage fluctuations and associated river water intrusion into the aquifer. Furthermore, the results also demonstrate the sensitivity of surface based ERT measurements to those changes over time.« less

  16. A fully coupled method for massively parallel simulation of hydraulically driven fractures in 3-dimensions: FULLY COUPLED PARALLEL SIMULATION OF HYDRAULIC FRACTURES IN 3-D

    DOE PAGES

    Settgast, Randolph R.; Fu, Pengcheng; Walsh, Stuart D. C.; ...

    2016-09-18

    This study describes a fully coupled finite element/finite volume approach for simulating field-scale hydraulically driven fractures in three dimensions, using massively parallel computing platforms. The proposed method is capable of capturing realistic representations of local heterogeneities, layering and natural fracture networks in a reservoir. A detailed description of the numerical implementation is provided, along with numerical studies comparing the model with both analytical solutions and experimental results. The results demonstrate the effectiveness of the proposed method for modeling large-scale problems involving hydraulically driven fractures in three dimensions.

  17. A fully coupled method for massively parallel simulation of hydraulically driven fractures in 3-dimensions: FULLY COUPLED PARALLEL SIMULATION OF HYDRAULIC FRACTURES IN 3-D

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Settgast, Randolph R.; Fu, Pengcheng; Walsh, Stuart D. C.

    This study describes a fully coupled finite element/finite volume approach for simulating field-scale hydraulically driven fractures in three dimensions, using massively parallel computing platforms. The proposed method is capable of capturing realistic representations of local heterogeneities, layering and natural fracture networks in a reservoir. A detailed description of the numerical implementation is provided, along with numerical studies comparing the model with both analytical solutions and experimental results. The results demonstrate the effectiveness of the proposed method for modeling large-scale problems involving hydraulically driven fractures in three dimensions.

  18. Aerodynamic simulation on massively parallel systems

    NASA Technical Reports Server (NTRS)

    Haeuser, Jochem; Simon, Horst D.

    1992-01-01

    This paper briefly addresses the computational requirements for the analysis of complete configurations of aircraft and spacecraft currently under design to be used for advanced transportation in commercial applications as well as in space flight. The discussion clearly shows that massively parallel systems are the only alternative which is both cost effective and on the other hand can provide the necessary TeraFlops, needed to satisfy the narrow design margins of modern vehicles. It is assumed that the solution of the governing physical equations, i.e., the Navier-Stokes equations which may be complemented by chemistry and turbulence models, is done on multiblock grids. This technique is situated between the fully structured approach of classical boundary fitted grids and the fully unstructured tetrahedra grids. A fully structured grid best represents the flow physics, while the unstructured grid gives best geometrical flexibility. The multiblock grid employed is structured within a block, but completely unstructured on the block level. While a completely unstructured grid is not straightforward to parallelize, the above mentioned multiblock grid is inherently parallel, in particular for multiple instruction multiple datastream (MIMD) machines. In this paper guidelines are provided for setting up or modifying an existing sequential code so that a direct parallelization on a massively parallel system is possible. Results are presented for three parallel systems, namely the Intel hypercube, the Ncube hypercube, and the FPS 500 system. Some preliminary results for an 8K CM2 machine will also be mentioned. The code run is the two dimensional grid generation module of Grid, which is a general two dimensional and three dimensional grid generation code for complex geometries. A system of nonlinear Poisson equations is solved. This code is also a good testcase for complex fluid dynamics codes, since the same datastructures are used. All systems provided good speedups, but

  19. A parallelization method for time periodic steady state in simulation of radio frequency sheath dynamics

    NASA Astrophysics Data System (ADS)

    Kwon, Deuk-Chul; Shin, Sung-Sik; Yu, Dong-Hun

    2017-10-01

    In order to reduce the computing time in simulation of radio frequency (rf) plasma sources, various numerical schemes were developed. It is well known that the upwind, exponential, and power-law schemes can efficiently overcome the limitation on the grid size for fluid transport simulations of high density plasma discharges. Also, the semi-implicit method is a well-known numerical scheme to overcome on the simulation time step. However, despite remarkable advances in numerical techniques and computing power over the last few decades, efficient multi-dimensional modeling of low temperature plasma discharges has remained a considerable challenge. In particular, there was a difficulty on parallelization in time for the time periodic steady state problems such as capacitively coupled plasma discharges and rf sheath dynamics because values of plasma parameters in previous time step are used to calculate new values each time step. Therefore, we present a parallelization method for the time periodic steady state problems by using period-slices. In order to evaluate the efficiency of the developed method, one-dimensional fluid simulations are conducted for describing rf sheath dynamics. The result shows that speedup can be achieved by using a multithreading method.

  20. Validating the simulation of large-scale parallel applications using statistical characteristics

    DOE PAGES

    Zhang, Deli; Wilke, Jeremiah; Hendry, Gilbert; ...

    2016-03-01

    Simulation is a widely adopted method to analyze and predict the performance of large-scale parallel applications. Validating the hardware model is highly important for complex simulations with a large number of parameters. Common practice involves calculating the percent error between the projected and the real execution time of a benchmark program. However, in a high-dimensional parameter space, this coarse-grained approach often suffers from parameter insensitivity, which may not be known a priori. Moreover, the traditional approach cannot be applied to the validation of software models, such as application skeletons used in online simulations. In this work, we present a methodologymore » and a toolset for validating both hardware and software models by quantitatively comparing fine-grained statistical characteristics obtained from execution traces. Although statistical information has been used in tasks like performance optimization, this is the first attempt to apply it to simulation validation. Lastly, our experimental results show that the proposed evaluation approach offers significant improvement in fidelity when compared to evaluation using total execution time, and the proposed metrics serve as reliable criteria that progress toward automating the simulation tuning process.« less