Sample records for multi-threaded event processing

  1. CMS event processing multi-core efficiency status

    NASA Astrophysics Data System (ADS)

    Jones, C. D.; CMS Collaboration

    2017-10-01

    In 2015, CMS was the first LHC experiment to begin using a multi-threaded framework for doing event processing. This new framework utilizes Intel’s Thread Building Block library to manage concurrency via a task based processing model. During the 2015 LHC run period, CMS only ran reconstruction jobs using multiple threads because only those jobs were sufficiently thread efficient. Recent work now allows simulation and digitization to be thread efficient. In addition, during 2015 the multi-threaded framework could run events in parallel but could only use one thread per event. Work done in 2016 now allows multiple threads to be used while processing one event. In this presentation we will show how these recent changes have improved CMS’s overall threading and memory efficiency and we will discuss work to be done to further increase those efficiencies.

  2. Multi-threaded Event Processing with DANA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    David Lawrence; Elliott Wolin

    2007-05-14

    The C++ data analysis framework DANA has been written to support the next generation of Nuclear Physics experiments at Jefferson Lab commensurate with the anticipated 12GeV upgrade. The DANA framework was designed to allow multi-threaded event processing with a minimal impact on developers of reconstruction software. This document describes how DANA implements multi-threaded event processing and compares it to simply running multiple instances of a program. Also presented are relative reconstruction rates for Pentium4, Xeon, and Opteron based machines.

  3. AthenaMT: upgrading the ATLAS software framework for the many-core world with multi-threading

    NASA Astrophysics Data System (ADS)

    Leggett, Charles; Baines, John; Bold, Tomasz; Calafiura, Paolo; Farrell, Steven; van Gemmeren, Peter; Malon, David; Ritsch, Elmar; Stewart, Graeme; Snyder, Scott; Tsulaia, Vakhtang; Wynne, Benjamin; ATLAS Collaboration

    2017-10-01

    ATLAS’s current software framework, Gaudi/Athena, has been very successful for the experiment in LHC Runs 1 and 2. However, its single threaded design has been recognized for some time to be increasingly problematic as CPUs have increased core counts and decreased available memory per core. Even the multi-process version of Athena, AthenaMP, will not scale to the range of architectures we expect to use beyond Run2. After concluding a rigorous requirements phase, where many design components were examined in detail, ATLAS has begun the migration to a new data-flow driven, multi-threaded framework, which enables the simultaneous processing of singleton, thread unsafe legacy Algorithms, cloned Algorithms that execute concurrently in their own threads with different Event contexts, and fully re-entrant, thread safe Algorithms. In this paper we report on the process of modifying the framework to safely process multiple concurrent events in different threads, which entails significant changes in the underlying handling of features such as event and time dependent data, asynchronous callbacks, metadata, integration with the online High Level Trigger for partial processing in certain regions of interest, concurrent I/O, as well as ensuring thread safety of core services. We also report on upgrading the framework to handle Algorithms that are fully re-entrant.

  4. Multithreaded Stochastic PDES for Reactions and Diffusions in Neurons.

    PubMed

    Lin, Zhongwei; Tropper, Carl; Mcdougal, Robert A; Patoary, Mohammand Nazrul Ishlam; Lytton, William W; Yao, Yiping; Hines, Michael L

    2017-07-01

    Cells exhibit stochastic behavior when the number of molecules is small. Hence a stochastic reaction-diffusion simulator capable of working at scale can provide a more accurate view of molecular dynamics within the cell. This paper describes a parallel discrete event simulator, Neuron Time Warp-Multi Thread (NTW-MT), developed for the simulation of reaction diffusion models of neurons. To the best of our knowledge, this is the first parallel discrete event simulator oriented towards stochastic simulation of chemical reactions in a neuron. The simulator was developed as part of the NEURON project. NTW-MT is optimistic and thread-based, which attempts to capitalize on multi-core architectures used in high performance machines. It makes use of a multi-level queue for the pending event set and a single roll-back message in place of individual anti-messages to disperse contention and decrease the overhead of processing rollbacks. Global Virtual Time is computed asynchronously both within and among processes to get rid of the overhead for synchronizing threads. Memory usage is managed in order to avoid locking and unlocking when allocating and de-allocating memory and to maximize cache locality. We verified our simulator on a calcium buffer model. We examined its performance on a calcium wave model, comparing it to the performance of a process based optimistic simulator and a threaded simulator which uses a single priority queue for each thread. Our multi-threaded simulator is shown to achieve superior performance to these simulators. Finally, we demonstrated the scalability of our simulator on a larger CICR model and a more detailed CICR model.

  5. Optimal Configuration and Deployment of Software on Multi-Core Processing Architectures

    DTIC Science & Technology

    2008-07-01

    between the event generating threads and the collector thread is implemented through semaphores . The Perseus data logger is designed to minimize the...performance counters (through the PAPI API) and opens up access to the shared memory logger through a semaphore and Remote Procedure Call (RPC) buffer... synchronization events. Using this rich data, the TMAM is able to output all of the information necessary to identify precisely which pairs of thread

  6. A multi-threaded version of MCFM

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Campbell, John M.; Ellis, R. Keith; Giele, Walter T.

    We report on our findings modifying MCFM using OpenMP to implement multi-threading. By using OpenMP, the modified MCFM will execute on any processor, automatically adjusting to the number of available threads. We then modified the integration routine VEGAS to distribute the event evaluation over the threads, while combining all events at the end of every iteration to optimize the numerical integration. Furthermore, we took special care so that the results of the Monte Carlo integration were independent of the number of threads used, to facilitate the validation of the OpenMP version of MCFM.

  7. Using a source-to-source transformation to introduce multi-threading into the AliRoot framework for a parallel event reconstruction

    NASA Astrophysics Data System (ADS)

    Lohn, Stefan B.; Dong, Xin; Carminati, Federico

    2012-12-01

    Chip-Multiprocessors are going to support massive parallelism by many additional physical and logical cores. Improving performance can no longer be obtained by increasing clock-frequency because the technical limits are almost reached. Instead, parallel execution must be used to gain performance. Resources like main memory, the cache hierarchy, bandwidth of the memory bus or links between cores and sockets are not going to be improved as fast. Hence, parallelism can only result into performance gains if the memory usage is optimized and the communication between threads is minimized. Besides concurrent programming has become a domain for experts. Implementing multi-threading is error prone and labor-intensive. A full reimplementation of the whole AliRoot source-code is unaffordable. This paper describes the effort to evaluate the adaption of AliRoot to the needs of multi-threading and to provide the capability of parallel processing by using a semi-automatic source-to-source transformation to address the problems as described before and to provide a straight-forward way of parallelization with almost no interference between threads. This makes the approach simple and reduces the required manual changes in the code. In a first step, unconditional thread-safety will be introduced to bring the original sequential and thread unaware source-code into the position of utilizing multi-threading. Afterwards further investigations have to be performed to point out candidates of classes that are useful to share amongst threads. Then in a second step, the transformation has to change the code to share these classes and finally to verify if there are anymore invalid interferences between threads.

  8. Evolution of the ATLAS Software Framework towards Concurrency

    NASA Astrophysics Data System (ADS)

    Jones, R. W. L.; Stewart, G. A.; Leggett, C.; Wynne, B. M.

    2015-05-01

    The ATLAS experiment has successfully used its Gaudi/Athena software framework for data taking and analysis during the first LHC run, with billions of events successfully processed. However, the design of Gaudi/Athena dates from early 2000 and the software and the physics code has been written using a single threaded, serial design. This programming model has increasing difficulty in exploiting the potential of current CPUs, which offer their best performance only through taking full advantage of multiple cores and wide vector registers. Future CPU evolution will intensify this trend, with core counts increasing and memory per core falling. Maximising performance per watt will be a key metric, so all of these cores must be used as efficiently as possible. In order to address the deficiencies of the current framework, ATLAS has embarked upon two projects: first, a practical demonstration of the use of multi-threading in our reconstruction software, using the GaudiHive framework; second, an exercise to gather requirements for an updated framework, going back to the first principles of how event processing occurs. In this paper we report on both these aspects of our work. For the hive based demonstrators, we discuss what changes were necessary in order to allow the serially designed ATLAS code to run, both to the framework and to the tools and algorithms used. We report on what general lessons were learned about the code patterns that had been employed in the software and which patterns were identified as particularly problematic for multi-threading. These lessons were fed into our considerations of a new framework and we present preliminary conclusions on this work. In particular we identify areas where the framework can be simplified in order to aid the implementation of a concurrent event processing scheme. Finally, we discuss the practical difficulties involved in migrating a large established code base to a multi-threaded framework and how this can be achieved for LHC Run 3.

  9. Software Defined Radio with Parallelized Software Architecture

    NASA Technical Reports Server (NTRS)

    Heckler, Greg

    2013-01-01

    This software implements software-defined radio procession over multi-core, multi-CPU systems in a way that maximizes the use of CPU resources in the system. The software treats each processing step in either a communications or navigation modulator or demodulator system as an independent, threaded block. Each threaded block is defined with a programmable number of input or output buffers; these buffers are implemented using POSIX pipes. In addition, each threaded block is assigned a unique thread upon block installation. A modulator or demodulator system is built by assembly of the threaded blocks into a flow graph, which assembles the processing blocks to accomplish the desired signal processing. This software architecture allows the software to scale effortlessly between single CPU/single-core computers or multi-CPU/multi-core computers without recompilation. NASA spaceflight and ground communications systems currently rely exclusively on ASICs or FPGAs. This software allows low- and medium-bandwidth (100 bps to .50 Mbps) software defined radios to be designed and implemented solely in C/C++ software, while lowering development costs and facilitating reuse and extensibility.

  10. a Spatiotemporal Aggregation Query Method Using Multi-Thread Parallel Technique Based on Regional Division

    NASA Astrophysics Data System (ADS)

    Liao, S.; Chen, L.; Li, J.; Xiong, W.; Wu, Q.

    2015-07-01

    Existing spatiotemporal database supports spatiotemporal aggregation query over massive moving objects datasets. Due to the large amounts of data and single-thread processing method, the query speed cannot meet the application requirements. On the other hand, the query efficiency is more sensitive to spatial variation then temporal variation. In this paper, we proposed a spatiotemporal aggregation query method using multi-thread parallel technique based on regional divison and implemented it on the server. Concretely, we divided the spatiotemporal domain into several spatiotemporal cubes, computed spatiotemporal aggregation on all cubes using the technique of multi-thread parallel processing, and then integrated the query results. By testing and analyzing on the real datasets, this method has improved the query speed significantly.

  11. Thread scheduling for GPU-based OPC simulation on multi-thread

    NASA Astrophysics Data System (ADS)

    Lee, Heejun; Kim, Sangwook; Hong, Jisuk; Lee, Sooryong; Han, Hwansoo

    2018-03-01

    As semiconductor product development based on shrinkage continues, the accuracy and difficulty required for the model based optical proximity correction (MBOPC) is increasing. OPC simulation time, which is the most timeconsuming part of MBOPC, is rapidly increasing due to high pattern density in a layout and complex OPC model. To reduce OPC simulation time, we attempt to apply graphic processing unit (GPU) to MBOPC because OPC process is good to be programmed in parallel. We address some issues that may typically happen during GPU-based OPC simulation in multi thread system, such as "out of memory" and "GPU idle time". To overcome these problems, we propose a thread scheduling method, which manages OPC jobs in multiple threads in such a way that simulations jobs from multiple threads are alternatively executed on GPU while correction jobs are executed at the same time in each CPU cores. It was observed that the amount of GPU peak memory usage decreases by up to 35%, and MBOPC runtime also decreases by 4%. In cases where out of memory issues occur in a multi-threaded environment, the thread scheduler was used to improve MBOPC runtime up to 23%.

  12. Software Defined Radio with Parallelized Software Architecture

    NASA Technical Reports Server (NTRS)

    Heckler, Greg

    2013-01-01

    This software implements software-defined radio procession over multicore, multi-CPU systems in a way that maximizes the use of CPU resources in the system. The software treats each processing step in either a communications or navigation modulator or demodulator system as an independent, threaded block. Each threaded block is defined with a programmable number of input or output buffers; these buffers are implemented using POSIX pipes. In addition, each threaded block is assigned a unique thread upon block installation. A modulator or demodulator system is built by assembly of the threaded blocks into a flow graph, which assembles the processing blocks to accomplish the desired signal processing. This software architecture allows the software to scale effortlessly between single CPU/single-core computers or multi-CPU/multi-core computers without recompilation. NASA spaceflight and ground communications systems currently rely exclusively on ASICs or FPGAs. This software allows low- and medium-bandwidth (100 bps to approx.50 Mbps) software defined radios to be designed and implemented solely in C/C++ software, while lowering development costs and facilitating reuse and extensibility.

  13. Expressing Parallelism with ROOT

    NASA Astrophysics Data System (ADS)

    Piparo, D.; Tejedor, E.; Guiraud, E.; Ganis, G.; Mato, P.; Moneta, L.; Valls Pla, X.; Canal, P.

    2017-10-01

    The need for processing the ever-increasing amount of data generated by the LHC experiments in a more efficient way has motivated ROOT to further develop its support for parallelism. Such support is being tackled both for shared-memory and distributed-memory environments. The incarnations of the aforementioned parallelism are multi-threading, multi-processing and cluster-wide executions. In the area of multi-threading, we discuss the new implicit parallelism and related interfaces, as well as the new building blocks to safely operate with ROOT objects in a multi-threaded environment. Regarding multi-processing, we review the new MultiProc framework, comparing it with similar tools (e.g. multiprocessing module in Python). Finally, as an alternative to PROOF for cluster-wide executions, we introduce the efforts on integrating ROOT with state-of-the-art distributed data processing technologies like Spark, both in terms of programming model and runtime design (with EOS as one of the main components). For all the levels of parallelism, we discuss, based on real-life examples and measurements, how our proposals can increase the productivity of scientists.

  14. Expressing Parallelism with ROOT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Piparo, D.; Tejedor, E.; Guiraud, E.

    The need for processing the ever-increasing amount of data generated by the LHC experiments in a more efficient way has motivated ROOT to further develop its support for parallelism. Such support is being tackled both for shared-memory and distributed-memory environments. The incarnations of the aforementioned parallelism are multi-threading, multi-processing and cluster-wide executions. In the area of multi-threading, we discuss the new implicit parallelism and related interfaces, as well as the new building blocks to safely operate with ROOT objects in a multi-threaded environment. Regarding multi-processing, we review the new MultiProc framework, comparing it with similar tools (e.g. multiprocessing module inmore » Python). Finally, as an alternative to PROOF for cluster-wide executions, we introduce the efforts on integrating ROOT with state-of-the-art distributed data processing technologies like Spark, both in terms of programming model and runtime design (with EOS as one of the main components). For all the levels of parallelism, we discuss, based on real-life examples and measurements, how our proposals can increase the productivity of scientists.« less

  15. Processing communications events in parallel active messaging interface by awakening thread from wait state

    DOEpatents

    Archer, Charles J; Blocksome, Michael A; Ratterman, Joseph D; Smith, Brian E

    2013-10-22

    Processing data communications events in a parallel active messaging interface (`PAMI`) of a parallel computer that includes compute nodes that execute a parallel application, with the PAMI including data communications endpoints, and the endpoints are coupled for data communications through the PAMI and through other data communications resources, including determining by an advance function that there are no actionable data communications events pending for its context, placing by the advance function its thread of execution into a wait state, waiting for a subsequent data communications event for the context; responsive to occurrence of a subsequent data communications event for the context, awakening by the thread from the wait state; and processing by the advance function the subsequent data communications event now pending for the context.

  16. Multi-threading: A new dimension to massively parallel scientific computation

    NASA Astrophysics Data System (ADS)

    Nielsen, Ida M. B.; Janssen, Curtis L.

    2000-06-01

    Multi-threading is becoming widely available for Unix-like operating systems, and the application of multi-threading opens new ways for performing parallel computations with greater efficiency. We here briefly discuss the principles of multi-threading and illustrate the application of multi-threading for a massively parallel direct four-index transformation of electron repulsion integrals. Finally, other potential applications of multi-threading in scientific computing are outlined.

  17. Servicing a globally broadcast interrupt signal in a multi-threaded computer

    DOEpatents

    Attinella, John E.; Davis, Kristan D.; Musselman, Roy G.; Satterfield, David L.

    2015-12-29

    Methods, apparatuses, and computer program products for servicing a globally broadcast interrupt signal in a multi-threaded computer comprising a plurality of processor threads. Embodiments include an interrupt controller indicating in a plurality of local interrupt status locations that a globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include a thread determining that a local interrupt status location corresponding to the thread indicates that the globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include the thread processing one or more entries in a global interrupt status bit queue based on whether global interrupt status bits associated with the globally broadcast interrupt signal are locked. Each entry in the global interrupt status bit queue corresponds to a queued global interrupt.

  18. Processing data communications events by awakening threads in parallel active messaging interface of a parallel computer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Archer, Charles J.; Blocksome, Michael A.; Ratterman, Joseph D.

    Processing data communications events in a parallel active messaging interface (`PAMI`) of a parallel computer that includes compute nodes that execute a parallel application, with the PAMI including data communications endpoints, and the endpoints are coupled for data communications through the PAMI and through other data communications resources, including determining by an advance function that there are no actionable data communications events pending for its context, placing by the advance function its thread of execution into a wait state, waiting for a subsequent data communications event for the context; responsive to occurrence of a subsequent data communications event for themore » context, awakening by the thread from the wait state; and processing by the advance function the subsequent data communications event now pending for the context.« less

  19. Development of an Autonomous Navigation Technology Test Vehicle

    DTIC Science & Technology

    2004-08-01

    as an independent thread on processors using the Linux operating system. The computer hardware selected for the nodes that host the MRS threads...communications system design. Linux was chosen as the operating system for all of the single board computers used on the Mule. Linux was specifically...used for system analysis and development. The simple realization of multi-thread processing and inter-process communications in Linux made it a

  20. Geant4 Computing Performance Benchmarking and Monitoring

    DOE PAGES

    Dotti, Andrea; Elvira, V. Daniel; Folger, Gunter; ...

    2015-12-23

    Performance evaluation and analysis of large scale computing applications is essential for optimal use of resources. As detector simulation is one of the most compute intensive tasks and Geant4 is the simulation toolkit most widely used in contemporary high energy physics (HEP) experiments, it is important to monitor Geant4 through its development cycle for changes in computing performance and to identify problems and opportunities for code improvements. All Geant4 development and public releases are being profiled with a set of applications that utilize different input event samples, physics parameters, and detector configurations. Results from multiple benchmarking runs are compared tomore » previous public and development reference releases to monitor CPU and memory usage. Observed changes are evaluated and correlated with code modifications. Besides the full summary of call stack and memory footprint, a detailed call graph analysis is available to Geant4 developers for further analysis. The set of software tools used in the performance evaluation procedure, both in sequential and multi-threaded modes, include FAST, IgProf and Open|Speedshop. In conclusion, the scalability of the CPU time and memory performance in multi-threaded application is evaluated by measuring event throughput and memory gain as a function of the number of threads for selected event samples.« less

  1. Development of a Next Generation Concurrent Framework for the ATLAS Experiment

    NASA Astrophysics Data System (ADS)

    Calafiura, P.; Lampl, W.; Leggett, C.; Malon, D.; Stewart, G.; Wynne, B.

    2015-12-01

    The ATLAS experiment has successfully used its Gaudi/Athena software framework for data taking and analysis during the first LHC run, with billions of events successfully processed. However, the design of Gaudi/Athena dates from early 2000 and the software and the physics code has been written using a single threaded, serial design. This programming model has increasing difficulty in exploiting the potential of current CPUs, which offer their best performance only through taking full advantage of multiple cores and wide vector registers. Future CPU evolution will intensify this trend, with core counts increasing and memory per core falling. With current memory consumption for 64 bit ATLAS reconstruction in a high luminosity environment approaching 4GB, it will become impossible to fully occupy all cores in a machine without exhausting available memory. However, since maximizing performance per watt will be a key metric, a mechanism must be found to use all cores as efficiently as possible. In this paper we report on our progress with a practical demonstration of the use of multithreading in the ATLAS reconstruction software, using the GaudiHive framework. We have expanded support to Calorimeter, Inner Detector, and Tracking code, discussing what changes were necessary in order to allow the serially designed ATLAS code to run, both to the framework and to the tools and algorithms used. We report on both the performance gains, and what general lessons were learned about the code patterns that had been employed in the software and which patterns were identified as particularly problematic for multi-threading. We also present our findings on implementing a hybrid multi-threaded / multi-process framework, to take advantage of the strengths of each type of concurrency, while avoiding some of their corresponding limitations.

  2. Efficient methods for implementation of multi-level nonrigid mass-preserving image registration on GPUs and multi-threaded CPUs.

    PubMed

    Ellingwood, Nathan D; Yin, Youbing; Smith, Matthew; Lin, Ching-Long

    2016-04-01

    Faster and more accurate methods for registration of images are important for research involved in conducting population-based studies that utilize medical imaging, as well as improvements for use in clinical applications. We present a novel computation- and memory-efficient multi-level method on graphics processing units (GPU) for performing registration of two computed tomography (CT) volumetric lung images. We developed a computation- and memory-efficient Diffeomorphic Multi-level B-Spline Transform Composite (DMTC) method to implement nonrigid mass-preserving registration of two CT lung images on GPU. The framework consists of a hierarchy of B-Spline control grids of increasing resolution. A similarity criterion known as the sum of squared tissue volume difference (SSTVD) was adopted to preserve lung tissue mass. The use of SSTVD consists of the calculation of the tissue volume, the Jacobian, and their derivatives, which makes its implementation on GPU challenging due to memory constraints. The use of the DMTC method enabled reduced computation and memory storage of variables with minimal communication between GPU and Central Processing Unit (CPU) due to ability to pre-compute values. The method was assessed on six healthy human subjects. Resultant GPU-generated displacement fields were compared against the previously validated CPU counterpart fields, showing good agreement with an average normalized root mean square error (nRMS) of 0.044±0.015. Runtime and performance speedup are compared between single-threaded CPU, multi-threaded CPU, and GPU algorithms. Best performance speedup occurs at the highest resolution in the GPU implementation for the SSTVD cost and cost gradient computations, with a speedup of 112 times that of the single-threaded CPU version and 11 times over the twelve-threaded version when considering average time per iteration using a Nvidia Tesla K20X GPU. The proposed GPU-based DMTC method outperforms its multi-threaded CPU version in terms of runtime. Total registration time reduced runtime to 2.9min on the GPU version, compared to 12.8min on twelve-threaded CPU version and 112.5min on a single-threaded CPU. Furthermore, the GPU implementation discussed in this work can be adapted for use of other cost functions that require calculation of the first derivatives. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  3. Multi-threaded integration of HTC-Vive and MeVisLab

    NASA Astrophysics Data System (ADS)

    Gunacker, Simon; Gall, Markus; Schmalstieg, Dieter; Egger, Jan

    2018-03-01

    This work presents how Virtual Reality (VR) can easily be integrated into medical applications via a plugin for a medical image processing framework called MeVisLab. A multi-threaded plugin has been developed using OpenVR, a VR library that can be used for developing vendor and platform independent VR applications. The plugin is tested using the HTC Vive, a head-mounted display developed by HTC and Valve Corporation.

  4. Using all of your CPU's in HIPE

    NASA Astrophysics Data System (ADS)

    Jacobson, J. D.; Fadda, D.

    2012-09-01

    Modern computer architectures increasingly feature multi-core CPU's. For example, the MacbookPro features the Intel quad-core i7 processors. Through the use of hyper-threading, where each core can execute two threads simultaneously, the quad-core i7 can support eight simultaneous processing threads. All this on your laptop! This CPU power can now be put into service by scientists to perform data reduction tasks, but only if the software has been designed to take advantage of the multiple processor architectures. Up to now, software written for Herschel data reduction (HIPE), written in Jython and JAVA, is single-threaded and can only utilize a single processor. Users of HIPE do not get any advantage from the additional processors. Why not put all of the CPU resources to work reducing your data? We present a multi-threaded software application that corrects long-term transients in the signal from the PACS unchopped spectroscopy line scan mode. In this poster, we present a multi-threaded software framework to achieve performance improvements from parallel execution. We will show how a task to correct transients in the PACS Spectroscopy Pipeline for the un-chopped line scan mode, has been threaded. This computation-intensive task uses either a one-parameter or a three parameter exponential function, to characterize the transient. The task uses a JAVA implementation of Minpack, translated from the C (Moshier) and IDL (Markwardt) by the authors, to optimize the correction parameters. We also explain how to determine if a task can benefit from threading (Amdahl's Law), and if it is safe to thread. The design and implementation, using the JAVA concurrency package completions service is described. Pitfalls, timing bugs, thread safety, resource control, testing and performance improvements are described and plotted.

  5. Data Parallel Bin-Based Indexing for Answering Queries on Multi-Core Architectures

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gosink, Luke; Wu, Kesheng; Bethel, E. Wes

    2009-06-02

    The multi-core trend in CPUs and general purpose graphics processing units (GPUs) offers new opportunities for the database community. The increase of cores at exponential rates is likely to affect virtually every server and client in the coming decade, and presents database management systems with a huge, compelling disruption that will radically change how processing is done. This paper presents a new parallel indexing data structure for answering queries that takes full advantage of the increasing thread-level parallelism emerging in multi-core architectures. In our approach, our Data Parallel Bin-based Index Strategy (DP-BIS) first bins the base data, and then partitionsmore » and stores the values in each bin as a separate, bin-based data cluster. In answering a query, the procedures for examining the bin numbers and the bin-based data clusters offer the maximum possible level of concurrency; each record is evaluated by a single thread and all threads are processed simultaneously in parallel. We implement and demonstrate the effectiveness of DP-BIS on two multi-core architectures: a multi-core CPU and a GPU. The concurrency afforded by DP-BIS allows us to fully utilize the thread-level parallelism provided by each architecture--for example, our GPU-based DP-BIS implementation simultaneously evaluates over 12,000 records with an equivalent number of concurrently executing threads. In comparing DP-BIS's performance across these architectures, we show that the GPU-based DP-BIS implementation requires significantly less computation time to answer a query than the CPU-based implementation. We also demonstrate in our analysis that DP-BIS provides better overall performance than the commonly utilized CPU and GPU-based projection index. Finally, due to data encoding, we show that DP-BIS accesses significantly smaller amounts of data than index strategies that operate solely on a column's base data; this smaller data footprint is critical for parallel processors that possess limited memory resources (e.g., GPUs).« less

  6. Scalable Probabilistic Inference for Global Seismic Monitoring

    NASA Astrophysics Data System (ADS)

    Arora, N. S.; Dear, T.; Russell, S.

    2011-12-01

    We describe a probabilistic generative model for seismic events, their transmission through the earth, and their detection (or mis-detection) at seismic stations. We also describe an inference algorithm that constructs the most probable event bulletin explaining the observed set of detections. The model and inference are called NET-VISA (network processing vertically integrated seismic analysis) and is designed to replace the current automated network processing at the IDC, the SEL3 bulletin. Our results (attached table) demonstrate that NET-VISA significantly outperforms SEL3 by reducing the missed events from 30.3% down to 12.5%. The difference is even more dramatic for smaller magnitude events. NET-VISA has no difficulty in locating nuclear explosions as well. The attached figure demonstrates the location predicted by NET-VISA versus other bulletins for the second DPRK event. Further evaluation on dense regional networks demonstrates that NET-VISA finds many events missed in the LEB bulletin, which is produced by the human analysts. Large aftershock sequences, as produced by the 2004 December Sumatra earthquake and the 2011 March Tohoku earthquake, can pose a significant load for automated processing, often delaying the IDC bulletins by weeks or months. Indeed these sequences can overload the serial NET-VISA inference as well. We describe an enhancement to NET-VISA to make it multi-threaded, and hence take full advantage of the processing power of multi-core and -cpu machines. Our experiments show that the new inference algorithm is able to achieve 80% efficiency in parallel speedup.

  7. FODEM: A Multi-Threaded Research and Development Method for Educational Technology

    ERIC Educational Resources Information Center

    Suhonen, Jarkko; de Villiers, M. Ruth; Sutinen, Erkki

    2012-01-01

    Formative development method (FODEM) is a multithreaded design approach that was originated to support the design and development of various types of educational technology innovations, such as learning tools, and online study programmes. The threaded and agile structure of the approach provides flexibility to the design process. Intensive…

  8. Study of a Fine Grained Threaded Framework Design

    NASA Astrophysics Data System (ADS)

    Jones, C. D.

    2012-12-01

    Traditionally, HEP experiments exploit the multiple cores in a CPU by having each core process one event. However, future PC designs are expected to use CPUs which double the number of processing cores at the same rate as the cost of memory falls by a factor of two. This effectively means the amount of memory per processing core will remain constant. This is a major challenge for LHC processing frameworks since the LHC is expected to deliver more complex events (e.g. greater pileup events) in the coming years while the LHC experiment's frameworks are already memory constrained. Therefore in the not so distant future we may need to be able to efficiently use multiple cores to process one event. In this presentation we will discuss a design for an HEP processing framework which can allow very fine grained parallelization within one event as well as supporting processing multiple events simultaneously while minimizing the memory footprint of the job. The design is built around the libdispatch framework created by Apple Inc. (a port for Linux is available) whose central concept is the use of task queues. This design also accommodates the reality that not all code will be thread safe and therefore allows one to easily mark modules or sub parts of modules as being thread unsafe. In addition, the design efficiently handles the requirement that events in one run must all be processed before starting to process events from a different run. After explaining the design we will provide measurements from simulating different processing scenarios where the processing times used for the simulation are drawn from processing times measured from actual CMS event processing.

  9. Accelerated event-by-event Monte Carlo microdosimetric calculations of electrons and protons tracks on a multi-core CPU and a CUDA-enabled GPU.

    PubMed

    Kalantzis, Georgios; Tachibana, Hidenobu

    2014-01-01

    For microdosimetric calculations event-by-event Monte Carlo (MC) methods are considered the most accurate. The main shortcoming of those methods is the extensive requirement for computational time. In this work we present an event-by-event MC code of low projectile energy electron and proton tracks for accelerated microdosimetric MC simulations on a graphic processing unit (GPU). Additionally, a hybrid implementation scheme was realized by employing OpenMP and CUDA in such a way that both GPU and multi-core CPU were utilized simultaneously. The two implementation schemes have been tested and compared with the sequential single threaded MC code on the CPU. Performance comparison was established on the speed-up for a set of benchmarking cases of electron and proton tracks. A maximum speedup of 67.2 was achieved for the GPU-based MC code, while a further improvement of the speedup up to 20% was achieved for the hybrid approach. The results indicate the capability of our CPU-GPU implementation for accelerated MC microdosimetric calculations of both electron and proton tracks without loss of accuracy. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  10. Real time display Fourier-domain OCT using multi-thread parallel computing with data vectorization

    NASA Astrophysics Data System (ADS)

    Eom, Tae Joong; Kim, Hoon Seop; Kim, Chul Min; Lee, Yeung Lak; Choi, Eun-Seo

    2011-03-01

    We demonstrate a real-time display of processed OCT images using multi-thread parallel computing with a quad-core CPU of a personal computer. The data of each A-line are treated as one vector to maximize the data translation rate between the cores of the CPU and RAM stored image data. A display rate of 29.9 frames/sec for processed OCT data (4096 FFT-size x 500 A-scans) is achieved in our system using a wavelength swept source with 52-kHz swept frequency. The data processing times of the OCT image and a Doppler OCT image with a 4-time average are 23.8 msec and 91.4 msec.

  11. Parallel Lattice Basis Reduction Using a Multi-threaded Schnorr-Euchner LLL Algorithm

    NASA Astrophysics Data System (ADS)

    Backes, Werner; Wetzel, Susanne

    In this paper, we introduce a new parallel variant of the LLL lattice basis reduction algorithm. Our new, multi-threaded algorithm is the first to provide an efficient, parallel implementation of the Schorr-Euchner algorithm for today’s multi-processor, multi-core computer architectures. Experiments with sparse and dense lattice bases show a speed-up factor of about 1.8 for the 2-thread and about factor 3.2 for the 4-thread version of our new parallel lattice basis reduction algorithm in comparison to the traditional non-parallel algorithm.

  12. Topical perspective on massive threading and parallelism.

    PubMed

    Farber, Robert M

    2011-09-01

    Unquestionably computer architectures have undergone a recent and noteworthy paradigm shift that now delivers multi- and many-core systems with tens to many thousands of concurrent hardware processing elements per workstation or supercomputer node. GPGPU (General Purpose Graphics Processor Unit) technology in particular has attracted significant attention as new software development capabilities, namely CUDA (Compute Unified Device Architecture) and OpenCL™, have made it possible for students as well as small and large research organizations to achieve excellent speedup for many applications over more conventional computing architectures. The current scientific literature reflects this shift with numerous examples of GPGPU applications that have achieved one, two, and in some special cases, three-orders of magnitude increased computational performance through the use of massive threading to exploit parallelism. Multi-core architectures are also evolving quickly to exploit both massive-threading and massive-parallelism such as the 1.3 million threads Blue Waters supercomputer. The challenge confronting scientists in planning future experimental and theoretical research efforts--be they individual efforts with one computer or collaborative efforts proposing to use the largest supercomputers in the world is how to capitalize on these new massively threaded computational architectures--especially as not all computational problems will scale to massive parallelism. In particular, the costs associated with restructuring software (and potentially redesigning algorithms) to exploit the parallelism of these multi- and many-threaded machines must be considered along with application scalability and lifespan. This perspective is an overview of the current state of threading and parallelize with some insight into the future. Published by Elsevier Inc.

  13. Large Scale Document Inversion using a Multi-threaded Computing System

    PubMed Central

    Jung, Sungbo; Chang, Dar-Jen; Park, Juw Won

    2018-01-01

    Current microprocessor architecture is moving towards multi-core/multi-threaded systems. This trend has led to a surge of interest in using multi-threaded computing devices, such as the Graphics Processing Unit (GPU), for general purpose computing. We can utilize the GPU in computation as a massive parallel coprocessor because the GPU consists of multiple cores. The GPU is also an affordable, attractive, and user-programmable commodity. Nowadays a lot of information has been flooded into the digital domain around the world. Huge volume of data, such as digital libraries, social networking services, e-commerce product data, and reviews, etc., is produced or collected every moment with dramatic growth in size. Although the inverted index is a useful data structure that can be used for full text searches or document retrieval, a large number of documents will require a tremendous amount of time to create the index. The performance of document inversion can be improved by multi-thread or multi-core GPU. Our approach is to implement a linear-time, hash-based, single program multiple data (SPMD), document inversion algorithm on the NVIDIA GPU/CUDA programming platform utilizing the huge computational power of the GPU, to develop high performance solutions for document indexing. Our proposed parallel document inversion system shows 2-3 times faster performance than a sequential system on two different test datasets from PubMed abstract and e-commerce product reviews. CCS Concepts •Information systems➝Information retrieval • Computing methodologies➝Massively parallel and high-performance simulations. PMID:29861701

  14. Large Scale Document Inversion using a Multi-threaded Computing System.

    PubMed

    Jung, Sungbo; Chang, Dar-Jen; Park, Juw Won

    2017-06-01

    Current microprocessor architecture is moving towards multi-core/multi-threaded systems. This trend has led to a surge of interest in using multi-threaded computing devices, such as the Graphics Processing Unit (GPU), for general purpose computing. We can utilize the GPU in computation as a massive parallel coprocessor because the GPU consists of multiple cores. The GPU is also an affordable, attractive, and user-programmable commodity. Nowadays a lot of information has been flooded into the digital domain around the world. Huge volume of data, such as digital libraries, social networking services, e-commerce product data, and reviews, etc., is produced or collected every moment with dramatic growth in size. Although the inverted index is a useful data structure that can be used for full text searches or document retrieval, a large number of documents will require a tremendous amount of time to create the index. The performance of document inversion can be improved by multi-thread or multi-core GPU. Our approach is to implement a linear-time, hash-based, single program multiple data (SPMD), document inversion algorithm on the NVIDIA GPU/CUDA programming platform utilizing the huge computational power of the GPU, to develop high performance solutions for document indexing. Our proposed parallel document inversion system shows 2-3 times faster performance than a sequential system on two different test datasets from PubMed abstract and e-commerce product reviews. •Information systems➝Information retrieval • Computing methodologies➝Massively parallel and high-performance simulations.

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Attinella, John E.; Davis, Kristan D.; Musselman, Roy G.

    Methods, apparatuses, and computer program products for servicing a globally broadcast interrupt signal in a multi-threaded computer comprising a plurality of processor threads. Embodiments include an interrupt controller indicating in a plurality of local interrupt status locations that a globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include a thread determining that a local interrupt status location corresponding to the thread indicates that the globally broadcast interrupt signal has been received by the interrupt controller. Embodiments also include the thread processing one or more entries in a global interrupt status bit queue based on whethermore » global interrupt status bits associated with the globally broadcast interrupt signal are locked. Each entry in the global interrupt status bit queue corresponds to a queued global interrupt.« less

  16. AN MHD AVALANCHE IN A MULTI-THREADED CORONAL LOOP

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hood, A. W.; Cargill, P. J.; Tam, K. V.

    For the first time, we demonstrate how an MHD avalanche might occur in a multithreaded coronal loop. Considering 23 non-potential magnetic threads within a loop, we use 3D MHD simulations to show that only one thread needs to be unstable in order to start an avalanche even when the others are below marginal stability. This has significant implications for coronal heating in that it provides for energy dissipation with a trigger mechanism. The instability of the unstable thread follows the evolution determined in many earlier investigations. However, once one stable thread is disrupted, it coalesces with a neighboring thread andmore » this process disrupts other nearby threads. Coalescence with these disrupted threads then occurs leading to the disruption of yet more threads as the avalanche develops. Magnetic energy is released in discrete bursts as the surrounding stable threads are disrupted. The volume integrated heating, as a function of time, shows short spikes suggesting that the temporal form of the heating is more like that of nanoflares than of constant heating.« less

  17. SU-E-T-531: Performance Evaluation of Multithreaded Geant4 for Proton Therapy Dose Calculations in a High Performance Computing Facility

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shin, J; Coss, D; McMurry, J

    Purpose: To evaluate the efficiency of multithreaded Geant4 (Geant4-MT, version 10.0) for proton Monte Carlo dose calculations using a high performance computing facility. Methods: Geant4-MT was used to calculate 3D dose distributions in 1×1×1 mm3 voxels in a water phantom and patient's head with a 150 MeV proton beam covering approximately 5×5 cm2 in the water phantom. Three timestamps were measured on the fly to separately analyze the required time for initialization (which cannot be parallelized), processing time of individual threads, and completion time. Scalability of averaged processing time per thread was calculated as a function of thread number (1,more » 100, 150, and 200) for both 1M and 50 M histories. The total memory usage was recorded. Results: Simulations with 50 M histories were fastest with 100 threads, taking approximately 1.3 hours and 6 hours for the water phantom and the CT data, respectively with better than 1.0 % statistical uncertainty. The calculations show 1/N scalability in the event loops for both cases. The gains from parallel calculations started to decrease with 150 threads. The memory usage increases linearly with number of threads. No critical failures were observed during the simulations. Conclusion: Multithreading in Geant4-MT decreased simulation time in proton dose distribution calculations by a factor of 64 and 54 at a near optimal 100 threads for water phantom and patient's data respectively. Further simulations will be done to determine the efficiency at the optimal thread number. Considering the trend of computer architecture development, utilizing Geant4-MT for radiotherapy simulations is an excellent cost-effective alternative for a distributed batch queuing system. However, because the scalability depends highly on simulation details, i.e., the ratio of the processing time of one event versus waiting time to access for the shared event queue, a performance evaluation as described is recommended.« less

  18. Multi-Threaded DNA Tag/Anti-Tag Library Generator for Multi-Core Platforms

    DTIC Science & Technology

    2009-05-01

    base pair)  Watson ‐ Crick  strand pairs that bind perfectly within pairs, but poorly across pairs. A variety  of  DNA  strand hybridization metrics...AFRL-RI-RS-TR-2009-131 Final Technical Report May 2009 MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE PLATFORMS...TYPE Final 3. DATES COVERED (From - To) Jun 08 – Feb 09 4. TITLE AND SUBTITLE MULTI-THREADED DNA TAG/ANTI-TAG LIBRARY GENERATOR FOR MULTI-CORE

  19. Optimized FPGA Implementation of Multi-Rate FIR Filters Through Thread Decomposition

    NASA Technical Reports Server (NTRS)

    Zheng, Jason Xin; Nguyen, Kayla; He, Yutao

    2010-01-01

    Multirate (decimation/interpolation) filters are among the essential signal processing components in spaceborne instruments where Finite Impulse Response (FIR) filters are often used to minimize nonlinear group delay and finite-precision effects. Cascaded (multi-stage) designs of Multi-Rate FIR (MRFIR) filters are further used for large rate change ratio, in order to lower the required throughput while simultaneously achieving comparable or better performance than single-stage designs. Traditional representation and implementation of MRFIR employ polyphase decomposition of the original filter structure, whose main purpose is to compute only the needed output at the lowest possible sampling rate. In this paper, an alternative representation and implementation technique, called TD-MRFIR (Thread Decomposition MRFIR), is presented. The basic idea is to decompose MRFIR into output computational threads, in contrast to a structural decomposition of the original filter as done in the polyphase decomposition. Each thread represents an instance of the finite convolution required to produce a single output of the MRFIR. The filter is thus viewed as a finite collection of concurrent threads. The technical details of TD-MRFIR will be explained, first showing its applicability to the implementation of downsampling, upsampling, and resampling FIR filters, and then describing a general strategy to optimally allocate the number of filter taps. A particular FPGA design of multi-stage TD-MRFIR for the L-band radar of NASA's SMAP (Soil Moisture Active Passive) instrument is demonstrated; and its implementation results in several targeted FPGA devices are summarized in terms of the functional (bit width, fixed-point error) and performance (time closure, resource usage, and power estimation) parameters.

  20. Real-time SHVC software decoding with multi-threaded parallel processing

    NASA Astrophysics Data System (ADS)

    Gudumasu, Srinivas; He, Yuwen; Ye, Yan; He, Yong; Ryu, Eun-Seok; Dong, Jie; Xiu, Xiaoyu

    2014-09-01

    This paper proposes a parallel decoding framework for scalable HEVC (SHVC). Various optimization technologies are implemented on the basis of SHVC reference software SHM-2.0 to achieve real-time decoding speed for the two layer spatial scalability configuration. SHVC decoder complexity is analyzed with profiling information. The decoding process at each layer and the up-sampling process are designed in parallel and scheduled by a high level application task manager. Within each layer, multi-threaded decoding is applied to accelerate the layer decoding speed. Entropy decoding, reconstruction, and in-loop processing are pipeline designed with multiple threads based on groups of coding tree units (CTU). A group of CTUs is treated as a processing unit in each pipeline stage to achieve a better trade-off between parallelism and synchronization. Motion compensation, inverse quantization, and inverse transform modules are further optimized with SSE4 SIMD instructions. Simulations on a desktop with an Intel i7 processor 2600 running at 3.4 GHz show that the parallel SHVC software decoder is able to decode 1080p spatial 2x at up to 60 fps (frames per second) and 1080p spatial 1.5x at up to 50 fps for those bitstreams generated with SHVC common test conditions in the JCT-VC standardization group. The decoding performance at various bitrates with different optimization technologies and different numbers of threads are compared in terms of decoding speed and resource usage, including processor and memory.

  1. OpenGeoSys-GEMS: Hybrid parallelization of a reactive transport code with MPI and threads

    NASA Astrophysics Data System (ADS)

    Kosakowski, G.; Kulik, D. A.; Shao, H.

    2012-04-01

    OpenGeoSys-GEMS is a generic purpose reactive transport code based on the operator splitting approach. The code couples the Finite-Element groundwater flow and multi-species transport modules of the OpenGeoSys (OGS) project (http://www.ufz.de/index.php?en=18345) with the GEM-Selektor research package to model thermodynamic equilibrium of aquatic (geo)chemical systems utilizing the Gibbs Energy Minimization approach (http://gems.web.psi.ch/). The combination of OGS and the GEM-Selektor kernel (GEMS3K) is highly flexible due to the object-oriented modular code structures and the well defined (memory based) data exchange modules. Like other reactive transport codes, the practical applicability of OGS-GEMS is often hampered by the long calculation time and large memory requirements. • For realistic geochemical systems which might include dozens of mineral phases and several (non-ideal) solid solutions the time needed to solve the chemical system with GEMS3K may increase exceptionally. • The codes are coupled in a sequential non-iterative loop. In order to keep the accuracy, the time step size is restricted. In combination with a fine spatial discretization the time step size may become very small which increases calculation times drastically even for small 1D problems. • The current version of OGS is not optimized for memory use and the MPI version of OGS does not distribute data between nodes. Even for moderately small 2D problems the number of MPI processes that fit into memory of up-to-date workstations or HPC hardware is limited. One strategy to overcome the above mentioned restrictions of OGS-GEMS is to parallelize the coupled code. For OGS a parallelized version already exists. It is based on a domain decomposition method implemented with MPI and provides a parallel solver for fluid and mass transport processes. In the coupled code, after solving fluid flow and solute transport, geochemical calculations are done in form of a central loop over all finite element nodes with calls to GEMS3K and consecutive calculations of changed material parameters. In a first step the existing MPI implementation was utilized to parallelize this loop. Calculations were split between the MPI processes and afterwards data was synchronized by using MPI communication routines. Furthermore, multi-threaded calculation of the loop was implemented with help of the boost thread library (http://www.boost.org). This implementation provides a flexible environment to distribute calculations between several threads. For each MPI process at least one and up to several dozens of worker threads are spawned. These threads do not replicate the complete OGS-GEM data structure and use only a limited amount of memory. Calculation of the central geochemical loop is shared between all threads. Synchronization between the threads is done by barrier commands. The overall number of local threads times MPI processes should match the number of available computing nodes. The combination of multi-threading and MPI provides an effective and flexible environment to speed up OGS-GEMS calculations while limiting the required memory use. Test calculations on different hardware show that for certain types of applications tremendous speedups are possible.

  2. A knittable fiber-shaped supercapacitor based on natural cotton thread for wearable electronics

    NASA Astrophysics Data System (ADS)

    Zhou, Qianlong; Jia, Chunyang; Ye, Xingke; Tang, Zhonghua; Wan, Zhongquan

    2016-09-01

    At present, the topic of building high-performance, miniaturized and mechanically flexible energy storage modules which can be directly integrated into textile based wearable electronics is a hotspot in the wearable technology field. In this paper, we reported a highly flexible fiber-shaped electrode fabricated through a one-step convenient hydrothermal process. The prepared graphene hydrogels/multi-walled carbon nanotubes-cotton thread derived from natural cotton thread is electrochemically active and mechanically strong. Fiber-shaped supercapacitor based on the prepared fiber electrodes and polyvinyl alcohol-H3PO4 gel electrolyte exhibits good capacitive performance (97.73 μF cm-1 at scan rate of 2 mV s-1), long cycle life (95.51% capacitance retention after 8000 charge-discharge cycles) and considerable stability (90.75% capacitance retention after 500 continuous bending cycles). Due to its good mechanical and electrochemical properties, the graphene hydrogels/multi-walled carbon nanotubes-cotton thread based all-solid fiber-shaped supercapacitor can be directly knitted into fabrics and maintain its original capacitive performance. Such a low-cost textile thread based versatile energy storage device may hold great potential for future wearable electronics applications.

  3. Muon g-2 Reconstruction and Analysis Framework for the Muon Anomalous Precession Frequency

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Khaw, Kim Siang

    The Muon g-2 experiment at Fermilab, with the aim to measure the muon anomalous magnetic moment to an unprecedented level of 140~ppb, has started beam and detector commissioning in Summer 2017. To deal with incoming data projected to be around tens of petabytes, a robust data reconstruction and analysis chain based on Fermilab's \\textit{art} event-processing framework is developed. Herein, I report the current status of the framework, together with its novel features such as multi-threaded algorithms for online data quality monitor (DQM) and fast-turnaround operation (nearline). Performance of the framework during the commissioning run is also discussed.

  4. Application of Intel Many Integrated Core (MIC) architecture to the Yonsei University planetary boundary layer scheme in Weather Research and Forecasting model

    NASA Astrophysics Data System (ADS)

    Huang, Melin; Huang, Bormin; Huang, Allen H.

    2014-10-01

    The Weather Research and Forecasting (WRF) model provided operational services worldwide in many areas and has linked to our daily activity, in particular during severe weather events. The scheme of Yonsei University (YSU) is one of planetary boundary layer (PBL) models in WRF. The PBL is responsible for vertical sub-grid-scale fluxes due to eddy transports in the whole atmospheric column, determines the flux profiles within the well-mixed boundary layer and the stable layer, and thus provide atmospheric tendencies of temperature, moisture (including clouds), and horizontal momentum in the entire atmospheric column. The YSU scheme is very suitable for massively parallel computation as there are no interactions among horizontal grid points. To accelerate the computation process of the YSU scheme, we employ Intel Many Integrated Core (MIC) Architecture as it is a multiprocessor computer structure with merits of efficient parallelization and vectorization essentials. Our results show that the MIC-based optimization improved the performance of the first version of multi-threaded code on Xeon Phi 5110P by a factor of 2.4x. Furthermore, the same CPU-based optimizations improved the performance on Intel Xeon E5-2603 by a factor of 1.6x as compared to the first version of multi-threaded code.

  5. Reducing False Positives in Runtime Analysis of Deadlocks

    NASA Technical Reports Server (NTRS)

    Bensalem, Saddek; Havelund, Klaus; Clancy, Daniel (Technical Monitor)

    2002-01-01

    This paper presents an improvement of a standard algorithm for detecting dead-lock potentials in multi-threaded programs, in that it reduces the number of false positives. The standard algorithm works as follows. The multi-threaded program under observation is executed, while lock and unlock events are observed. A graph of locks is built, with edges between locks symbolizing locking orders. Any cycle in the graph signifies a potential for a deadlock. The typical standard example is the group of dining philosophers sharing forks. The algorithm is interesting because it can catch deadlock potentials even though no deadlocks occur in the examined trace, and at the same time it scales very well in contrast t o more formal approaches to deadlock detection. The algorithm, however, can yield false positives (as well as false negatives). The extension of the algorithm described in this paper reduces the amount of false positives for three particular cases: when a gate lock protects a cycle, when a single thread introduces a cycle, and when the code segments in different threads that cause the cycle can actually not execute in parallel. The paper formalizes a theory for dynamic deadlock detection and compares it to model checking and static analysis techniques. It furthermore describes an implementation for analyzing Java programs and its application to two case studies: a planetary rover and a space craft altitude control system.

  6. Self-cleaning threaded rod spinneret for high-efficiency needleless electrospinning

    NASA Astrophysics Data System (ADS)

    Zheng, Gaofeng; Jiang, Jiaxin; Wang, Xiang; Li, Wenwang; Zhong, Weizheng; Guo, Shumin

    2018-07-01

    High-efficiency production of nanofibers is the key to the application of electrospinning technology. This work focuses on multi-jet electrospinning, in which a threaded rod electrode is utilized as the needless spinneret to achieve high-efficiency production of nanofibers. A slipper block, which fits into and moves through the threaded rod, is designed to transfer polymer solution evenly to the surface of the rod spinneret. The relative motion between the slipper block and the threaded rod electrode promotes the instable fluctuation of the solution surface, thus the rotation of threaded rod electrode decreases the critical voltage for the initial multi-jet ejection and the diameter of nanofibers. The residual solution on the surface of threaded rod is cleaned up by the moving slipper block, showing a great self-cleaning ability, which ensures the stable multi-jet ejection and increases the productivity of nanofibers. Each thread of the threaded rod electrode serves as an independent spinneret, which enhances the electric field strength and constrains the position of the Taylor cone, resulting in high productivity of uniform nanofibers. The diameter of nanofibers decreases with the increase of threaded rod rotation speed, and the productivity increases with the solution flow rate. The rotation of electrode provides an excess force for the ejection of charged jets, which also contributes to the high-efficiency production of nanofibers. The maximum productivity of nanofibers from the threaded rod spinneret is 5-6 g/h, about 250-300 times as high as that from the single-needle spinneret. The self-cleaning threaded rod spinneret is an effective way to realize continuous multi-jet electrospinning, which promotes industrial applications of uniform nanofibrous membrane.

  7. Application of Advanced Multi-Core Processor Technologies to Oceanographic Research

    DTIC Science & Technology

    2013-09-30

    STM32 NXP LPC series No Proprietary Microchip PIC32/DSPIC No > 500 mW; < 5 W ARM Cortex TI OMAP TI Sitara Broadcom BCM2835 Varies FPGA...1 DISTRIBUTION STATEMENT A. Approved for public release; distribution is unlimited. Application of Advanced Multi-Core Processor Technologies...state-of-the-art information processing architectures. OBJECTIVES Next-generation processor architectures (multi-core, multi-threaded) hold the

  8. A review of event processing frameworks used in HEP

    DOE PAGES

    Sexton-Kennedy, E.

    2015-12-23

    Today there are many different experimental event processing frameworks in use by running or about to be running experiments. This talk will discuss the different components of these frameworks. In the past there have been attempts at shared framework projects for example the collaborations on the BaBar framework (between BaBar, CDF, and CLEO), on the Gaudi framework (between LHCb and ATLAS), on AliROOT/FairROOT (between Alice and GSI/Fair), and in some ways on art (Fermilab based experiments) and CMS’ framework. However, for reasons that will be discussed, these collaborations did not result in common frameworks shared among the intended experiments. Thoughmore » importantly, two of the resulting projects have succeeded in providing frameworks that are shared among many customer experiments: Fermilab's art framework and GSI/Fair's FairROOT. Interestingly, several projects are considering remerging their frameworks after many years apart. I'll report on an investigation and analysis of these realities. In addition, with the advent of the need for multi-threaded frameworks and the scarce available manpower, it is important to collaborate in the future, however it is also important to understand why previous attempts at multi-experiment frameworks either worked or didn't work.« less

  9. Multi-threading performance of Geant4, MCNP6, and PHITS Monte Carlo codes for tetrahedral-mesh geometry.

    PubMed

    Han, Min Cheol; Yeom, Yeon Soo; Lee, Hyun Su; Shin, Bangho; Kim, Chan Hyeong; Furuta, Takuya

    2018-05-04

    In this study, the multi-threading performance of the Geant4, MCNP6, and PHITS codes was evaluated as a function of the number of threads (N) and the complexity of the tetrahedral-mesh phantom. For this, three tetrahedral-mesh phantoms of varying complexity (simple, moderately complex, and highly complex) were prepared and implemented in the three different Monte Carlo codes, in photon and neutron transport simulations. Subsequently, for each case, the initialization time, calculation time, and memory usage were measured as a function of the number of threads used in the simulation. It was found that for all codes, the initialization time significantly increased with the complexity of the phantom, but not with the number of threads. Geant4 exhibited much longer initialization time than the other codes, especially for the complex phantom (MRCP). The improvement of computation speed due to the use of a multi-threaded code was calculated as the speed-up factor, the ratio of the computation speed on a multi-threaded code to the computation speed on a single-threaded code. Geant4 showed the best multi-threading performance among the codes considered in this study, with the speed-up factor almost linearly increasing with the number of threads, reaching ~30 when N  =  40. PHITS and MCNP6 showed a much smaller increase of the speed-up factor with the number of threads. For PHITS, the speed-up factors were low when N  =  40. For MCNP6, the increase of the speed-up factors was better, but they were still less than ~10 when N  =  40. As for memory usage, Geant4 was found to use more memory than the other codes. In addition, compared to that of the other codes, the memory usage of Geant4 more rapidly increased with the number of threads, reaching as high as ~74 GB when N  =  40 for the complex phantom (MRCP). It is notable that compared to that of the other codes, the memory usage of PHITS was much lower, regardless of both the complexity of the phantom and the number of threads, hardly increasing with the number of threads for the MRCP.

  10. Nebo: An efficient, parallel, and portable domain-specific language for numerically solving partial differential equations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Earl, Christopher; Might, Matthew; Bagusetty, Abhishek

    This study presents Nebo, a declarative domain-specific language embedded in C++ for discretizing partial differential equations for transport phenomena on multiple architectures. Application programmers use Nebo to write code that appears sequential but can be run in parallel, without editing the code. Currently Nebo supports single-thread execution, multi-thread execution, and many-core (GPU-based) execution. With single-thread execution, Nebo performs on par with code written by domain experts. With multi-thread execution, Nebo can linearly scale (with roughly 90% efficiency) up to 12 cores, compared to its single-thread execution. Moreover, Nebo’s many-core execution can be over 140x faster than its single-thread execution.

  11. Nebo: An efficient, parallel, and portable domain-specific language for numerically solving partial differential equations

    DOE PAGES

    Earl, Christopher; Might, Matthew; Bagusetty, Abhishek; ...

    2016-01-26

    This study presents Nebo, a declarative domain-specific language embedded in C++ for discretizing partial differential equations for transport phenomena on multiple architectures. Application programmers use Nebo to write code that appears sequential but can be run in parallel, without editing the code. Currently Nebo supports single-thread execution, multi-thread execution, and many-core (GPU-based) execution. With single-thread execution, Nebo performs on par with code written by domain experts. With multi-thread execution, Nebo can linearly scale (with roughly 90% efficiency) up to 12 cores, compared to its single-thread execution. Moreover, Nebo’s many-core execution can be over 140x faster than its single-thread execution.

  12. Simulation of LHC events on a millions threads

    NASA Astrophysics Data System (ADS)

    Childers, J. T.; Uram, T. D.; LeCompte, T. J.; Papka, M. E.; Benjamin, D. P.

    2015-12-01

    Demand for Grid resources is expected to double during LHC Run II as compared to Run I; the capacity of the Grid, however, will not double. The HEP community must consider how to bridge this computing gap by targeting larger compute resources and using the available compute resources as efficiently as possible. Argonne's Mira, the fifth fastest supercomputer in the world, can run roughly five times the number of parallel processes that the ATLAS experiment typically uses on the Grid. We ported Alpgen, a serial x86 code, to run as a parallel application under MPI on the Blue Gene/Q architecture. By analysis of the Alpgen code, we reduced the memory footprint to allow running 64 threads per node, utilizing the four hardware threads available per core on the PowerPC A2 processor. Event generation and unweighting, typically run as independent serial phases, are coupled together in a single job in this scenario, reducing intermediate writes to the filesystem. By these optimizations, we have successfully run LHC proton-proton physics event generation at the scale of a million threads, filling two-thirds of Mira.

  13. 76 FR 57766 - Entergy Nuclear Vermont Yankee, LLC and Entergy Nuclear Operations, Inc.; Notice of Issuance of...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-09-16

    ...) due to leakage through the shaft to piston thread seals. This event did potentially affect the ability... combined thread seal leakage and accumulator leakage impacted the ability of the SRVs to satisfy design... Management Directive 8.11, ``Review Process for 10 CFR 2.206 Petitions'' (ADAMS Accession No. ML041770328...

  14. Historical perspectives on channel pattern in the Clark Fork River, Montana and implications for post-dam removal restoration

    NASA Astrophysics Data System (ADS)

    Woelfle-Erskine, C. A.; Wilcox, A. C.

    2009-12-01

    Active restoration approaches such as channel reconstruction have moved beyond the realm of small streams and are being applied to larger rivers. Uncertainties arising from limited knowledge, fluvial and ecosystem variability, and contaminants are especially significant in restoration of large rivers, where project costs and the social, infrastructural, and ecological costs of failure are high. We use the case of Milltown Dam removal on the Clark Fork River, Montana and subsequent channel reconstruction in the former reservoir to examine the use of historical research and uncertainty analysis in river restoration. At a cost of approximately $120 million, the Milltown Dam removal involves the mechanical removal of approximately 2 million cubic meters of sediments contaminated by upstream mining, followed by restoration of the former reservoir reach in which a single-thread meandering channel is being constructed. Historical maps, surveys, photographs, and accounts suggest a conceptual model of a multi-thread, anastomosing river in the reach targeted for channel reconstruction, upstream of the confluence of the Clark Fork and Blackfoot Rivers. We supplemented historical research with analysis of aerial photographs, topographic data, and USGS stage-discharge measurements in a lotic but reservoir-influenced reach of the Clark Fork River within our study area to estimate avulsion frequency (0.8 avulsions/year over a 70-year period) and average rates of lateral migration and aggradation. These were used to calculate the mobility number, a dimensionless relationship between channel filling and lateral migration timescales that can be used to predict whether a river’s planform is single or multi-threaded. The mobility number within our study reach ranged from 0.6 (multi-thread channel) to 1.7 (transitional channel). We predict that, in the absence of active channel reconstruction, the post-dam channel pattern would evolve to one that alternates between single and multi-threaded. We propose that multiple working hypotheses should be applied to managing uncertainty as part of an adaptive management plan for restoration in our study area and elsewhere. In this approach, restoration planning and implementation would be underpinned by an explicitly identified set of uncertainties and hypotheses about channel processes and post-restoration responses. This framework would allow for and embrace channel processes such as bifurcations and avulsions that are excluded from dominant approaches to channel reconstruction, which emphasize single-thread meandering planforms.

  15. Platform-Independence and Scheduling In a Multi-Threaded Real-Time Simulation

    NASA Technical Reports Server (NTRS)

    Sugden, Paul P.; Rau, Melissa A.; Kenney, P. Sean

    2001-01-01

    Aviation research often relies on real-time, pilot-in-the-loop flight simulation as a means to develop new flight software, flight hardware, or pilot procedures. Often these simulations become so complex that a single processor is incapable of performing the necessary computations within a fixed time-step. Threads are an elegant means to distribute the computational work-load when running on a symmetric multi-processor machine. However, programming with threads often requires operating system specific calls that reduce code portability and maintainability. While a multi-threaded simulation allows a significant increase in the simulation complexity, it also increases the workload of a simulation operator by requiring that the operator determine which models run on which thread. To address these concerns an object-oriented design was implemented in the NASA Langley Standard Real-Time Simulation in C++ (LaSRS++) application framework. The design provides a portable and maintainable means to use threads and also provides a mechanism to automatically load balance the simulation models.

  16. Data Acquisition System for Multi-Frequency Radar Flight Operations Preparation

    NASA Technical Reports Server (NTRS)

    Leachman, Jonathan

    2010-01-01

    A three-channel data acquisition system was developed for the NASA Multi-Frequency Radar (MFR) system. The system is based on a commercial-off-the-shelf (COTS) industrial PC (personal computer) and two dual-channel 14-bit digital receiver cards. The decimated complex envelope representations of the three radar signals are passed to the host PC via the PCI bus, and then processed in parallel by multiple cores of the PC CPU (central processing unit). The innovation is this parallelization of the radar data processing using multiple cores of a standard COTS multi-core CPU. The data processing portion of the data acquisition software was built using autonomous program modules or threads, which can run simultaneously on different cores. A master program module calculates the optimal number of processing threads, launches them, and continually supplies each with data. The benefit of this new parallel software architecture is that COTS PCs can be used to implement increasingly complex processing algorithms on an increasing number of radar range gates and data rates. As new PCs become available with higher numbers of CPU cores, the software will automatically utilize the additional computational capacity.

  17. Multi-Core Processors: An Enabling Technology for Embedded Distributed Model-Based Control (Postprint)

    DTIC Science & Technology

    2008-07-01

    generation of process partitioning, a thread pipelining becomes possible. In this paper we briefly summarize the requirements and trends for FADEC based... FADEC environment, presenting a hypothetical realization of an example application. Finally we discuss the application of Time-Triggered...based control applications of the future. 15. SUBJECT TERMS Gas turbine, FADEC , Multi-core processing technology, disturbed based control

  18. CMS Readiness for Multi-Core Workload Scheduling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Perez-Calero Yzquierdo, A.; Balcas, J.; Hernandez, J.

    In the present run of the LHC, CMS data reconstruction and simulation algorithms benefit greatly from being executed as multiple threads running on several processor cores. The complexity of the Run 2 events requires parallelization of the code to reduce the memory-per- core footprint constraining serial execution programs, thus optimizing the exploitation of present multi-core processor architectures. The allocation of computing resources for multi-core tasks, however, becomes a complex problem in itself. The CMS workload submission infrastructure employs multi-slot partitionable pilots, built on HTCondor and GlideinWMS native features, to enable scheduling of single and multi-core jobs simultaneously. This provides amore » solution for the scheduling problem in a uniform way across grid sites running a diversity of gateways to compute resources and batch system technologies. This paper presents this strategy and the tools on which it has been implemented. The experience of managing multi-core resources at the Tier-0 and Tier-1 sites during 2015, along with the deployment phase to Tier-2 sites during early 2016 is reported. The process of performance monitoring and optimization to achieve efficient and flexible use of the resources is also described.« less

  19. CMS readiness for multi-core workload scheduling

    NASA Astrophysics Data System (ADS)

    Perez-Calero Yzquierdo, A.; Balcas, J.; Hernandez, J.; Aftab Khan, F.; Letts, J.; Mason, D.; Verguilov, V.

    2017-10-01

    In the present run of the LHC, CMS data reconstruction and simulation algorithms benefit greatly from being executed as multiple threads running on several processor cores. The complexity of the Run 2 events requires parallelization of the code to reduce the memory-per- core footprint constraining serial execution programs, thus optimizing the exploitation of present multi-core processor architectures. The allocation of computing resources for multi-core tasks, however, becomes a complex problem in itself. The CMS workload submission infrastructure employs multi-slot partitionable pilots, built on HTCondor and GlideinWMS native features, to enable scheduling of single and multi-core jobs simultaneously. This provides a solution for the scheduling problem in a uniform way across grid sites running a diversity of gateways to compute resources and batch system technologies. This paper presents this strategy and the tools on which it has been implemented. The experience of managing multi-core resources at the Tier-0 and Tier-1 sites during 2015, along with the deployment phase to Tier-2 sites during early 2016 is reported. The process of performance monitoring and optimization to achieve efficient and flexible use of the resources is also described.

  20. EventThread: Visual Summarization and Stage Analysis of Event Sequence Data.

    PubMed

    Guo, Shunan; Xu, Ke; Zhao, Rongwen; Gotz, David; Zha, Hongyuan; Cao, Nan

    2018-01-01

    Event sequence data such as electronic health records, a person's academic records, or car service records, are ordered series of events which have occurred over a period of time. Analyzing collections of event sequences can reveal common or semantically important sequential patterns. For example, event sequence analysis might reveal frequently used care plans for treating a disease, typical publishing patterns of professors, and the patterns of service that result in a well-maintained car. It is challenging, however, to visually explore large numbers of event sequences, or sequences with large numbers of event types. Existing methods focus on extracting explicitly matching patterns of events using statistical analysis to create stages of event progression over time. However, these methods fail to capture latent clusters of similar but not identical evolutions of event sequences. In this paper, we introduce a novel visualization system named EventThread which clusters event sequences into threads based on tensor analysis and visualizes the latent stage categories and evolution patterns by interactively grouping the threads by similarity into time-specific clusters. We demonstrate the effectiveness of EventThread through usage scenarios in three different application domains and via interviews with an expert user.

  1. Situation exploration in a persistent surveillance system with multidimensional data

    NASA Astrophysics Data System (ADS)

    Habibi, Mohammad S.

    2013-03-01

    There is an emerging need for fusing hard and soft sensor data in an efficient surveillance system to provide accurate estimation of situation awareness. These mostly abstract, multi-dimensional and multi-sensor data pose a great challenge to the user in performing analysis of multi-threaded events efficiently and cohesively. To address this concern an interactive Visual Analytics (VA) application is developed for rapid assessment and evaluation of different hypotheses based on context-sensitive ontology spawn from taxonomies describing human/human and human/vehicle/object interactions. A methodology is described here for generating relevant ontology in a Persistent Surveillance System (PSS) and demonstrates how they can be utilized in the context of PSS to track and identify group activities pertaining to potential threats. The proposed VA system allows for visual analysis of raw data as well as metadata that have spatiotemporal representation and content-based implications. Additionally in this paper, a technique for rapid search of tagged information contingent to ranking and confidence is explained for analysis of multi-dimensional data. Lastly the issue of uncertainty associated with processing and interpretation of heterogeneous data is also addressed.

  2. Experimental investigation of effects of stitching orientation on forming behaviors of 2D P-aramid multilayer woven preform

    NASA Astrophysics Data System (ADS)

    Abtew, Mulat Alubel; Boussu, François; Bruniaux, Pascal; Loghin, Carmen; Cristian, Irina; Chen, Yan; Wang, Lichuan

    2018-05-01

    In many textile applications stitching process is one of the widely used methods to join the multi-layer fabric plies not only due to its easy applicability and flexible production but also provide structural integrity throughout-the-thickness of materials. In this research, the influences of stitching pattern on various molding characteristics of multi-layer 2D para-aramid plain woven fabrics while deformation was investigated. The fabrics were made of high performance fiber with 930dtex yarn linear density and fabric areal density of 200gm/m2. First, different stitch pattern (orientation) was applied for joining the mentioned multi-layered fabrics keeping other stitching parameters such as stitch gap, stitch thread tension, stitch length, stitch type, stitch thread type etc. constant throughout the study. Then, a pneumatic based molding device with a low speed forming process specially designed for preforming of textile with a predefined hemispherical shape of punch. The result shows that stitching pattern is one of the parameter that influences the different molding behavior and should be consider while molding stitched multi-layer fabrics.

  3. Parallelization strategies for continuum-generalized method of moments on the multi-thread systems

    NASA Astrophysics Data System (ADS)

    Bustamam, A.; Handhika, T.; Ernastuti, Kerami, D.

    2017-07-01

    Continuum-Generalized Method of Moments (C-GMM) covers the Generalized Method of Moments (GMM) shortfall which is not as efficient as Maximum Likelihood estimator by using the continuum set of moment conditions in a GMM framework. However, this computation would take a very long time since optimizing regularization parameter. Unfortunately, these calculations are processed sequentially whereas in fact all modern computers are now supported by hierarchical memory systems and hyperthreading technology, which allowing for parallel computing. This paper aims to speed up the calculation process of C-GMM by designing a parallel algorithm for C-GMM on the multi-thread systems. First, parallel regions are detected for the original C-GMM algorithm. There are two parallel regions in the original C-GMM algorithm, that are contributed significantly to the reduction of computational time: the outer-loop and the inner-loop. Furthermore, this parallel algorithm will be implemented with standard shared-memory application programming interface, i.e. Open Multi-Processing (OpenMP). The experiment shows that the outer-loop parallelization is the best strategy for any number of observations.

  4. Development and Evaluation of Vectorised and Multi-Core Event Reconstruction Algorithms within the CMS Software Framework

    NASA Astrophysics Data System (ADS)

    Hauth, T.; Innocente and, V.; Piparo, D.

    2012-12-01

    The processing of data acquired by the CMS detector at LHC is carried out with an object-oriented C++ software framework: CMSSW. With the increasing luminosity delivered by the LHC, the treatment of recorded data requires extraordinary large computing resources, also in terms of CPU usage. A possible solution to cope with this task is the exploitation of the features offered by the latest microprocessor architectures. Modern CPUs present several vector units, the capacity of which is growing steadily with the introduction of new processor generations. Moreover, an increasing number of cores per die is offered by the main vendors, even on consumer hardware. Most recent C++ compilers provide facilities to take advantage of such innovations, either by explicit statements in the programs sources or automatically adapting the generated machine instructions to the available hardware, without the need of modifying the existing code base. Programming techniques to implement reconstruction algorithms and optimised data structures are presented, that aim to scalable vectorization and parallelization of the calculations. One of their features is the usage of new language features of the C++11 standard. Portions of the CMSSW framework are illustrated which have been found to be especially profitable for the application of vectorization and multi-threading techniques. Specific utility components have been developed to help vectorization and parallelization. They can easily become part of a larger common library. To conclude, careful measurements are described, which show the execution speedups achieved via vectorised and multi-threaded code in the context of CMSSW.

  5. Playback system designed for X-Band SAR

    NASA Astrophysics Data System (ADS)

    Yuquan, Liu; Changyong, Dou

    2014-03-01

    SAR(Synthetic Aperture Radar) has extensive application because it is daylight and weather independent. In particular, X-Band SAR strip map, designed by Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, provides high ground resolution images, at the same time it has a large spatial coverage and a short acquisition time, so it is promising in multi-applications. When sudden disaster comes, the emergency situation acquires radar signal data and image as soon as possible, in order to take action to reduce loss and save lives in the first time. This paper summarizes a type of X-Band SAR playback processing system designed for disaster response and scientific needs. It describes SAR data workflow includes the payload data transmission and reception process. Playback processing system completes signal analysis on the original data, providing SAR level 0 products and quick image. Gigabit network promises radar signal transmission efficiency from recorder to calculation unit. Multi-thread parallel computing and ping pong operation can ensure computation speed. Through gigabit network, multi-thread parallel computing and ping pong operation, high speed data transmission and processing meet the SAR radar data playback real time requirement.

  6. Implementation of GPU accelerated SPECT reconstruction with Monte Carlo-based scatter correction.

    PubMed

    Bexelius, Tobias; Sohlberg, Antti

    2018-06-01

    Statistical SPECT reconstruction can be very time-consuming especially when compensations for collimator and detector response, attenuation, and scatter are included in the reconstruction. This work proposes an accelerated SPECT reconstruction algorithm based on graphics processing unit (GPU) processing. Ordered subset expectation maximization (OSEM) algorithm with CT-based attenuation modelling, depth-dependent Gaussian convolution-based collimator-detector response modelling, and Monte Carlo-based scatter compensation was implemented using OpenCL. The OpenCL implementation was compared against the existing multi-threaded OSEM implementation running on a central processing unit (CPU) in terms of scatter-to-primary ratios, standardized uptake values (SUVs), and processing speed using mathematical phantoms and clinical multi-bed bone SPECT/CT studies. The difference in scatter-to-primary ratios, visual appearance, and SUVs between GPU and CPU implementations was minor. On the other hand, at its best, the GPU implementation was noticed to be 24 times faster than the multi-threaded CPU version on a normal 128 × 128 matrix size 3 bed bone SPECT/CT data set when compensations for collimator and detector response, attenuation, and scatter were included. GPU SPECT reconstructions show great promise as an every day clinical reconstruction tool.

  7. Online friends, offline loved ones, and full-time media: young adult "mass personal" use of communication resources for informational and emotional support.

    PubMed

    Love, Brad; Donovan, Erin E

    2014-06-01

    As Web 2.0 technologies proliferate, patient education is changing dramatically. Information about prevention and survivorship arrives from a mix of sources. The present manuscript describes a study to shed light on how young adults (YAs) affected by cancer manage the digital world. Our investigation was guided by a research question asking how young adults affected by cancer engage in communication work in an environment of mass personal communication. The sample for this research consisted of 500 posts comprising 50 complete threads from an online support community for young adults affected by cancer. Threads were purposively sampled in a multi-stage process. Researchers used constant comparison to define themes, examining text in increments. Individuals harnessed assets of various communication tools for the purposes of message preparation and credibility checking. YAs demonstrated the multi-channel way they move between channels for different purposes, driven by preparation for future interactions. The result is a process that allows co-creation of knowledge in a trusted community. Findings indicate that completing communication work through multiple channels in a deliberate and savvy way is normal for YAs, particularly for message preparation and credibility checking. The multidirectional nature of digital tools plays an important role for YAs, as interactive resources appear to be the first or second stop for information after key events in the cancer trajectory. Results from this study are important as guidance to help manage the volume and depth of information common to the cancer experience in the Web 2.0 world.

  8. Collagen insulated from tensile damage by domains that unfold reversibly: in situ X-ray investigation of mechanical yield and damage repair in the mussel byssus

    PubMed Central

    Harrington, Matthew J.; Gupta, Himadri S.; Fratzl, Peter; Waite, J. Herbert

    2009-01-01

    The byssal threads of the California mussel, Mytilus californianus, are highly hysteretic, elastomeric fibers that collectively perform a holdfast function in wave-swept rocky seashore habitats. Following cyclic loading past the mechanical yield point, threads exhibit a damage-dependent reduction in mechanical performance. However, the distal portion of the byssal thread is capable of recovering initial material properties through a time-dependent healing process in the absence of active cellular metabolism. Byssal threads are composed almost exclusively of multi-domain hybrid collagens known as preCols, which largely determine the mechanical properties of the thread. Here, the structure-property relationships that govern thread mechanical performance are further probed. The molecular rearrangements that occur during yield and damage repair were investigated using time-resolved in situ wide angle X-ray diffraction (WAXD) coupled with cyclic tensile loading of threads and through thermally enhanced damage-repair studies. Results indicate that the collagen domains in byssal preCols are mechanically protected by the unfolding of sacrificial non-collagenous domains that refold on a slower time-scale. Time-dependent healing is primarily attributed to stochastic recoupling of broken histidine-metal coordination complexes. PMID:19275941

  9. Implementation of a multi-threaded framework for large-scale scientific applications

    DOE PAGES

    Sexton-Kennedy, E.; Gartung, Patrick; Jones, C. D.; ...

    2015-05-22

    The CMS experiment has recently completed the development of a multi-threaded capable application framework. In this paper, we will discuss the design, implementation and application of this framework to production applications in CMS. For the 2015 LHC run, this functionality is particularly critical for both our online and offline production applications, which depend on faster turn-around times and a reduced memory footprint relative to before. These applications are complex codes, each including a large number of physics-driven algorithms. While the framework is capable of running a mix of thread-safe and 'legacy' modules, algorithms running in our production applications need tomore » be thread-safe for optimal use of this multi-threaded framework at a large scale. Towards this end, we discuss the types of changes, which were necessary for our algorithms to achieve good performance of our multithreaded applications in a full-scale application. Lastly performance numbers for what has been achieved for the 2015 run are presented.« less

  10. FastGCN: A GPU Accelerated Tool for Fast Gene Co-Expression Networks

    PubMed Central

    Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

    2015-01-01

    Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out. PMID:25602758

  11. FastGCN: a GPU accelerated tool for fast gene co-expression networks.

    PubMed

    Liang, Meimei; Zhang, Futao; Jin, Gulei; Zhu, Jun

    2015-01-01

    Gene co-expression networks comprise one type of valuable biological networks. Many methods and tools have been published to construct gene co-expression networks; however, most of these tools and methods are inconvenient and time consuming for large datasets. We have developed a user-friendly, accelerated and optimized tool for constructing gene co-expression networks that can fully harness the parallel nature of GPU (Graphic Processing Unit) architectures. Genetic entropies were exploited to filter out genes with no or small expression changes in the raw data preprocessing step. Pearson correlation coefficients were then calculated. After that, we normalized these coefficients and employed the False Discovery Rate to control the multiple tests. At last, modules identification was conducted to construct the co-expression networks. All of these calculations were implemented on a GPU. We also compressed the coefficient matrix to save space. We compared the performance of the GPU implementation with those of multi-core CPU implementations with 16 CPU threads, single-thread C/C++ implementation and single-thread R implementation. Our results show that GPU implementation largely outperforms single-thread C/C++ implementation and single-thread R implementation, and GPU implementation outperforms multi-core CPU implementation when the number of genes increases. With the test dataset containing 16,000 genes and 590 individuals, we can achieve greater than 63 times the speed using a GPU implementation compared with a single-thread R implementation when 50 percent of genes were filtered out and about 80 times the speed when no genes were filtered out.

  12. Event Reconstruction for Many-core Architectures using Java

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Graf, Norman A.; /SLAC

    Although Moore's Law remains technically valid, the performance enhancements in computing which traditionally resulted from increased CPU speeds ended years ago. Chip manufacturers have chosen to increase the number of core CPUs per chip instead of increasing clock speed. Unfortunately, these extra CPUs do not automatically result in improvements in simulation or reconstruction times. To take advantage of this extra computing power requires changing how software is written. Event reconstruction is globally serial, in the sense that raw data has to be unpacked first, channels have to be clustered to produce hits before those hits are identified as belonging tomore » a track or shower, tracks have to be found and fit before they are vertexed, etc. However, many of the individual procedures along the reconstruction chain are intrinsically independent and are perfect candidates for optimization using multi-core architecture. Threading is perhaps the simplest approach to parallelizing a program and Java includes a powerful threading facility built into the language. We have developed a fast and flexible reconstruction package (org.lcsim) written in Java that has been used for numerous physics and detector optimization studies. In this paper we present the results of our studies on optimizing the performance of this toolkit using multiple threads on many-core architectures.« less

  13. Evolution of channel morphology in a large river subject to rectification

    NASA Astrophysics Data System (ADS)

    Scorpio, Vittoria; Mastronunzio, Marco; Proto, Matteo; Zen, Simone; Bertoldi, Walter; Prà, Elena Dai; Comiti, Francesco; Surian, Nicola; Zolezzi, Guido

    2016-04-01

    Many large rivers in Europe have been subject to heavy modifications for land reclamation and flood mitigation through centuries. As a consequence, the study of the pre-alteration morphological patterns and of the related channel evolution following the anthropic modifications is rather challenging. The Adige River is the second longest river in Italy and drains 12,100 km2 of the Eastern Italian Alps. Currently, it features a straight to sinuous pattern and an average channel width of 40-60 m. A massive rectification scheme aiming at land reclamation of the Adige valley bottom was planned in the late 18th century, and implemented starting in the first decades of 19th century. Nowadays, it can be considered one of the most altered rivers in Italy, not only due to channelization but also to the presence of many hydropower reservoirs and check-dams along its tributaries. This study aims to the reconstruction of the Adige River's evolutionary trajectory over the last 250 years, and comprehension of key control factors driving channel evolution. A multi-temporal analysis of historical maps and orthophotos from 1776, to 2006 was performed in order to assess channel modifications. In addition, land use changes at the basin scale, years of occurrence of most relevant flood events, and climate variability over the investigated period were analyzed. The detailed topographical map surveyed in 1803 was taken as a reference, and the study sector (115 km long) was divided into 39 reaches. Active channel, bars, riparian vegetation and channel control works were geo-processed. Results show that the Adige River suffered the most intense alteration from 1803 to 1855, and especially from 1847 to 1855. During this period channel narrowing ranged from 14% to 70%, coupled with pattern changes and decreases in the braiding, sinuosity and anabrancing indices. Most important alterations occurred in the reaches presenting a multi-thread morphology in 1803, as their average width declined from 220 m to 110 m. On the contrary, reaches originally sinuous remained quite stable, decreasing from 100 m to 95 m. Overall, relevant channel morphology modifications took place by 1855, when channel configuration had shifted from alternating longitudinal sequences of multi-thread and single-thread, at the beginning of the 19th century, to mainly single-thread. Total length of multi-thread reaches shifted from 31% in 1805, to 22% in 1847, to 8% in the 1855. On the contrary, sinuous and straight patterns increased from 26% (in 1803) to 62% (in 1847), up to 77% of the whole studied river length in 1855. Nevertheless, overall increases in channel braiding and mean channel width was observed downstream of the confluences with the main tributaries. Analysis of the evolutionary trajectory of channel morphology and of controlling factors, shows that human disturbances have largely prevailed over climatic influences in constraining the Adige's dynamics and morphology, mainly because of channelization causing sharp changes in channel pattern and width that occurred during the 19th century.

  14. Using Multi-threading for the Automatic Load Balancing of 2D Adaptive Finite Element Meshes

    NASA Technical Reports Server (NTRS)

    Heber, Gerd; Biswas, Rupak; Thulasiraman, Parimala; Gao, Guang R.; Saini, Subhash (Technical Monitor)

    1998-01-01

    In this paper, we present a multi-threaded approach for the automatic load balancing of adaptive finite element (FE) meshes The platform of our choice is the EARTH multi-threaded system which offers sufficient capabilities to tackle this problem. We implement the adaption phase of FE applications oil triangular meshes and exploit the EARTH token mechanism to automatically balance the resulting irregular and highly nonuniform workload. We discuss the results of our experiments oil EARTH-SP2, on implementation of EARTH on the IBM SP2 with different load balancing strategies that are built into the runtime system.

  15. Multi-threaded parallel simulation of non-local non-linear problems in ultrashort laser pulse propagation in the presence of plasma

    NASA Astrophysics Data System (ADS)

    Baregheh, Mandana; Mezentsev, Vladimir; Schmitz, Holger

    2011-06-01

    We describe a parallel multi-threaded approach for high performance modelling of wide class of phenomena in ultrafast nonlinear optics. Specific implementation has been performed using the highly parallel capabilities of a programmable graphics processor.

  16. Multi-thread parallel algorithm for reconstructing 3D large-scale porous structures

    NASA Astrophysics Data System (ADS)

    Ju, Yang; Huang, Yaohui; Zheng, Jiangtao; Qian, Xu; Xie, Heping; Zhao, Xi

    2017-04-01

    Geomaterials inherently contain many discontinuous, multi-scale, geometrically irregular pores, forming a complex porous structure that governs their mechanical and transport properties. The development of an efficient reconstruction method for representing porous structures can significantly contribute toward providing a better understanding of the governing effects of porous structures on the properties of porous materials. In order to improve the efficiency of reconstructing large-scale porous structures, a multi-thread parallel scheme was incorporated into the simulated annealing reconstruction method. In the method, four correlation functions, which include the two-point probability function, the linear-path functions for the pore phase and the solid phase, and the fractal system function for the solid phase, were employed for better reproduction of the complex well-connected porous structures. In addition, a random sphere packing method and a self-developed pre-conditioning method were incorporated to cast the initial reconstructed model and select independent interchanging pairs for parallel multi-thread calculation, respectively. The accuracy of the proposed algorithm was evaluated by examining the similarity between the reconstructed structure and a prototype in terms of their geometrical, topological, and mechanical properties. Comparisons of the reconstruction efficiency of porous models with various scales indicated that the parallel multi-thread scheme significantly shortened the execution time for reconstruction of a large-scale well-connected porous model compared to a sequential single-thread procedure.

  17. The CMS TierO goes Cloud and Grid for LHC Run 2

    NASA Astrophysics Data System (ADS)

    Hufnagel, Dirk

    2015-12-01

    In 2015, CMS will embark on a new era of collecting LHC collisions at unprecedented rates and complexity. This will put a tremendous stress on our computing systems. Prompt Processing of the raw data by the Tier-0 infrastructure will no longer be constrained to CERN alone due to the significantly increased resource requirements. In LHC Run 2, we will need to operate it as a distributed system utilizing both the CERN Cloud-based Agile Infrastructure and a significant fraction of the CMS Tier-1 Grid resources. In another big change for LHC Run 2, we will process all data using the multi-threaded framework to deal with the increased event complexity and to ensure efficient use of the resources. This contribution will cover the evolution of the Tier-0 infrastructure and present scale testing results and experiences from the first data taking in 2015.

  18. A software bus for thread objects

    NASA Technical Reports Server (NTRS)

    Callahan, John R.; Li, Dehuai

    1995-01-01

    The authors have implemented a software bus for lightweight threads in an object-oriented programming environment that allows for rapid reconfiguration and reuse of thread objects in discrete-event simulation experiments. While previous research in object-oriented, parallel programming environments has focused on direct communication between threads, our lightweight software bus, called the MiniBus, provides a means to isolate threads from their contexts of execution by restricting communications between threads to message-passing via their local ports only. The software bus maintains a topology of connections between these ports. It routes, queues, and delivers messages according to this topology. This approach allows for rapid reconfiguration and reuse of thread objects in other systems without making changes to the specifications or source code. A layered approach that provides the needed transparency to developers is presented. Examples of using the MiniBus are given, and the value of bus architectures in building and conducting simulations of discrete-event systems is discussed.

  19. 60Ma of legume nodulation. What's new? What's changing?

    PubMed

    Sprent, Janet I

    2008-01-01

    Current evidence suggests that legumes evolved about 60 million years ago. Genetic material for nodulation was recruited from existing DNA, often following gene duplication. The initial process of infection probably did not involve either root hairs or infection threads. From this initial event, two branched pathways of nodule developmental processes evolved, one involving and one not involving the development of infection threads to 'escort' bacteria to young nodule cells. Extant legumes have a wide range of nodule structures and at least 25% of them do not have infection threads. The latter have uniform infected tissue whereas those that have infection threads have infected cells interspersed with uninfected (interstitial) cells. Each type of nodule may develop indeterminately, with an apical meristem, or show determinate growth. These nodule structures are host determined and are largely congruent with taxonomic position. In addition to variation on the plant side, the last 10 years have seen the recognition of many new types of 'rhizobia', bacteria that can induce nodulation and fix nitrogen. It is not yet possible to fit these into the emerging pattern of nodule evolution.

  20. Fast parallel algorithm for slicing STL based on pipeline

    NASA Astrophysics Data System (ADS)

    Ma, Xulong; Lin, Feng; Yao, Bo

    2016-05-01

    In Additive Manufacturing field, the current researches of data processing mainly focus on a slicing process of large STL files or complicated CAD models. To improve the efficiency and reduce the slicing time, a parallel algorithm has great advantages. However, traditional algorithms can't make full use of multi-core CPU hardware resources. In the paper, a fast parallel algorithm is presented to speed up data processing. A pipeline mode is adopted to design the parallel algorithm. And the complexity of the pipeline algorithm is analyzed theoretically. To evaluate the performance of the new algorithm, effects of threads number and layers number are investigated by a serial of experiments. The experimental results show that the threads number and layers number are two remarkable factors to the speedup ratio. The tendency of speedup versus threads number reveals a positive relationship which greatly agrees with the Amdahl's law, and the tendency of speedup versus layers number also keeps a positive relationship agreeing with Gustafson's law. The new algorithm uses topological information to compute contours with a parallel method of speedup. Another parallel algorithm based on data parallel is used in experiments to show that pipeline parallel mode is more efficient. A case study at last shows a suspending performance of the new parallel algorithm. Compared with the serial slicing algorithm, the new pipeline parallel algorithm can make full use of the multi-core CPU hardware, accelerate the slicing process, and compared with the data parallel slicing algorithm, the new slicing algorithm in this paper adopts a pipeline parallel model, and a much higher speedup ratio and efficiency is achieved.

  1. CNT coated thread micro-electro-mechanical system for finger proprioception sensing

    NASA Astrophysics Data System (ADS)

    Shafi, A. A.; Wicaksono, D. H. B.

    2017-04-01

    In this paper, we aim to fabricate cotton thread based sensor for proprioceptive application. Cotton threads are utilized as the structural component of flexible sensors. The thread is coated with multi-walled carbon nanotube (MWCNT) dispersion by using facile conventional dipping-drying method. The electrical characterization of the coated thread found that the resistance per meter of the coated thread decreased with increasing the number of dipping. The CNT coated thread sensor works based on piezoresistive theory in which the resistance of the coated thread changes when force is applied. This thread sensor is sewed on glove at the index finger between middle and proximal phalanx parts and the resistance change is measured upon grasping mechanism. The thread based microelectromechanical system (MEMS) enables the flexible sensor to easily fit perfectly on the finger joint and gives reliable response as proprioceptive sensing.

  2. Applying Jlint to Space Exploration Software

    NASA Technical Reports Server (NTRS)

    Artho, Cyrille; Havelund, Klaus

    2004-01-01

    Java is a very successful programming language which is also becoming widespread in embedded systems, where software correctness is critical. Jlint is a simple but highly efficient static analyzer that checks a Java program for several common errors, such as null pointer exceptions, and overflow errors. It also includes checks for multi-threading problems, such as deadlocks and data races. The case study described here shows the effectiveness of Jlint in find-false positives in the multi-threading warnings gives an insight into design patterns commonly used in multi-threaded code. The results show that a few analysis techniques are sufficient to avoid almost all false positives. These techniques include investigating all possible callers and a few code idioms. Verifying the correct application of these patterns is still crucial, because their correct usage is not trivial.

  3. GPU COMPUTING FOR PARTICLE TRACKING

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nishimura, Hiroshi; Song, Kai; Muriki, Krishna

    2011-03-25

    This is a feasibility study of using a modern Graphics Processing Unit (GPU) to parallelize the accelerator particle tracking code. To demonstrate the massive parallelization features provided by GPU computing, a simplified TracyGPU program is developed for dynamic aperture calculation. Performances, issues, and challenges from introducing GPU are also discussed. General purpose Computation on Graphics Processing Units (GPGPU) bring massive parallel computing capabilities to numerical calculation. However, the unique architecture of GPU requires a comprehensive understanding of the hardware and programming model to be able to well optimize existing applications. In the field of accelerator physics, the dynamic aperture calculationmore » of a storage ring, which is often the most time consuming part of the accelerator modeling and simulation, can benefit from GPU due to its embarrassingly parallel feature, which fits well with the GPU programming model. In this paper, we use the Tesla C2050 GPU which consists of 14 multi-processois (MP) with 32 cores on each MP, therefore a total of 448 cores, to host thousands ot threads dynamically. Thread is a logical execution unit of the program on GPU. In the GPU programming model, threads are grouped into a collection of blocks Within each block, multiple threads share the same code, and up to 48 KB of shared memory. Multiple thread blocks form a grid, which is executed as a GPU kernel. A simplified code that is a subset of Tracy++ [2] is developed to demonstrate the possibility of using GPU to speed up the dynamic aperture calculation by having each thread track a particle.« less

  4. History of river regulation of the Noce River (NE Italy) and related bio-morphodynamic responses

    NASA Astrophysics Data System (ADS)

    Serlet, Alyssa; Scorpio, Vittoria; Mastronunzio, Marco; Proto, Matteo; Zen, Simone; Zolezzi, Guido; Bertoldi, Walter; Comiti, Francesco; Prà, Elena Dai; Surian, Nicola; Gurnell, Angela

    2016-04-01

    The Noce River is a hydropower-regulated Alpine stream in Northern-East Italy and a major tributary of the Adige River, the second longest Italian river. The objective of the research is to investigate the response of the lower course of the Noce to two main stages of hydromorphological regulation; channelization/ diversion and, one century later, hydropower regulation. This research uses a historical reconstruction to link the geomorphic response with natural and human-induced factors by identifying morphological and vegetation features from historical maps and airborne photogrammetry and implementing a quantitative analysis of the river response to channelization and flow / sediment supply regulation related to hydropower development. A descriptive overview is presented. The concept of evolutionary trajectory is integrated with predictions from morphodynamic theories for river bars that allow increased insight to investigate the river response to a complex sequence of regulatory events such as development of bars, islands and riparian vegetation. Until the mid-19th century the river had a multi-thread channel pattern. Thereafter (1852) the river was straightened and diverted. Upstream of Mezzolombardo village the river was constrained between embankments of approximately 100 m width while downstream they are of approximately 50 m width. Since channelization some interesting geomorphic changes have appeared in the river e.g. the appearance of alternate bars in the channel. In 1926 there was a breach in the right bank of the downstream part that resulted in a multi-thread river reach which can be viewed as a recovery to the earlier multi-thread pattern. After the 1950's the flow and sediment supply became strongly regulated by hydropower development. The analysis of aerial images reveals that the multi-thread reach became progressively stabilized by vegetation development over the bars, though signs of some dynamics can still be recognizable today, despite the strong hydropeaking that dominates the flow regime. The results of the historical analysis will be used in a larger framework that focuses on interdisciplinary research of interactions between flow, sediment and vegetation in regulated rivers and aims to enhance knowledge on the interplay between river bars and vegetation in the perspective of providing enhanced tools for river rehabilitation and restoration.

  5. Optimized FPGA Implementation of Multi-Rate FIR Filters Through Thread Decomposition

    NASA Technical Reports Server (NTRS)

    Kobayashi, Kayla N.; He, Yutao; Zheng, Jason X.

    2011-01-01

    Multi-rate finite impulse response (MRFIR) filters are among the essential signal-processing components in spaceborne instruments where finite impulse response filters are often used to minimize nonlinear group delay and finite precision effects. Cascaded (multistage) designs of MRFIR filters are further used for large rate change ratio in order to lower the required throughput, while simultaneously achieving comparable or better performance than single-stage designs. Traditional representation and implementation of MRFIR employ polyphase decomposition of the original filter structure, whose main purpose is to compute only the needed output at the lowest possible sampling rate. In this innovation, an alternative representation and implementation technique called TD-MRFIR (Thread Decomposition MRFIR) is presented. The basic idea is to decompose MRFIR into output computational threads, in contrast to a structural decomposition of the original filter as done in the polyphase decomposition. A naive implementation of a decimation filter consisting of a full FIR followed by a downsampling stage is very inefficient, as most of the computations performed by the FIR state are discarded through downsampling. In fact, only 1/M of the total computations are useful (M being the decimation factor). Polyphase decomposition provides an alternative view of decimation filters, where the downsampling occurs before the FIR stage, and the outputs are viewed as the sum of M sub-filters with length of N/M taps. Although this approach leads to more efficient filter designs, in general the implementation is not straightforward if the numbers of multipliers need to be minimized. In TD-MRFIR, each thread represents an instance of the finite convolution required to produce a single output of the MRFIR. The filter is thus viewed as a finite collection of concurrent threads. Each of the threads completes when a convolution result (filter output value) is computed, and activated when the first input of the convolution becomes available. Thus, the new threads get spawned at exactly the rate of N/M, where N is the total number of taps, and M is the decimation factor. Existing threads retire at the same rate of N/M. The implementation of an MRFIR is thus transformed into a problem to statically schedule the minimum number of multipliers such that all threads can be completed on time. Solving the static scheduling problem is rather straightforward if one examines the Thread Decomposition Diagram, which is a table-like diagram that has rows representing computation threads and columns representing time. The control logic of the MRFIR can be implemented using simple counters. Instead of decomposing MRFIRs into subfilters as suggested by polyphase decomposition, the thread decomposition diagrams transform the problem into a familiar one of static scheduling, which can be easily solved as the input rate is constant.

  6. Energy-aware Thread and Data Management in Heterogeneous Multi-core, Multi-memory Systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Su, Chun-Yi

    By 2004, microprocessor design focused on multicore scaling—increasing the number of cores per die in each generation—as the primary strategy for improving performance. These multicore processors typically equip multiple memory subsystems to improve data throughput. In addition, these systems employ heterogeneous processors such as GPUs and heterogeneous memories like non-volatile memory to improve performance, capacity, and energy efficiency. With the increasing volume of hardware resources and system complexity caused by heterogeneity, future systems will require intelligent ways to manage hardware resources. Early research to improve performance and energy efficiency on heterogeneous, multi-core, multi-memory systems focused on tuning a single primitivemore » or at best a few primitives in the systems. The key limitation of past efforts is their lack of a holistic approach to resource management that balances the tradeoff between performance and energy consumption. In addition, the shift from simple, homogeneous systems to these heterogeneous, multicore, multi-memory systems requires in-depth understanding of efficient resource management for scalable execution, including new models that capture the interchange between performance and energy, smarter resource management strategies, and novel low-level performance/energy tuning primitives and runtime systems. Tuning an application to control available resources efficiently has become a daunting challenge; managing resources in automation is still a dark art since the tradeoffs among programming, energy, and performance remain insufficiently understood. In this dissertation, I have developed theories, models, and resource management techniques to enable energy-efficient execution of parallel applications through thread and data management in these heterogeneous multi-core, multi-memory systems. I study the effect of dynamic concurrent throttling on the performance and energy of multi-core, non-uniform memory access (NUMA) systems. I use critical path analysis to quantify memory contention in the NUMA memory system and determine thread mappings. In addition, I implement a runtime system that combines concurrent throttling and a novel thread mapping algorithm to manage thread resources and improve energy efficient execution in multi-core, NUMA systems.« less

  7. Implementation of the ATLAS trigger within the multi-threaded software framework AthenaMT

    NASA Astrophysics Data System (ADS)

    Wynne, Ben; ATLAS Collaboration

    2017-10-01

    We present an implementation of the ATLAS High Level Trigger, HLT, that provides parallel execution of trigger algorithms within the ATLAS multithreaded software framework, AthenaMT. This development will enable the ATLAS HLT to meet future challenges due to the evolution of computing hardware and upgrades of the Large Hadron Collider, LHC, and ATLAS Detector. During the LHC data-taking period starting in 2021, luminosity will reach up to three times the original design value. Luminosity will increase further, to up to 7.5 times the design value, in 2026 following LHC and ATLAS upgrades. This includes an upgrade of the ATLAS trigger architecture that will result in an increase in the HLT input rate by a factor of 4 to 10 compared to the current maximum rate of 100 kHz. The current ATLAS multiprocess framework, AthenaMP, manages a number of processes that each execute algorithms sequentially for different events. AthenaMT will provide a fully multi-threaded environment that will additionally enable concurrent execution of algorithms within an event. This has the potential to significantly reduce the memory footprint on future manycore devices. An additional benefit of the HLT implementation within AthenaMT is that it facilitates the integration of offline code into the HLT. The trigger must retain high rejection in the face of increasing numbers of pileup collisions. This will be achieved by greater use of offline algorithms that are designed to maximize the discrimination of signal from background. Therefore a unification of the HLT and offline reconstruction software environment is required. This has been achieved while at the same time retaining important HLT-specific optimisations that minimize the computation performed to reach a trigger decision. Such optimizations include early event rejection and reconstruction within restricted geometrical regions. We report on an HLT prototype in which the need for HLT-specific components has been reduced to a minimum. Promising results have been obtained with a prototype that includes the key elements of trigger functionality including regional reconstruction and early event rejection. We report on the first experience of migrating trigger selections to this new framework and present the next steps towards a full implementation of the ATLAS trigger.

  8. Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Meng, Jiayuan; Uram, Thomas; Morozov, Vitali A.

    Most accelerators, such as graphics processing units (GPUs) and vector processors, are particularly suitable for accelerating massively parallel workloads. On the other hand, conventional workloads are developed for multi-core parallelism, which often scale to only a few dozen OpenMP threads. When hardware threads significantly outnumber the degree of parallelism in the outer loop, programmers are challenged with efficient hardware utilization. A common solution is to further exploit the parallelism hidden deep in the code structure. Such parallelism is less structured: parallel and sequential loops may be imperfectly nested within each other, neigh boring inner loops may exhibit different concurrency patternsmore » (e.g. Reduction vs. Forall), yet have to be parallelized in the same parallel section. Many input-dependent transformations have to be explored. A programmer often employs a larger group of hardware threads to cooperatively walk through a smaller outer loop partition and adaptively exploit any encountered parallelism. This process is time-consuming and error-prone, yet the risk of gaining little or no performance remains high for such workloads. To reduce risk and guide implementation, we propose a technique to model workloads with limited parallelism that can automatically explore and evaluate transformations involving cooperative threads. Eventually, our framework projects the best achievable performance and the most promising transformations without implementing GPU code or using physical hardware. We envision our technique to be integrated into future compilers or optimization frameworks for autotuning.« less

  9. Mechanism of supporting sub-communicator collectives with O(64) counters as opposed to one counter for each sub-communicator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kumar, Sameer; Mamidala, Amith R.; Ratterman, Joseph D.

    A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a bather algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal tomore » the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.« less

  10. Mechanism of supporting sub-communicator collectives with o(64) counters as opposed to one counter for each sub-communicator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Blocksome, Michael; Kumar, Sameer; Mamidala, Amith R.

    A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a barrier algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal tomore » the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.« less

  11. Mechanism of supporting sub-communicator collectives with O(64) counters as opposed to one counter for each sub-communicator

    DOEpatents

    Kumar, Sameer; Mamidala, Amith R.; Ratterman, Joseph D.; Blocksome, Michael; Miller, Douglas

    2013-09-03

    A system and method for enhancing barrier collective synchronization on a computer system comprises a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program being executed by a processor. The system includes providing a plurality of communicators for storing state information for a bather algorithm. Each communicator designates a master core in a multi-processor environment of the computer system. The system allocates or designates one counter for each of a plurality of threads. The system configures a table with a number of entries equal to the maximum number of threads. The system sets a table entry with an ID associated with a communicator when a process thread initiates a collective. The system determines an allocated or designated counter by searching entries in the table.

  12. Shadow-Bitcoin: Scalable Simulation via Direct Execution of Multi-Threaded Applications

    DTIC Science & Technology

    2015-08-10

    Shadow- Bitcoin : Scalable Simulation via Direct Execution of Multi-threaded Applications Andrew Miller University of Maryland amiller@cs.umd.edu Rob...Shadow plug-in that directly executes the Bitcoin reference client software. To demonstrate the usefulness of this tool, we present novel denial-of...service attacks against the Bit- coin software that exploit low-level implementation ar- tifacts in the Bitcoin reference client; our determinis- tic

  13. Observation and modelling of the Fe XXI line profile observed by IRIS during the impulsive phase of flares

    NASA Astrophysics Data System (ADS)

    Polito, V.; Testa, P.; De Pontieu, B.; Allred, J. C.

    2017-12-01

    The observation of the high temperature (above 10 MK) Fe XXI 1354.1 A line with the Interface Region Imaging Spectrograph (IRIS) has provided significant insights into the chromospheric evaporation process in flares. In particular, the line is often observed to be completely blueshifted, in contrast to previous observations at lower spatial and spectral resolution, and in agreement with predictions from theoretical models. Interestingly, the line is also observed to be mostly symmetric and with a large excess above the thermal width. One popular interpretation for the excess broadening is given by assuming a superposition of flows from different loop strands. In this work, we perform a statistical analysis of Fe XXI line profiles observed by IRIS during the impulsive phase of flares and compare our results with hydrodynamic simulations of multi-thread flare loops performed with the 1D RADYN code. Our results indicate that the multi-thread models cannot easily reproduce the symmetry of the line and that some other physical process might need to be invoked in order to explain the observed profiles.

  14. Parallelizing ATLAS Reconstruction and Simulation: Issues and Optimization Solutions for Scaling on Multi- and Many-CPU Platforms

    NASA Astrophysics Data System (ADS)

    Leggett, C.; Binet, S.; Jackson, K.; Levinthal, D.; Tatarkhanov, M.; Yao, Y.

    2011-12-01

    Thermal limitations have forced CPU manufacturers to shift from simply increasing clock speeds to improve processor performance, to producing chip designs with multi- and many-core architectures. Further the cores themselves can run multiple threads as a zero overhead context switch allowing low level resource sharing (Intel Hyperthreading). To maximize bandwidth and minimize memory latency, memory access has become non uniform (NUMA). As manufacturers add more cores to each chip, a careful understanding of the underlying architecture is required in order to fully utilize the available resources. We present AthenaMP and the Atlas event loop manager, the driver of the simulation and reconstruction engines, which have been rewritten to make use of multiple cores, by means of event based parallelism, and final stage I/O synchronization. However, initial studies on 8 andl6 core Intel architectures have shown marked non-linearities as parallel process counts increase, with as much as 30% reductions in event throughput in some scenarios. Since the Intel Nehalem architecture (both Gainestown and Westmere) will be the most common choice for the next round of hardware procurements, an understanding of these scaling issues is essential. Using hardware based event counters and Intel's Performance Tuning Utility, we have studied the performance bottlenecks at the hardware level, and discovered optimization schemes to maximize processor throughput. We have also produced optimization mechanisms, common to all large experiments, that address the extreme nature of today's HEP code, which due to it's size, places huge burdens on the memory infrastructure of today's processors.

  15. Development and study of a parallel algorithm of iteratively forming latent functionally-determined structures for classification and analysis of meteorological data

    NASA Astrophysics Data System (ADS)

    Sorokin, V. A.; Volkov, Yu V.; Sherstneva, A. I.; Botygin, I. A.

    2016-11-01

    This paper overviews a method of generating climate regions based on an analytic signal theory. When applied to atmospheric surface layer temperature data sets, the method allows forming climatic structures with the corresponding changes in the temperature to make conclusions on the uniformity of climate in an area and to trace the climate changes in time by analyzing the type group shifts. The algorithm is based on the fact that the frequency spectrum of the thermal oscillation process is narrow-banded and has only one mode for most weather stations. This allows using the analytic signal theory, causality conditions and introducing an oscillation phase. The annual component of the phase, being a linear function, was removed by the least squares method. The remaining phase fluctuations allow consistent studying of their coordinated behavior and timing, using the Pearson correlation coefficient for dependence evaluation. This study includes program experiments to evaluate the calculation efficiency in the phase grouping task. The paper also overviews some single-threaded and multi-threaded computing models. It is shown that the phase grouping algorithm for meteorological data can be parallelized and that a multi-threaded implementation leads to a 25-30% increase in the performance.

  16. Large-scale automated image analysis for computational profiling of brain tissue surrounding implanted neuroprosthetic devices using Python.

    PubMed

    Rey-Villamizar, Nicolas; Somasundar, Vinay; Megjhani, Murad; Xu, Yan; Lu, Yanbin; Padmanabhan, Raghav; Trett, Kristen; Shain, William; Roysam, Badri

    2014-01-01

    In this article, we describe the use of Python for large-scale automated server-based bio-image analysis in FARSIGHT, a free and open-source toolkit of image analysis methods for quantitative studies of complex and dynamic tissue microenvironments imaged by modern optical microscopes, including confocal, multi-spectral, multi-photon, and time-lapse systems. The core FARSIGHT modules for image segmentation, feature extraction, tracking, and machine learning are written in C++, leveraging widely used libraries including ITK, VTK, Boost, and Qt. For solving complex image analysis tasks, these modules must be combined into scripts using Python. As a concrete example, we consider the problem of analyzing 3-D multi-spectral images of brain tissue surrounding implanted neuroprosthetic devices, acquired using high-throughput multi-spectral spinning disk step-and-repeat confocal microscopy. The resulting images typically contain 5 fluorescent channels. Each channel consists of 6000 × 10,000 × 500 voxels with 16 bits/voxel, implying image sizes exceeding 250 GB. These images must be mosaicked, pre-processed to overcome imaging artifacts, and segmented to enable cellular-scale feature extraction. The features are used to identify cell types, and perform large-scale analysis for identifying spatial distributions of specific cell types relative to the device. Python was used to build a server-based script (Dell 910 PowerEdge servers with 4 sockets/server with 10 cores each, 2 threads per core and 1TB of RAM running on Red Hat Enterprise Linux linked to a RAID 5 SAN) capable of routinely handling image datasets at this scale and performing all these processing steps in a collaborative multi-user multi-platform environment. Our Python script enables efficient data storage and movement between computers and storage servers, logs all the processing steps, and performs full multi-threaded execution of all codes, including open and closed-source third party libraries.

  17. GPU accelerated dynamic functional connectivity analysis for functional MRI data.

    PubMed

    Akgün, Devrim; Sakoğlu, Ünal; Esquivel, Johnny; Adinoff, Bryon; Mete, Mutlu

    2015-07-01

    Recent advances in multi-core processors and graphics card based computational technologies have paved the way for an improved and dynamic utilization of parallel computing techniques. Numerous applications have been implemented for the acceleration of computationally-intensive problems in various computational science fields including bioinformatics, in which big data problems are prevalent. In neuroimaging, dynamic functional connectivity (DFC) analysis is a computationally demanding method used to investigate dynamic functional interactions among different brain regions or networks identified with functional magnetic resonance imaging (fMRI) data. In this study, we implemented and analyzed a parallel DFC algorithm based on thread-based and block-based approaches. The thread-based approach was designed to parallelize DFC computations and was implemented in both Open Multi-Processing (OpenMP) and Compute Unified Device Architecture (CUDA) programming platforms. Another approach developed in this study to better utilize CUDA architecture is the block-based approach, where parallelization involves smaller parts of fMRI time-courses obtained by sliding-windows. Experimental results showed that the proposed parallel design solutions enabled by the GPUs significantly reduce the computation time for DFC analysis. Multicore implementation using OpenMP on 8-core processor provides up to 7.7× speed-up. GPU implementation using CUDA yielded substantial accelerations ranging from 18.5× to 157× speed-up once thread-based and block-based approaches were combined in the analysis. Proposed parallel programming solutions showed that multi-core processor and CUDA-supported GPU implementations accelerated the DFC analyses significantly. Developed algorithms make the DFC analyses more practical for multi-subject studies with more dynamic analyses. Copyright © 2015 Elsevier Ltd. All rights reserved.

  18. Hardware based redundant multi-threading inside a GPU for improved reliability

    DOEpatents

    Sridharan, Vilas; Gurumurthi, Sudhanva

    2015-05-05

    A system and method for verifying computation output using computer hardware are provided. Instances of computation are generated and processed on hardware-based processors. As instances of computation are processed, each instance of computation receives a load accessible to other instances of computation. Instances of output are generated by processing the instances of computation. The instances of output are verified against each other in a hardware based processor to ensure accuracy of the output.

  19. RACER: Effective Race Detection Using AspectJ

    NASA Technical Reports Server (NTRS)

    Bodden, Eric; Havelund, Klaus

    2008-01-01

    The limits of coding with joint constraints on detected and undetected error rates Programming errors occur frequently in large software systems, and even more so if these systems are concurrent. In the past, researchers have developed specialized programs to aid programmers detecting concurrent programming errors such as deadlocks, livelocks, starvation and data races. In this work we propose a language extension to the aspect-oriented programming language AspectJ, in the form of three new built-in pointcuts, lock(), unlock() and may be Shared(), which allow programmers to monitor program events where locks are granted or handed back, and where values are accessed that may be shared amongst multiple Java threads. We decide thread-locality using a static thread-local objects analysis developed by others. Using the three new primitive pointcuts, researchers can directly implement efficient monitoring algorithms to detect concurrent programming errors online. As an example, we expose a new algorithm which we call RACER, an adoption of the well-known ERASER algorithm to the memory model of Java. We implemented the new pointcuts as an extension to the Aspect Bench Compiler, implemented the RACER algorithm using this language extension and then applied the algorithm to the NASA K9 Rover Executive. Our experiments proved our implementation very effective. In the Rover Executive RACER finds 70 data races. Only one of these races was previously known.We further applied the algorithm to two other multi-threaded programs written by Computer Science researchers, in which we found races as well.

  20. Cable-type supercapacitors of three-dimensional cotton thread based multi-grade nanostructures for wearable energy storage.

    PubMed

    Liu, Nishuang; Ma, Wenzhen; Tao, Jiayou; Zhang, Xianghui; Su, Jun; Li, Luying; Yang, Congxing; Gao, Yihua; Golberg, Dmitri; Bando, Yoshio

    2013-09-20

    A novel cable-type flexible supercapacitor with excellent performance is fabricated using 3D polypyrrole(PPy)-MnO2 -CNT-cotton thread multi-grade nanostructure-based electrodes. The multiple supercapacitors with a high areal capacitance 1.49 F cm(-2) at a scan rate of 1 mV s(-1) connected in series and in parallel can successfully drive a LED segment display. Such an excellent performance is attributed to the cumulative effect of conducting single-walled carbon nanotubes on cotton thread, active mesoporous flower-like MnO2 nanoplates, and PPy conductive wrapping layer improving the conductivity, and acting as pseudocapacitance material simultaneously. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. A Multi-Threaded Cryptographic Pseudorandom Number Generator Test Suite

    DTIC Science & Technology

    2016-09-01

    bitcoin thieves, Google releases patch. (2013, Aug. 16). SiliconANGLE. [Online]. Available: http://siliconangle.com/blog/2013/ 08/16/android-crypto-prng...flaw-aided- bitcoin -thieves-google-releases-patch/ [5] M. Gondree. (2014, Sep. 28). NPS POSIX thread pool library. [Online]. Available: https

  2. Shared prefetching to reduce execution skew in multi-threaded systems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eichenberger, Alexandre E; Gunnels, John A

    Mechanisms are provided for optimizing code to perform prefetching of data into a shared memory of a computing device that is shared by a plurality of threads that execute on the computing device. A memory stream of a portion of code that is shared by the plurality of threads is identified. A set of prefetch instructions is distributed across the plurality of threads. Prefetch instructions are inserted into the instruction sequences of the plurality of threads such that each instruction sequence has a separate sub-portion of the set of prefetch instructions, thereby generating optimized code. Executable code is generated basedmore » on the optimized code and stored in a storage device. The executable code, when executed, performs the prefetches associated with the distributed set of prefetch instructions in a shared manner across the plurality of threads.« less

  3. Results of SEI Independent Research and Development Projects

    DTIC Science & Technology

    2008-12-01

    contained there. When laptops with a dual-core processor came out, ITunes fails crashed. ITunes was designed as multi-threaded application, but until...involving product portfolio, in-bound technical marketing, research and development, product engineering, supply chain, and out-bound sales and marketing...of quality and process improvement professionals to the marketing, product engineering, supply chain, product test and sales professionals. 3

  4. Accelerating the Gillespie Exact Stochastic Simulation Algorithm using hybrid parallel execution on graphics processing units.

    PubMed

    Komarov, Ivan; D'Souza, Roshan M

    2012-01-01

    The Gillespie Stochastic Simulation Algorithm (GSSA) and its variants are cornerstone techniques to simulate reaction kinetics in situations where the concentration of the reactant is too low to allow deterministic techniques such as differential equations. The inherent limitations of the GSSA include the time required for executing a single run and the need for multiple runs for parameter sweep exercises due to the stochastic nature of the simulation. Even very efficient variants of GSSA are prohibitively expensive to compute and perform parameter sweeps. Here we present a novel variant of the exact GSSA that is amenable to acceleration by using graphics processing units (GPUs). We parallelize the execution of a single realization across threads in a warp (fine-grained parallelism). A warp is a collection of threads that are executed synchronously on a single multi-processor. Warps executing in parallel on different multi-processors (coarse-grained parallelism) simultaneously generate multiple trajectories. Novel data-structures and algorithms reduce memory traffic, which is the bottleneck in computing the GSSA. Our benchmarks show an 8×-120× performance gain over various state-of-the-art serial algorithms when simulating different types of models.

  5. Efficient Parallelization of a Dynamic Unstructured Application on the Tera MTA

    NASA Technical Reports Server (NTRS)

    Oliker, Leonid; Biswas, Rupak

    1999-01-01

    The success of parallel computing in solving real-life computationally-intensive problems relies on their efficient mapping and execution on large-scale multiprocessor architectures. Many important applications are both unstructured and dynamic in nature, making their efficient parallel implementation a daunting task. This paper presents the parallelization of a dynamic unstructured mesh adaptation algorithm using three popular programming paradigms on three leading supercomputers. We examine an MPI message-passing implementation on the Cray T3E and the SGI Origin2OOO, a shared-memory implementation using cache coherent nonuniform memory access (CC-NUMA) of the Origin2OOO, and a multi-threaded version on the newly-released Tera Multi-threaded Architecture (MTA). We compare several critical factors of this parallel code development, including runtime, scalability, programmability, and memory overhead. Our overall results demonstrate that multi-threaded systems offer tremendous potential for quickly and efficiently solving some of the most challenging real-life problems on parallel computers.

  6. High Resolution Modelling of the Congo River's Multi-Threaded Main Stem Hydraulics

    NASA Astrophysics Data System (ADS)

    Carr, A. B.; Trigg, M.; Tshimanga, R.; Neal, J. C.; Borman, D.; Smith, M. W.; Bola, G.; Kabuya, P.; Mushie, C. A.; Tschumbu, C. L.

    2017-12-01

    We present the results of a summer 2017 field campaign by members of the Congo River users Hydraulics and Morphology (CRuHM) project, and a subsequent reach-scale hydraulic modelling study on the Congo's main stem. Sonar bathymetry, ADCP transects, and water surface elevation data have been collected along the Congo's heavily multi-threaded middle reach, which exhibits complex in-channel hydraulic processes that are not well understood. To model the entire basin's hydrodynamics, these in-channel hydraulic processes must be parameterised since it is not computationally feasible to represent them explicitly. Furthermore, recent research suggests that relative to other large global rivers, in-channel flows on the Congo represent a relatively large proportion of total flow through the river-floodplain system. We therefore regard sufficient representation of in-channel hydraulic processes as a Congo River hydrodynamic research priority. To enable explicit representation of in-channel hydraulics, we develop a reach-scale (70 km), high resolution hydraulic model. Simulation of flow through individual channel threads provides new information on flow depths and velocities, and will be used to inform the parameterisation of a broader basin-scale hydrodynamic model. The basin-scale model will ultimately be used to investigate floodplain fluxes, flood wave attenuation, and the impact of future hydrological change scenarios on basin hydrodynamics. This presentation will focus on the methodology we use to develop a reach-scale bathymetric DEM. The bathymetry of only a small proportion of channel threads can realistically be captured, necessitating some estimation of the bathymetry of channels not surveyed. We explore different approaches to this bathymetry estimation, and the extent to which it influences hydraulic model predictions. The CRuHM project is a consortium comprising the Universities of Kinshasa, Rhodes, Dar es Salaam, Bristol, and Leeds, and is funded by Royal Society-DFID Africa Capacity Building Initiative. The project aims to strengthen institutional research capacity and advance our understanding of the hydrology, hydrodynamics and sediment dynamics of the world's second largest river system through fieldwork and development of numerical models.

  7. A Magnetic Reconnection Event in the Solar Atmosphere Driven by Relaxation of a Twisted Arch Filament System

    NASA Astrophysics Data System (ADS)

    Huang, Zhenghua; Mou, Chaozhou; Fu, Hui; Deng, Linhua; Li, Bo; Xia, Lidong

    2018-02-01

    We present high-resolution observations of a magnetic reconnection event in the solar atmosphere taken with the New Vacuum Solar Telescope, Atmospheric Imaging Assembly (AIA), and Helioseismic and Magnetic Imager (HMI). The reconnection event occurred between the threads of a twisted arch filament system (AFS) and coronal loops. Our observations reveal that the relaxation of the twisted AFS drives some of its threads to encounter the coronal loops, providing inflows of the reconnection. The reconnection is evidenced by flared X-shape features in the AIA images, a current-sheet-like feature apparently connecting post-reconnection loops in the Hα + 1 Å images, small-scale magnetic cancelation in the HMI magnetograms and flows with speeds of 40–80 km s‑1 along the coronal loops. The post-reconnection coronal loops seen in the AIA 94 Å passband appear to remain bright for a relatively long time, suggesting that they have been heated and/or filled up by dense plasmas previously stored in the AFS threads. Our observations suggest that the twisted magnetic system could release its free magnetic energy into the upper solar atmosphere through reconnection processes. While the plasma pressure in the reconnecting flux tubes are significantly different, the reconfiguration of field lines could result in transferring of mass among them and induce heating therein.

  8. The Combined Effects of Response Time and Message Content on Growth Patterns of Discussion Threads in Computer-Supported Collaborative Argumentation

    ERIC Educational Resources Information Center

    Jeong, Allan

    2004-01-01

    This study examined the effects of response time and message content on the growth patterns of discussion threads in computer-supported collaborative argumentation. Event sequence analysis was used to measure response times between threaded messages and responses containing arguments, evidence, critiques, evaluations, and other comments from…

  9. Formation and Evolution of a Multi-Threaded Prominence

    NASA Technical Reports Server (NTRS)

    Luna, M.; Karpen, J. T.; DeVore, C. R.

    2012-01-01

    We investigate the process of formation and subsequent evolution of prominence plasma in a filament channel and its overlying arcade. We construct a three-dimensional time-dependent model of a filament-channel prominence suitable to be compared with observations. We combine this magnetic field structure with one-dimensional independent simulations of many flux tubes. The magnetic structure is a three-dimensional sheared double arcade, and the thermal non-equilibrium process governs the plasma evolution. We have found that the condensations in the corona can be divided into two populations: threads and blobs. Threads are massive condensations that linger in the field line dips. Blobs are ubiquitous small condensations that are produced throughout the filament and overlying arcade magnetic structure, and rapidly fall to the chromosphere. The total prominence mass is in agreement with observations. The threads are the principal contributors to the total mass, whereas the blob contribution is small. The motion of the threads is basically horizontal, while blobs move in all directions along the field. The peak velocities for both populations are comparable, but there is a weak tendency for the velocity to increase with the inclination, and the blobs with motion near vertical have the largest values of the velocity. We have generated synthetic images of the whole structure in an H proxy and in two EUV channels of the AIA instrument aboard SDO. These images show the plasma at cool, warm and hot temperatures. The theoretical differential emission measure of our system agrees very well with observations in the temperature range log T = 4.6-5.7. We conclude that the sheared-arcade magnetic structure and plasma dynamics fit well the abundant observational evidence.

  10. A Massively Parallel Computational Method of Reading Index Files for SOAPsnv.

    PubMed

    Zhu, Xiaoqian; Peng, Shaoliang; Liu, Shaojie; Cui, Yingbo; Gu, Xiang; Gao, Ming; Fang, Lin; Fang, Xiaodong

    2015-12-01

    SOAPsnv is the software used for identifying the single nucleotide variation in cancer genes. However, its performance is yet to match the massive amount of data to be processed. Experiments reveal that the main performance bottleneck of SOAPsnv software is the pileup algorithm. The original pileup algorithm's I/O process is time-consuming and inefficient to read input files. Moreover, the scalability of the pileup algorithm is also poor. Therefore, we designed a new algorithm, named BamPileup, aiming to improve the performance of sequential read, and the new pileup algorithm implemented a parallel read mode based on index. Using this method, each thread can directly read the data start from a specific position. The results of experiments on the Tianhe-2 supercomputer show that, when reading data in a multi-threaded parallel I/O way, the processing time of algorithm is reduced to 3.9 s and the application program can achieve a speedup up to 100×. Moreover, the scalability of the new algorithm is also satisfying.

  11. Multigroup Monte Carlo on GPUs: Comparison of history- and event-based algorithms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hamilton, Steven P.; Slattery, Stuart R.; Evans, Thomas M.

    This article presents an investigation of the performance of different multigroup Monte Carlo transport algorithms on GPUs with a discussion of both history-based and event-based approaches. Several algorithmic improvements are introduced for both approaches. By modifying the history-based algorithm that is traditionally favored in CPU-based MC codes to occasionally filter out dead particles to reduce thread divergence, performance exceeds that of either the pure history-based or event-based approaches. The impacts of several algorithmic choices are discussed, including performance studies on Kepler and Pascal generation NVIDIA GPUs for fixed source and eigenvalue calculations. Single-device performance equivalent to 20–40 CPU cores onmore » the K40 GPU and 60–80 CPU cores on the P100 GPU is achieved. Last, in addition, nearly perfect multi-device parallel weak scaling is demonstrated on more than 16,000 nodes of the Titan supercomputer.« less

  12. Multigroup Monte Carlo on GPUs: Comparison of history- and event-based algorithms

    DOE PAGES

    Hamilton, Steven P.; Slattery, Stuart R.; Evans, Thomas M.

    2017-12-22

    This article presents an investigation of the performance of different multigroup Monte Carlo transport algorithms on GPUs with a discussion of both history-based and event-based approaches. Several algorithmic improvements are introduced for both approaches. By modifying the history-based algorithm that is traditionally favored in CPU-based MC codes to occasionally filter out dead particles to reduce thread divergence, performance exceeds that of either the pure history-based or event-based approaches. The impacts of several algorithmic choices are discussed, including performance studies on Kepler and Pascal generation NVIDIA GPUs for fixed source and eigenvalue calculations. Single-device performance equivalent to 20–40 CPU cores onmore » the K40 GPU and 60–80 CPU cores on the P100 GPU is achieved. Last, in addition, nearly perfect multi-device parallel weak scaling is demonstrated on more than 16,000 nodes of the Titan supercomputer.« less

  13. Flare particle acceleration in the interaction of twisted coronal flux ropes

    NASA Astrophysics Data System (ADS)

    Threlfall, J.; Hood, A. W.; Browning, P. K.

    2018-03-01

    Aim. The aim of this work is to investigate and characterise non-thermal particle behaviour in a three-dimensional (3D) magnetohydrodynamical (MHD) model of unstable multi-threaded flaring coronal loops. Methods: We have used a numerical scheme which solves the relativistic guiding centre approximation to study the motion of electrons and protons. The scheme uses snapshots from high resolution numerical MHD simulations of coronal loops containing two threads, where a single thread becomes unstable and (in one case) destabilises and merges with an additional thread. Results: The particle responses to the reconnection and fragmentation in MHD simulations of two loop threads are examined in detail. We illustrate the role played by uniform background resistivity and distinguish this from the role of anomalous resistivity using orbits in an MHD simulation where only one thread becomes unstable without destabilising further loop threads. We examine the (scalable) orbit energy gains and final positions recovered at different stages of a second MHD simulation wherein a secondary loop thread is destabilised by (and merges with) the first thread. We compare these results with other theoretical particle acceleration models in the context of observed energetic particle populations during solar flares.

  14. Comprehensive Software Simulation on Ground Power Supply for Launch Pads and Processing Facilities at NASA Kennedy Space Center

    NASA Technical Reports Server (NTRS)

    Dominguez, Jesus A.; Victor, Elias; Vasquez, Angel L.; Urbina, Alfredo R.

    2017-01-01

    A multi-threaded software application has been developed in-house by the Ground Special Power (GSP) team at NASA Kennedy Space Center (KSC) to separately simulate and fully emulate all units that supply VDC power and battery-based power backup to multiple KSC launch ground support systems for NASA Space Launch Systems (SLS) rocket.

  15. Multi-petascale highly efficient parallel supercomputer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Asaad, Sameh; Bellofatto, Ralph E.; Blocksome, Michael A.

    A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaflop-scale includes node architectures based upon System-On-a-Chip technology, where each processing node comprises a single Application Specific Integrated Circuit (ASIC). The ASIC nodes are interconnected by a five dimensional torus network that optimally maximize the throughput of packet communications between nodes and minimize latency. The network implements collective network and a global asynchronous network that provides global barrier and notification functions. Integrated in the node design include a list-based prefetcher. The memory system implements transaction memory, thread level speculation, and multiversioning cache that improves soft error rate at the same time andmore » supports DMA functionality allowing for parallel processing message-passing.« less

  16. Present Situation of the Anti-Fatigue Processing of High-Strength Steel Internal Thread Based on Cold Extrusion Technology: A Review

    NASA Astrophysics Data System (ADS)

    Miao, Hong; Jiang, Cheng; Liu, Sixing; Zhang, Shanwen; Zhang, Yanjun

    2017-03-01

    The adoption of cold-extrusion forming for internal thread net forming becomes an important component of anti-fatigue processing with the development of internal thread processing towards high performance, low cost and low energy consumption. It has vast application foreground in the field of aviation, spaceflight, high speed train and etc. The internal thread processing and anti-fatigue manufacture technology are summarized. In terms of the perspective of processing quality and fatigue serving life, the advantages and disadvantages of the processing methods from are compared. The internal thread cold-extrusion processing technology is investigated for the purpose of improving the anti-fatigue serving life of internal thread. The superiorities of the plastic deformation law and surface integrity of the metal layer in the course of cold extrusion for improving its stability and economy are summed up. The proposed research forecasts the development tendency of the internal thread anti-fatigue manufacturing technology.

  17. Integrating Health Information Systems into a Database Course: A Case Study

    ERIC Educational Resources Information Center

    Anderson, Nicole; Zhang, Mingrui; McMaster, Kirby

    2011-01-01

    Computer Science is a rich field with many growing application areas, such as Health Information Systems. What we suggest here is that multi-disciplinary threads can be introduced to supplement, enhance, and strengthen the primary area of study in a course. We call these supplementary materials "threads," because they are executed…

  18. Electronic and optical properties of GaN/AlN quantum dots with adjacent threading dislocations

    NASA Astrophysics Data System (ADS)

    Ye, Han; Lu, Peng-Fei; Yu, Zhong-Yuan; Yao, Wen-Jie; Chen, Zhi-Hui; Jia, Bo-Yong; Liu, Yu-Min

    2010-04-01

    We present a theory to simulate a coherent GaN QD with an adjacent pure edge threading dislocation by using a finite element method. The piezoelectric effects and the strain modified band edges are investigated in the framework of multi-band k · p theory to calculate the electron and the heavy hole energy levels. The linear optical absorption coefficients corresponding to the interband ground state transition are obtained via the density matrix approach and perturbation expansion method. The results indicate that the strain distribution of the threading dislocation affects the electronic structure. Moreover, the ground state transition behaviour is also influenced by the position of the adjacent threading dislocation.

  19. CUDA Optimization Strategies for Compute- and Memory-Bound Neuroimaging Algorithms

    PubMed Central

    Lee, Daren; Dinov, Ivo; Dong, Bin; Gutman, Boris; Yanovsky, Igor; Toga, Arthur W.

    2011-01-01

    As neuroimaging algorithms and technology continue to grow faster than CPU performance in complexity and image resolution, data-parallel computing methods will be increasingly important. The high performance, data-parallel architecture of modern graphical processing units (GPUs) can reduce computational times by orders of magnitude. However, its massively threaded architecture introduces challenges when GPU resources are exceeded. This paper presents optimization strategies for compute- and memory-bound algorithms for the CUDA architecture. For compute-bound algorithms, the registers are reduced through variable reuse via shared memory and the data throughput is increased through heavier thread workloads and maximizing the thread configuration for a single thread block per multiprocessor. For memory-bound algorithms, fitting the data into the fast but limited GPU resources is achieved through reorganizing the data into self-contained structures and employing a multi-pass approach. Memory latencies are reduced by selecting memory resources whose cache performance are optimized for the algorithm's access patterns. We demonstrate the strategies on two computationally expensive algorithms and achieve optimized GPU implementations that perform up to 6× faster than unoptimized ones. Compared to CPU implementations, we achieve peak GPU speedups of 129× for the 3D unbiased nonlinear image registration technique and 93× for the non-local means surface denoising algorithm. PMID:21159404

  20. CUDA optimization strategies for compute- and memory-bound neuroimaging algorithms.

    PubMed

    Lee, Daren; Dinov, Ivo; Dong, Bin; Gutman, Boris; Yanovsky, Igor; Toga, Arthur W

    2012-06-01

    As neuroimaging algorithms and technology continue to grow faster than CPU performance in complexity and image resolution, data-parallel computing methods will be increasingly important. The high performance, data-parallel architecture of modern graphical processing units (GPUs) can reduce computational times by orders of magnitude. However, its massively threaded architecture introduces challenges when GPU resources are exceeded. This paper presents optimization strategies for compute- and memory-bound algorithms for the CUDA architecture. For compute-bound algorithms, the registers are reduced through variable reuse via shared memory and the data throughput is increased through heavier thread workloads and maximizing the thread configuration for a single thread block per multiprocessor. For memory-bound algorithms, fitting the data into the fast but limited GPU resources is achieved through reorganizing the data into self-contained structures and employing a multi-pass approach. Memory latencies are reduced by selecting memory resources whose cache performance are optimized for the algorithm's access patterns. We demonstrate the strategies on two computationally expensive algorithms and achieve optimized GPU implementations that perform up to 6× faster than unoptimized ones. Compared to CPU implementations, we achieve peak GPU speedups of 129× for the 3D unbiased nonlinear image registration technique and 93× for the non-local means surface denoising algorithm. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  1. ATLAS DataFlow Infrastructure: Recent results from ATLAS cosmic and first-beam data-taking

    NASA Astrophysics Data System (ADS)

    Vandelli, Wainer; ATLAS TDAQ Collaboration

    2010-04-01

    The ATLAS DataFlow infrastructure is responsible for the collection and conveyance of event data from the detector front-end electronics to the mass storage. Several optimized and multi-threaded applications fulfill this purpose operating over a multi-stage Gigabit Ethernet network which is the backbone of the ATLAS Trigger and Data Acquisition System. The system must be able to efficiently transport event-data with high reliability, while providing aggregated bandwidths larger than 5 GByte/s and coping with many thousands network connections. Nevertheless, routing and streaming capabilities and monitoring and data accounting functionalities are also fundamental requirements. During 2008, a few months of ATLAS cosmic data-taking and the first experience with the LHC beams provided an unprecedented test-bed for the evaluation of the performance of the ATLAS DataFlow, in terms of functionality, robustness and stability. Besides, operating the system far from its design specifications helped in exercising its flexibility and contributed in understanding its limitations. Moreover, the integration with the detector and the interfacing with the off-line data processing and management have been able to take advantage of this extended data taking-period as well. In this paper we report on the usage of the DataFlow infrastructure during the ATLAS data-taking. These results, backed-up by complementary performance tests, validate the architecture of the ATLAS DataFlow and prove that the system is robust, flexible and scalable enough to cope with the final requirements of the ATLAS experiment.

  2. Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce.

    PubMed

    Decap, Dries; Reumers, Joke; Herzeel, Charlotte; Costanza, Pascal; Fostier, Jan

    2017-01-01

    Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice, several methods have been proposed to reduce runtime for DNA-seq analysis through the use of parallel computing. Determining the effectively expressed variants from transcriptomics (RNA-seq) data has only recently become possible, and as such does not yet benefit from efficiently parallelized workflows. We introduce Halvade-RNA, a parallel, multi-node RNA-seq variant calling pipeline based on the GATK Best Practices recommendations. Halvade-RNA makes use of the MapReduce programming model to create and manage parallel data streams on which multiple instances of existing tools such as STAR and GATK operate concurrently. Whereas the single-threaded processing of a typical RNA-seq sample requires ∼28h, Halvade-RNA reduces this runtime to ∼2h using a small cluster with two 20-core machines. Even on a single, multi-core workstation, Halvade-RNA can significantly reduce runtime compared to using multi-threading, thus providing for a more cost-effective processing of RNA-seq data. Halvade-RNA is written in Java and uses the Hadoop MapReduce 2.0 API. It supports a wide range of distributions of Hadoop, including Cloudera and Amazon EMR.

  3. Tool Removes Coil-Spring Thread Inserts

    NASA Technical Reports Server (NTRS)

    Collins, Gerald J., Jr.; Swenson, Gary J.; Mcclellan, J. Scott

    1991-01-01

    Tool removes coil-spring thread inserts from threaded holes. Threads into hole, pries insert loose, grips insert, then pulls insert to thread it out of hole. Effects essentially reverse of insertion process to ease removal and avoid further damage to threaded inner surface of hole.

  4. Kernel optimization for short-range molecular dynamics

    NASA Astrophysics Data System (ADS)

    Hu, Changjun; Wang, Xianmeng; Li, Jianjiang; He, Xinfu; Li, Shigang; Feng, Yangde; Yang, Shaofeng; Bai, He

    2017-02-01

    To optimize short-range force computations in Molecular Dynamics (MD) simulations, multi-threading and SIMD optimizations are presented in this paper. With respect to multi-threading optimization, a Partition-and-Separate-Calculation (PSC) method is designed to avoid write conflicts caused by using Newton's third law. Serial bottlenecks are eliminated with no additional memory usage. The method is implemented by using the OpenMP model. Furthermore, the PSC method is employed on Intel Xeon Phi coprocessors in both native and offload models. We also evaluate the performance of the PSC method under different thread affinities on the MIC architecture. In the SIMD execution, we explain the performance influence in the PSC method, considering the "if-clause" of the cutoff radius check. The experiment results show that our PSC method is relatively more efficient compared to some traditional methods. In double precision, our 256-bit SIMD implementation is about 3 times faster than the scalar version.

  5. Systematic and Scalable Testing of Concurrent Programs

    DTIC Science & Technology

    2013-12-16

    The evaluation of CHESS [107] checked eight different programs ranging from process management libraries to a distributed execution engine to a research...tool (§3.1) targets systematic testing of scheduling nondeterminism in multi- threaded components of the Omega cluster management system [129], while...tool for systematic testing of multithreaded com- ponents of the Omega cluster management system [129]. In particular, §3.1.1 defines a model for

  6. Parallelization of interpolation, solar radiation and water flow simulation modules in GRASS GIS using OpenMP

    NASA Astrophysics Data System (ADS)

    Hofierka, Jaroslav; Lacko, Michal; Zubal, Stanislav

    2017-10-01

    In this paper, we describe the parallelization of three complex and computationally intensive modules of GRASS GIS using the OpenMP application programming interface for multi-core computers. These include the v.surf.rst module for spatial interpolation, the r.sun module for solar radiation modeling and the r.sim.water module for water flow simulation. We briefly describe the functionality of the modules and parallelization approaches used in the modules. Our approach includes the analysis of the module's functionality, identification of source code segments suitable for parallelization and proper application of OpenMP parallelization code to create efficient threads processing the subtasks. We document the efficiency of the solutions using the airborne laser scanning data representing land surface in the test area and derived high-resolution digital terrain model grids. We discuss the performance speed-up and parallelization efficiency depending on the number of processor threads. The study showed a substantial increase in computation speeds on a standard multi-core computer while maintaining the accuracy of results in comparison to the output from original modules. The presented parallelization approach showed the simplicity and efficiency of the parallelization of open-source GRASS GIS modules using OpenMP, leading to an increased performance of this geospatial software on standard multi-core computers.

  7. Multi-phase SPH modelling of violent hydrodynamics on GPUs

    NASA Astrophysics Data System (ADS)

    Mokos, Athanasios; Rogers, Benedict D.; Stansby, Peter K.; Domínguez, José M.

    2015-11-01

    This paper presents the acceleration of multi-phase smoothed particle hydrodynamics (SPH) using a graphics processing unit (GPU) enabling large numbers of particles (10-20 million) to be simulated on just a single GPU card. With novel hardware architectures such as a GPU, the optimum approach to implement a multi-phase scheme presents some new challenges. Many more particles must be included in the calculation and there are very different speeds of sound in each phase with the largest speed of sound determining the time step. This requires efficient computation. To take full advantage of the hardware acceleration provided by a single GPU for a multi-phase simulation, four different algorithms are investigated: conditional statements, binary operators, separate particle lists and an intermediate global function. Runtime results show that the optimum approach needs to employ separate cell and neighbour lists for each phase. The profiler shows that this approach leads to a reduction in both memory transactions and arithmetic operations giving significant runtime gains. The four different algorithms are compared to the efficiency of the optimised single-phase GPU code, DualSPHysics, for 2-D and 3-D simulations which indicate that the multi-phase functionality has a significant computational overhead. A comparison with an optimised CPU code shows a speed up of an order of magnitude over an OpenMP simulation with 8 threads and two orders of magnitude over a single thread simulation. A demonstration of the multi-phase SPH GPU code is provided by a 3-D dam break case impacting an obstacle. This shows better agreement with experimental results than an equivalent single-phase code. The multi-phase GPU code enables a convergence study to be undertaken on a single GPU with a large number of particles that otherwise would have required large high performance computing resources.

  8. Exploration of microfluidic devices based on multi-filament threads and textiles: A review

    PubMed Central

    Nilghaz, A.; Ballerini, D. R.; Shen, W.

    2013-01-01

    In this paper, we review the recent progress in the development of low-cost microfluidic devices based on multifilament threads and textiles for semi-quantitative diagnostic and environmental assays. Hydrophilic multifilament threads are capable of transporting aqueous and non-aqueous fluids via capillary action and possess desirable properties for building fluid transport pathways in microfluidic devices. Thread can be sewn onto various support materials to form fluid transport channels without the need for the patterned hydrophobic barriers essential for paper-based microfluidic devices. Thread can also be used to manufacture fabrics which can be patterned to achieve suitable hydrophilic-hydrophobic contrast, creating hydrophilic channels which allow the control of fluids flow. Furthermore, well established textile patterning methods and combination of hydrophilic and hydrophobic threads can be applied to fabricate low-cost microfluidic devices that meet the low-cost and low-volume requirements. In this paper, we review the current limitations and shortcomings of multifilament thread and textile-based microfluidics, and the research efforts to date on the development of fluid flow control concepts and fabrication methods. We also present a summary of different methods for modelling the fluid capillary flow in microfluidic thread and textile-based systems. Finally, we summarized the published works of thread surface treatment methods and the potential of combining multifilament thread with other materials to construct devices with greater functionality. We believe these will be important research focuses of thread- and textile-based microfluidics in future. PMID:24086179

  9. HELICAL MOTIONS OF FINE-STRUCTURE PROMINENCE THREADS OBSERVED BY HINODE AND IRIS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Okamoto, Takenori J.; Liu, Wei; Tsuneta, Saku, E-mail: joten.okamoto@nao.ac.jp

    Fine-structure dynamics in solar prominences holds critical clues to understanding their physical nature of significant space-weather implications. We report evidence of rotational motions of horizontal helical threads in two active-region prominences observed by the Hinode and/or Interface Region Imaging Spectrograph satellites at high resolution. In the first event, we found transverse motions of brightening threads at speeds up to 55 km s{sup -1} seen in the plane of the sky. Such motions appeared as sinusoidal space–time trajectories with a typical period of ∼390 s, which is consistent with plane-of-sky projections of rotational motions. Phase delays at different locations suggest themore » propagation of twists along the threads at phase speeds of 90–270 km s{sup -1}. At least 15 episodes of such motions occurred in two days, none associated with an eruption. For these episodes, the plane-of-sky speed is linearly correlated with the vertical travel distance, suggestive of a constant angular speed. In the second event, we found Doppler velocities of 30–40 km s{sup -1} in opposite directions in the top and bottom portions of the prominence, comparable to the plane-of-sky speed. The moving threads have about twice broader line widths than stationary threads. These observations, when taken together, provide strong evidence for rotations of helical prominence threads, which were likely driven by unwinding twists triggered by magnetic reconnection between twisted prominence magnetic fields and ambient coronal fields.« less

  10. List-mode PET image reconstruction for motion correction using the Intel XEON PHI co-processor

    NASA Astrophysics Data System (ADS)

    Ryder, W. J.; Angelis, G. I.; Bashar, R.; Gillam, J. E.; Fulton, R.; Meikle, S.

    2014-03-01

    List-mode image reconstruction with motion correction is computationally expensive, as it requires projection of hundreds of millions of rays through a 3D array. To decrease reconstruction time it is possible to use symmetric multiprocessing computers or graphics processing units. The former can have high financial costs, while the latter can require refactoring of algorithms. The Xeon Phi is a new co-processor card with a Many Integrated Core architecture that can run 4 multiple-instruction, multiple data threads per core with each thread having a 512-bit single instruction, multiple data vector register. Thus, it is possible to run in the region of 220 threads simultaneously. The aim of this study was to investigate whether the Xeon Phi co-processor card is a viable alternative to an x86 Linux server for accelerating List-mode PET image reconstruction for motion correction. An existing list-mode image reconstruction algorithm with motion correction was ported to run on the Xeon Phi coprocessor with the multi-threading implemented using pthreads. There were no differences between images reconstructed using the Phi co-processor card and images reconstructed using the same algorithm run on a Linux server. However, it was found that the reconstruction runtimes were 3 times greater for the Phi than the server. A new version of the image reconstruction algorithm was developed in C++ using OpenMP for mutli-threading and the Phi runtimes decreased to 1.67 times that of the host Linux server. Data transfer from the host to co-processor card was found to be a rate-limiting step; this needs to be carefully considered in order to maximize runtime speeds. When considering the purchase price of a Linux workstation with Xeon Phi co-processor card and top of the range Linux server, the former is a cost-effective computation resource for list-mode image reconstruction. A multi-Phi workstation could be a viable alternative to cluster computers at a lower cost for medical imaging applications.

  11. Neuropil threads occur in dendrites of tangle-bearing nerve cells.

    PubMed

    Braak, H; Braak, E

    1988-01-01

    Transparent Golgi preparations counterstained for Alzheimer's neurofibrillary changes rendered possible the demonstration of neuropil threads in defined cellular processes. Only dendrites of tangle-bearing cortical nerve cells were found to contain neuropil threads. Processes of glial cells as well as axons present in the material were devoid of neuropil threads.

  12. Too Much Control Can Hurt: A Threaded Cognition Model of the Attentional Blink

    ERIC Educational Resources Information Center

    Taatgen, Niels A.; Juvina, Ion; Schipper, Marc; Borst, Jelmer P.; Martens, Sander

    2009-01-01

    Explanations for the attentional blink (AB; a deficit in identifying the second of two targets when presented 200-500ms after the first) have recently shifted from limitations in memory consolidation to disruptions in cognitive control. With a new model based on the threaded cognition theory of multi-tasking we propose a different explanation: the…

  13. Thread concept for automatic task parallelization in image analysis

    NASA Astrophysics Data System (ADS)

    Lueckenhaus, Maximilian; Eckstein, Wolfgang

    1998-09-01

    Parallel processing of image analysis tasks is an essential method to speed up image processing and helps to exploit the full capacity of distributed systems. However, writing parallel code is a difficult and time-consuming process and often leads to an architecture-dependent program that has to be re-implemented when changing the hardware. Therefore it is highly desirable to do the parallelization automatically. For this we have developed a special kind of thread concept for image analysis tasks. Threads derivated from one subtask may share objects and run in the same context but may process different threads of execution and work on different data in parallel. In this paper we describe the basics of our thread concept and show how it can be used as basis of an automatic task parallelization to speed up image processing. We further illustrate the design and implementation of an agent-based system that uses image analysis threads for generating and processing parallel programs by taking into account the available hardware. The tests made with our system prototype show that the thread concept combined with the agent paradigm is suitable to speed up image processing by an automatic parallelization of image analysis tasks.

  14. Accelerating population balance-Monte Carlo simulation for coagulation dynamics from the Markov jump model, stochastic algorithm and GPU parallel computing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xu, Zuwei; Zhao, Haibo, E-mail: klinsmannzhb@163.com; Zheng, Chuguang

    2015-01-15

    This paper proposes a comprehensive framework for accelerating population balance-Monte Carlo (PBMC) simulation of particle coagulation dynamics. By combining Markov jump model, weighted majorant kernel and GPU (graphics processing unit) parallel computing, a significant gain in computational efficiency is achieved. The Markov jump model constructs a coagulation-rule matrix of differentially-weighted simulation particles, so as to capture the time evolution of particle size distribution with low statistical noise over the full size range and as far as possible to reduce the number of time loopings. Here three coagulation rules are highlighted and it is found that constructing appropriate coagulation rule providesmore » a route to attain the compromise between accuracy and cost of PBMC methods. Further, in order to avoid double looping over all simulation particles when considering the two-particle events (typically, particle coagulation), the weighted majorant kernel is introduced to estimate the maximum coagulation rates being used for acceptance–rejection processes by single-looping over all particles, and meanwhile the mean time-step of coagulation event is estimated by summing the coagulation kernels of rejected and accepted particle pairs. The computational load of these fast differentially-weighted PBMC simulations (based on the Markov jump model) is reduced greatly to be proportional to the number of simulation particles in a zero-dimensional system (single cell). Finally, for a spatially inhomogeneous multi-dimensional (multi-cell) simulation, the proposed fast PBMC is performed in each cell, and multiple cells are parallel processed by multi-cores on a GPU that can implement the massively threaded data-parallel tasks to obtain remarkable speedup ratio (comparing with CPU computation, the speedup ratio of GPU parallel computing is as high as 200 in a case of 100 cells with 10 000 simulation particles per cell). These accelerating approaches of PBMC are demonstrated in a physically realistic Brownian coagulation case. The computational accuracy is validated with benchmark solution of discrete-sectional method. The simulation results show that the comprehensive approach can attain very favorable improvement in cost without sacrificing computational accuracy.« less

  15. Parallel Computer System for 3D Visualization Stereo on GPU

    NASA Astrophysics Data System (ADS)

    Al-Oraiqat, Anas M.; Zori, Sergii A.

    2018-03-01

    This paper proposes the organization of a parallel computer system based on Graphic Processors Unit (GPU) for 3D stereo image synthesis. The development is based on the modified ray tracing method developed by the authors for fast search of tracing rays intersections with scene objects. The system allows significant increase in the productivity for the 3D stereo synthesis of photorealistic quality. The generalized procedure of 3D stereo image synthesis on the Graphics Processing Unit/Graphics Processing Clusters (GPU/GPC) is proposed. The efficiency of the proposed solutions by GPU implementation is compared with single-threaded and multithreaded implementations on the CPU. The achieved average acceleration in multi-thread implementation on the test GPU and CPU is about 7.5 and 1.6 times, respectively. Studying the influence of choosing the size and configuration of the computational Compute Unified Device Archi-tecture (CUDA) network on the computational speed shows the importance of their correct selection. The obtained experimental estimations can be significantly improved by new GPUs with a large number of processing cores and multiprocessors, as well as optimized configuration of the computing CUDA network.

  16. Simulating electron wave dynamics in graphene superlattices exploiting parallel processing advantages

    NASA Astrophysics Data System (ADS)

    Rodrigues, Manuel J.; Fernandes, David E.; Silveirinha, Mário G.; Falcão, Gabriel

    2018-01-01

    This work introduces a parallel computing framework to characterize the propagation of electron waves in graphene-based nanostructures. The electron wave dynamics is modeled using both "microscopic" and effective medium formalisms and the numerical solution of the two-dimensional massless Dirac equation is determined using a Finite-Difference Time-Domain scheme. The propagation of electron waves in graphene superlattices with localized scattering centers is studied, and the role of the symmetry of the microscopic potential in the electron velocity is discussed. The computational methodologies target the parallel capabilities of heterogeneous multi-core CPU and multi-GPU environments and are built with the OpenCL parallel programming framework which provides a portable, vendor agnostic and high throughput-performance solution. The proposed heterogeneous multi-GPU implementation achieves speedup ratios up to 75x when compared to multi-thread and multi-core CPU execution, reducing simulation times from several hours to a couple of minutes.

  17. Data preprocessing for determining outer/inner parallelization in the nested loop problem using OpenMP

    NASA Astrophysics Data System (ADS)

    Handhika, T.; Bustamam, A.; Ernastuti, Kerami, D.

    2017-07-01

    Multi-thread programming using OpenMP on the shared-memory architecture with hyperthreading technology allows the resource to be accessed by multiple processors simultaneously. Each processor can execute more than one thread for a certain period of time. However, its speedup depends on the ability of the processor to execute threads in limited quantities, especially the sequential algorithm which contains a nested loop. The number of the outer loop iterations is greater than the maximum number of threads that can be executed by a processor. The thread distribution technique that had been found previously only be applied by the high-level programmer. This paper generates a parallelization procedure for low-level programmer in dealing with 2-level nested loop problems with the maximum number of threads that can be executed by a processor is smaller than the number of the outer loop iterations. Data preprocessing which is related to the number of the outer loop and the inner loop iterations, the computational time required to execute each iteration and the maximum number of threads that can be executed by a processor are used as a strategy to determine which parallel region that will produce optimal speedup.

  18. Using Multithreading for the Automatic Load Balancing of 2D Adaptive Finite Element Meshes

    NASA Technical Reports Server (NTRS)

    Heber, Gerd; Biswas, Rupak; Thulasiraman, Parimala; Gao, Guang R.; Bailey, David H. (Technical Monitor)

    1998-01-01

    In this paper, we present a multi-threaded approach for the automatic load balancing of adaptive finite element (FE) meshes. The platform of our choice is the EARTH multi-threaded system which offers sufficient capabilities to tackle this problem. We implement the question phase of FE applications on triangular meshes, and exploit the EARTH token mechanism to automatically balance the resulting irregular and highly nonuniform workload. We discuss the results of our experiments on EARTH-SP2, an implementation of EARTH on the IBM SP2, with different load balancing strategies that are built into the runtime system.

  19. A Review of Lightweight Thread Approaches for High Performance Computing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Castello, Adrian; Pena, Antonio J.; Seo, Sangmin

    High-level, directive-based solutions are becoming the programming models (PMs) of the multi/many-core architectures. Several solutions relying on operating system (OS) threads perfectly work with a moderate number of cores. However, exascale systems will spawn hundreds of thousands of threads in order to exploit their massive parallel architectures and thus conventional OS threads are too heavy for that purpose. Several lightweight thread (LWT) libraries have recently appeared offering lighter mechanisms to tackle massive concurrency. In order to examine the suitability of LWTs in high-level runtimes, we develop a set of microbenchmarks consisting of commonlyfound patterns in current parallel codes. Moreover, wemore » study the semantics offered by some LWT libraries in order to expose the similarities between different LWT application programming interfaces. This study reveals that a reduced set of LWT functions can be sufficient to cover the common parallel code patterns and that those LWT libraries perform better than OS threads-based solutions in cases where task and nested parallelism are becoming more popular with new architectures.« less

  20. IOPA: I/O-aware parallelism adaption for parallel programs

    PubMed Central

    Liu, Tao; Liu, Yi; Qian, Chen; Qian, Depei

    2017-01-01

    With the development of multi-/many-core processors, applications need to be written as parallel programs to improve execution efficiency. For data-intensive applications that use multiple threads to read/write files simultaneously, an I/O sub-system can easily become a bottleneck when too many of these types of threads exist; on the contrary, too few threads will cause insufficient resource utilization and hurt performance. Therefore, programmers must pay much attention to parallelism control to find the appropriate number of I/O threads for an application. This paper proposes a parallelism control mechanism named IOPA that can adjust the parallelism of applications to adapt to the I/O capability of a system and balance computing resources and I/O bandwidth. The programming interface of IOPA is also provided to programmers to simplify parallel programming. IOPA is evaluated using multiple applications with both solid state and hard disk drives. The results show that the parallel applications using IOPA can achieve higher efficiency than those with a fixed number of threads. PMID:28278236

  1. IOPA: I/O-aware parallelism adaption for parallel programs.

    PubMed

    Liu, Tao; Liu, Yi; Qian, Chen; Qian, Depei

    2017-01-01

    With the development of multi-/many-core processors, applications need to be written as parallel programs to improve execution efficiency. For data-intensive applications that use multiple threads to read/write files simultaneously, an I/O sub-system can easily become a bottleneck when too many of these types of threads exist; on the contrary, too few threads will cause insufficient resource utilization and hurt performance. Therefore, programmers must pay much attention to parallelism control to find the appropriate number of I/O threads for an application. This paper proposes a parallelism control mechanism named IOPA that can adjust the parallelism of applications to adapt to the I/O capability of a system and balance computing resources and I/O bandwidth. The programming interface of IOPA is also provided to programmers to simplify parallel programming. IOPA is evaluated using multiple applications with both solid state and hard disk drives. The results show that the parallel applications using IOPA can achieve higher efficiency than those with a fixed number of threads.

  2. Multithreading with separate data to improve the performance of Backpropagation method

    NASA Astrophysics Data System (ADS)

    Dhamma, Mulia; Zarlis, Muhammad; Budhiarti Nababan, Erna

    2017-12-01

    Backpropagation is one method of artificial neural network that can make a prediction for a new data with learning by supervised of the past data. The learning process of backpropagation method will become slow if we give too much data for backpropagation method to learn the data. Multithreading with a separate data inside of each thread are being used in order to improve the performance of backpropagtion method . Base on the research for 39 data and also 5 times experiment with separate data into 2 thread, the result showed that the average epoch become 6490 when using 2 thread and 453049 epoch when using only 1 thread. The most lowest epoch for 2 thread is 1295 and 1 thread is 356116. The process of improvement is caused by the minimum error from 2 thread that has been compared to take the weight and bias value. This process will be repeat as long as the backpropagation do learning.

  3. WaveJava: Wavelet-based network computing

    NASA Astrophysics Data System (ADS)

    Ma, Kun; Jiao, Licheng; Shi, Zhuoer

    1997-04-01

    Wavelet is a powerful theory, but its successful application still needs suitable programming tools. Java is a simple, object-oriented, distributed, interpreted, robust, secure, architecture-neutral, portable, high-performance, multi- threaded, dynamic language. This paper addresses the design and development of a cross-platform software environment for experimenting and applying wavelet theory. WaveJava, a wavelet class library designed by the object-orient programming, is developed to take advantage of the wavelets features, such as multi-resolution analysis and parallel processing in the networking computing. A new application architecture is designed for the net-wide distributed client-server environment. The data are transmitted with multi-resolution packets. At the distributed sites around the net, these data packets are done the matching or recognition processing in parallel. The results are fed back to determine the next operation. So, the more robust results can be arrived quickly. The WaveJava is easy to use and expand for special application. This paper gives a solution for the distributed fingerprint information processing system. It also fits for some other net-base multimedia information processing, such as network library, remote teaching and filmless picture archiving and communications.

  4. Observations and implications of large-amplitude longitudinal oscillations in a solar filament

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Luna, M.; Knizhnik, K.; Muglach, K.

    On 2010 August 20, an energetic disturbance triggered large-amplitude longitudinal oscillations in a nearby filament. The triggering mechanism appears to be episodic jets connecting the energetic event with the filament threads. In the present work, we analyze this periodic motion in a large fraction of the filament to characterize the underlying physics of the oscillation as well as the filament properties. The results support our previous theoretical conclusions that the restoring force of large-amplitude longitudinal oscillations is solar gravity, and the damping mechanism is the ongoing accumulation of mass onto the oscillating threads. Based on our previous work, we usedmore » the fitted parameters to determine the magnitude and radius of curvature of the dipped magnetic field along the filament, as well as the mass accretion rate onto the filament threads. These derived properties are nearly uniform along the filament, indicating a remarkable degree of cohesiveness throughout the filament channel. Moreover, the estimated mass accretion rate implies that the footpoint heating responsible for the thread formation, according to the thermal nonequilibrium model, agrees with previous coronal heating estimates. We estimate the magnitude of the energy released in the nearby event by studying the dynamic response of the filament threads, and discuss the implications of our study for filament structure and heating.« less

  5. Vectorization, threading, and cache-blocking considerations for hydrocodes on emerging architectures

    DOE PAGES

    Fung, J.; Aulwes, R. T.; Bement, M. T.; ...

    2015-07-14

    This work reports on considerations for improving computational performance in preparation for current and expected changes to computer architecture. The algorithms studied will include increasingly complex prototypes for radiation hydrodynamics codes, such as gradient routines and diffusion matrix assembly (e.g., in [1-6]). The meshes considered for the algorithms are structured or unstructured meshes. The considerations applied for performance improvements are meant to be general in terms of architecture (not specifically graphical processing unit (GPUs) or multi-core machines, for example) and include techniques for vectorization, threading, tiling, and cache blocking. Out of a survey of optimization techniques on applications such asmore » diffusion and hydrodynamics, we make general recommendations with a view toward making these techniques conceptually accessible to the applications code developer. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.« less

  6. Clinical leadership as an integral curriculum thread in pre-registration nursing programmes.

    PubMed

    Brown, Angela; Dewing, Jan; Crookes, Patrick

    2016-03-01

    In recent years there has been a growth in leadership development frameworks in health for the existing workforce. There has also been a related abundance of leadership programmes developed specifically for qualified nurses. There is a groundswell of opinion that clinical leadership preparation needs to extend to preparatory programmes leading to registration as a nurse. To this end a doctoral research study has been completed that focused specifically on the identification and verification of the antecedents of clinical leadership (leadership and management) so they can shape the curriculum content and the best way to deliver the curriculum content as a curriculum thread. To conceptualise how the curriculum content, identified and verified empirically, can be structured within a curriculum thread and to contribute to the discussion on effective pedagogical approaches and educational strategies for learning and teaching of clinical leadership. A multi-method design was utilised in the research in Australia. Drawing on core principles in critical social theory, an integral curriculum thread is proposed for pre-registration nursing programmes that identifies the antecedents of clinical leadership; the core concepts, together with the continuum of enlightenment, empowerment, and emancipation. The curriculum content, the effective pedagogical approaches and the educational strategies are supported theoretically and we believe this offers a design template for action and a way of thinking about this important aspect of preparatory nursing education. Moreover, we hope to have created a process contributing to a heighten sense of awareness in the nursing student (and other key stakeholders) of the what, how and when of clinical leadership for a novice registered nurse. The next stage is to further test through research the proposed integral curriculum thread. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Multi-threaded ATLAS simulation on Intel Knights Landing processors

    NASA Astrophysics Data System (ADS)

    Farrell, Steven; Calafiura, Paolo; Leggett, Charles; Tsulaia, Vakhtang; Dotti, Andrea; ATLAS Collaboration

    2017-10-01

    The Knights Landing (KNL) release of the Intel Many Integrated Core (MIC) Xeon Phi line of processors is a potential game changer for HEP computing. With 72 cores and deep vector registers, the KNL cards promise significant performance benefits for highly-parallel, compute-heavy applications. Cori, the newest supercomputer at the National Energy Research Scientific Computing Center (NERSC), was delivered to its users in two phases with the first phase online at the end of 2015 and the second phase now online at the end of 2016. Cori Phase 2 is based on the KNL architecture and contains over 9000 compute nodes with 96GB DDR4 memory. ATLAS simulation with the multithreaded Athena Framework (AthenaMT) is a good potential use-case for the KNL architecture and supercomputers like Cori. ATLAS simulation jobs have a high ratio of CPU computation to disk I/O and have been shown to scale well in multi-threading and across many nodes. In this paper we will give an overview of the ATLAS simulation application with details on its multi-threaded design. Then, we will present a performance analysis of the application on KNL devices and compare it to a traditional x86 platform to demonstrate the capabilities of the architecture and evaluate the benefits of utilizing KNL platforms like Cori for ATLAS production.

  8. MetAlign 3.0: performance enhancement by efficient use of advances in computer hardware.

    PubMed

    Lommen, Arjen; Kools, Harrie J

    2012-08-01

    A new, multi-threaded version of the GC-MS and LC-MS data processing software, metAlign, has been developed which is able to utilize multiple cores on one PC. This new version was tested using three different multi-core PCs with different operating systems. The performance of noise reduction, baseline correction and peak-picking was 8-19 fold faster compared to the previous version on a single core machine from 2008. The alignment was 5-10 fold faster. Factors influencing the performance enhancement are discussed. Our observations show that performance scales with the increase in processor core numbers we currently see in consumer PC hardware development.

  9. PolarHub: A Global Hub for Polar Data Discovery

    NASA Astrophysics Data System (ADS)

    Li, W.

    2014-12-01

    This paper reports the outcome of a NSF project in developing a large-scale web crawler PolarHub to discover automatically the distributed polar dataset in the format of OGC web services (OWS) in the cyberspace. PolarHub is a machine robot; its goal is to visit as many webpages as possible to find those containing information about polar OWS, extract this information and store it into the backend data repository. This is a very challenging task given huge data volume of webpages on the Web. Three unique features was introduced in PolarHub to make it distinctive from earlier crawler solutions: (1) a multi-task, multi-user, multi-thread support to the crawling tasks; (2) an extensive use of thread pool and Data Access Object (DAO) design patterns to separate persistent data storage and business logic to achieve high extendibility of the crawler tool; (3) a pattern-matching based customizable crawling algorithm to support discovery of multi-type geospatial web services; and (4) a universal and portable client-server communication mechanism combining a server-push and client pull strategies for enhanced asynchronous processing. A series of experiments were conducted to identify the impact of crawling parameters to the overall system performance. The geographical distribution pattern of all PolarHub identified services is also demonstrated. We expect this work to make a major contribution to the field of geospatial information retrieval and geospatial interoperability, to bridge the gap between data provider and data consumer, and to accelerate polar science by enhancing the accessibility and reusability of adequate polar data.

  10. Cribellate thread production in spiders: Complex processing of nano-fibres into a functional capture thread.

    PubMed

    Joel, Anna-Christin; Kappel, Peter; Adamova, Hana; Baumgartner, Werner; Scholz, Ingo

    2015-11-01

    Spider silk production has been studied intensively in the last years. However, capture threads of cribellate spiders employ an until now often unnoticed alternative of thread production. This thread in general is highly interesting, as it not only involves a controlled arrangement of three types of threads with one being nano-scale fibres (cribellate fibres), but also a special comb-like structure on the metatarsus of the fourth leg (calamistrum) for its production. We found the cribellate fibres organized as a mat, enclosing two parallel larger fibres (axial fibres) and forming the typical puffy structure of cribellate threads. Mat and axial fibres are punctiform connected to each other between two puffs, presumably by the action of the median spinnerets. However, this connection alone does not lead to the typical puffy shape of a cribellate thread. Removing the calamistrum, we found a functional capture thread still being produced, but the puffy shape of the thread was lost. Therefore, the calamistrum is not necessary for the extraction or combination of fibres, but for further processing of the nano-scale cribellate fibres. Using data from Uloborus plumipes we were able to develop a model of the cribellate thread production, probably universally valid for cribellate spiders. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. Hydraulic conditions of flood flows in a Polish Carpathian river subjected to variable human impacts

    NASA Astrophysics Data System (ADS)

    Radecki-Pawlik, Artur; Czech, Wiktoria; Wyżga, Bartłomiej; Mikuś, Paweł; Zawiejska, Joanna; Ruiz-Villanueva, Virginia

    2016-04-01

    Channel morphology of the Czarny Dunajec River, Polish Carpathians, has been considerably modified as a result of channelization and gravel-mining induced channel incision, and now it varies from a single-thread, incised or regulated channel to an unmanaged, multi-thread channel. We investigated effects of these distinct channel morphologies on the conditions for flood flows in a study of 25 cross-sections from the middle river course where the Czarny Dunajec receives no significant tributaries and flood discharges increase little in the downstream direction. Cross-sectional morphology, channel slope and roughness of particular cross-section parts were used as input data for the hydraulic modelling performed with the 1D steady-flow HEC-RAS model for discharges with recurrence interval from 1.5 to 50 years. The model for each cross-section was calibrated with the water level of a 20-year flood from May 2014, determined shortly after the flood on the basis of high-water marks. Results indicated that incised and channelized river reaches are typified by similar flow widths and cross-sectional flow areas, which are substantially smaller than those in the multi-thread reach. However, because of steeper channel slope in the incised reach than in the channelized reach, the three river reaches differ in unit stream power and bed shear stress, which attain the highest values in the incised reach, intermediate values in the channelized reach, and the lowest ones in the multi-thread reach. These patterns of flow power and hydraulic forces are reflected in significant differences in river competence between the three river reaches. Since the introduction of the channelization scheme 30 years ago, sedimentation has reduced its initial flow conveyance by more than half and elevated water stages at given flood discharges by about 0.5-0.7 m. This partly reflects a progressive growth of natural levees along artificially stabilized channel banks. By contrast, sediments of natural levees deposited along the multi-thread channel and subsequently eroded in the course of lateral channel migration and floodplain reworking; as a result, they do not reduce the conveyance of floodplain flows in this reach. This study was performed within the scope of the Research Project DEC-2013/09/B/ST10/00056 financed by the National Science Centre of Poland.

  12. 49 CFR 178.56 - Specification 4AA480 welded steel cylinders.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... by spinning process not permitted. (b) Steel. The limiting chemical composition of steel authorized... equipment and processes adequate to ensure that each cylinder produced conforms to the requirements of this... welding or by threads. If threads are used they must comply with the following: (i) Threads must be clean...

  13. 49 CFR 178.56 - Specification 4AA480 welded steel cylinders.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... by spinning process not permitted. (b) Steel. The limiting chemical composition of steel authorized... equipment and processes adequate to ensure that each cylinder produced conforms to the requirements of this... welding or by threads. If threads are used they must comply with the following: (i) Threads must be clean...

  14. 49 CFR 178.56 - Specification 4AA480 welded steel cylinders.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... by spinning process not permitted. (b) Steel. The limiting chemical composition of steel authorized... equipment and processes adequate to ensure that each cylinder produced conforms to the requirements of this... welding or by threads. If threads are used they must comply with the following: (i) Threads must be clean...

  15. Supporting Graduate Student Writers with VoiceThread

    ERIC Educational Resources Information Center

    Gonzalez, Michelle; Moore, Noreen S.

    2018-01-01

    This qualitative case study examined the influence of the use of VoiceThread technology on the feedback process for thesis writing in two online asynchronous graduate courses. The influence on instructor feedback process and graduate student writers' perceptions of the use of VoiceThread were the foci of the study. Master's-level students (n = 18)…

  16. 49 CFR 178.56 - Specification 4AA480 welded steel cylinders.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... by spinning process not permitted. (b) Steel. The limiting chemical composition of steel authorized... equipment and processes adequate to ensure that each cylinder produced conforms to the requirements of this... welding or by threads. If threads are used they must comply with the following: (i) Threads must be clean...

  17. Effects of thread interruptions on tool pins in friction stir welding of AA6061

    DOE PAGES

    Reza-E-Rabby, Md.; Tang, Wei; Reynolds, Anthony P.

    2017-06-21

    In this paper, effects of pin thread and thread interruptions (flats) on weld quality and process response parameters during friction stir welding (FSW) of 6061 aluminium alloy were quantified. Otherwise, identical smooth and threaded pins with zero to four flats were adopted for FSW. Weldability and process response variables were examined. Results showed that threads with flats significantly improved weld quality and reduced in-plane forces. A three-flat threaded pin led to production of defect-free welds under all examined welding conditions. Spectral analyses of in-plane forces and weld cross-sectional analysis were performed to establish correlation among pin flats, force dynamics andmore » defect formation. Finally, the lowest in-plane force spectra amplitudes were consistently observed for defect-free welds.« less

  18. Effects of thread interruptions on tool pins in friction stir welding of AA6061

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Reza-E-Rabby, Md.; Tang, Wei; Reynolds, Anthony P.

    In this paper, effects of pin thread and thread interruptions (flats) on weld quality and process response parameters during friction stir welding (FSW) of 6061 aluminium alloy were quantified. Otherwise, identical smooth and threaded pins with zero to four flats were adopted for FSW. Weldability and process response variables were examined. Results showed that threads with flats significantly improved weld quality and reduced in-plane forces. A three-flat threaded pin led to production of defect-free welds under all examined welding conditions. Spectral analyses of in-plane forces and weld cross-sectional analysis were performed to establish correlation among pin flats, force dynamics andmore » defect formation. Finally, the lowest in-plane force spectra amplitudes were consistently observed for defect-free welds.« less

  19. Model Checking with Multi-Threaded IC3 Portfolios

    DTIC Science & Technology

    2015-01-15

    different runs varies randomly depending on the thread interleaving. The use of a portfolio of solvers to maximize the likelihood of a quick solution is...empirically show (cf. Sec. 5.2) that the predictions based on this formula have high accuracy. Note that each solver in the portfolio potentially searches...speedup of over 300. We also show that widening the proof search of ic3 by randomizing its SAT solver is not as effective as paral- lelization

  20. Threaded Cognition: An Integrated Theory of Concurrent Multitasking

    ERIC Educational Resources Information Center

    Salvucci, Dario D.; Taatgen, Niels A.

    2008-01-01

    The authors propose the idea of threaded cognition, an integrated theory of concurrent multitasking--that is, performing 2 or more tasks at once. Threaded cognition posits that streams of thought can be represented as threads of processing coordinated by a serial procedural resource and executed across other available resources (e.g., perceptual…

  1. Slowing down of ring polymer diffusion caused by inter-ring threading.

    PubMed

    Lee, Eunsang; Kim, Soree; Jung, YounJoon

    2015-06-01

    Diffusion of long ring polymers in a melt is much slower than the reorganization of their internal structures. While direct evidence for entanglements has not been observed in the long ring polymers unlike linear polymer melts, threading between the rings is suspected to be the main reason for slowing down of ring polymer diffusion. It is, however, difficult to define the threading configuration between two rings because the rings have no chain end. In this work, evidence for threading dynamics of ring polymers is presented by using molecular dynamics simulation and applying a novel analysis method. The simulation results are analyzed in terms of the statistics of persistence and exchange times that have proved useful in studying heterogeneous dynamics of glassy systems. It is found that the threading time of ring polymer melts increases more rapidly with the degree of polymerization than that of linear polymer melts. This indicates that threaded ring polymers cannot diffuse until an unthreading event occurs, which results in the slowing down of ring polymer diffusion. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. 49 CFR 179.300-13 - Venting, loading and unloading valves.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... HAZARDOUS MATERIALS SAFETY ADMINISTRATION, DEPARTMENT OF TRANSPORTATION (CONTINUED) SPECIFICATIONS FOR TANK CARS Specifications for Multi-Unit Tank Car Tanks (Classes DOT-106A and 110AW) § 179.300-13 Venting... (h)(3)(ii). Threads for the clean-out/inspection ports of DOT Specification 110A multi-unit tank car...

  3. Constructing Neuronal Network Models in Massively Parallel Environments.

    PubMed

    Ippen, Tammo; Eppler, Jochen M; Plesser, Hans E; Diesmann, Markus

    2017-01-01

    Recent advances in the development of data structures to represent spiking neuron network models enable us to exploit the complete memory of petascale computers for a single brain-scale network simulation. In this work, we investigate how well we can exploit the computing power of such supercomputers for the creation of neuronal networks. Using an established benchmark, we divide the runtime of simulation code into the phase of network construction and the phase during which the dynamical state is advanced in time. We find that on multi-core compute nodes network creation scales well with process-parallel code but exhibits a prohibitively large memory consumption. Thread-parallel network creation, in contrast, exhibits speedup only up to a small number of threads but has little overhead in terms of memory. We further observe that the algorithms creating instances of model neurons and their connections scale well for networks of ten thousand neurons, but do not show the same speedup for networks of millions of neurons. Our work uncovers that the lack of scaling of thread-parallel network creation is due to inadequate memory allocation strategies and demonstrates that thread-optimized memory allocators recover excellent scaling. An analysis of the loop order used for network construction reveals that more complex tests on the locality of operations significantly improve scaling and reduce runtime by allowing construction algorithms to step through large networks more efficiently than in existing code. The combination of these techniques increases performance by an order of magnitude and harnesses the increasingly parallel compute power of the compute nodes in high-performance clusters and supercomputers.

  4. Constructing Neuronal Network Models in Massively Parallel Environments

    PubMed Central

    Ippen, Tammo; Eppler, Jochen M.; Plesser, Hans E.; Diesmann, Markus

    2017-01-01

    Recent advances in the development of data structures to represent spiking neuron network models enable us to exploit the complete memory of petascale computers for a single brain-scale network simulation. In this work, we investigate how well we can exploit the computing power of such supercomputers for the creation of neuronal networks. Using an established benchmark, we divide the runtime of simulation code into the phase of network construction and the phase during which the dynamical state is advanced in time. We find that on multi-core compute nodes network creation scales well with process-parallel code but exhibits a prohibitively large memory consumption. Thread-parallel network creation, in contrast, exhibits speedup only up to a small number of threads but has little overhead in terms of memory. We further observe that the algorithms creating instances of model neurons and their connections scale well for networks of ten thousand neurons, but do not show the same speedup for networks of millions of neurons. Our work uncovers that the lack of scaling of thread-parallel network creation is due to inadequate memory allocation strategies and demonstrates that thread-optimized memory allocators recover excellent scaling. An analysis of the loop order used for network construction reveals that more complex tests on the locality of operations significantly improve scaling and reduce runtime by allowing construction algorithms to step through large networks more efficiently than in existing code. The combination of these techniques increases performance by an order of magnitude and harnesses the increasingly parallel compute power of the compute nodes in high-performance clusters and supercomputers. PMID:28559808

  5. Using OpenMP vs. Threading Building Blocks for Medical Imaging on Multi-cores

    NASA Astrophysics Data System (ADS)

    Kegel, Philipp; Schellmann, Maraike; Gorlatch, Sergei

    We compare two parallel programming approaches for multi-core systems: the well-known OpenMP and the recently introduced Threading Building Blocks (TBB) library by Intel®. The comparison is made using the parallelization of a real-world numerical algorithm for medical imaging. We develop several parallel implementations, and compare them w.r.t. programming effort, programming style and abstraction, and runtime performance. We show that TBB requires a considerable program re-design, whereas with OpenMP simple compiler directives are sufficient. While TBB appears to be less appropriate for parallelizing existing implementations, it fosters a good programming style and higher abstraction level for newly developed parallel programs. Our experimental measurements on a dual quad-core system demonstrate that OpenMP slightly outperforms TBB in our implementation.

  6. Integrating end-to-end threads of control into object-oriented analysis and design

    NASA Technical Reports Server (NTRS)

    Mccandlish, Janet E.; Macdonald, James R.; Graves, Sara J.

    1993-01-01

    Current object-oriented analysis and design methodologies fall short in their use of mechanisms for identifying threads of control for the system being developed. The scenarios which typically describe a system are more global than looking at the individual objects and representing their behavior. Unlike conventional methodologies that use data flow and process-dependency diagrams, object-oriented methodologies do not provide a model for representing these global threads end-to-end. Tracing through threads of control is key to ensuring that a system is complete and timing constraints are addressed. The existence of multiple threads of control in a system necessitates a partitioning of the system into processes. This paper describes the application and representation of end-to-end threads of control to the object-oriented analysis and design process using object-oriented constructs. The issue of representation is viewed as a grouping problem, that is, how to group classes/objects at a higher level of abstraction so that the system may be viewed as a whole with both classes/objects and their associated dynamic behavior. Existing object-oriented development methodology techniques are extended by adding design-level constructs termed logical composite classes and process composite classes. Logical composite classes are design-level classes which group classes/objects both logically and by thread of control information. Process composite classes further refine the logical composite class groupings by using process partitioning criteria to produce optimum concurrent execution results. The goal of these design-level constructs is to ultimately provide the basis for a mechanism that can support the creation of process composite classes in an automated way. Using an automated mechanism makes it easier to partition a system into concurrently executing elements that can be run in parallel on multiple processors.

  7. Design and Test Requirements for Space Flight Pressurized Systems

    DTIC Science & Technology

    2014-11-26

    5.7.2 Fire Resistant Fluids ......................................................................................... 24 5.7.3 Accumulators...5.1.7 Threaded Parts [5.1.7-1] All threaded parts in components shall be securely locked to resist uncoupling forces by acceptable safe design...event of force balance on both inlet ports may occur, thus causing the shutoff valve to restrict flow from the outlet port. 5.7.2 Fire Resistant

  8. Image-based 3D reconstruction and virtual environmental walk-through

    NASA Astrophysics Data System (ADS)

    Sun, Jifeng; Fang, Lixiong; Luo, Ying

    2001-09-01

    We present a 3D reconstruction method, which combines geometry-based modeling, image-based modeling and rendering techniques. The first component is an interactive geometry modeling method which recovery of the basic geometry of the photographed scene. The second component is model-based stereo algorithm. We discus the image processing problems and algorithms of walking through in virtual space, then designs and implement a high performance multi-thread wandering algorithm. The applications range from architectural planning and archaeological reconstruction to virtual environments and cinematic special effects.

  9. Electronic Structure Calculations and Adaptation Scheme in Multi-core Computing Environments

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Seshagiri, Lakshminarasimhan; Sosonkina, Masha; Zhang, Zhao

    2009-05-20

    Multi-core processing environments have become the norm in the generic computing environment and are being considered for adding an extra dimension to the execution of any application. The T2 Niagara processor is a very unique environment where it consists of eight cores having a capability of running eight threads simultaneously in each of the cores. Applications like General Atomic and Molecular Electronic Structure (GAMESS), used for ab-initio molecular quantum chemistry calculations, can be good indicators of the performance of such machines and would be a guideline for both hardware designers and application programmers. In this paper we try to benchmarkmore » the GAMESS performance on a T2 Niagara processor for a couple of molecules. We also show the suitability of using a middleware based adaptation algorithm on GAMESS on such a multi-core environment.« less

  10. Scheduler for multiprocessor system switch with selective pairing

    DOEpatents

    Gara, Alan; Gschwind, Michael Karl; Salapura, Valentina

    2015-01-06

    System, method and computer program product for scheduling threads in a multiprocessing system with selective pairing of processor cores for increased processing reliability. A selective pairing facility is provided that selectively connects, i.e., pairs, multiple microprocessor or processor cores to provide one highly reliable thread (or thread group). The method configures the selective pairing facility to use checking provide one highly reliable thread for high-reliability and allocate threads to corresponding processor cores indicating need for hardware checking. The method configures the selective pairing facility to provide multiple independent cores and allocate threads to corresponding processor cores indicating inherent resilience.

  11. On-line monitoring of multi-component strain development in a tufting needle using optical fibre Bragg grating sensors

    NASA Astrophysics Data System (ADS)

    Chehura, Edmon; Dell'Anno, Giuseppe; Huet, Tristan; Staines, Stephen; James, Stephen W.; Partridge, Ivana K.; Tatam, Ralph P.

    2014-07-01

    Dynamic loadings induced on a tufting needle during the tufting of dry carbon fibre preform via a commercial robot-controlled tufting head were investigated in situ and in real-time using optical fibre Bragg grating (FBG) sensors bonded to the needle shaft. The sensors were configured such that the axial strain and bending moments experienced by the needle could be measured. A study of the influence of thread and thread type on the strain imparted to the needle revealed axial strain profiles which had equivalent trends but different magnitudes. The mean of the maximum axial compression strains measured during the tufting of a 4-ply quasi-isotropic carbon fibre dry preform were - 499 ± 79 μɛ, - 463 ± 51 μɛ and - 431 ± 59 μɛ for a needle without thread, with metal wire and with Kevlar® thread, respectively. The needle similarly exhibited bending moments of different magnitude when the different needle feeding configurations were used.

  12. When the lowest energy does not induce native structures: parallel minimization of multi-energy values by hybridizing searching intelligences.

    PubMed

    Lü, Qiang; Xia, Xiao-Yan; Chen, Rong; Miao, Da-Jun; Chen, Sha-Sha; Quan, Li-Jun; Li, Hai-Ou

    2012-01-01

    Protein structure prediction (PSP), which is usually modeled as a computational optimization problem, remains one of the biggest challenges in computational biology. PSP encounters two difficult obstacles: the inaccurate energy function problem and the searching problem. Even if the lowest energy has been luckily found by the searching procedure, the correct protein structures are not guaranteed to obtain. A general parallel metaheuristic approach is presented to tackle the above two problems. Multi-energy functions are employed to simultaneously guide the parallel searching threads. Searching trajectories are in fact controlled by the parameters of heuristic algorithms. The parallel approach allows the parameters to be perturbed during the searching threads are running in parallel, while each thread is searching the lowest energy value determined by an individual energy function. By hybridizing the intelligences of parallel ant colonies and Monte Carlo Metropolis search, this paper demonstrates an implementation of our parallel approach for PSP. 16 classical instances were tested to show that the parallel approach is competitive for solving PSP problem. This parallel approach combines various sources of both searching intelligences and energy functions, and thus predicts protein conformations with good quality jointly determined by all the parallel searching threads and energy functions. It provides a framework to combine different searching intelligence embedded in heuristic algorithms. It also constructs a container to hybridize different not-so-accurate objective functions which are usually derived from the domain expertise.

  13. When the Lowest Energy Does Not Induce Native Structures: Parallel Minimization of Multi-Energy Values by Hybridizing Searching Intelligences

    PubMed Central

    Lü, Qiang; Xia, Xiao-Yan; Chen, Rong; Miao, Da-Jun; Chen, Sha-Sha; Quan, Li-Jun; Li, Hai-Ou

    2012-01-01

    Background Protein structure prediction (PSP), which is usually modeled as a computational optimization problem, remains one of the biggest challenges in computational biology. PSP encounters two difficult obstacles: the inaccurate energy function problem and the searching problem. Even if the lowest energy has been luckily found by the searching procedure, the correct protein structures are not guaranteed to obtain. Results A general parallel metaheuristic approach is presented to tackle the above two problems. Multi-energy functions are employed to simultaneously guide the parallel searching threads. Searching trajectories are in fact controlled by the parameters of heuristic algorithms. The parallel approach allows the parameters to be perturbed during the searching threads are running in parallel, while each thread is searching the lowest energy value determined by an individual energy function. By hybridizing the intelligences of parallel ant colonies and Monte Carlo Metropolis search, this paper demonstrates an implementation of our parallel approach for PSP. 16 classical instances were tested to show that the parallel approach is competitive for solving PSP problem. Conclusions This parallel approach combines various sources of both searching intelligences and energy functions, and thus predicts protein conformations with good quality jointly determined by all the parallel searching threads and energy functions. It provides a framework to combine different searching intelligence embedded in heuristic algorithms. It also constructs a container to hybridize different not-so-accurate objective functions which are usually derived from the domain expertise. PMID:23028708

  14. Symbolic Analysis of Concurrent Programs with Polymorphism

    NASA Technical Reports Server (NTRS)

    Rungta, Neha Shyam

    2010-01-01

    The current trend of multi-core and multi-processor computing is causing a paradigm shift from inherently sequential to highly concurrent and parallel applications. Certain thread interleavings, data input values, or combinations of both often cause errors in the system. Systematic verification techniques such as explicit state model checking and symbolic execution are extensively used to detect errors in such systems [7, 9]. Explicit state model checking enumerates possible thread schedules and input data values of a program in order to check for errors [3, 9]. To partially mitigate the state space explosion from data input values, symbolic execution techniques substitute data input values with symbolic values [5, 7, 6]. Explicit state model checking and symbolic execution techniques used in conjunction with exhaustive search techniques such as depth-first search are unable to detect errors in medium to large-sized concurrent programs because the number of behaviors caused by data and thread non-determinism is extremely large. We present an overview of abstraction-guided symbolic execution for concurrent programs that detects errors manifested by a combination of thread schedules and data values [8]. The technique generates a set of key program locations relevant in testing the reachability of the target locations. The symbolic execution is then guided along these locations in an attempt to generate a feasible execution path to the error state. This allows the execution to focus in parts of the behavior space more likely to contain an error.

  15. metAlignID: a high-throughput software tool set for automated detection of trace level contaminants in comprehensive LECO two-dimensional gas chromatography time-of-flight mass spectrometry data.

    PubMed

    Lommen, Arjen; van der Kamp, Henk J; Kools, Harrie J; van der Lee, Martijn K; van der Weg, Guido; Mol, Hans G J

    2012-11-09

    A new alternative data processing tool set, metAlignID, is developed for automated pre-processing and library-based identification and concentration estimation of target compounds after analysis by comprehensive two-dimensional gas chromatography with mass spectrometric detection. The tool set has been developed for and tested on LECO data. The software is developed to run multi-threaded (one thread per processor core) on a standard PC (personal computer) under different operating systems and is as such capable of processing multiple data sets simultaneously. Raw data files are converted into netCDF (network Common Data Form) format using a fast conversion tool. They are then preprocessed using previously developed algorithms originating from metAlign software. Next, the resulting reduced data files are searched against a user-composed library (derived from user or commercial NIST-compatible libraries) (NIST=National Institute of Standards and Technology) and the identified compounds, including an indicative concentration, are reported in Excel format. Data can be processed batch wise. The overall time needed for conversion together with processing and searching of 30 raw data sets for 560 compounds is routinely within an hour. The screening performance is evaluated for detection of pesticides and contaminants in raw data obtained after analysis of soil and plant samples. Results are compared to the existing data-handling routine based on proprietary software (LECO, ChromaTOF). The developed software tool set, which is freely downloadable at www.metalign.nl, greatly accelerates data-analysis and offers more options for fine-tuning automated identification toward specific application needs. The quality of the results obtained is slightly better than the standard processing and also adds a quantitative estimate. The software tool set in combination with two-dimensional gas chromatography coupled to time-of-flight mass spectrometry shows great potential as a highly-automated and fast multi-residue instrumental screening method. Copyright © 2012 Elsevier B.V. All rights reserved.

  16. Merced

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hedstrom, Gerald; Beck, Bret; Mattoon, Caleb

    2016-10-01

    Merced performs a multi-dimensional integral tl generate so-called 'transfer matrices' for use in deterministic radiation transport applications. It produces transfer matrices on the user-defind energy grid. The angular dependence of outgoing products is captured in a Legendre expansion, up to a user-specified maximun Legendre order. Merced calculations can use multi-threading for enhanced performance on a single compute node.

  17. Language Issues in Mobile Program Security

    DTIC Science & Technology

    1998-01-01

    primitives for instance synchronous operations Nondeterminism and Privacy Now suppose we introduce nondeterminism via a simple concurrent language...code setting is that the only observable events are those that can be observed from within a mobile program using language primitives and any host...Possibilistic NI is given in It uses a main thread and two triggered threads each with a busy wait loop implementing a semaphore to copy every bit of

  18. Lack of ubiquitin immunoreactivities at both ends of neuropil threads. Possible bidirectional growth of neuropil threads.

    PubMed

    Iwatsubo, T; Hasegawa, M; Esaki, Y; Ihara, Y

    1992-02-01

    Immunocytochemically, neuropil threads (curly fibers) were investigated in the Alzheimer's disease brain using a confocal laser scanning fluorescence microscope by double labeling with tau/ubiquitin antibodies. Ubiquitin immunoreactivities were found to be lacking at one or both ends in more than 40% of tau-positive threads. Immunoelectron microscopy showed that bundles of paired helical filaments, which constitute neuropil threads, were positive for ubiquitin around their midportions, but often negative at their ends. Since it is reasonable to postulate that tau deposition as paired helical filaments precedes ubiquitination, the aforementioned observation suggests that the ends of the threads are newly formed portions, and thus the threads are often growing bidirectionally in small neuronal processes.

  19. High-resolution Observations of Downflows at One End of a Pre-eruption Filament

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Qin; Deng, Na; Jing, Ju

    Studying the dynamics of filaments at the pre-eruption phase can shed light on the precursor of eruptive events. Such high-resolution studies (of the order of 0.″1) are highly desirable yet very rare. In this work, we present a detailed observation of a pre-eruption evolution of a filament obtained by the 1.6 m New Solar Telescope (NST) at the Big Bear Solar Observatory (BBSO). One end of the filament is anchored at the sunspot in the NOAA active region (AR) 11515, which is well observed by NST H α off-bands from four hours before to one hour after the filament eruption.more » A M1.6 flare is associated with the eruption. We observed persistent downflowing materials along the H α multi-threaded component of the loop toward the AR end during the pre-eruption phase. We traced the trajectories of plasma blobs along the H α threads and obtained a plane-of-sky velocity of 45 km s{sup −1} on average. Furthermore, we estimated the real velocities of the downflows and the altitude of the filament by matching the observed H α threads with magnetic field lines extrapolated from a nonlinear force-free field model. Observations of chromospheric brightenings at the footpoints of the falling plasma blobs are also presented. The lower limit of the kinetic energy per second of the downflows through the brightenings is found to be ∼10{sup 21} erg. Larger FOV observations from BBSO full-disk H α images show that the AR end of the filament started ascending four hours before the flare. We attribute the observed downflows at the AR end of the filament to the draining effect of the filament rising prior to its eruption. During the slow-rise phase, the downflows continuously drained away ∼10{sup 15}g mass from the filament over a few hours, which is believed to be essential for the instability, and could be an important precursor of eruptive events.« less

  20. 29 CFR 1910.103 - Hydrogen.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ..., threaded, socket, or compression fittings. Gaskets and thread sealants shall be suitable for hydrogen... closures. (v) Piping, tubing, and fittings. (a) Piping, tubing, and fittings and gasket and thread sealants... including process or analytical equipment. (c) Be located 25 feet from concentrations of people. (d) Be...

  1. Thread bonds in molecules

    NASA Astrophysics Data System (ADS)

    Ivlev, B.

    2017-07-01

    Unusual chemical bonds are proposed. Each bond is characterized by the thread of a small radius, 10-11 cm, extended between two nuclei in a molecule. An analogue of a potential well, of the depth of MeV scale, is formed within the thread. This occurs due to the local reduction of zero point electromagnetic energy. This is similar to formation of the Casimir well. The electron-photon interaction only is not sufficient for formation of thread state. The mechanism of electron mass generation is involved in the close vicinity, 10-16 cm, of the thread. Thread bonds are stable and cannot be created or destructed in chemical or optical processes.

  2. Sequential protein unfolding through a carbon nanotube pore

    NASA Astrophysics Data System (ADS)

    Xu, Zhonghe; Zhang, Shuang; Weber, Jeffrey K.; Luan, Binquan; Zhou, Ruhong; Li, Jingyuan

    2016-06-01

    An assortment of biological processes, like protein degradation and the transport of proteins across membranes, depend on protein unfolding events mediated by nanopore interfaces. In this work, we exploit fully atomistic simulations of an artificial, CNT-based nanopore to investigate the nature of ubiquitin unfolding. With one end of the protein subjected to an external force, we observe non-canonical unfolding behaviour as ubiquitin is pulled through the pore opening. Secondary structural elements are sequentially detached from the protein and threaded into the nanotube, interestingly, the remaining part maintains native-like characteristics. The constraints of the nanopore interface thus facilitate the formation of stable ``unfoldon'' motifs above the nanotube aperture that can exist in the absence of specific native contacts with the other secondary structure. Destruction of these unfoldons gives rise to distinct force peaks in our simulations, providing us with a sensitive probe for studying the kinetics of serial unfolding events. Our detailed analysis of nanopore-mediated protein unfolding events not only provides insight into how related processes might proceed in the cell, but also serves to deepen our understanding of structural arrangements which form the basis for protein conformational stability.An assortment of biological processes, like protein degradation and the transport of proteins across membranes, depend on protein unfolding events mediated by nanopore interfaces. In this work, we exploit fully atomistic simulations of an artificial, CNT-based nanopore to investigate the nature of ubiquitin unfolding. With one end of the protein subjected to an external force, we observe non-canonical unfolding behaviour as ubiquitin is pulled through the pore opening. Secondary structural elements are sequentially detached from the protein and threaded into the nanotube, interestingly, the remaining part maintains native-like characteristics. The constraints of the nanopore interface thus facilitate the formation of stable ``unfoldon'' motifs above the nanotube aperture that can exist in the absence of specific native contacts with the other secondary structure. Destruction of these unfoldons gives rise to distinct force peaks in our simulations, providing us with a sensitive probe for studying the kinetics of serial unfolding events. Our detailed analysis of nanopore-mediated protein unfolding events not only provides insight into how related processes might proceed in the cell, but also serves to deepen our understanding of structural arrangements which form the basis for protein conformational stability. Electronic supplementary information (ESI) available. See DOI: 10.1039/c6nr00410e

  3. “Superluminal” FITS File Processing on Multiprocessors: Zero Time Endian Conversion Technique

    NASA Astrophysics Data System (ADS)

    Eguchi, Satoshi

    2013-05-01

    The FITS is the standard file format in astronomy, and it has been extended to meet the astronomical needs of the day. However, astronomical datasets have been inflating year by year. In the case of the ALMA telescope, a ˜TB-scale four-dimensional data cube may be produced for one target. Considering that typical Internet bandwidth is tens of MB/s at most, the original data cubes in FITS format are hosted on a VO server, and the region which a user is interested in should be cut out and transferred to the user (Eguchi et al. 2012). The system will equip a very high-speed disk array to process a TB-scale data cube in 10 s, and disk I/O speed, endian conversion, and data processing speeds will be comparable. Hence, reducing the endian conversion time is one of issues to solve in our system. In this article, I introduce a technique named “just-in-time endian conversion”, which delays the endian conversion for each pixel just before it is really needed, to sweep out the endian conversion time; by applying this method, the FITS processing speed increases 20% for single threading and 40% for multi-threading compared to CFITSIO. The speedup tightly relates to modern CPU architecture to improve the efficiency of instruction pipelines due to break of “causality”, a programmed instruction code sequence.

  4. Lack of ubiquitin immunoreactivities at both ends of neuropil threads. Possible bidirectional growth of neuropil threads.

    PubMed Central

    Iwatsubo, T.; Hasegawa, M.; Esaki, Y.; Ihara, Y.

    1992-01-01

    Immunocytochemically, neuropil threads (curly fibers) were investigated in the Alzheimer's disease brain using a confocal laser scanning fluorescence microscope by double labeling with tau/ubiquitin antibodies. Ubiquitin immunoreactivities were found to be lacking at one or both ends in more than 40% of tau-positive threads. Immunoelectron microscopy showed that bundles of paired helical filaments, which constitute neuropil threads, were positive for ubiquitin around their midportions, but often negative at their ends. Since it is reasonable to postulate that tau deposition as paired helical filaments precedes ubiquitination, the aforementioned observation suggests that the ends of the threads are newly formed portions, and thus the threads are often growing bidirectionally in small neuronal processes. Images Figure 1 Figure 2 PMID:1310831

  5. 49 CFR 178.38 - Specification 3B seamless steel cylinders.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... the heat number. (d) Manufacture. Cylinders must be manufactured using equipment and processes... plugs, etc.) for those openings. Threads, conforming to the following, are required on all openings: (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads when used, must be of a...

  6. 49 CFR 178.38 - Specification 3B seamless steel cylinders.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... the heat number. (d) Manufacture. Cylinders must be manufactured using equipment and processes... plugs, etc.) for those openings. Threads, conforming to the following, are required on all openings: (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads when used, must be of a...

  7. 49 CFR 178.38 - Specification 3B seamless steel cylinders.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... the heat number. (d) Manufacture. Cylinders must be manufactured using equipment and processes... plugs, etc.) for those openings. Threads, conforming to the following, are required on all openings: (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads when used, must be of a...

  8. 49 CFR 178.38 - Specification 3B seamless steel cylinders.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... the heat number. (d) Manufacture. Cylinders must be manufactured using equipment and processes... plugs, etc.) for those openings. Threads, conforming to the following, are required on all openings: (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads when used, must be of a...

  9. Memory and Energy Optimization Strategies for Multithreaded Operating System on the Resource-Constrained Wireless Sensor Node

    PubMed Central

    Liu, Xing; Hou, Kun Mean; de Vaulx, Christophe; Xu, Jun; Yang, Jianfeng; Zhou, Haiying; Shi, Hongling; Zhou, Peng

    2015-01-01

    Memory and energy optimization strategies are essential for the resource-constrained wireless sensor network (WSN) nodes. In this article, a new memory-optimized and energy-optimized multithreaded WSN operating system (OS) LiveOS is designed and implemented. Memory cost of LiveOS is optimized by using the stack-shifting hybrid scheduling approach. Different from the traditional multithreaded OS in which thread stacks are allocated statically by the pre-reservation, thread stacks in LiveOS are allocated dynamically by using the stack-shifting technique. As a result, memory waste problems caused by the static pre-reservation can be avoided. In addition to the stack-shifting dynamic allocation approach, the hybrid scheduling mechanism which can decrease both the thread scheduling overhead and the thread stack number is also implemented in LiveOS. With these mechanisms, the stack memory cost of LiveOS can be reduced more than 50% if compared to that of a traditional multithreaded OS. Not is memory cost optimized, but also the energy cost is optimized in LiveOS, and this is achieved by using the multi-core “context aware” and multi-core “power-off/wakeup” energy conservation approaches. By using these approaches, energy cost of LiveOS can be reduced more than 30% when compared to the single-core WSN system. Memory and energy optimization strategies in LiveOS not only prolong the lifetime of WSN nodes, but also make the multithreaded OS feasible to run on the memory-constrained WSN nodes. PMID:25545264

  10. Employing Nested OpenMP for the Parallelization of Multi-Zone Computational Fluid Dynamics Applications

    NASA Technical Reports Server (NTRS)

    Ayguade, Eduard; Gonzalez, Marc; Martorell, Xavier; Jost, Gabriele

    2004-01-01

    In this paper we describe the parallelization of the multi-zone code versions of the NAS Parallel Benchmarks employing multi-level OpenMP parallelism. For our study we use the NanosCompiler, which supports nesting of OpenMP directives and provides clauses to control the grouping of threads, load balancing, and synchronization. We report the benchmark results, compare the timings with those of different hybrid parallelization paradigms and discuss OpenMP implementation issues which effect the performance of multi-level parallel applications.

  11. 49 CFR 178.68 - Specification 4E welded aluminum cylinders.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... to cylinder by welding by inert gas shielded arc process or by threads. If threads are used, they... process and cylinders with longitudinal seams are not authorized. (b) Authorized material. The cylinder...'s lot number. (d) Manufacture. Cylinders must be manufactured using equipment and processes adequate...

  12. 49 CFR 178.68 - Specification 4E welded aluminum cylinders.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... to cylinder by welding by inert gas shielded arc process or by threads. If threads are used, they... process and cylinders with longitudinal seams are not authorized. (b) Authorized material. The cylinder...'s lot number. (d) Manufacture. Cylinders must be manufactured using equipment and processes adequate...

  13. 49 CFR 178.68 - Specification 4E welded aluminum cylinders.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... to cylinder by welding by inert gas shielded arc process or by threads. If threads are used, they... process and cylinders with longitudinal seams are not authorized. (b) Authorized material. The cylinder...'s lot number. (d) Manufacture. Cylinders must be manufactured using equipment and processes adequate...

  14. Adapting the serial Alpgen parton-interaction generator to simulate LHC collisions on millions of parallel threads

    NASA Astrophysics Data System (ADS)

    Childers, J. T.; Uram, T. D.; LeCompte, T. J.; Papka, M. E.; Benjamin, D. P.

    2017-01-01

    As the LHC moves to higher energies and luminosity, the demand for computing resources increases accordingly and will soon outpace the growth of the Worldwide LHC Computing Grid. To meet this greater demand, event generation Monte Carlo was targeted for adaptation to run on Mira, the supercomputer at the Argonne Leadership Computing Facility. Alpgen is a Monte Carlo event generation application that is used by LHC experiments in the simulation of collisions that take place in the Large Hadron Collider. This paper details the process by which Alpgen was adapted from a single-processor serial-application to a large-scale parallel-application and the performance that was achieved.

  15. Threaded cognition: an integrated theory of concurrent multitasking.

    PubMed

    Salvucci, Dario D; Taatgen, Niels A

    2008-01-01

    The authors propose the idea of threaded cognition, an integrated theory of concurrent multitasking--that is, performing 2 or more tasks at once. Threaded cognition posits that streams of thought can be represented as threads of processing coordinated by a serial procedural resource and executed across other available resources (e.g., perceptual and motor resources). The theory specifies a parsimonious mechanism that allows for concurrent execution, resource acquisition, and resolution of resource conflicts, without the need for specialized executive processes. By instantiating this mechanism as a computational model, threaded cognition provides explicit predictions of how multitasking behavior can result in interference, or lack thereof, for a given set of tasks. The authors illustrate the theory in model simulations of several representative domains ranging from simple laboratory tasks such as dual-choice tasks to complex real-world domains such as driving and driver distraction. (c) 2008 APA, all rights reserved

  16. High-Level Data Races

    NASA Technical Reports Server (NTRS)

    Artho, Cyrille; Havelund, Klaus; Biere, Armin; Koga, Dennis (Technical Monitor)

    2003-01-01

    Data races are a common problem in concurrent and multi-threaded programming. They are hard to detect without proper tool support. Despite the successful application of these tools, experience shows that the notion of data race is not powerful enough to capture certain types of inconsistencies occurring in practice. In this paper we investigate data races on a higher abstraction layer. This enables us to detect inconsistent uses of shared variables, even if no classical race condition occurs. For example, a data structure representing a coordinate pair may have to be treated atomically. By lifting the meaning of a data race to a higher level, such problems can now be covered. The paper defines the concepts view and view consistency to give a notation for this novel kind of property. It describes what kinds of errors can be detected with this new definition, and where its limitations are. It also gives a formal guideline for using data structures in a multi-threading environment.

  17. High-Performance, Multi-Node File Copies and Checksums for Clustered File Systems

    NASA Technical Reports Server (NTRS)

    Kolano, Paul Z.; Ciotti, Robert B.

    2012-01-01

    Modern parallel file systems achieve high performance using a variety of techniques, such as striping files across multiple disks to increase aggregate I/O bandwidth and spreading disks across multiple servers to increase aggregate interconnect bandwidth. To achieve peak performance from such systems, it is typically necessary to utilize multiple concurrent readers/writers from multiple systems to overcome various singlesystem limitations, such as number of processors and network bandwidth. The standard cp and md5sum tools of GNU coreutils found on every modern Unix/Linux system, however, utilize a single execution thread on a single CPU core of a single system, and hence cannot take full advantage of the increased performance of clustered file systems. Mcp and msum are drop-in replacements for the standard cp and md5sum programs that utilize multiple types of parallelism and other optimizations to achieve maximum copy and checksum performance on clustered file systems. Multi-threading is used to ensure that nodes are kept as busy as possible. Read/write parallelism allows individual operations of a single copy to be overlapped using asynchronous I/O. Multinode cooperation allows different nodes to take part in the same copy/checksum. Split-file processing allows multiple threads to operate concurrently on the same file. Finally, hash trees allow inherently serial checksums to be performed in parallel. Mcp and msum provide significant performance improvements over standard cp and md5sum using multiple types of parallelism and other optimizations. The total speed-ups from all improvements are significant. Mcp improves cp performance over 27x, msum improves md5sum performance almost 19x, and the combination of mcp and msum improves verified copies via cp and md5sum by almost 22x. These improvements come in the form of drop-in replacements for cp and md5sum, so are easily used and are available for download as open source software at http://mutil.sourceforge.net.

  18. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Skochko, G.W.; Herrmann, T.P.

    Axial load cycling fatigue tests of threaded fasteners are useful in determining fastener fatigue failure or design properties. By using appropriate design factors between the failure and design fatigue strengths, such tests are used to establish fatigue failure and design parameters of fasteners for axial and bending cyclic load conditions. This paper reviews the factors which influence the fatigue strength of low Alloy steel threaded fasteners, identifies those most significant to fatigue strength, and provides design guidelines based on the direct evaluation of fatigue tests of threaded fasteners. Influences on fatigue strength of thread manufacturing process (machining and rolling ofmore » threads), effect of fastener membrane and bending stresses, thread root radii, fastener sizes, fastener tensile strength, stress relaxation, mean stress, and test temperature are discussed.« less

  19. 49 CFR 178.36 - Specification 3A and 3AX seamless steel cylinders.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    .... Cylinders must be manufactured using equipment and processes adequate to ensure that each cylinder produced... openings. Threads are required on openings. (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads, when used, must be of length not less than as specified for American Standard...

  20. 49 CFR 178.45 - Specification 3T seamless steel cylinder.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ..., or electric furnace process steel of uniform quality is authorized. The steel analysis must conform... inches in diameter. (4) All openings must be circular. (5) All openings must be threaded. Threads must be in compliance with the following: (i) Each thread must be clean cut, even, without any checks, and to...

  1. 16 CFR 1632.4 - Mattress test procedure.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... directly over the thread or in the depression created by the quilting process on the half of the test... characteristic such as permanent press or flame resistance. It shall have 120-210 threads per square inch and... their full lengths over the thread or depression, then the cigarettes shall be positioned in a manner...

  2. 49 CFR 178.45 - Specification 3T seamless steel cylinder.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ..., or electric furnace process steel of uniform quality is authorized. The steel analysis must conform... inches in diameter. (4) All openings must be circular. (5) All openings must be threaded. Threads must be in compliance with the following: (i) Each thread must be clean cut, even, without any checks, and to...

  3. 49 CFR 178.45 - Specification 3T seamless steel cylinder.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ..., or electric furnace process steel of uniform quality is authorized. The steel analysis must conform... inches in diameter. (4) All openings must be circular. (5) All openings must be threaded. Threads must be in compliance with the following: (i) Each thread must be clean cut, even, without any checks, and to...

  4. 16 CFR 1512.4 - Mechanical requirements.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... of edges, or any burrs or spurs caused during the shearing process. (c) Integrity. There shall be no.... Recommended quality thread form is specified in Handbook H28, “Screw Thread Standards for Federal Service,” 1..., 262, and 263, “General Purpose Screw Threads.” 2 1 Copies may be obtained from: Superintendent of...

  5. 16 CFR 1632.4 - Mattress test procedure.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... directly over the thread or in the depression created by the quilting process on the half of the test... as permanent press or flame resistance. It shall have 120-210 threads per square inch and fabric... their full lengths over the thread or depression, then the cigarettes shall be positioned in a manner...

  6. 49 CFR 178.36 - Specification 3A and 3AX seamless steel cylinders.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    .... Cylinders must be manufactured using equipment and processes adequate to ensure that each cylinder produced... openings. Threads are required on openings. (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads, when used, must be of length not less than as specified for American Standard...

  7. 16 CFR 1512.4 - Mechanical requirements.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... burrs of spurs caused during the shearing process. (c) Integrity. There shall be no visible fracture of... quality thread form is specified in Handbook H28, “Screw Thread Standards for Federal Service,” 1 issued..., 262, and 263, “General Purpose Screw Threads.” 2 1 Copies may be obtained from: Superintendent of...

  8. 16 CFR 1512.4 - Mechanical requirements.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... of edges, or any burrs or spurs caused during the shearing process. (c) Integrity. There shall be no.... Recommended quality thread form is specified in Handbook H28, “Screw Thread Standards for Federal Service,” 1..., 262, and 263, “General Purpose Screw Threads.” 2 1 Copies may be obtained from: Superintendent of...

  9. 49 CFR 178.36 - Specification 3A and 3AX seamless steel cylinders.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    .... Cylinders must be manufactured using equipment and processes adequate to ensure that each cylinder produced... openings. Threads are required on openings. (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads, when used, must be of length not less than as specified for American Standard...

  10. HPC Profiling with the Sun Studio™ Performance Tools

    NASA Astrophysics Data System (ADS)

    Itzkowitz, Marty; Maruyama, Yukon

    In this paper, we describe how to use the Sun Studio Performance Tools to understand the nature and causes of application performance problems. We first explore CPU and memory performance problems for single-threaded applications, giving some simple examples. Then, we discuss multi-threaded performance issues, such as locking and false-sharing of cache lines, in each case showing how the tools can help. We go on to describe OpenMP applications and the support for them in the performance tools. Then we discuss MPI applications, and the techniques used to profile them. Finally, we present our conclusions.

  11. 29 CFR 530.1 - Definitions.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... knitting process is performed; and the manufacture of bathing suits from any purchased fabric: Provided... other finishing of knitted shirts made in the same establishment as that where the knitting process is... embroidery, thread splitting, embroidery thread cutting, scallop cutting, lace cutting, lace making-up...

  12. 29 CFR 530.1 - Definitions.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... knitting process is performed; and the manufacture of bathing suits from any purchased fabric: Provided... other finishing of knitted shirts made in the same establishment as that where the knitting process is... embroidery, thread splitting, embroidery thread cutting, scallop cutting, lace cutting, lace making-up...

  13. 29 CFR 530.1 - Definitions.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... knitting process is performed; and the manufacture of bathing suits from any purchased fabric: Provided... other finishing of knitted shirts made in the same establishment as that where the knitting process is... embroidery, thread splitting, embroidery thread cutting, scallop cutting, lace cutting, lace making-up...

  14. 29 CFR 530.1 - Definitions.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... knitting process is performed; and the manufacture of bathing suits from any purchased fabric: Provided... other finishing of knitted shirts made in the same establishment as that where the knitting process is... embroidery, thread splitting, embroidery thread cutting, scallop cutting, lace cutting, lace making-up...

  15. Scheduling based on a dynamic resource connection

    NASA Astrophysics Data System (ADS)

    Nagiyev, A. E.; Botygin, I. A.; Shersntneva, A. I.; Konyaev, P. A.

    2017-02-01

    The practical using of distributed computing systems associated with many problems, including troubles with the organization of an effective interaction between the agents located at the nodes of the system, with the specific configuration of each node of the system to perform a certain task, with the effective distribution of the available information and computational resources of the system, with the control of multithreading which implements the logic of solving research problems and so on. The article describes the method of computing load balancing in distributed automatic systems, focused on the multi-agency and multi-threaded data processing. The scheme of the control of processing requests from the terminal devices, providing the effective dynamic scaling of computing power under peak load is offered. The results of the model experiments research of the developed load scheduling algorithm are set out. These results show the effectiveness of the algorithm even with a significant expansion in the number of connected nodes and zoom in the architecture distributed computing system.

  16. The research and development of the non-contact detection of the tubing internal thread with a line structured light

    NASA Astrophysics Data System (ADS)

    Hu, Yuanyuan; Xu, Yingying; Hao, Qun; Hu, Yao

    2013-12-01

    The tubing internal thread plays an irreplaceable role in the petroleum equipment. The unqualified tubing can directly lead to leakage, slippage and bring huge losses for oil industry. For the purpose of improving efficiency and precision of tubing internal thread detection, we develop a new non-contact tubing internal thread measurement system based on the laser triangulation principle. Firstly, considering that the tubing thread had a small diameter and relatively smooth surface, we built a set of optical system with a line structured light to irradiate the internal thread surface and obtain an image which contains the internal thread profile information through photoelectric sensor. Secondly, image processing techniques were used to do the edge detection of the internal thread from the obtained image. One key method was the sub-pixel technique which greatly improved the detection accuracy under the same hardware conditions. Finally, we restored the real internal thread contour information on the basis of laser triangulation method and calculated tubing thread parameters such as the pitch, taper and tooth type angle. In this system, the profile of several thread teeth can be obtained at the same time. Compared with other existing scanning methods using point light and stepper motor, this system greatly improves the detection efficiency. Experiment results indicate that this system can achieve the high precision and non-contact measurement of the tubing internal thread.

  17. Adaptive and mobile ground sensor array.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Holzrichter, Michael Warren; O'Rourke, William T.; Zenner, Jennifer

    The goal of this LDRD was to demonstrate the use of robotic vehicles for deploying and autonomously reconfiguring seismic and acoustic sensor arrays with high (centimeter) accuracy to obtain enhancement of our capability to locate and characterize remote targets. The capability to accurately place sensors and then retrieve and reconfigure them allows sensors to be placed in phased arrays in an initial monitoring configuration and then to be reconfigured in an array tuned to the specific frequencies and directions of the selected target. This report reviews the findings and accomplishments achieved during this three-year project. This project successfully demonstrated autonomousmore » deployment and retrieval of a payload package with an accuracy of a few centimeters using differential global positioning system (GPS) signals. It developed an autonomous, multisensor, temporally aligned, radio-frequency communication and signal processing capability, and an array optimization algorithm, which was implemented on a digital signal processor (DSP). Additionally, the project converted the existing single-threaded, monolithic robotic vehicle control code into a multi-threaded, modular control architecture that enhances the reuse of control code in future projects.« less

  18. Development of an extensible dual-core wireless sensing node for cyber-physical systems

    NASA Astrophysics Data System (ADS)

    Kane, Michael; Zhu, Dapeng; Hirose, Mitsuhito; Dong, Xinjun; Winter, Benjamin; Häckell, Mortiz; Lynch, Jerome P.; Wang, Yang; Swartz, A.

    2014-04-01

    The introduction of wireless telemetry into the design of monitoring and control systems has been shown to reduce system costs while simplifying installations. To date, wireless nodes proposed for sensing and actuation in cyberphysical systems have been designed using microcontrollers with one computational pipeline (i.e., single-core microcontrollers). While concurrent code execution can be implemented on single-core microcontrollers, concurrency is emulated by splitting the pipeline's resources to support multiple threads of code execution. For many applications, this approach to multi-threading is acceptable in terms of speed and function. However, some applications such as feedback controls demand deterministic timing of code execution and maximum computational throughput. For these applications, the adoption of multi-core processor architectures represents one effective solution. Multi-core microcontrollers have multiple computational pipelines that can execute embedded code in parallel and can be interrupted independent of one another. In this study, a new wireless platform named Martlet is introduced with a dual-core microcontroller adopted in its design. The dual-core microcontroller design allows Martlet to dedicate one core to standard wireless sensor operations while the other core is reserved for embedded data processing and real-time feedback control law execution. Another distinct feature of Martlet is a standardized hardware interface that allows specialized daughter boards (termed wing boards) to be interfaced to the Martlet baseboard. This extensibility opens opportunity to encapsulate specialized sensing and actuation functions in a wing board without altering the design of Martlet. In addition to describing the design of Martlet, a few example wings are detailed, along with experiments showing the Martlet's ability to monitor and control physical systems such as wind turbines and buildings.

  19. Simulation of DKIST solar adaptive optics system

    NASA Astrophysics Data System (ADS)

    Marino, Jose; Carlisle, Elizabeth; Schmidt, Dirk

    2016-07-01

    Solar adaptive optics (AO) simulations are a valuable tool to guide the design and optimization process of current and future solar AO and multi-conjugate AO (MCAO) systems. Solar AO and MCAO systems rely on extended object cross-correlating Shack-Hartmann wavefront sensors to measure the wavefront. Accurate solar AO simulations require computationally intensive operations, which have until recently presented a prohibitive computational cost. We present an update on the status of a solar AO and MCAO simulation tool being developed at the National Solar Observatory. The simulation tool is a multi-threaded application written in the C++ language that takes advantage of current large multi-core CPU computer systems and fast ethernet connections to provide accurate full simulation of solar AO and MCAO systems. It interfaces with KAOS, a state of the art solar AO control software developed by the Kiepenheuer-Institut fuer Sonnenphysik, that provides reliable AO control. We report on the latest results produced by the solar AO simulation tool.

  20. 49 CFR 178.37 - Specification 3AA and 3AAX seamless steel cylinders.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... processes adequate to ensure that each cylinder produced conforms to the requirements of this subpart. No..., fuse plugs, etc.) for those openings. Threads are required on openings. (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads, when used, must be of a length not less than as...

  1. 49 CFR 178.37 - Specification 3AA and 3AAX seamless steel cylinders.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... processes adequate to ensure that each cylinder produced conforms to the requirements of this subpart. No..., fuse plugs, etc.) for those openings. Threads are required on openings. (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads, when used, must be of a length not less than as...

  2. 49 CFR 178.37 - Specification 3AA and 3AAX seamless steel cylinders.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... processes adequate to ensure that each cylinder produced conforms to the requirements of this subpart. No..., fuse plugs, etc.) for those openings. Threads are required on openings. (1) Threads must be clean cut, even, without checks, and to gauge. (2) Taper threads, when used, must be of a length not less than as...

  3. 16 CFR § 1512.4 - Mechanical requirements.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... of edges, or any burrs or spurs caused during the shearing process. (c) Integrity. There shall be no.... Recommended quality thread form is specified in Handbook H28, “Screw Thread Standards for Federal Service,” 1..., 262, and 263, “General Purpose Screw Threads.” 2 1 Copies may be obtained from: Superintendent of...

  4. A comparative study of history-based versus vectorized Monte Carlo methods in the GPU/CUDA environment for a simple neutron eigenvalue problem

    NASA Astrophysics Data System (ADS)

    Liu, Tianyu; Du, Xining; Ji, Wei; Xu, X. George; Brown, Forrest B.

    2014-06-01

    For nuclear reactor analysis such as the neutron eigenvalue calculations, the time consuming Monte Carlo (MC) simulations can be accelerated by using graphics processing units (GPUs). However, traditional MC methods are often history-based, and their performance on GPUs is affected significantly by the thread divergence problem. In this paper we describe the development of a newly designed event-based vectorized MC algorithm for solving the neutron eigenvalue problem. The code was implemented using NVIDIA's Compute Unified Device Architecture (CUDA), and tested on a NVIDIA Tesla M2090 GPU card. We found that although the vectorized MC algorithm greatly reduces the occurrence of thread divergence thus enhancing the warp execution efficiency, the overall simulation speed is roughly ten times slower than the history-based MC code on GPUs. Profiling results suggest that the slow speed is probably due to the memory access latency caused by the large amount of global memory transactions. Possible solutions to improve the code efficiency are discussed.

  5. Problems and Processes in Medical Encounters: The CASES method of dialogue analysis

    PubMed Central

    Laws, M. Barton; Taubin, Tatiana; Bezreh, Tanya; Lee, Yoojin; Beach, Mary Catherine; Wilson, Ira B.

    2013-01-01

    Objective To develop methods to reliably capture structural and dynamic temporal features of clinical interactions. Methods Observational study of 50 audio-recorded routine outpatient visits to HIV specialty clinics, using innovative analytic methods. The Comprehensive Analysis of the Structure of Encounters System (CASES) uses transcripts coded for speech acts, then imposes larger-scale structural elements: threads – the problems or issues addressed; and processes within threads –basic tasks of clinical care labeled Presentation, Information, Resolution (decision making) and Engagement (interpersonal exchange). Threads are also coded for the nature of resolution. Results 61% of utterances are in presentation processes. Provider verbal dominance is greatest in information and resolution processes, which also contain a high proportion of provider directives. About half of threads result in no action or decision. Information flows predominantly from patient to provider in presentation processes, and from provider to patient in information processes. Engagement is rare. Conclusions In this data, resolution is provider centered; more time for patient participation in resolution, or interpersonal engagement, would have to come from presentation. Practice Implications Awareness of the use of time in clinical encounters, and the interaction processes associated with various tasks, may help make clinical communication more efficient and effective. PMID:23391684

  6. Problems and processes in medical encounters: the cases method of dialogue analysis.

    PubMed

    Laws, M Barton; Taubin, Tatiana; Bezreh, Tanya; Lee, Yoojin; Beach, Mary Catherine; Wilson, Ira B

    2013-05-01

    To develop methods to reliably capture structural and dynamic temporal features of clinical interactions. Observational study of 50 audio-recorded routine outpatient visits to HIV specialty clinics, using innovative analytic methods. The comprehensive analysis of the structure of encounters system (CASES) uses transcripts coded for speech acts, then imposes larger-scale structural elements: threads--the problems or issues addressed; and processes within threads--basic tasks of clinical care labeled presentation, information, resolution (decision making) and Engagement (interpersonal exchange). Threads are also coded for the nature of resolution. 61% of utterances are in presentation processes. Provider verbal dominance is greatest in information and resolution processes, which also contain a high proportion of provider directives. About half of threads result in no action or decision. Information flows predominantly from patient to provider in presentation processes, and from provider to patient in information processes. Engagement is rare. In this data, resolution is provider centered; more time for patient participation in resolution, or interpersonal engagement, would have to come from presentation. Awareness of the use of time in clinical encounters, and the interaction processes associated with various tasks, may help make clinical communication more efficient and effective. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  7. a Cache Design Method for Spatial Information Visualization in 3d Real-Time Rendering Engine

    NASA Astrophysics Data System (ADS)

    Dai, X.; Xiong, H.; Zheng, X.

    2012-07-01

    A well-designed cache system has positive impacts on the 3D real-time rendering engine. As the amount of visualization data getting larger, the effects become more obvious. They are the base of the 3D real-time rendering engine to smoothly browsing through the data, which is out of the core memory, or from the internet. In this article, a new kind of caches which are based on multi threads and large file are introduced. The memory cache consists of three parts, the rendering cache, the pre-rendering cache and the elimination cache. The rendering cache stores the data that is rendering in the engine; the data that is dispatched according to the position of the view point in the horizontal and vertical directions is stored in the pre-rendering cache; the data that is eliminated from the previous cache is stored in the eliminate cache and is going to write to the disk cache. Multi large files are used in the disk cache. When a disk cache file size reaches the limit length(128M is the top in the experiment), no item will be eliminated from the file, but a new large cache file will be created. If the large file number is greater than the maximum number that is pre-set, the earliest file will be deleted from the disk. In this way, only one file is opened for writing and reading, and the rest are read-only so the disk cache can be used in a high asynchronous way. The size of the large file is limited in order to map to the core memory to save loading time. Multi-thread is used to update the cache data. The threads are used to load data to the rendering cache as soon as possible for rendering, to load data to the pre-rendering cache for rendering next few frames, and to load data to the elimination cache which is not necessary for the moment. In our experiment, two threads are designed. The first thread is to organize the memory cache according to the view point, and created two threads: the adding list and the deleting list, the adding list index the data that should be loaded to the pre-rendering cache immediately, the deleting list index the data that is no longer visible in the rendering scene and should be moved to the eliminate cache; the other thread is to move the data in the memory and disk cache according to the adding and the deleting list, and create the download requests when the data is indexed in the adding but cannot be found either in memory cache or disk cache, eliminate cache data is moved to the disk cache when the adding list and deleting are empty. The cache designed as described above in our experiment shows reliable and efficient, and the data loading time and files I/O time decreased sharply, especially when the rendering data getting larger.

  8. In-Storage Embedded Accelerator for Sparse Pattern Processing

    DTIC Science & Technology

    2016-08-13

    performance of RAM disk. Since this configuration offloads most of processing onto the FPGA, the host software consists of only two threads for...more. Fig. 13. Document Processed vs CPU Threads Note that BlueDBM efficiency comes from our in-store processing paradigm that uses the FPGA...In-Storage Embedded Accelerator for Sparse Pattern Processing Sang-Woo Jun*, Huy T. Nguyen#, Vijay Gadepally#*, and Arvind* #MIT Lincoln Laboratory

  9. Validation of a virtual source model of medical linac for Monte Carlo dose calculation using multi-threaded Geant4

    NASA Astrophysics Data System (ADS)

    Aboulbanine, Zakaria; El Khayati, Naïma

    2018-04-01

    The use of phase space in medical linear accelerator Monte Carlo (MC) simulations significantly improves the execution time and leads to results comparable to those obtained from full calculations. The classical representation of phase space stores directly the information of millions of particles, producing bulky files. This paper presents a virtual source model (VSM) based on a reconstruction algorithm, taking as input a compressed file of roughly 800 kb derived from phase space data freely available in the International Atomic Energy Agency (IAEA) database. This VSM includes two main components; primary and scattered particle sources, with a specific reconstruction method developed for each. Energy spectra and other relevant variables were extracted from IAEA phase space and stored in the input description data file for both sources. The VSM was validated for three photon beams: Elekta Precise 6 MV/10 MV and a Varian TrueBeam 6 MV. Extensive calculations in water and comparisons between dose distributions of the VSM and IAEA phase space were performed to estimate the VSM precision. The Geant4 MC toolkit in multi-threaded mode (Geant4-[mt]) was used for fast dose calculations and optimized memory use. Four field configurations were chosen for dose calculation validation to test field size and symmetry effects, , , and for squared fields, and for an asymmetric rectangular field. Good agreement in terms of formalism, for 3%/3 mm and 2%/3 mm criteria, for each evaluated radiation field and photon beam was obtained within a computation time of 60 h on a single WorkStation for a 3 mm voxel matrix. Analyzing the VSM’s precision in high dose gradient regions, using the distance to agreement concept (DTA), showed also satisfactory results. In all investigated cases, the mean DTA was less than 1 mm in build-up and penumbra regions. In regards to calculation efficiency, the event processing speed is six times faster using Geant4-[mt] compared to sequential Geant4, when running the same simulation code for both. The developed VSM for 6 MV/10 MV beams widely used, is a general concept easy to adapt in order to reconstruct comparable beam qualities for various linac configurations, facilitating its integration for MC treatment planning purposes.

  10. 75 FR 34943 - Defense Federal Acquisition Regulation Supplement; Para-Aramid Fibers and Yarns Manufactured in a...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-06-21

    ... needed for national defense purposes. Another respondent notes that DuPont is in the process of building.... Domestic Para-Aramid Sewing Thread May Be of Lower Quality One respondent fully supported the interim rule... specification to use para-aramid thread that was heavier and weaker than the commercial thread that was used in...

  11. Parallel mutual information estimation for inferring gene regulatory networks on GPUs

    PubMed Central

    2011-01-01

    Background Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based mutual information estimation method is competitive to the kernel based methods in terms of quality but at a lower computational complexity. Results We present a new approach to accelerate the B-spline function based mutual information estimation algorithm with commodity graphics hardware. To derive an efficient mapping onto this type of architecture, we have used the Compute Unified Device Architecture (CUDA) programming model to design and implement a new parallel algorithm. Our implementation, called CUDA-MI, can achieve speedups of up to 82 using double precision on a single GPU compared to a multi-threaded implementation on a quad-core CPU for large microarray datasets. We have used the results obtained by CUDA-MI to infer gene regulatory networks (GRNs) from microarray data. The comparisons to existing methods including ARACNE and TINGe show that CUDA-MI produces GRNs of higher quality in less time. Conclusions CUDA-MI is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant speedup over sequential multi-threaded implementation by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs. PMID:21672264

  12. Ultrastructural characterization of the tau-immunoreactive tubules in the oligodendroglial perikarya and their inner loop processes in progressive supranuclear palsy.

    PubMed

    Arima, K; Nakamura, M; Sunohara, N; Ogawa, M; Anno, M; Izumiyama, Y; Hirai, S; Ikeda, K

    1997-06-01

    Coiled bodies and interfascicular threads are conspicuous white matter abnormalities of brains of patients with progressive supranuclear palsy (PSP). Both structures are argyrophilic and immunoreactive for the microtubule-binding protein tau. This report concerns the ultrastructural localization of interfascicular threads and their relationship to coiled bodies in five PSP patients. We showed for the first time that abnormal tubules with a 13- to 15-nm diameter and fuzzy outer contours were the common structures of coiled bodies in the oligodendroglial perikarya and of interfascicular threads. Moreover, the tubules were immunolabeled by anti-tau antibodies. The abnormal tau-positive tubules of interfascicular threads were located in the inner loop of the myelin sheath. Our study further indicated that the thread-like structures in the white matter comprised, at least in part, oligodendroglial processes, and that they were also present in gray matter. We consider that the formation of coiled bodies in the perikarya and of interfascicular threads represents a common cytoskeletal abnormality of the oligodendroglia of PSP patients. Moreover, even though the white matter alterations of PSP resemble those of corticobasal degeneration, there are certain ultrastructural differences in the abnormal oligodendroglial tubules of the two diseases.

  13. Multithreading in vector processors

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Evangelinos, Constantinos; Kim, Changhoan; Nair, Ravi

    In one embodiment, a system includes a processor having a vector processing mode and a multithreading mode. The processor is configured to operate on one thread per cycle in the multithreading mode. The processor includes a program counter register having a plurality of program counters, and the program counter register is vectorized. Each program counter in the program counter register represents a distinct corresponding thread of a plurality of threads. The processor is configured to execute the plurality of threads by activating the plurality of program counters in a round robin cycle.

  14. 31 CFR 601.2 - Description of paper.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... the Secretary of the Treasury, security threads embedded beneath the surface of the paper during the manufacturing process. Security threads shall contain graphics consisting of the designation “USA” and the..., incorporated in the body of the paper while in the process of manufacture and evenly distributed throughout. In...

  15. 31 CFR 601.2 - Description of paper.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... the Secretary of the Treasury, security threads embedded beneath the surface of the paper during the manufacturing process. Security threads shall contain graphics consisting of the designation “USA” and the..., incorporated in the body of the paper while in the process of manufacture and evenly distributed throughout. In...

  16. 31 CFR 601.2 - Description of paper.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... the Secretary of the Treasury, security threads embedded beneath the surface of the paper during the manufacturing process. Security threads shall contain graphics consisting of the designation “USA” and the..., incorporated in the body of the paper while in the process of manufacture and evenly distributed throughout. In...

  17. 31 CFR 601.2 - Description of paper.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... the Secretary of the Treasury, security threads embedded beneath the surface of the paper during the manufacturing process. Security threads shall contain graphics consisting of the designation “USA” and the..., incorporated in the body of the paper while in the process of manufacture and evenly distributed throughout. In...

  18. 31 CFR 601.2 - Description of paper.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... the Secretary of the Treasury, security threads embedded beneath the surface of the paper during the manufacturing process. Security threads shall contain graphics consisting of the designation “USA” and the..., incorporated in the body of the paper while in the process of manufacture and evenly distributed throughout. In...

  19. Adapting the serial Alpgen parton-interaction generator to simulate LHC collisions on millions of parallel threads

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Childers, J. T.; Uram, T. D.; LeCompte, T. J.

    As the LHC moves to higher energies and luminosity, the demand for computing resources increases accordingly and will soon outpace the growth of the World- wide LHC Computing Grid. To meet this greater demand, event generation Monte Carlo was targeted for adaptation to run on Mira, the supercomputer at the Argonne Leadership Computing Facility. Alpgen is a Monte Carlo event generation application that is used by LHC experiments in the simulation of collisions that take place in the Large Hadron Collider. This paper details the process by which Alpgen was adapted from a single-processor serial-application to a large-scale parallel-application andmore » the performance that was achieved.« less

  20. Adapting the serial Alpgen parton-interaction generator to simulate LHC collisions on millions of parallel threads

    DOE PAGES

    Childers, J. T.; Uram, T. D.; LeCompte, T. J.; ...

    2016-09-29

    As the LHC moves to higher energies and luminosity, the demand for computing resources increases accordingly and will soon outpace the growth of the Worldwide LHC Computing Grid. To meet this greater demand, event generation Monte Carlo was targeted for adaptation to run on Mira, the supercomputer at the Argonne Leadership Computing Facility. Alpgen is a Monte Carlo event generation application that is used by LHC experiments in the simulation of collisions that take place in the Large Hadron Collider. Finally, this paper details the process by which Alpgen was adapted from a single-processor serial-application to a large-scale parallel-application andmore » the performance that was achieved.« less

  1. Adapting the serial Alpgen parton-interaction generator to simulate LHC collisions on millions of parallel threads

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Childers, J. T.; Uram, T. D.; LeCompte, T. J.

    As the LHC moves to higher energies and luminosity, the demand for computing resources increases accordingly and will soon outpace the growth of the Worldwide LHC Computing Grid. To meet this greater demand, event generation Monte Carlo was targeted for adaptation to run on Mira, the supercomputer at the Argonne Leadership Computing Facility. Alpgen is a Monte Carlo event generation application that is used by LHC experiments in the simulation of collisions that take place in the Large Hadron Collider. Finally, this paper details the process by which Alpgen was adapted from a single-processor serial-application to a large-scale parallel-application andmore » the performance that was achieved.« less

  2. Concurrent Breakpoints

    DTIC Science & Technology

    2011-12-18

    Proceedings of the SIGMET- RICS Symposium on Parallel and Distributed Tools, pages 48–59, 1998. [8] A. Dinning and E. Schonberg . Detecting access...multi- threaded programs. ACM Trans. Comput. Syst., 15(4):391– 411, 1997. [38] E. Schonberg . On-the-fly detection of access anomalies. In Proceedings

  3. Riparian vegetation controls on channels formed in non-cohesive sediment

    NASA Astrophysics Data System (ADS)

    Gran, K.; Tal, M.; Paola, C.

    2002-05-01

    Riparian vegetation can significantly influence the morphology of a river, affecting channel geometry and flow dynamics. In channels formed in non-cohesive material, vegetation is the main source of bank cohesion and could affect the overall behavior of the river, potentially constraining the flow from a multi-thread channel to a single-thread channel. To examine the effects of riparian vegetation on streams formed in non-cohesive material, we conducted a series of physical experiments at the St. Anthony Falls Laboratory. The first set of experiments examines the effects of varying densities of vegetation on braided stream dynamics. Water discharge, sediment discharge, and grain size were held constant. For each run, we allowed a braided system to develop, then halved the discharge, and seeded the flume with alfalfa (Medicago sativa). After ten to fourteen days of growth, we returned the discharge to its original value and continued the run for 30-36 hours. Our results show that the influence of vegetation on the overall river pattern varied systematically with the spatial density of plant stems. The vegetation reduced the number of active channels and increased bank stability, leading to lower lateral migration rates, narrower and deeper channels, and an increase in channel relief. All these effects increased with vegetation density. Vegetation also influenced flow dynamics, increasing the variance of flow direction in the vegetated runs, and increasing scour depths through strong downwelling where the flow collided with relatively resistant banks. This oblique bank collision provides a new mechanism for producing secondary flows. We found these bank collision driven secondary flows to be more important than the classical curvature-driven mechanism in the vegetated runs. The next set of experiments examines more closely how the channel pattern evolves through time, allowing for both channel migration and successive vegetation growth. In these on-going experiments, vegetation is reseeded following repeat high flow events, simulating the natural process of vegetation encroachment on the floodplain and channel.

  4. On the Performance of an Algebraic MultigridSolver on Multicore Clusters

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Baker, A H; Schulz, M; Yang, U M

    2010-04-29

    Algebraic multigrid (AMG) solvers have proven to be extremely efficient on distributed-memory architectures. However, when executed on modern multicore cluster architectures, we face new challenges that can significantly harm AMG's performance. We discuss our experiences on such an architecture and present a set of techniques that help users to overcome the associated problems, including thread and process pinning and correct memory associations. We have implemented most of the techniques in a MultiCore SUPport library (MCSup), which helps to map OpenMP applications to multicore machines. We present results using both an MPI-only and a hybrid MPI/OpenMP model.

  5. Architecture of the software for LAMOST fiber positioning subsystem

    NASA Astrophysics Data System (ADS)

    Peng, Xiaobo; Xing, Xiaozheng; Hu, Hongzhuan; Zhai, Chao; Li, Weimin

    2004-09-01

    The architecture of the software which controls the LAMOST fiber positioning sub-system is described. The software is composed of two parts as follows: a main control program in a computer and a unit controller program in a MCS51 single chip microcomputer ROM. And the function of the software includes: Client/Server model establishment, observation planning, collision handling, data transmission, pulse generation, CCD control, image capture and processing, and data analysis etc. Particular attention is paid to the ways in which different parts of the software can communicate. Also software techniques for multi threads, SOCKET programming, Microsoft Windows message response, and serial communications are discussed.

  6. NEW VACUUM SOLAR TELESCOPE OBSERVATIONS OF A FLUX ROPE TRACKED BY A FILAMENT ACTIVATION

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Shuhong; Zhang, Jun; Liu, Zhong

    2014-04-01

    One main goal of the New Vacuum Solar Telescope (NVST) which is located at the Fuxian Solar Observatory is to image the Sun at high resolution. Based on the high spatial and temporal resolution NVST Hα data and combined with the simultaneous observations from the Solar Dynamics Observatory for the first time, we investigate a flux rope tracked by filament activation. The filament material is initially located at one end of the flux rope and fills in a section of the rope; the filament is then activated by magnetic field cancellation. The activated filament rises and flows along helical threads,more » tracking the twisted flux rope structure. The length of the flux rope is about 75 Mm, the average width of its individual threads is 1.11 Mm, and the estimated twist is 1π. The flux rope appears as a dark structure in Hα images, a partial dark and partial bright structure in 304 Å, and as a bright structure in 171 Å and 131 Å images. During this process, the overlying coronal loops are quite steady since the filament is confined within the flux rope and does not erupt successfully. It seems that, for the event in this study, the filament is located and confined within the flux rope threads, instead of being suspended in the dips of twisted magnetic flux.« less

  7. Cpu/gpu Computing for AN Implicit Multi-Block Compressible Navier-Stokes Solver on Heterogeneous Platform

    NASA Astrophysics Data System (ADS)

    Deng, Liang; Bai, Hanli; Wang, Fang; Xu, Qingxin

    2016-06-01

    CPU/GPU computing allows scientists to tremendously accelerate their numerical codes. In this paper, we port and optimize a double precision alternating direction implicit (ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house Computational Fluid Dynamics (CFD) software on heterogeneous platform. First, we implement a full GPU version of the ADI solver to remove a lot of redundant data transfers between CPU and GPU, and then design two fine-grain schemes, namely “one-thread-one-point” and “one-thread-one-line”, to maximize the performance. Second, we present a dual-level parallelization scheme using the CPU/GPU collaborative model to exploit the computational resources of both multi-core CPUs and many-core GPUs within the heterogeneous platform. Finally, considering the fact that memory on a single node becomes inadequate when the simulation size grows, we present a tri-level hybrid programming pattern MPI-OpenMP-CUDA that merges fine-grain parallelism using OpenMP and CUDA threads with coarse-grain parallelism using MPI for inter-node communication. We also propose a strategy to overlap the computation with communication using the advanced features of CUDA and MPI programming. We obtain speedups of 6.0 for the ADI solver on one Tesla M2050 GPU in contrast to two Xeon X5670 CPUs. Scalability tests show that our implementation can offer significant performance improvement on heterogeneous platform.

  8. The role of feedback mechanisms in historic channel changes of the lower Rio Grande in the Big Bend region

    NASA Astrophysics Data System (ADS)

    Dean, David J.; Schmidt, John C.

    2011-03-01

    Over the last century, large-scale water development of the upper Rio Grande in the U.S. and Mexico, and of the Rio Conchos in Mexico, has resulted in progressive channel narrowing of the lower Rio Grande in the Big Bend region. We used methods operating at multiple spatial and temporal scales to analyze the rate, magnitude, and processes responsible for channel narrowing. These methods included: hydrologic analysis of historic stream gage data, analysis of notes of measured discharges, historic oblique and aerial photograph analysis, and stratigraphic and dendrogeomorphic analysis of inset floodplain deposits. Our analyses indicate that frequent large floods between 1900 and the mid-1940s acted as a negative feedback mechanism and maintained a wide, sandy, multi-threaded river. Declines in mean and peak flow in the mid-1940s resulted in progressive channel narrowing. Channel narrowing has been temporarily interrupted by occasional large floods that widened the channel, however, channel narrowing has always resumed. After large floods in 1990 and 1991, the active channel width of the lower Rio Grande has narrowed by 36-52%. Narrowing has occurred by the vertical accretion of fine-grained deposits on top of sand and gravel bars, inset within natural levees. Channel narrowing by vertical accretion occurred simultaneously with a rapid invasion of non-native riparian vegetation ( Tamarix spp., Arundo donax) which created a positive feedback and exacerbated the processes of channel narrowing and vertical accretion. In two floodplain trenches, we measured 2.75 and 3.5 m of vertical accretion between 1993 and 2008. In some localities, nearly 90% of bare, active channel bars were converted to vegetated floodplain during the same period. Upward shifts of stage-discharge relations occurred resulting in over-bank flooding at lower discharges, and continued vertical accretion despite a progressive reduction in stream flow. Thus, although the magnitude of the average annual flood was reduced between 40 and 50%, over-bank flooding continued. These changes reflect a shift in the geomorphic nature of the Rio Grande from a wide, laterally unstable, multi-thread river, to a laterally stable, single-thread channel with cohesive, vertical banks, and few active in-channel bars.

  9. Co Modeling and Co Synthesis of Safety Critical Multi threaded Embedded Software for Multi Core Embedded Platforms

    DTIC Science & Technology

    2017-03-20

    computation, Prime Implicates, Boolean Abstraction, real- time embedded software, software synthesis, correct by construction software design , model...types for time -dependent data-flow networks". J.-P. Talpin, P. Jouvelot, S. Shukla. ACM-IEEE Conference on Methods and Models for System Design ...information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing   data sources, gathering and

  10. 46 CFR 56.70-15 - Procedure.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... shall be prepared for each process and welding position to be employed in the fabrication. (1) Girth...) processes. Classes I, I-L, and II-L piping are required to have the inside of the pipe machined for good fit...). (1) Where seal welding of threaded joints is performed, threads shall be entirely covered by the seal...

  11. 46 CFR 56.70-15 - Procedure.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... shall be prepared for each process and welding position to be employed in the fabrication. (1) Girth...) processes. Classes I, I-L, and II-L piping are required to have the inside of the pipe machined for good fit...). (1) Where seal welding of threaded joints is performed, threads shall be entirely covered by the seal...

  12. 46 CFR 56.70-15 - Procedure.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... shall be prepared for each process and welding position to be employed in the fabrication. (1) Girth...) processes. Classes I, I-L, and II-L piping are required to have the inside of the pipe machined for good fit...). (1) Where seal welding of threaded joints is performed, threads shall be entirely covered by the seal...

  13. 46 CFR 56.70-15 - Procedure.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... shall be prepared for each process and welding position to be employed in the fabrication. (1) Girth...) processes. Classes I, I-L, and II-L piping are required to have the inside of the pipe machined for good fit...). (1) Where seal welding of threaded joints is performed, threads shall be entirely covered by the seal...

  14. 46 CFR 56.70-15 - Procedure.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... shall be prepared for each process and welding position to be employed in the fabrication. (1) Girth...) processes. Classes I, I-L, and II-L piping are required to have the inside of the pipe machined for good fit...). (1) Where seal welding of threaded joints is performed, threads shall be entirely covered by the seal...

  15. Racist Facebook Event against Native Americans: Preservice Teachers Explore Ethical and Critical Multicultural Implications

    ERIC Educational Resources Information Center

    Aragon, Antonette; Kaminski, Karen

    2012-01-01

    This exploratory case study sought to analyze data from Collaborative Learning Modalities (Brantmeier, Aragon & Folkestad, 2011) on-line threaded discussions in a teacher education course where pre-service teachers examined the nature of a Facebook event revealing unethical and racist notions against Native Americans. In 2010 a university…

  16. Modeling self-organization of novel organic materials

    NASA Astrophysics Data System (ADS)

    Sayar, Mehmet

    In this thesis, the structural organization of oligomeric multi-block molecules is analyzed by computational analysis of coarse-grained models. These molecules form nanostructures with different dimensionalities, and the nanostructured nature of these materials leads to novel structural properties at different length scales. Previously, a number of oligomeric triblock rodcoil molecules have been shown to self-organize into mushroom shaped noncentrosymmetric nanostructures. Interestingly, thin films of these molecules contain polar domains and a finite macroscopic polarization. However, the fully polarized state is not the equilibrium state. In the first chapter, by solving a model with dipolar and Ising-like short range interactions, we show that polar domains are stable in films composed of aggregates as opposed to isolated molecules. Unlike classical molecular systems, these nanoaggregates have large intralayer spacings (a ≈ 6 nm), leading to a reduction in the repulsive dipolar interactions that oppose polar order within layers. This enables the formation of a striped pattern with polar domains of alternating directions. The energies of the possible structures at zero temperature are computed exactly and results of Monte Carlo simulations are provided at non-zero temperatures. In the second chapter, the macroscopic polarization of such nanostructured films is analyzed in the presence of a short range surface interaction. The surface interaction leads to a periodic domain structure where the balance between the up and down domains is broken, and therefore films of finite thickness have a net macroscopic polarization. The polarization per unit volume is a function of film thickness and strength of the surface interaction. Finally, in chapter three, self-organization of organic molecules into a network of one dimensional objects is analyzed. Multi-block organic dendron rodcoil molecules were found to self-organize into supramolecular nanoribbons (threads) and form gels at very low concentrations. Here, the formation and structural properties of these networks are studied with Monte Carlo simulations. The model gelators can form intra and inter-thread bonds, and the threads have a finite stiffness. The results suggest that the high persistence length is a result of the interplay of thread stiffness and inter-thread interactions. Furthermore, this high persistence length enables the formation of networks at low concentrations.

  17. Real-time inextensible surgical thread simulation.

    PubMed

    Xu, Lang; Liu, Qian

    2018-03-27

    This paper discusses a real-time simulation method of inextensible surgical thread based on the Cosserat rod theory using position-based dynamics (PBD). The method realizes stable twining and knotting of surgical thread while including inextensibility, bending, twisting and coupling effects. The Cosserat rod theory is used to model the nonlinear elastic behavior of surgical thread. The surgical thread model is solved with PBD to achieve a real-time, extremely stable simulation. Due to the one-dimensional linear structure of surgical thread, the direct solution of the distance constraint based on tridiagonal matrix algorithm is used to enhance stretching resistance in every constraint projection iteration. In addition, continuous collision detection and collision response guarantee a large time step and high performance. Furthermore, friction is integrated into the constraint projection process to stabilize the twining of multiple threads and complex contact situations. Through comparisons with existing methods, the surgical thread maintains constant length under large deformation after applying the direct distance constraint in our method. The twining and knotting of multiple threads correspond to stable solutions to contact and friction forces. A surgical suture scene is also modeled to demonstrate the practicality and simplicity of our method. Our method achieves stable and fast simulation of inextensible surgical thread. Benefiting from the unified particle framework, the rigid body, elastic rod, and soft body can be simultaneously simulated. The method is appropriate for applications in virtual surgery that require multiple dynamic bodies.

  18. A generic flexible and robust approach for intelligent real-time video-surveillance systems

    NASA Astrophysics Data System (ADS)

    Desurmont, Xavier; Delaigle, Jean-Francois; Bastide, Arnaud; Macq, Benoit

    2004-05-01

    In this article we present a generic, flexible and robust approach for an intelligent real-time video-surveillance system. A previous version of the system was presented in [1]. The goal of these advanced tools is to provide help to operators by detecting events of interest in visual scenes and highlighting alarms and compute statistics. The proposed system is a multi-camera platform able to handle different standards of video inputs (composite, IP, IEEE1394 ) and which can basically compress (MPEG4), store and display them. This platform also integrates advanced video analysis tools, such as motion detection, segmentation, tracking and interpretation. The design of the architecture is optimised to playback, display, and process video flows in an efficient way for video-surveillance application. The implementation is distributed on a scalable computer cluster based on Linux and IP network. It relies on POSIX threads for multitasking scheduling. Data flows are transmitted between the different modules using multicast technology and under control of a TCP-based command network (e.g. for bandwidth occupation control). We report here some results and we show the potential use of such a flexible system in third generation video surveillance system. We illustrate the interest of the system in a real case study, which is the indoor surveillance.

  19. Testing the Digital Thread in Support of Model-Based Manufacturing and Inspection

    PubMed Central

    Hedberg, Thomas; Lubell, Joshua; Fischer, Lyle; Maggiano, Larry; Feeney, Allison Barnard

    2016-01-01

    A number of manufacturing companies have reported anecdotal evidence describing the benefits of Model-Based Enterprise (MBE). Based on this evidence, major players in industry have embraced a vision to deploy MBE. In our view, the best chance of realizing this vision is the creation of a single “digital thread.” Under MBE, there exists a Model-Based Definition (MBD), created by the Engineering function, that downstream functions reuse to complete Model-Based Manufacturing and Model-Based Inspection activities. The ensemble of data that enables the combination of model-based definition, manufacturing, and inspection defines this digital thread. Such a digital thread would enable real-time design and analysis, collaborative process-flow development, automated artifact creation, and full-process traceability in a seamless real-time collaborative development among project participants. This paper documents the strengths and weaknesses in the current, industry strategies for implementing MBE. It also identifies gaps in the transition and/or exchange of data between various manufacturing processes. Lastly, this paper presents measured results from a study of model-based processes compared to drawing-based processes and provides evidence to support the anecdotal evidence and vision made by industry. PMID:27325911

  20. Landslide: Systematic Dynamic Race Detection in Kernel Space

    DTIC Science & Technology

    2012-05-01

    schedule_in_flight← true; CAUSE_TIMER_INTERRUPT(); end if end function Thread Scheduling Finally, the Landslide scheduler is responsible for managing ...child process vanish() simultaneously. • double_wait: Tests interactions of multiple waiters on a single child. • double_thread_fork: Tests for...conditions using Landslide. We describe them here. • Too many waiters allowed. Using the double_wait test case, Group 1 found a bug in which more threads

  1. Asynchronous Message Service Reference Implementation

    NASA Technical Reports Server (NTRS)

    Burleigh, Scott C.

    2011-01-01

    This software provides a library of middleware functions with a simple application programming interface, enabling implementation of distributed applications in conformance with the CCSDS AMS (Consultative Committee for Space Data Systems Asynchronous Message Service) specification. The AMS service, and its protocols, implement an architectural concept under which the modules of mission systems may be designed as if they were to operate in isolation, each one producing and consuming mission information without explicit awareness of which other modules are currently operating. Communication relationships among such modules are self-configuring; this tends to minimize complexity in the development and operations of modular data systems. A system built on this model is a society of generally autonomous, inter-operating modules that may fluctuate freely over time in response to changing mission objectives, modules functional upgrades, and recovery from individual module failure. The purpose of AMS, then, is to reduce mission cost and risk by providing standard, reusable infrastructure for the exchange of information among data system modules in a manner that is simple to use, highly automated, flexible, robust, scalable, and efficient. The implementation is designed to spawn multiple threads of AMS functionality under the control of an AMS application program. These threads enable all members of an AMS-based, distributed application to discover one another in real time, subscribe to messages on specific topics, and to publish messages on specific topics. The query/reply (client/server) communication model is also supported. Message exchange is optionally subject to encryption (to support confidentiality) and authorization. Fault tolerance measures in the discovery protocol minimize the likelihood of overall application failure due to any single operational error anywhere in the system. The multi-threaded design simplifies processing while enabling application nodes to operate at high speeds; linked lists protected by mutex semaphores and condition variables are used for efficient, inter-thread communication. Applications may use a variety of transport protocols underlying AMS itself, including TCP (Transmission Control Protocol), UDP (User Datagram Protocol), and message queues.

  2. Running ATLAS workloads within massively parallel distributed applications using Athena Multi-Process framework (AthenaMP)

    NASA Astrophysics Data System (ADS)

    Calafiura, Paolo; Leggett, Charles; Seuster, Rolf; Tsulaia, Vakhtang; Van Gemmeren, Peter

    2015-12-01

    AthenaMP is a multi-process version of the ATLAS reconstruction, simulation and data analysis framework Athena. By leveraging Linux fork and copy-on-write mechanisms, it allows for sharing of memory pages between event processors running on the same compute node with little to no change in the application code. Originally targeted to optimize the memory footprint of reconstruction jobs, AthenaMP has demonstrated that it can reduce the memory usage of certain configurations of ATLAS production jobs by a factor of 2. AthenaMP has also evolved to become the parallel event-processing core of the recently developed ATLAS infrastructure for fine-grained event processing (Event Service) which allows the running of AthenaMP inside massively parallel distributed applications on hundreds of compute nodes simultaneously. We present the architecture of AthenaMP, various strategies implemented by AthenaMP for scheduling workload to worker processes (for example: Shared Event Queue and Shared Distributor of Event Tokens) and the usage of AthenaMP in the diversity of ATLAS event processing workloads on various computing resources: Grid, opportunistic resources and HPC.

  3. LONG DURATION FLARE EMISSION: IMPULSIVE HEATING OR GRADUAL HEATING?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Qiu, Jiong; Longcope, Dana W.

    Flare emissions in X-ray and EUV wavelengths have previously been modeled as the plasma response to impulsive heating from magnetic reconnection. Some flares exhibit gradually evolving X-ray and EUV light curves, which are believed to result from superposition of an extended sequence of impulsive heating events occurring in different adjacent loops or even unresolved threads within each loop. In this paper, we apply this approach to a long duration two-ribbon flare SOL2011-09-13T22 observed by the Atmosphere Imaging Assembly (AIA). We find that to reconcile with observed signatures of flare emission in multiple EUV wavelengths, each thread should be heated inmore » two phases, an intense impulsive heating followed by a gradual, low-rate heating tail that is attenuated over 20–30 minutes. Each AIA resolved single loop may be composed of several such threads. The two-phase heating scenario is supported by modeling with both a zero-dimensional and a 1D hydrodynamic code. We discuss viable physical mechanisms for the two-phase heating in a post-reconnection thread.« less

  4. Falcon: a highly flexible open-source software for closed-loop neuroscience.

    PubMed

    Ciliberti, Davide; Kloosterman, Fabian

    2017-08-01

    Closed-loop experiments provide unique insights into brain dynamics and function. To facilitate a wide range of closed-loop experiments, we created an open-source software platform that enables high-performance real-time processing of streaming experimental data. We wrote Falcon, a C++ multi-threaded software in which the user can load and execute an arbitrary processing graph. Each node of a Falcon graph is mapped to a single thread and nodes communicate with each other through thread-safe buffers. The framework allows for easy implementation of new processing nodes and data types. Falcon was tested both on a 32-core and a 4-core workstation. Streaming data was read from either a commercial acquisition system (Neuralynx) or the open-source Open Ephys hardware, while closed-loop TTL pulses were generated with a USB module for digital output. We characterized the round-trip latency of our Falcon-based closed-loop system, as well as the specific latency contribution of the software architecture, by testing processing graphs with up to 32 parallel pipelines and eight serial stages. We finally deployed Falcon in a task of real-time detection of population bursts recorded live from the hippocampus of a freely moving rat. On Neuralynx hardware, round-trip latency was well below 1 ms and stable for at least 1 h, while on Open Ephys hardware latencies were below 15 ms. The latency contribution of the software was below 0.5 ms. Round-trip and software latencies were similar on both 32- and 4-core workstations. Falcon was used successfully to detect population bursts online with ~40 ms average latency. Falcon is a novel open-source software for closed-loop neuroscience. It has sub-millisecond intrinsic latency and gives the experimenter direct control of CPU resources. We envisage Falcon to be a useful tool to the neuroscientific community for implementing a wide variety of closed-loop experiments, including those requiring use of complex data structures and real-time execution of computationally intensive algorithms, such as population neural decoding/encoding from large cell assemblies.

  5. Falcon: a highly flexible open-source software for closed-loop neuroscience

    NASA Astrophysics Data System (ADS)

    Ciliberti, Davide; Kloosterman, Fabian

    2017-08-01

    Objective. Closed-loop experiments provide unique insights into brain dynamics and function. To facilitate a wide range of closed-loop experiments, we created an open-source software platform that enables high-performance real-time processing of streaming experimental data. Approach. We wrote Falcon, a C++ multi-threaded software in which the user can load and execute an arbitrary processing graph. Each node of a Falcon graph is mapped to a single thread and nodes communicate with each other through thread-safe buffers. The framework allows for easy implementation of new processing nodes and data types. Falcon was tested both on a 32-core and a 4-core workstation. Streaming data was read from either a commercial acquisition system (Neuralynx) or the open-source Open Ephys hardware, while closed-loop TTL pulses were generated with a USB module for digital output. We characterized the round-trip latency of our Falcon-based closed-loop system, as well as the specific latency contribution of the software architecture, by testing processing graphs with up to 32 parallel pipelines and eight serial stages. We finally deployed Falcon in a task of real-time detection of population bursts recorded live from the hippocampus of a freely moving rat. Main results. On Neuralynx hardware, round-trip latency was well below 1 ms and stable for at least 1 h, while on Open Ephys hardware latencies were below 15 ms. The latency contribution of the software was below 0.5 ms. Round-trip and software latencies were similar on both 32- and 4-core workstations. Falcon was used successfully to detect population bursts online with ~40 ms average latency. Significance. Falcon is a novel open-source software for closed-loop neuroscience. It has sub-millisecond intrinsic latency and gives the experimenter direct control of CPU resources. We envisage Falcon to be a useful tool to the neuroscientific community for implementing a wide variety of closed-loop experiments, including those requiring use of complex data structures and real-time execution of computationally intensive algorithms, such as population neural decoding/encoding from large cell assemblies.

  6. Contributions to systemic analysis for worm screw production using thread whirling devices

    NASA Astrophysics Data System (ADS)

    Cretu, G.

    2017-08-01

    The paper aims to achieve a systemic analysis of worms processing using whirling threaded devices with highlighting all the factors involved in this system. It will also carry out an analysis of these factors depending on specific conditions such machining. Are also presented the stages of experimentation program and ways of processing for data obtained.

  7. The signal extraction of fetal heart rate based on wavelet transform and BP neural network

    NASA Astrophysics Data System (ADS)

    Yang, Xiao Hong; Zhang, Bang-Cheng; Fu, Hu Dai

    2005-04-01

    This paper briefly introduces the collection and recognition of bio-medical signals, designs the method to collect FM signals. A detailed discussion on the system hardware, structure and functions is also given. Under LabWindows/CVI,the hardware and the driver do compatible, the hardware equipment work properly actively. The paper adopts multi threading technology for real-time analysis and makes use of latency time of CPU effectively, expedites program reflect speed, improves the program to perform efficiency. One threading is collecting data; the other threading is analyzing data. Using the method, it is broaden to analyze the signal in real-time. Wavelet transform to remove the main interference in the FM and by adding time-window to recognize with BP network; Finally the results of collecting signals and BP networks are discussed. 8 pregnant women's signals of FM were collected successfully by using the sensor. The correctness rate of BP network recognition is about 83.3% by using the above measure.

  8. Fatigue acceptance test limit criterion for larger diameter rolled thread fasteners

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kephart, A.R.

    1997-05-01

    This document describes a fatigue lifetime acceptance test criterion by which studs having rolled threads, larger than 1.0 inches in diameter, can be assured to meet minimum quality attributes associated with a controlled rolling process. This criterion is derived from a stress dependent, room temperature air fatigue database for test studs having a 0.625 inch diameter threads of Alloys X-750 HTH and direct aged 625. Anticipated fatigue lives of larger threads are based on thread root elastic stress concentration factors which increase with increasing thread diameters. Over the thread size range of interest, a 30% increase in notch stress ismore » equivalent to a factor of five (5X) reduction in fatigue life. The resulting diameter dependent fatigue acceptance criterion is normalized to the aerospace rolled thread acceptance standards for a 1.0 inch diameter, 0.125 inch pitch, Unified National thread with a controlled Root radius (UNR). Testing was conducted at a stress of 50% of the minimum specified material ultimate strength, 80 Ksi, and at a stress ratio (R) of 0.10. Limited test data for fastener diameters of 1.00 to 2.25 inches are compared to the acceptance criterion. Sensitivity of fatigue life of threads to test nut geometry variables was also shown to be dependent on notch stress conditions. Bearing surface concavity of the compression nuts and thread flank contact mismatch conditions can significantly affect the fastener fatigue life. Without improved controls these conditions could potentially provide misleading acceptance data. Alternate test nut geometry features are described and implemented in the rolled thread stud specification, MIL-DTL-24789(SH), to mitigate the potential effects on fatigue acceptance data.« less

  9. Function and evolution of a Lotus japonicus AP2/ERF family transcription factor that is required for development of infection threads

    PubMed Central

    Yano, Koji; Aoki, Seishiro; Liu, Meng; Umehara, Yosuke; Suganuma, Norio; Iwasaki, Wataru; Sato, Shusei; Soyano, Takashi; Kouchi, Hiroshi

    2017-01-01

    Abstract Legume-rhizobium symbiosis is achieved by two major events evolutionarily acquired: root hair infection and organogenesis. Infection thread (IT) development is a distinct element for rhizobial infection. Through ITs, rhizobia are efficiently transported from infection foci on root hairs to dividing meristematic cortical cells. To unveil this process, we performed genetic screening using Lotus japonicus MG-20 and isolated symbiotic mutant lines affecting nodulation, root hair morphology, and IT development. Map-based cloning identified an AP2/ERF transcription factor gene orthologous to Medicago truncatula ERN1. LjERN1 was activated in response to rhizobial infection and depended on CYCLOPS and NSP2. Legumes conserve an ERN1 homolog, ERN2, that functions redundantly with ERN1 in M. truncatula. Phylogenetic analysis showed that the lineages of ERN1 and ERN2 genes originated from a gene duplication event in the common ancestor of legume plants. However, genomic analysis suggested the lack of ERN2 gene in the L. japonicus genome, consistent with Ljern1 mutants exhibited a root hair phenotype that is observed in ern1/ern2 double mutants in M. truncatula. Molecular evolutionary analysis suggested that the nonsynonymous/synonymous rate ratios of legume ERN1 genes was almost identical to that of non-legume plants, whereas the ERN2 genes experienced a relaxed selective constraint. PMID:28028038

  10. Pausing and activating thread state upon pin assertion by external logic monitoring polling loop exit time condition

    DOEpatents

    Chen, Dong; Giampapa, Mark; Heidelberger, Philip; Ohmacht, Martin; Satterfield, David L; Steinmacher-Burow, Burkhard; Sugavanam, Krishnan

    2013-05-21

    A system and method for enhancing performance of a computer which includes a computer system including a data storage device. The computer system includes a program stored in the data storage device and steps of the program are executed by a processer. The processor processes instructions from the program. A wait state in the processor waits for receiving specified data. A thread in the processor has a pause state wherein the processor waits for specified data. A pin in the processor initiates a return to an active state from the pause state for the thread. A logic circuit is external to the processor, and the logic circuit is configured to detect a specified condition. The pin initiates a return to the active state of the thread when the specified condition is detected using the logic circuit.

  11. The Digital Thread as the Key Enabler

    DTIC Science & Technology

    2016-11-01

    17 Defense AT&L: November-December 2016 The Digital Thread as the Key Enabler Col. Keith Bearden, USAF Bearden is the deputy director of...enabling you to do your job better, faster and cheaper. There is one initiative, the key enabler, to accomplish this goal—the digital thread . But let’s... process that would allow for rapid cross- domain analysis and technology transition prior to bending metal. • Re-establish a culture of “hands-on

  12. SISSY: An example of a multi-threaded, networked, object-oriented databased application

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Scipioni, B.; Liu, D.; Song, T.

    1993-05-01

    The Systems Integration Support SYstem (SISSY) is presented and its capabilities and techniques are discussed. It is fully automated data collection and analysis system supporting the SSCL`s systems analysis activities as they relate to the Physics Detector and Simulation Facility (PDSF). SISSY itself is a paradigm of effective computing on the PDSF. It uses home-grown code (C++), network programming (RPC, SNMP), relational (SYBASE) and object-oriented (ObjectStore) DBMSs, UNIX operating system services (IRIX threads, cron, system utilities, shells scripts, etc.), and third party software applications (NetCentral Station, Wingz, DataLink) all of which act together as a single application to monitor andmore » analyze the PDSF.« less

  13. Vectorization for Molecular Dynamics on Intel Xeon Phi Corpocessors

    NASA Astrophysics Data System (ADS)

    Yi, Hongsuk

    2014-03-01

    Many modern processors are capable of exploiting data-level parallelism through the use of single instruction multiple data (SIMD) execution. The new Intel Xeon Phi coprocessor supports 512 bit vector registers for the high performance computing. In this paper, we have developed a hierarchical parallelization scheme for accelerated molecular dynamics simulations with the Terfoff potentials for covalent bond solid crystals on Intel Xeon Phi coprocessor systems. The scheme exploits multi-level parallelism computing. We combine thread-level parallelism using a tightly coupled thread-level and task-level parallelism with 512-bit vector register. The simulation results show that the parallel performance of SIMD implementations on Xeon Phi is apparently superior to their x86 CPU architecture.

  14. Study of Measurement Strategies of Geometric Deviation of the Position of the Threaded Holes

    NASA Astrophysics Data System (ADS)

    Drbul, Mário; Martikan, Pavol; Sajgalik, Michal; Czan, Andrej; Broncek, Jozef; Babik, Ondrej

    2017-12-01

    Verification of product and quality control is an integral part of current production process. In terms of functional requirements and product interoperability, it is necessary to analyze their dimensional and also geometric specifications. Threaded holes are verified elements too, which are a substantial part of detachable screw connections and have a broad presence in engineering products. This paper deals with on the analysing of measurement strategies of verification geometric deviation of the position of the threaded holes, which are the indirect method of measuring threaded pins when applying different measurement strategies which can affect the result of the verification of the product..

  15. A Tool for Intersecting Context-Free Grammars and Its Applications

    NASA Technical Reports Server (NTRS)

    Gange, Graeme; Navas, Jorge A.; Schachte, Peter; Sondergaard, Harald; Stuckey, Peter J.

    2015-01-01

    This paper describes a tool for intersecting context-free grammars. Since this problem is undecidable the tool follows a refinement-based approach and implements a novel refinement which is complete for regularly separable grammars. We show its effectiveness for safety verification of recursive multi-threaded programs.

  16. Challenges in scaling NLO generators to leadership computers

    NASA Astrophysics Data System (ADS)

    Benjamin, D.; Childers, JT; Hoeche, S.; LeCompte, T.; Uram, T.

    2017-10-01

    Exascale computing resources are roughly a decade away and will be capable of 100 times more computing than current supercomputers. In the last year, Energy Frontier experiments crossed a milestone of 100 million core-hours used at the Argonne Leadership Computing Facility, Oak Ridge Leadership Computing Facility, and NERSC. The Fortran-based leading-order parton generator called Alpgen was successfully scaled to millions of threads to achieve this level of usage on Mira. Sherpa and MadGraph are next-to-leading order generators used heavily by LHC experiments for simulation. Integration times for high-multiplicity or rare processes can take a week or more on standard Grid machines, even using all 16-cores. We will describe our ongoing work to scale the Sherpa generator to thousands of threads on leadership-class machines and reduce run-times to less than a day. This work allows the experiments to leverage large-scale parallel supercomputers for event generation today, freeing tens of millions of grid hours for other work, and paving the way for future applications (simulation, reconstruction) on these and future supercomputers.

  17. Modern multicore and manycore architectures: Modelling, optimisation and benchmarking a multiblock CFD code

    NASA Astrophysics Data System (ADS)

    Hadade, Ioan; di Mare, Luca

    2016-08-01

    Modern multicore and manycore processors exhibit multiple levels of parallelism through a wide range of architectural features such as SIMD for data parallel execution or threads for core parallelism. The exploitation of multi-level parallelism is therefore crucial for achieving superior performance on current and future processors. This paper presents the performance tuning of a multiblock CFD solver on Intel SandyBridge and Haswell multicore CPUs and the Intel Xeon Phi Knights Corner coprocessor. Code optimisations have been applied on two computational kernels exhibiting different computational patterns: the update of flow variables and the evaluation of the Roe numerical fluxes. We discuss at great length the code transformations required for achieving efficient SIMD computations for both kernels across the selected devices including SIMD shuffles and transpositions for flux stencil computations and global memory transformations. Core parallelism is expressed through threading based on a number of domain decomposition techniques together with optimisations pertaining to alleviating NUMA effects found in multi-socket compute nodes. Results are correlated with the Roofline performance model in order to assert their efficiency for each distinct architecture. We report significant speedups for single thread execution across both kernels: 2-5X on the multicore CPUs and 14-23X on the Xeon Phi coprocessor. Computations at full node and chip concurrency deliver a factor of three speedup on the multicore processors and up to 24X on the Xeon Phi manycore coprocessor.

  18. Spider-web inspired multi-resolution graphene tactile sensor.

    PubMed

    Liu, Lu; Huang, Yu; Li, Fengyu; Ma, Ying; Li, Wenbo; Su, Meng; Qian, Xin; Ren, Wanjie; Tang, Kanglai; Song, Yanlin

    2018-05-08

    Multi-dimensional accurate response and smooth signal transmission are critical challenges in the advancement of multi-resolution recognition and complex environment analysis. Inspired by the structure-activity relationship between discrepant microstructures of the spiral and radial threads in a spider web, we designed and printed graphene with porous and densely-packed microstructures to integrate into a multi-resolution graphene tactile sensor. The three-dimensional (3D) porous graphene structure performs multi-dimensional deformation responses. The laminar densely-packed graphene structure contributes excellent conductivity with flexible stability. The spider-web inspired printed pattern inherits orientational and locational kinesis tracking. The multi-structure construction with homo-graphene material can integrate discrepant electronic properties with remarkable flexibility, which will attract enormous attention for electronic skin, wearable devices and human-machine interactions.

  19. Visualization design and verification of Ada tasking using timing diagrams

    NASA Technical Reports Server (NTRS)

    Vidale, R. F.; Szulewski, P. A.; Weiss, J. B.

    1986-01-01

    The use of timing diagrams is recommended in the design and testing of multi-task Ada programs. By displaying the task states vs. time, timing diagrams can portray the simultaneous threads of data flow and control which characterize tasking programs. This description of the system's dynamic behavior from conception to testing is a necessary adjunct to other graphical techniques, such as structure charts, which essentially give a static view of the system. A series of steps is recommended which incorporates timing diagrams into the design process. Finally, a description is provided of a prototype Ada Execution Analyzer (AEA) which automates the production of timing diagrams from VAX/Ada debugger output.

  20. Crafting threads of diblock copolymer micelles via flow-enabled self-assembly.

    PubMed

    Li, Bo; Han, Wei; Jiang, Beibei; Lin, Zhiqun

    2014-03-25

    Hierarchically assembled amphiphilic diblock copolymer micelles were exquisitely crafted over large areas by capitalizing on two concurrent self-assembling processes at different length scales, namely, the periodic threads composed of a monolayer or a bilayer of diblock copolymer micelles precisely positioned by flow-enabled self-assembly (FESA) on the microscopic scale and the self-assembly of amphiphilic diblock copolymer micelles into ordered arrays within an individual thread on the nanometer scale. A minimum spacing between two adjacent threads λmin was observed. A model was proposed to rationalize the relationship between the thread width and λmin. Such FESA of diblock copolymer micelles is remarkably controllable and easy to implement. It opens up possibilities for lithography-free positioning and patterning of diblock copolymer micelles for various applications in template fabrication of periodic inorganic nanostructures, nanoelectronics, optoelectronics, magnetic devices, and biotechnology.

  1. Inertia Compensation While Scanning Screw Threads on Coordinate Measuring Machines

    NASA Astrophysics Data System (ADS)

    Kosarevsky, Sergey; Latypov, Viktor

    2010-01-01

    Usage of scanning coordinate-measuring machines for inspection of screw threads has become a common practice nowadays. Compared to touch trigger probing, scanning capabilities allow to speed up the measuring process while still maintaining high accuracy. However, in some cases accuracy drastically depends on the scanning speed. In this paper a compensation method is proposed allowing to reduce the influence of inertia of the probing system while scanning screw threads on coordinate-measuring machines.

  2. Successive Two-sided Loop Jets Caused by Magnetic Reconnection between Two Adjacent Filamentary Threads

    NASA Astrophysics Data System (ADS)

    Tian, Zhanjun; Liu, Yu; Shen, Yuandeng; Elmhamdi, Abouazza; Su, Jiangtao; Liu, Ying D.; Kordi, Ayman. S.

    2017-08-01

    We present observational analysis of two successive two-sided loop jets observed by the ground-based New Vacuum Solar Telescope and the space-borne Solar Dynamics Observatory. The two successive two-sided loop jets manifested similar evolution processes and both were associated with the interaction of two small-scale adjacent filamentary threads, magnetic emerging, and cancellation processes at the jet’s source region. High temporal and high spatial resolution observations reveal that the two adjacent ends of the two filamentary threads are rooted in opposite magnetic polarities within the source region. The two threads approached each other, and then an obvious brightening patch is observed at the interaction position. Subsequently, a pair of hot plasma ejections are observed heading in opposite directions along the paths of the two filamentary threads at a typical speed for two-sided loop jets of the order 150 km s-1. Close to the end of the second jet, we report the formation of a bright hot loop structure at the source region, which suggests the formation of new loops during the interaction. Based on the observational results, we propose that the observed two-sided loop jets are caused by magnetic reconnection between the two adjacent filamentary threads, largely different from the previous scenario that a two-sided loop jet is generated by magnetic reconnection between an emerging bipole and the overlying horizontal magnetic fields.

  3. 75 FR 25839 - Foreign-Trade Zone 26 Atlanta, Georgia, Application for Subzone, Yates Bleachery Company (Textile...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-05-10

    ...), high thread count (180 threads per inch and higher) fabrics under FTZ procedures based on a tolling... process any other customer- owned fabric under FTZ procedures. Subzone status would allow for deferral of...

  4. Effect of Pin Geometry on the Mechanical Strength of Friction-Stir-Welded Polypropylene Composite Plates

    NASA Astrophysics Data System (ADS)

    Kordestani, F.; Ashenai Ghasemi, F.; Arab, N. B. M.

    2017-09-01

    Friction stir welding (FSW) is a solid-state welding process, which has successfully been applied in aerospace and automotive industries for joining materials. The friction stir tool is the key element in the FSW process. In this study, the effect of four different tool pin geometries on the mechanical properties of two types of polypropylene composite plates, with 30% glass and carbon fiber, respectively, were investigated. For this purpose, four pins of different geometry, namely, a threaded-tapered pin, square pin, four-flute threaded pin, and threaded-tapered pin with a chamfer were made and used to carry out the butt welding of 5-mm-thick plates. The standard tensile and Izod impact tests were performed to evaluate the tensile strength and impact toughness of welded specimens. The results indicated that the threaded-tapered pin with a chamfer produced welds with a better surface appearance and higher tensile and impact strengths. The tests also showed that, with the threaded-tapered pin with a chamfer, the impact strength of the glass- and carbon-fiber composite welds were about 40 and 50%, respectively, of that of the base materials.

  5. Improvement and speed optimization of numerical tsunami modelling program using OpenMP technology

    NASA Astrophysics Data System (ADS)

    Chernov, A.; Zaytsev, A.; Yalciner, A.; Kurkin, A.

    2009-04-01

    Currently, the basic problem of tsunami modeling is low speed of calculations which is unacceptable for services of the operative notification. Existing algorithms of numerical modeling of hydrodynamic processes of tsunami waves are developed without taking the opportunities of modern computer facilities. There is an opportunity to have considerable acceleration of process of calculations by using parallel algorithms. We discuss here new approach to parallelization tsunami modeling code using OpenMP Technology (for multiprocessing systems with the general memory). Nowadays, multiprocessing systems are easily accessible for everyone. The cost of the use of such systems becomes much lower comparing to the costs of clusters. This opportunity also benefits all programmers to apply multithreading algorithms on desktop computers of researchers. Other important advantage of the given approach is the mechanism of the general memory - there is no necessity to send data on slow networks (for example Ethernet). All memory is the common for all computing processes; it causes almost linear scalability of the program and processes. In the new version of NAMI DANCE using OpenMP technology and multi-threading algorithm provide 80% gain in speed in comparison with the one-thread version for dual-processor unit. The speed increased and 320% gain was attained for four core processor unit of PCs. Thus, it was possible to reduce considerably time of performance of calculations on the scientific workstations (desktops) without complete change of the program and user interfaces. The further modernization of algorithms of preparation of initial data and processing of results using OpenMP looks reasonable. The final version of NAMI DANCE with the increased computational speed can be used not only for research purposes but also in real time Tsunami Warning Systems.

  6. [Mechanical behaviour of three types of surgical knots using 4/0 monofilament].

    PubMed

    Gil Santos, Luis; Más-Estellés, Jorge; Salmerón Sánchez, Manuel; Barrios, Carlos

    2012-01-01

    To experimentally study the behaviour of 4 types of monofilament with 3 knotting techniques, very often used in surgery, employing mechanical tests. Four 4/0 monofilaments were chosen, two of nylon, and the other two of polypropylene. Three types of knot designs were made with each thread. The first design (D-S-S) consisted of a double half-knot (D) followed by one single (S) in the opposite direction, a third S in turn in the opposite direction. The configuration of the second design was D-S-D, and the third, S-S-D. A mechanical fracture test was performed by stretching the thread at a rate of 4N/s, the force and deformation being recorded at intervals of 100 ms. There was a decrease in the force and deformation in the tests on threads with knots compared to threads without knots. In all cases the rupture of the thread occurred in the knot area, presumably due to damage caused to the thread during the knotting process. The D-S-D knot had the greatest resistance with polypropylene threads, and S-S-D provided the greatest resistance with nylon threads Polypropylene threads, with D-S-D knots, should be more indicated to suture tissues that have to support great forces and with little deformation of the suture (e.g., tendons). Nylon threads, with S-S-D knots, would be better indicated for tissues that have to support smaller forces and that require greater elasticity (e.g., skin). Copyright © 2011 AEC. Published by Elsevier Espana. All rights reserved.

  7. Attenuation of the tip vortex flow using a flexible thread

    NASA Astrophysics Data System (ADS)

    Lee, Seung-Jae; Shin, Jin-Woo; Arndt, Roger E. A.; Suh, Jung-Chun

    2018-01-01

    Tip vortex cavitation (TVC) is important in a number of practical engineering applications. The onset of TVC is a critical concern for navy surface ships and submarines that aim to increase their capability to evade detection. A flexible thread attachment at blade tips was recently suggested as a new method to delay the onset of TVC. Although the occurrence of TVC can be reduced using a flexible thread, no scientific investigation focusing on its mechanisms has been undertaken. Thus, herein, we experimentally investigated the use of the flexible thread to suppress TVC from an elliptical wing. These investigations were performed in a cavitation tunnel and involved an observation of TVC using high-speed cameras, motion tracking of the thread using image-processing techniques, and near-field flow measurements performed using stereoscopic particle image velocimetry. The experimental data suggested that the flexible thread affects the axial velocity field more than the circumferential velocity field around the TVC axis. Furthermore, we observed no clear dependence of the vortex core size, circulation, and flow unsteadiness on TVC suppression. However, the presence of the thread at the wing tip led to a notable reduction in the streamwise velocity field, thereby alleviating TVC.

  8. Structure and Dynamics of Quiescent Prominence Eruptions

    NASA Astrophysics Data System (ADS)

    Su, Y.; Lu, M.; van Ballegooijen, A.

    2012-05-01

    We present a survey on the fine structure and dynamics of quiescent prominence eruptions observed both on the disk and at the limb. We have identified 45 quiescent prominence eruptions by looking at the SDO (Solar Dynamics Observatory)/AIA (Atmospheric Imaging Assembly) daily movies from April to June in 2011. Among these events, there are 24 symmetric eruptions (coherent loop-like eruptions) and 21 asymmetric eruptions (one footpoint lifts off) as shown by AIA and STEREO/EUVI observations. Vertical filament threads are identified in 10 out of the 45 events, while horizontal threads are observed in almost all eruptions. We find 23 events with twisting/untwisting motions. For 14 selected limb events, we carry out a detailed study of the eruption dynamics using AIA observations at 304 Å. We find that the initial heights of these erupting prominences are located around 50-110 Mm above the limb. The eruptions start from a speed of less than 5 km/s, then increase to several tens km/s in the AIA field of view. The maximum speed of these events is 50 km/s. The acceleration plots show a positive acceleration in the range of 0 to 20 m/s2. No significant difference is identified in the dynamics of the symmetric and asymmetric eruptions.

  9. Vision-Based Finger Detection, Tracking, and Event Identification Techniques for Multi-Touch Sensing and Display Systems

    PubMed Central

    Chen, Yen-Lin; Liang, Wen-Yew; Chiang, Chuan-Yen; Hsieh, Tung-Ju; Lee, Da-Cheng; Yuan, Shyan-Ming; Chang, Yang-Lang

    2011-01-01

    This study presents efficient vision-based finger detection, tracking, and event identification techniques and a low-cost hardware framework for multi-touch sensing and display applications. The proposed approach uses a fast bright-blob segmentation process based on automatic multilevel histogram thresholding to extract the pixels of touch blobs obtained from scattered infrared lights captured by a video camera. The advantage of this automatic multilevel thresholding approach is its robustness and adaptability when dealing with various ambient lighting conditions and spurious infrared noises. To extract the connected components of these touch blobs, a connected-component analysis procedure is applied to the bright pixels acquired by the previous stage. After extracting the touch blobs from each of the captured image frames, a blob tracking and event recognition process analyzes the spatial and temporal information of these touch blobs from consecutive frames to determine the possible touch events and actions performed by users. This process also refines the detection results and corrects for errors and occlusions caused by noise and errors during the blob extraction process. The proposed blob tracking and touch event recognition process includes two phases. First, the phase of blob tracking associates the motion correspondence of blobs in succeeding frames by analyzing their spatial and temporal features. The touch event recognition process can identify meaningful touch events based on the motion information of touch blobs, such as finger moving, rotating, pressing, hovering, and clicking actions. Experimental results demonstrate that the proposed vision-based finger detection, tracking, and event identification system is feasible and effective for multi-touch sensing applications in various operational environments and conditions. PMID:22163990

  10. Application of SNMP on CATV

    NASA Astrophysics Data System (ADS)

    Huang, Hong-bin; Liu, Wei-ping; Chen, Shun-er; Zheng, Liming

    2005-02-01

    A new type of CATV network management system developed by universal MCU, which supports SNMP, is proposed in this paper. From the point of view in both hardware and software, the function and method of every modules inside the system, which include communications in the physical layer, protocol process, data process, and etc, are analyzed. In our design, the management system takes IP MAN as data transmission channel and every controlled object in the management structure has a SNMP agent. In the SNMP agent developed, there are four function modules, including physical layer communication module, protocol process module, internal data process module and MIB management module. In the paper, the structure and function of every module are designed and demonstrated while the related hardware circuit, software flow as well as the experimental results are tested. Furthermore, by introducing RTOS into the software programming, the universal MCU procedure can conducts such multi-thread management as fast Ethernet controller driving, TCP/IP process, serial port signal monitoring and so on, which greatly improves efficiency of CPU.

  11. Three-dimensional Aerodynamic Instability in Multi-stage Axial Compressors

    NASA Technical Reports Server (NTRS)

    Suder, Kenneth (Technical Monitor); Tan, Choon-Sooi

    2003-01-01

    Four separate tasks are reported. The first task: A Computational Model for Short Wavelength Stall Inception and Development In Multi-Stage Compressors; the second task: Three-dimensional Rotating Stall Inception and Effects of Rotating Tip Clearance Asymmetry in Axial Compressors; the third task:Development of an Effective Computational Methodology for Body Force Representation of High-speed Rotor 37; and the fourth task:Development of Circumferential Inlet Distortion through a Representative Eleven Stage High-speed axial compressor. The common theme that threaded throughout these four tasks is the conceptual framework that consists of quantifying flow processes at the fadcompressor blade passage level to define the compressor performance characteristics needed for addressing physical phenomena such compressor aerodynamic instability and compressor response to flow distoriton with length scales larger than compressor blade-to-blade spacing at the system level. The results from these two levels can be synthesized to: (1) simulate compressor aerodynamic instability inception local to a blade rotor tip and its development from a local flow event into the nonlinear limit cycle instability that involves the entire compressor as was demonstrated in the first task; (2) determine the conditions under which compressor stability assessment based on two-dimensional model may not be adequate and the effects of self-induced flow distortion on compressor stability limit as in the second task; (3) quantify multistage compressor response to inlet distortion in stagnation pressure as illustrated in the fourth task; and (4) elucidate its potential applicability for compressor map generation under uniform as well as non-uniform inlet flow given three-dimensional Navier-Stokes solution for each individual blade row as was demonstrated in the third task.

  12. 75 FR 65609 - Determination Under the Textile and Apparel Commercial Availability Provision of the Dominican...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-10-26

    .../50 x 2/50 Thread Count (warp): 32 to 33 cm Thread Count (weft): 25.5 to 26.5 picks/cm Weave Type...: Piece dyed, then over dyed (single uniform color) Finishing Processes: (CRF) Finish softeners, singed...

  13. Benchmark and Framework for Encouraging Research on Multi-Threaded Testing Tools

    NASA Technical Reports Server (NTRS)

    Havelund, Klaus; Stoller, Scott D.; Ur, Shmuel

    2003-01-01

    A problem that has been getting prominence in testing is that of looking for intermittent bugs. Multi-threaded code is becoming very common, mostly on the server side. As there is no silver bullet solution, research focuses on a variety of partial solutions. In this paper (invited by PADTAD 2003) we outline a proposed project to facilitate research. The project goals are as follows. The first goal is to create a benchmark that can be used to evaluate different solutions. The benchmark, apart from containing programs with documented bugs, will include other artifacts, such as traces, that are useful for evaluating some of the technologies. The second goal is to create a set of tools with open API s that can be used to check ideas without building a large system. For example an instrumentor will be available, that could be used to test temporal noise making heuristics. The third goal is to create a focus for the research in this area around which a community of people who try to solve similar problems with different techniques, could congregate.

  14. Designing Next Generation Massively Multithreaded Architectures for Irregular Applications

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tumeo, Antonino; Secchi, Simone; Villa, Oreste

    Irregular applications, such as data mining or graph-based computations, show unpredictable memory/network access patterns and control structures. Massively multi-threaded architectures with large node count, like the Cray XMT, have been shown to address their requirements better than commodity clusters. In this paper we present the approaches that we are currently pursuing to design future generations of these architectures. First, we introduce the Cray XMT and compare it to other multithreaded architectures. We then propose an evolution of the architecture, integrating multiple cores per node and next generation network interconnect. We advocate the use of hardware support for remote memory referencemore » aggregation to optimize network utilization. For this evaluation we developed a highly parallel, custom simulation infrastructure for multi-threaded systems. Our simulator executes unmodified XMT binaries with very large datasets, capturing effects due to contention and hot-spotting, while predicting execution times with greater than 90% accuracy. We also discuss the FPGA prototyping approach that we are employing to study efficient support for irregular applications in next generation manycore processors.« less

  15. WOMBAT: A Scalable and High-performance Astrophysical Magnetohydrodynamics Code

    NASA Astrophysics Data System (ADS)

    Mendygral, P. J.; Radcliffe, N.; Kandalla, K.; Porter, D.; O'Neill, B. J.; Nolting, C.; Edmon, P.; Donnert, J. M. F.; Jones, T. W.

    2017-02-01

    We present a new code for astrophysical magnetohydrodynamics specifically designed and optimized for high performance and scaling on modern and future supercomputers. We describe a novel hybrid OpenMP/MPI programming model that emerged from a collaboration between Cray, Inc. and the University of Minnesota. This design utilizes MPI-RMA optimized for thread scaling, which allows the code to run extremely efficiently at very high thread counts ideal for the latest generation of multi-core and many-core architectures. Such performance characteristics are needed in the era of “exascale” computing. We describe and demonstrate our high-performance design in detail with the intent that it may be used as a model for other, future astrophysical codes intended for applications demanding exceptional performance.

  16. Cascading processes and interactions in torrent catchments and their influence on the damage pattern

    NASA Astrophysics Data System (ADS)

    Keiler, Margreth; Gebbers, David

    2014-05-01

    Research on single geomorphological processes during damaging events has a long history; however, comprehensive documentations and analyses of the events have been conducted not until the late 1980s. Thus, for highly damaging events insights about triggering, the evolution and the impacts of processes during an event and the resulting damage were produced. Though, in the majority of cases the processes were studied in a well-defined procedure of one disciplinary focus. These focused studies neglect mutable influences which may alter the sequence of the process or the event. During damaging events multiple geomorphological processes are active which leads to the assumption that they have a certain impact on each other and the course of damaging effect. Consequently, for a comprehensive hazard and risk analysis all processes of a catchment have to be analysed and evaluated quantitatively and qualitatively (MARZOCCHI, 2007). Although the demand for a sophisticated risk management is increasing, the research on interactions as well as on physical vulnerability to multiple hazards, including the different processes impact effects, is still very limited (KAPPES et al., 2010, 2011). The challenges in this field are the quantity of data needed, and furthermore to conduct this kind of analysis is very complex and complicated (KAPPES et al. 2012). Yet, knowledge about possible interactions and resulting impact effects could significantly contribute to the reduction of risk in a region. The objective of this study is to analyse, i) how geomorphological processes interact with each other and with other factors of the surrounding during a damaging event, ii) what influences those interactions have on the resulting damage of the event and iii) whether or not different events are comparable in terms of those interactions and their impacts. To meet these objectives, 15 damaging torrent events, which occurred between 2000 and 2011 in the Bernese Oberland and the Pennine Alps, Switzerland, were analysed on the basis of event reports and general catchment parameters. The interactions were classified into different categories regarding a process and the interacting counterpart (another process, with structures or disposition) and the temporal and spatial extent in which these interactions occurred. Additionally, positive and negative feedbacks of the processes were considered. First results highlight that some types of interaction can be extracted in several events and that their temporal and spatial extent is comparable. However, the analysis indicates that single interaction exhibits multi-path consequences which are a challenge for general propositions of interactions influencing damage patterns. In the further step of this study, clusters of interactions which could occur in different events in similar ways are analysed in more detail. REFERENCES Kappes, M.S., Papathoma-Köhle, M. & Keiler, M. 2011: Assessing physical vulnerability for multi-hazards using an indicator-based methodology, Applied Geography, 32, 577-590. Kappes, M.S., Keiler, M. & Glade, T. 2010: From single- to multi-hazard risk analyses: a concept addressing emerging challenges. In: Malet, J.-P.; Glade, T. & N. Casagli (eds.) Mountain Risks: bringing science to society. Proceedings of the 'Mountain Risks' International Conference, Firenze, Italy. Strasbourg. CERG Editions: 351-356 Kappes, M. S., Keiler, M., von Eleverfeldt, K., Glade, T. 2012: Challenges of analyzing multi-hazard risk: a review. NAT HAZARDS 64: 1925-1958. Marzocchi, W., Mastellone, M.L., Ruocco, A. 2009: Principles of multi-risk assessment: Interaction amongst natural and man-induced risks. European Commission. Brussels.

  17. Using Discrete Event Simulation for Programming Model Exploration at Extreme-Scale: Macroscale Components for the Structural Simulation Toolkit (SST).

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wilke, Jeremiah J; Kenny, Joseph P.

    2015-02-01

    Discrete event simulation provides a powerful mechanism for designing and testing new extreme- scale programming models for high-performance computing. Rather than debug, run, and wait for results on an actual system, design can first iterate through a simulator. This is particularly useful when test beds cannot be used, i.e. to explore hardware or scales that do not yet exist or are inaccessible. Here we detail the macroscale components of the structural simulation toolkit (SST). Instead of depending on trace replay or state machines, the simulator is architected to execute real code on real software stacks. Our particular user-space threading frameworkmore » allows massive scales to be simulated even on small clusters. The link between the discrete event core and the threading framework allows interesting performance metrics like call graphs to be collected from a simulated run. Performance analysis via simulation can thus become an important phase in extreme-scale programming model and runtime system design via the SST macroscale components.« less

  18. A Fast and On-Machine Measuring System Using the Laser Displacement Sensor for the Contour Parameters of the Drill Pipe Thread.

    PubMed

    Dong, Zhixu; Sun, Xingwei; Chen, Changzheng; Sun, Mengnan

    2018-04-13

    The inconvenient loading and unloading of a long and heavy drill pipe gives rise to the difficulty in measuring the contour parameters of its threads at both ends. To solve this problem, in this paper we take the SCK230 drill pipe thread-repairing machine tool as a carrier to design and achieve a fast and on-machine measuring system based on a laser probe. This system drives a laser displacement sensor to acquire the contour data of a certain axial section of the thread by using the servo function of a CNC machine tool. To correct the sensor's measurement errors caused by the measuring point inclination angle, an inclination error model is built to compensate data in real time. To better suppress random error interference and ensure real contour information, a new wavelet threshold function is proposed to process data through the wavelet threshold denoising. Discrete data after denoising is segmented according to the geometrical characteristics of the drill pipe thread, and the regression model of the contour data in each section is fitted by using the method of weighted total least squares (WTLS). Then, the thread parameters are calculated in real time to judge the processing quality. Inclination error experiments show that the proposed compensation model is accurate and effective, and it can improve the data acquisition accuracy of a sensor. Simulation results indicate that the improved threshold function is of better continuity and self-adaptability, which makes sure that denoising effects are guaranteed, and, meanwhile, the complete elimination of real data distorted in random errors is avoided. Additionally, NC50 thread-testing experiments show that the proposed on-machine measuring system can complete the measurement of a 25 mm thread in 7.8 s, with a measurement accuracy of ±8 μm and repeatability limit ≤ 4 μm (high repeatability), and hence the accuracy and efficiency of measurement are both improved.

  19. A Fast and On-Machine Measuring System Using the Laser Displacement Sensor for the Contour Parameters of the Drill Pipe Thread

    PubMed Central

    Sun, Xingwei; Chen, Changzheng; Sun, Mengnan

    2018-01-01

    The inconvenient loading and unloading of a long and heavy drill pipe gives rise to the difficulty in measuring the contour parameters of its threads at both ends. To solve this problem, in this paper we take the SCK230 drill pipe thread-repairing machine tool as a carrier to design and achieve a fast and on-machine measuring system based on a laser probe. This system drives a laser displacement sensor to acquire the contour data of a certain axial section of the thread by using the servo function of a CNC machine tool. To correct the sensor’s measurement errors caused by the measuring point inclination angle, an inclination error model is built to compensate data in real time. To better suppress random error interference and ensure real contour information, a new wavelet threshold function is proposed to process data through the wavelet threshold denoising. Discrete data after denoising is segmented according to the geometrical characteristics of the drill pipe thread, and the regression model of the contour data in each section is fitted by using the method of weighted total least squares (WTLS). Then, the thread parameters are calculated in real time to judge the processing quality. Inclination error experiments show that the proposed compensation model is accurate and effective, and it can improve the data acquisition accuracy of a sensor. Simulation results indicate that the improved threshold function is of better continuity and self-adaptability, which makes sure that denoising effects are guaranteed, and, meanwhile, the complete elimination of real data distorted in random errors is avoided. Additionally, NC50 thread-testing experiments show that the proposed on-machine measuring system can complete the measurement of a 25 mm thread in 7.8 s, with a measurement accuracy of ±8 μm and repeatability limit ≤ 4 μm (high repeatability), and hence the accuracy and efficiency of measurement are both improved. PMID:29652836

  20. Evaluating the use of laser radiation in cleaning of copper embroidery threads on archaeological Egyptian textiles

    NASA Astrophysics Data System (ADS)

    Abdel-Kareem, Omar; Harith, M. A.

    2008-07-01

    Cleaning of copper embroidery threads on archaeological textiles is still a complicated conservation process, as most textile conservators believe that the advantages of using traditional cleaning techniques are less than their disadvantages. In this study, the uses of laser cleaning method and two modified recipes of wet cleaning methods were evaluated for cleaning of the corroded archaeological Egyptian copper embroidery threads on an archaeological Egyptian textile fabric. Some corroded copper thread samples were cleaned using modified recipes of wet cleaning method; other corroded copper thread samples were cleaned with Q-switched Nd:YAG laser radiation of wavelength 532 nm. All tested metal thread samples before and after cleaning were investigated using a light microscope and a scanning electron microscope with an energy dispersive X-ray analysis unit. Also the laser-induced breakdown spectroscopy (LIBS) technique was used for the elemental analysis of laser-cleaned samples to follow up the laser cleaning procedure. The results show that laser cleaning is the most effective method among all tested methods in the cleaning of corroded copper threads. It can be used safely in removing the corrosion products without any damage to both metal strips and fibrous core. The tested laser cleaning technique has solved the problems caused by other traditional cleaning techniques that are commonly used in the cleaning of metal threads on museum textiles.

  1. A Multi-mission Event-Driven Component-Based System for Support of Flight Software Development, ATLO, and Operations first used by the Mars Science Laboratory (MSL) Project

    NASA Technical Reports Server (NTRS)

    Dehghani, Navid; Tankenson, Michael

    2006-01-01

    This paper details an architectural description of the Mission Data Processing and Control System (MPCS), an event-driven, multi-mission ground data processing components providing uplink, downlink, and data management capabilities which will support the Mars Science Laboratory (MSL) project as its first target mission. MPCS is developed based on a set of small reusable components, implemented in Java, each designed with a specific function and well-defined interfaces. An industry standard messaging bus is used to transfer information among system components. Components generate standard messages which are used to capture system information, as well as triggers to support the event-driven architecture of the system. Event-driven systems are highly desirable for processing high-rate telemetry (science and engineering) data, and for supporting automation for many mission operations processes.

  2. Fabrication of drug-loaded electrospun aligned fibrous threads for suture applications.

    PubMed

    He, Chuang-Long; Huang, Zheng-Ming; Han, Xiao-Jian

    2009-04-01

    In this work, drug-loaded fibers and threads were successfully fabricated by combining electrospinning with aligned fibers collection. Two different electrospinning processes, that is, blend and coaxial electrospinning, to incorporate a model drug tetracycline hydrochloride (TCH) into poly(L-lactic acid) (PLLA) fibers have been used and compared with each other. The resulting composite ultrafine fibers and threads were characterized through scanning electron microscopy, transmission electron microscopy, Fourier transform infrared spectroscopy, X-ray diffraction, differential scanning calorimetry, and tensile testing. It has been shown that average diameters of the fibers made from the same polymer concentration depended on the processing method. The blend TCH/PLLA fibers showed the smallest fiber diameter, whereas neat PLLA fibers and core-shell TCH-PLLA fibers showed a larger proximal average diameter. Higher rotating speed of a wheel collector is helpful for obtaining better-aligned fibers. Both the polymer and the drug in the electrospun fibers have poor crystalline property. In vitro release study indicated that threads made from the core-shell fibers could suppress the initial burst release and provide a sustained drug release useful for the release of growth factor or other therapeutic drugs. On the other hand, the threads from the blend fibers produced a large initial burst release that may be used to prevent bacteria infection. A combination of these results suggests that electrospinning technique provides a novel way to fabricate medical agents-loaded fibrous threads for tissue suturing and tissue regeneration applications. Copyright 2008 Wiley Periodicals, Inc.

  3. Successive Two-sided Loop Jets Caused by Magnetic Reconnection between Two Adjacent Filamentary Threads

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tian, Zhanjun; Liu, Yu; Shen, Yuandeng

    We present observational analysis of two successive two-sided loop jets observed by the ground-based New Vacuum Solar Telescope and the space-borne Solar Dynamics Observatory . The two successive two-sided loop jets manifested similar evolution processes and both were associated with the interaction of two small-scale adjacent filamentary threads, magnetic emerging, and cancellation processes at the jet’s source region. High temporal and high spatial resolution observations reveal that the two adjacent ends of the two filamentary threads are rooted in opposite magnetic polarities within the source region. The two threads approached each other, and then an obvious brightening patch is observedmore » at the interaction position. Subsequently, a pair of hot plasma ejections are observed heading in opposite directions along the paths of the two filamentary threads at a typical speed for two-sided loop jets of the order 150 km s{sup −1}. Close to the end of the second jet, we report the formation of a bright hot loop structure at the source region, which suggests the formation of new loops during the interaction. Based on the observational results, we propose that the observed two-sided loop jets are caused by magnetic reconnection between the two adjacent filamentary threads, largely different from the previous scenario that a two-sided loop jet is generated by magnetic reconnection between an emerging bipole and the overlying horizontal magnetic fields.« less

  4. Influence of multi-cycle loading on the structure and mechanics of marine mussel plaques.

    PubMed

    Wilhelm, Menaka H; Filippidi, Emmanouela; Waite, J Herbert; Valentine, Megan T

    2017-10-18

    The proteinaceous byssal plaque-thread structures created by marine mussels exhibit extraordinary load-bearing capability. Although the nanoscopic protein interactions that support interfacial adhesion are increasingly understood, major mechanistic questions about how mussel plaques maintain toughness on supramolecular scales remain unanswered. This study explores the mechanical properties of whole mussel plaques subjected to repetitive loading cycles, with varied recovery times. Mechanical measurements were complemented with scanning electron microscopy to investigate strain-induced structural changes after yield. Multicyclic loading of plaques decreases their low-strain stiffness and introduces irreversible, strain-dependent plastic damage within the plaque microstructure. However, strain history does not compromise critical strength or maximum extension compared with plaques monotonically loaded to failure. These results suggest that a multiplicity of force transfer mechanisms between the thread and plaque-substrate interface allow the plaque-thread structure to accommodate a wide range of extensions as it continues to bear load. This improved understanding of the mussel system at micron-to-millimeter lengthscales offers strategies for including similar fail-safe mechanisms in the design of soft, tough and resilient synthetic structures.

  5. 30 CFR 7.304 - Technical requirements.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... (coarse, loose fitting) threads. The covers shall be secured against loosening. (5) Flat surfaces between... flame-arresting path shall be finished during the manufacturing process to not more than 250 microinches... requirements as the pole piece. (12) Coil-thread inserts, if used in holes for fastenings, shall meet the...

  6. 75 FR 42054 - Endangered and Threatened Wildlife and Plants; Revised Critical Habitat for Brodiaea filifolia

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-07-20

    ... Critical Habitat for Brodiaea filifolia (Thread-leaved Brodiaea) AGENCY: Fish and Wildlife Service... revised designation of critical habitat for Brodiaea filifolia (thread-leaved brodiaea) under the... the existing consultation process because Brodiaea filifolia is federally listed as a threatened...

  7. 30 CFR 7.304 - Technical requirements.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... (coarse, loose fitting) threads. The covers shall be secured against loosening. (5) Flat surfaces between... flame-arresting path shall be finished during the manufacturing process to not more than 250 microinches... requirements as the pole piece. (12) Coil-thread inserts, if used in holes for fastenings, shall meet the...

  8. 30 CFR 7.304 - Technical requirements.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... (coarse, loose fitting) threads. The covers shall be secured against loosening. (5) Flat surfaces between... flame-arresting path shall be finished during the manufacturing process to not more than 250 microinches... requirements as the pole piece. (12) Coil-thread inserts, if used in holes for fastenings, shall meet the...

  9. 30 CFR 7.304 - Technical requirements.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... (coarse, loose fitting) threads. The covers shall be secured against loosening. (5) Flat surfaces between... flame-arresting path shall be finished during the manufacturing process to not more than 250 microinches... requirements as the pole piece. (12) Coil-thread inserts, if used in holes for fastenings, shall meet the...

  10. 30 CFR 7.304 - Technical requirements.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... (coarse, loose fitting) threads. The covers shall be secured against loosening. (5) Flat surfaces between... flame-arresting path shall be finished during the manufacturing process to not more than 250 microinches... requirements as the pole piece. (12) Coil-thread inserts, if used in holes for fastenings, shall meet the...

  11. Validation of a virtual source model of medical linac for Monte Carlo dose calculation using multi-threaded Geant4.

    PubMed

    Aboulbanine, Zakaria; El Khayati, Naïma

    2018-04-13

    The use of phase space in medical linear accelerator Monte Carlo (MC) simulations significantly improves the execution time and leads to results comparable to those obtained from full calculations. The classical representation of phase space stores directly the information of millions of particles, producing bulky files. This paper presents a virtual source model (VSM) based on a reconstruction algorithm, taking as input a compressed file of roughly 800 kb derived from phase space data freely available in the International Atomic Energy Agency (IAEA) database. This VSM includes two main components; primary and scattered particle sources, with a specific reconstruction method developed for each. Energy spectra and other relevant variables were extracted from IAEA phase space and stored in the input description data file for both sources. The VSM was validated for three photon beams: Elekta Precise 6 MV/10 MV and a Varian TrueBeam 6 MV. Extensive calculations in water and comparisons between dose distributions of the VSM and IAEA phase space were performed to estimate the VSM precision. The Geant4 MC toolkit in multi-threaded mode (Geant4-[mt]) was used for fast dose calculations and optimized memory use. Four field configurations were chosen for dose calculation validation to test field size and symmetry effects, [Formula: see text] [Formula: see text], [Formula: see text] [Formula: see text], and [Formula: see text] [Formula: see text] for squared fields, and [Formula: see text] [Formula: see text] for an asymmetric rectangular field. Good agreement in terms of [Formula: see text] formalism, for 3%/3 mm and 2%/3 mm criteria, for each evaluated radiation field and photon beam was obtained within a computation time of 60 h on a single WorkStation for a 3 mm voxel matrix. Analyzing the VSM's precision in high dose gradient regions, using the distance to agreement concept (DTA), showed also satisfactory results. In all investigated cases, the mean DTA was less than 1 mm in build-up and penumbra regions. In regards to calculation efficiency, the event processing speed is six times faster using Geant4-[mt] compared to sequential Geant4, when running the same simulation code for both. The developed VSM for 6 MV/10 MV beams widely used, is a general concept easy to adapt in order to reconstruct comparable beam qualities for various linac configurations, facilitating its integration for MC treatment planning purposes.

  12. Enabling communication concurrency through flexible MPI endpoints

    DOE PAGES

    Dinan, James; Grant, Ryan E.; Balaji, Pavan; ...

    2014-09-23

    MPI defines a one-to-one relationship between MPI processes and ranks. This model captures many use cases effectively; however, it also limits communication concurrency and interoperability between MPI and programming models that utilize threads. Our paper describes the MPI endpoints extension, which relaxes the longstanding one-to-one relationship between MPI processes and ranks. Using endpoints, an MPI implementation can map separate communication contexts to threads, allowing them to drive communication independently. Also, endpoints enable threads to be addressable in MPI operations, enhancing interoperability between MPI and other programming models. Furthermore, these characteristics are illustrated through several examples and an empirical study thatmore » contrasts current multithreaded communication performance with the need for high degrees of communication concurrency to achieve peak communication performance.« less

  13. [Effects of hyperbaric oxygenation on oxidative phosphorylation in post-nephrotomy tissues sutured with different surgical threads (an experimental study)].

    PubMed

    Kostenko, V A

    1998-01-01

    The activity of mitochondrial respiration and oxidative phosphorylation (OP) was studied in white rats subjected to nephrotomy. The suture was made with absorbable surgical threads such as catgut plain, biofil (from dura mater spinalis of the cattle), dexon II (polyglycolic acid). The use of catgut plain inhibits biosynthetic processes 7 and 14 days after operation. Hyperbaric oxygenation enhances oxidative phosphorylation in postoperative renal tissue sutured with different biological and synthetic absorbable surgical threads (catgut, biofil, dexon II) and prevents sharp depression of the above processes in the course of catgut biodegradation. This fact is of great importance for reduction of normal functional and metabolic activity of the operated kidney.

  14. Rayleigh instability at small length scales.

    PubMed

    Gopan, Nandu; Sathian, Sarith P

    2014-09-01

    The Rayleigh instability (also called the Plateau-Rayleigh instability) of a nanosized liquid propane thread is investigated using molecular dynamics (MD). The validity of classical predictions at small length scales is verified by comparing the temporal evolution of liquid thread simulated by MD against classical predictions. Previous works have shown that thermal fluctuations become dominant at small length scales. The role and influence of the stochastic nature of thermal fluctuations in determining the instability at small length scale is also investigated. Thermal fluctuations are seen to dominate and accelerate the breakup process only during the last stages of breakup. The simulations also reveal that the breakup profile of nanoscale threads undergo modification due to reorganization of molecules by the evaporation-condensation process.

  15. Enabling communication concurrency through flexible MPI endpoints

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dinan, James; Grant, Ryan E.; Balaji, Pavan

    MPI defines a one-to-one relationship between MPI processes and ranks. This model captures many use cases effectively; however, it also limits communication concurrency and interoperability between MPI and programming models that utilize threads. Our paper describes the MPI endpoints extension, which relaxes the longstanding one-to-one relationship between MPI processes and ranks. Using endpoints, an MPI implementation can map separate communication contexts to threads, allowing them to drive communication independently. Also, endpoints enable threads to be addressable in MPI operations, enhancing interoperability between MPI and other programming models. Furthermore, these characteristics are illustrated through several examples and an empirical study thatmore » contrasts current multithreaded communication performance with the need for high degrees of communication concurrency to achieve peak communication performance.« less

  16. Enabling communication concurrency through flexible MPI endpoints

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dinan, James; Grant, Ryan E.; Balaji, Pavan

    MPI defines a one-to-one relationship between MPI processes and ranks. This model captures many use cases effectively; however, it also limits communication concurrency and interoperability between MPI and programming models that utilize threads. This paper describes the MPI endpoints extension, which relaxes the longstanding one-to-one relationship between MPI processes and ranks. Using endpoints, an MPI implementation can map separate communication contexts to threads, allowing them to drive communication independently. Endpoints also enable threads to be addressable in MPI operations, enhancing interoperability between MPI and other programming models. These characteristics are illustrated through several examples and an empirical study that contrastsmore » current multithreaded communication performance with the need for high degrees of communication concurrency to achieve peak communication performance.« less

  17. Function and evolution of a Lotus japonicus AP2/ERF family transcription factor that is required for development of infection threads.

    PubMed

    Yano, Koji; Aoki, Seishiro; Liu, Meng; Umehara, Yosuke; Suganuma, Norio; Iwasaki, Wataru; Sato, Shusei; Soyano, Takashi; Kouchi, Hiroshi; Kawaguchi, Masayoshi

    2017-04-01

    Legume-rhizobium symbiosis is achieved by two major events evolutionarily acquired: root hair infection and organogenesis. Infection thread (IT) development is a distinct element for rhizobial infection. Through ITs, rhizobia are efficiently transported from infection foci on root hairs to dividing meristematic cortical cells. To unveil this process, we performed genetic screening using Lotus japonicus MG-20 and isolated symbiotic mutant lines affecting nodulation, root hair morphology, and IT development. Map-based cloning identified an AP2/ERF transcription factor gene orthologous to Medicago truncatula ERN1. LjERN1 was activated in response to rhizobial infection and depended on CYCLOPS and NSP2. Legumes conserve an ERN1 homolog, ERN2, that functions redundantly with ERN1 in M. truncatula. Phylogenetic analysis showed that the lineages of ERN1 and ERN2 genes originated from a gene duplication event in the common ancestor of legume plants. However, genomic analysis suggested the lack of ERN2 gene in the L. japonicus genome, consistent with Ljern1 mutants exhibited a root hair phenotype that is observed in ern1/ern2 double mutants in M. truncatula. Molecular evolutionary analysis suggested that the nonsynonymous/synonymous rate ratios of legume ERN1 genes was almost identical to that of non-legume plants, whereas the ERN2 genes experienced a relaxed selective constraint. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  18. Development and realization of the open fault diagnosis system based on XPE

    NASA Astrophysics Data System (ADS)

    Deng, Hui; Wang, TaiYong; He, HuiLong; Xu, YongGang; Zeng, JuXiang

    2005-12-01

    To make the complex mechanical equipment work in good service, the technology for realizing an embedded open system is introduced systematically, including open hardware configuration, customized embedded operation system and open software structure. The ETX technology is adopted in this system, integrating the CPU main-board functions, and achieving the quick, real-time signal acquisition and intelligent data analysis with applying DSP and CPLD data acquisition card. Under the open configuration, the signal bus mode such as PCI, ISA and PC/104 can be selected and the styles of the signals can be chosen too. In addition, through customizing XPE system, adopting the EWF (Enhanced Write Filter), and realizing the open system authentically, the stability of the system is enhanced. Multi-thread and multi-task programming techniques are adopted in the software programming process. Interconnecting with the remote fault diagnosis center via the net interface, cooperative diagnosis is conducted and the intelligent degree of the fault diagnosis is improved.

  19. Mobile Thread Task Manager

    NASA Technical Reports Server (NTRS)

    Clement, Bradley J.; Estlin, Tara A.; Bornstein, Benjamin J.

    2013-01-01

    The Mobile Thread Task Manager (MTTM) is being applied to parallelizing existing flight software to understand the benefits and to develop new techniques and architectural concepts for adapting software to multicore architectures. It allocates and load-balances tasks for a group of threads that migrate across processors to improve cache performance. In order to balance-load across threads, the MTTM augments a basic map-reduce strategy to draw jobs from a global queue. In a multicore processor, memory may be "homed" to the cache of a specific processor and must be accessed from that processor. The MTTB architecture wraps access to data with thread management to move threads to the home processor for that data so that the computation follows the data in an attempt to avoid L2 cache misses. Cache homing is also handled by a memory manager that translates identifiers to processor IDs where the data will be homed (according to rules defined by the user). The user can also specify the number of threads and processors separately, which is important for tuning performance for different patterns of computation and memory access. MTTM efficiently processes tasks in parallel on a multiprocessor computer. It also provides an interface to make it easier to adapt existing software to a multiprocessor environment.

  20. 16 CFR 1632.6 - Ticking substitution procedure.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ..., density, and chemical composition; a specific thread; a specific method of quilting; and a specific... ticking is quilted, one cigarette shall be placed over the thread or in the depression created by the quilting process. Each cigarette must be positioned no less than two inches (5.08 cm) from any other...

  1. 16 CFR 1615.31 - Labeling, recordkeeping, advertising, retail display and guaranties.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... records required must establish a line of continuity through the process of manufacture of each production... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each..., seams, threads, stitches, and trims used in such prototype testing, relating such samples to the records...

  2. 16 CFR 1632.6 - Ticking substitution procedure.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ..., density, and chemical composition; a specific thread; a specific method of quilting; and a specific... ticking is quilted, one cigarette shall be placed over the thread or in the depression created by the quilting process. Each cigarette must be positioned no less than two inches (5.08 cm) from any other...

  3. 16 CFR 1615.31 - Labeling, recordkeeping, advertising, retail display and guaranties.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... records required must establish a line of continuity through the process of manufacture of each production... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each..., seams, threads, stitches, and trims used in such prototype testing, relating such samples to the records...

  4. 16 CFR 1632.6 - Ticking substitution procedure.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ..., density, and chemical composition; a specific thread; a specific method of quilting; and a specific... ticking is quilted, one cigarette shall be placed over the thread or in the depression created by the quilting process. Each cigarette must be positioned no less than two inches (5.08 cm) from any other...

  5. 49 CFR 178.44 - Specification 3HT seamless steel cylinders for aircraft use.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... cylinder. (d) Manufacture. Cylinders must be manufactured using equipment and processes adequate to ensure.... (h) Openings in cylinders and connections (valves, fuse plugs, etc.) for those openings. Threads conforming to the following are required on openings: (1) Threads must be clean cut, even, without cracks...

  6. 49 CFR 178.44 - Specification 3HT seamless steel cylinders for aircraft use.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... cylinder. (d) Manufacture. Cylinders must be manufactured using equipment and processes adequate to ensure.... (h) Openings in cylinders and connections (valves, fuse plugs, etc.) for those openings. Threads conforming to the following are required on openings: (1) Threads must be clean cut, even, without cracks...

  7. 16 CFR 1632.6 - Ticking substitution procedure.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ..., density, and chemical composition; a specific thread; a specific method of quilting; and a specific... ticking is quilted, one cigarette shall be placed over the thread or in the depression created by the quilting process. Each cigarette must be positioned no less than two inches (5.08 cm) from any other...

  8. 16 CFR § 1632.6 - Ticking substitution procedure.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... specific filling, thickness, density, and chemical composition; a specific thread; a specific method of... ticking is quilted, one cigarette shall be placed over the thread or in the depression created by the quilting process. Each cigarette must be positioned no less than two inches (5.08 cm) from any other...

  9. 16 CFR 1616.4 - Sampling and acceptance procedures.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... a suitable thread and stitch. The specimen shall include each of the components over its entire... fabric in Tightened Sampling must be discontinued until that part of the process or component which is... otherwise attaching the trim shall be done with thread or fastening material of the same composition and...

  10. 16 CFR 1615.31 - Labeling, recordkeeping, advertising, retail display and guaranties.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... records required must establish a line of continuity through the process of manufacture of each production... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each..., seams, threads, stitches, and trims used in such prototype testing, relating such samples to the records...

  11. 49 CFR 178.44 - Specification 3HT seamless steel cylinders for aircraft use.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... cylinder. (d) Manufacture. Cylinders must be manufactured using equipment and processes adequate to ensure.... (h) Openings in cylinders and connections (valves, fuse plugs, etc.) for those openings. Threads conforming to the following are required on openings: (1) Threads must be clean cut, even, without cracks...

  12. In-Process Quality Control in Apparel Production: Sewing Defects

    DTIC Science & Technology

    1991-10-01

    runout 2. unbalanced stitch, stitch length variation Operator: 1. Raw edge, ply misalignment, sewing off of garment Thread: 1. Thread damage, broken...sewing machine. Two measurements were made for each position of the handwheel. The method used for marking the wheel is described below. A special tape

  13. How to review 4 million lines of ATLAS code

    NASA Astrophysics Data System (ADS)

    Stewart, Graeme A.; Lampl, Walter; ATLAS Collaboration

    2017-10-01

    As the ATLAS Experiment prepares to move to a multi-threaded framework (AthenaMT) for Run3, we are faced with the problem of how to migrate 4 million lines of C++ source code. This code has been written over the past 15 years and has often been adapted, re-written or extended to the changing requirements and circumstances of LHC data taking. The code was developed by different authors, many of whom are no longer active, and under the deep assumption that processing ATLAS data would be done in a serial fashion. In order to understand the scale of the problem faced by the ATLAS software community, and to plan appropriately the significant efforts posed by the new AthenaMT framework, ATLAS embarked on a wide ranging review of our offline code, covering all areas of activity: event generation, simulation, trigger, reconstruction. We discuss the difficulties in even logistically organising such reviews in an already busy community, how to examine areas in sufficient depth to learn key areas in need of upgrade, yet also to finish the reviews in a timely fashion. We show how the reviews were organised and how the ouptuts were captured in a way that the sub-system communities could then tackle the problems uncovered on a realistic timeline. Further, we discuss how the review has inuenced the overall planning for the Run 3 ATLAS offline code.

  14. [Integrated Development of Full-automatic Fluorescence Analyzer].

    PubMed

    Zhang, Mei; Lin, Zhibo; Yuan, Peng; Yao, Zhifeng; Hu, Yueming

    2015-10-01

    In view of the fact that medical inspection equipment sold in the domestic market is mainly imported from abroad and very expensive, we developed a full-automatic fluorescence analyzer in our center, presented in this paper. The present paper introduces the hardware architecture design of FPGA/DSP motion controlling card+PC+ STM32 embedded micro processing unit, software system based on C# multi thread, design and implementation of double-unit communication in detail. By simplifying the hardware structure, selecting hardware legitimately and adopting control system software to object-oriented technology, we have improved the precision and velocity of the control system significantly. Finally, the performance test showed that the control system could meet the needs of automated fluorescence analyzer on the functionality, performance and cost.

  15. Distributed run of a one-dimensional model in a regional application using SOAP-based web services

    NASA Astrophysics Data System (ADS)

    Smiatek, Gerhard

    This article describes the setup of a distributed computing system in Perl. It facilitates the parallel run of a one-dimensional environmental model on a number of simple network PC hosts. The system uses Simple Object Access Protocol (SOAP) driven web services offering the model run on remote hosts and a multi-thread environment distributing the work and accessing the web services. Its application is demonstrated in a regional run of a process-oriented biogenic emission model for the area of Germany. Within a network consisting of up to seven web services implemented on Linux and MS-Windows hosts, a performance increase of approximately 400% has been reached compared to a model run on the fastest single host.

  16. Optimisation of multi-layer rotationally moulded foamed structures

    NASA Astrophysics Data System (ADS)

    Pritchard, A. J.; McCourt, M. P.; Kearns, M. P.; Martin, P. J.; Cunningham, E.

    2018-05-01

    Multi-layer skin-foam and skin-foam-skin sandwich constructions are of increasing interest in the rotational moulding process for two reasons. Firstly, multi-layer constructions can improve the thermal insulation properties of a part. Secondly, foamed polyethylene sandwiched between solid polyethylene skins can increase the mechanical properties of rotationally moulded structural components, in particular increasing flexural properties and impact strength (IS). The processing of multiple layers of polyethylene and polyethylene foam presents unique challenges such as the control of chemical blowing agent decomposition temperature, and the optimisation of cooling rates to prevent destruction of the foam core; therefore, precise temperature control is paramount to success. Long cooling cycle times are associated with the creation of multi-layer foam parts due to their insulative nature; consequently, often making the costs of production prohibitive. Devices such as Rotocooler®, a rapid internal mould water spray cooling system, have been shown to have the potential to significantly decrease cooling times in rotational moulding. It is essential to monitor and control such devices to minimise the warpage associated with the rapid cooling of a moulding from only one side. The work presented here demonstrates the use of threaded thermocouples to monitor the polymer melt in multi-layer sandwich constructions, in order to analyse the cooling cycle of multi-layer foamed structures. A series of polyethylene skin-foam test mouldings were produced, and the effect of cooling medium on foam characteristics, mechanical properties, and process cycle time were investigated. Cooling cycle time reductions of 45%, 26%, and 29% were found for increasing (1%, 2%, and 3%) chemical blowing agent (CBA) amount when using internal water cooling technology from ˜123°C compared with forced air cooling (FAC). Subsequently, a reduction of IS for the same skin-foam parts was found to be 1%, 4%, and 16% compared with FAC.

  17. Multi-Station Broad Regional Event Detection Using Waveform Correlation

    NASA Astrophysics Data System (ADS)

    Slinkard, M.; Stephen, H.; Young, C. J.; Eckert, R.; Schaff, D. P.; Richards, P. G.

    2013-12-01

    Previous waveform correlation studies have established the occurrence of repeating seismic events in various regions, and the utility of waveform-correlation event-detection on broad regional or even global scales to find events currently not included in traditionally-prepared bulletins. The computational burden, however, is high, limiting previous experiments to relatively modest template libraries and/or processing time periods. We have developed a distributed computing waveform correlation event detection utility that allows us to process years of continuous waveform data with template libraries numbering in the thousands. We have used this system to process several years of waveform data from IRIS stations in East Asia, using libraries of template events taken from global and regional bulletins. Detections at a given station are confirmed by 1) comparison with independent bulletins of seismicity, and 2) consistent detections at other stations. We find that many of the detected events are not in traditional catalogs, hence the multi-station comparison is essential. In addition to detecting the similar events, we also estimate magnitudes very precisely based on comparison with the template events (when magnitudes are available). We have investigated magnitude variation within detected families of similar events, false alarm rates, and the temporal and spatial reach of templates.

  18. Multigrid Equation Solvers for Large Scale Nonlinear Finite Element Simulations

    DTIC Science & Technology

    1999-01-01

    purpose of the second partitioning phase , on each SMP, is to minimize the communication within the SMP; even if a multi - threaded matrix vector product...8.7 Comparison of model with experimental data for send phase of matrix vector product on ne grid...140 8.4 Matrix vector product phase times : : : : : : : : : : : : : : : : : : : : : : : 145 9.1 Flat and

  19. Final report for the Tera Computer TTI CRADA

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Davidson, G.S.; Pavlakos, C.; Silva, C.

    1997-01-01

    Tera Computer and Sandia National Laboratories have completed a CRADA, which examined the Tera Multi-Threaded Architecture (MTA) for use with large codes of importance to industry and DOE. The MTA is an innovative architecture that uses parallelism to mask latency between memories and processors. The physical implementation is a parallel computer with high cross-section bandwidth and GaAs processors designed by Tera, which support many small computation threads and fast, lightweight context switches between them. When any thread blocks while waiting for memory accesses to complete, another thread immediately begins execution so that high CPU utilization is maintained. The Tera MTAmore » parallel computer has a single, global address space, which is appealing when porting existing applications to a parallel computer. This ease of porting is further enabled by compiler technology that helps break computations into parallel threads. DOE and Sandia National Laboratories were interested in working with Tera to further develop this computing concept. While Tera Computer would continue the hardware development and compiler research, Sandia National Laboratories would work with Tera to ensure that their compilers worked well with important Sandia codes, most particularly CTH, a shock physics code used for weapon safety computations. In addition to that important code, Sandia National Laboratories would complete research on a robotic path planning code, SANDROS, which is important in manufacturing applications, and would evaluate the MTA performance on this code. Finally, Sandia would work directly with Tera to develop 3D visualization codes, which would be appropriate for use with the MTA. Each of these tasks has been completed to the extent possible, given that Tera has just completed the MTA hardware. All of the CRADA work had to be done on simulators.« less

  20. The Two Micron All Sky Survey

    NASA Astrophysics Data System (ADS)

    Lonsdale, Carol

    The 2 Micron All Sky Survey (2MASS) project, a collaboration between the University of Massachusetts (Dr. Mike Skrutskie, PI) and the Infrared Processing and Analysis Center, JPL/Caltech funded primarily by NASA and the NSF, will scan the entire sky utilizing two new, highly automated 1.3m telescopes at Mt. Hopkins, AZ and at CTIO, Chile. Each telescope simultaneously scans the sky at J, H and Ks with a three channel camera using 256x256 arrays of HgCdTe detectors to detect point sources brighter than about 1 mJy (to SNR=10), with a pixel size of 2.0 arcseconds. The data rate is $\\sim 19$ Gbyte per night, with a total processed data volume of 13 Tbytes of images and 0.5 Tbyte of tabular data. The 2MASS data is archived nightly into the Infrared Science Information System at IPAC, which is based on an Informix database engine, judged at the time of purchase to have the best commercially available indexing and parallelization flexibility, and a 5 Tbyte-capacity RAID multi-threaded disk system with multi-server shared disk architecture. I will discuss the challenges of processing and archiving the 2MASS data, and of supporting intelligent query access to them by the astronomical community across the net, including possibilities for cross-correlation with other remote data sets.

  1. Reactor shroud joint

    DOEpatents

    Ballas, Gary J.; Fife, Alex Blair; Ganz, Israel

    1998-01-01

    A shroud for a nuclear reactor is described. In one embodiment, the shroud includes first and second shroud sections, and each shroud section includes a substantially cylindrical main body having a first end and a second end. With respect to each shroud section, a flange is located at the main body first end, and the flange has a plurality of bolt openings therein and a plurality of scalloped regions. The first shroud section is welded to the second shroud section, and at least some of the bolt openings in the first shroud section flange align with respective bolt openings in the second shroud section flange. In the event that the onset of inter-granular stress corrosion cracking is ever detected in the weld between the shroud section, bolts are inserted through bolt openings in the first shroud section flange and through aligned bolt openings the second shroud section flange. Each bolt, in one embodiment, has a shank section and first and second threaded end sections. Nuts are threadedly engaged to the threaded end sections and tightened against the respective flanges.

  2. Robotic technology results in faster and more robust surgical skill acquisition than traditional laparoscopy.

    PubMed

    Moore, Lee J; Wilson, Mark R; Waine, Elizabeth; Masters, Rich S W; McGrath, John S; Vine, Samuel J

    2015-03-01

    Technical surgical skills are said to be acquired quicker on a robotic rather than laparoscopic platform. However, research examining this proposition is scarce. Thus, this study aimed to compare the performance and learning curves of novices acquiring skills using a robotic or laparoscopic system, and to examine if any learning advantages were maintained over time and transferred to more difficult and stressful tasks. Forty novice participants were randomly assigned to either a robotic- or laparoscopic-trained group. Following one baseline trial on a ball pick-and-drop task, participants performed 50 learning trials. Participants then completed an immediate retention trial and a transfer trial on a two-instrument rope-threading task. One month later, participants performed a delayed retention trial and a stressful multi-tasking trial. The results revealed that the robotic-trained group completed the ball pick-and-drop task more quickly and accurately than the laparoscopic-trained group across baseline, immediate retention, and delayed retention trials. Furthermore, the robotic-trained group displayed a shorter learning curve for accuracy. The robotic-trained group also performed the more complex rope-threading and stressful multi-tasking transfer trials better. Finally, in the multi-tasking trial, the robotic-trained group made fewer tone counting errors. The results highlight the benefits of using robotic technology for the acquisition of technical surgical skills.

  3. Benchmarking high performance computing architectures with CMS’ skeleton framework

    NASA Astrophysics Data System (ADS)

    Sexton-Kennedy, E.; Gartung, P.; Jones, C. D.

    2017-10-01

    In 2012 CMS evaluated which underlying concurrency technology would be the best to use for its multi-threaded framework. The available technologies were evaluated on the high throughput computing systems dominating the resources in use at that time. A skeleton framework benchmarking suite that emulates the tasks performed within a CMSSW application was used to select Intel’s Thread Building Block library, based on the measured overheads in both memory and CPU on the different technologies benchmarked. In 2016 CMS will get access to high performance computing resources that use new many core architectures; machines such as Cori Phase 1&2, Theta, Mira. Because of this we have revived the 2012 benchmark to test it’s performance and conclusions on these new architectures. This talk will discuss the results of this exercise.

  4. WOMBAT: A Scalable and High-performance Astrophysical Magnetohydrodynamics Code

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mendygral, P. J.; Radcliffe, N.; Kandalla, K.

    2017-02-01

    We present a new code for astrophysical magnetohydrodynamics specifically designed and optimized for high performance and scaling on modern and future supercomputers. We describe a novel hybrid OpenMP/MPI programming model that emerged from a collaboration between Cray, Inc. and the University of Minnesota. This design utilizes MPI-RMA optimized for thread scaling, which allows the code to run extremely efficiently at very high thread counts ideal for the latest generation of multi-core and many-core architectures. Such performance characteristics are needed in the era of “exascale” computing. We describe and demonstrate our high-performance design in detail with the intent that it maymore » be used as a model for other, future astrophysical codes intended for applications demanding exceptional performance.« less

  5. Linking consistency with object/thread semantics - An approach to robust computation

    NASA Technical Reports Server (NTRS)

    Chen, Raymond C.; Dasgupta, Partha

    1989-01-01

    This paper presents an object/thread based paradigm that links data consistency with object/thread semantics. The paradigm can be used to achieve a wide range of consistency semantics from strict atomic transactions to standard process semantics. The paradigm supports three types of data consistency. Object programmers indicate the type of consistency desired on a per-operation basis and the system performs automatic concurrency control and recovery management to ensure that those consistency requirements are met. This allows programmers to customize consistency and recovery on a per-application basis without having to supply complicated, custom recovery management schemes. The paradigm allows robust and nonrobust computation to operate concurrently on the same data in a well defined manner. The operating system needs to support only one vehicle of computation - the thread.

  6. Computer simulations and real-time control of ELT AO systems using graphical processing units

    NASA Astrophysics Data System (ADS)

    Wang, Lianqi; Ellerbroek, Brent

    2012-07-01

    The adaptive optics (AO) simulations at the Thirty Meter Telescope (TMT) have been carried out using the efficient, C based multi-threaded adaptive optics simulator (MAOS, http://github.com/lianqiw/maos). By porting time-critical parts of MAOS to graphical processing units (GPU) using NVIDIA CUDA technology, we achieved a 10 fold speed up for each GTX 580 GPU used compared to a modern quad core CPU. Each time step of full scale end to end simulation for the TMT narrow field infrared AO system (NFIRAOS) takes only 0.11 second in a desktop with two GTX 580s. We also demonstrate that the TMT minimum variance reconstructor can be assembled in matrix vector multiply (MVM) format in 8 seconds with 8 GTX 580 GPUs, meeting the TMT requirement for updating the reconstructor. Analysis show that it is also possible to apply the MVM using 8 GTX 580s within the required latency.

  7. Genetic algorithms for protein threading.

    PubMed

    Yadgari, J; Amir, A; Unger, R

    1998-01-01

    Despite many years of efforts, a direct prediction of protein structure from sequence is still not possible. As a result, in the last few years researchers have started to address the "inverse folding problem": Identifying and aligning a sequence to the fold with which it is most compatible, a process known as "threading". In two meetings in which protein folding predictions were objectively evaluated, it became clear that threading as a concept promises a real breakthrough, but that much improvement is still needed in the technique itself. Threading is a NP-hard problem, and thus no general polynomial solution can be expected. Still a practical approach with demonstrated ability to find optimal solutions in many cases, and acceptable solutions in other cases, is needed. We applied the technique of Genetic Algorithms in order to significantly improve the ability of threading algorithms to find the optimal alignment of a sequence to a structure, i.e. the alignment with the minimum free energy. A major progress reported here is the design of a representation of the threading alignment as a string of fixed length. With this representation validation of alignments and genetic operators are effectively implemented. Appropriate data structure and parameters have been selected. It is shown that Genetic Algorithm threading is effective and is able to find the optimal alignment in a few test cases. Furthermore, the described algorithm is shown to perform well even without pre-definition of core elements. Existing threading methods are dependent on such constraints to make their calculations feasible. But the concept of core elements is inherently arbitrary and should be avoided if possible. While a rigorous proof is hard to submit yet an, we present indications that indeed Genetic Algorithm threading is capable of finding consistently good solutions of full alignments in search spaces of size up to 10(70).

  8. Optical method for measuring the surface area of a threaded fastener

    Treesearch

    Douglas Rammer; Samuel Zelinka

    2010-01-01

    This article highlights major aspects of a new optical technique to determine the surface area of a threaded fastener; the theoretical framework has been reported elsewhere. Specifically, this article describes general surface area expressions used in the analysis, details of image acquisition system, and major image processing steps contained within the measurement...

  9. A MICRO DISSECTION OF THE PACHYTENE THREADS OF TRADESCANTIA VIRGINICA L. WITH OBSERVATIONS ON SOME ASPECTS OF MITOSIS.

    PubMed

    Sands, H C

    1925-11-20

    A micro dissection of the pachytene threads of Tradescantia virginica L. shows that the relation of the chromosomes is a matter of continuous linkage in a chain and that, undoubtedly, division and segregation are everywhere processes of abstriction with subsequent mechanical distribution of the elements.

  10. 16 CFR § 1615.31 - Labeling, recordkeeping, advertising, retail display and guaranties.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... records required must establish a line of continuity through the process of manufacture of each production... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each..., seams, threads, stitches, and trims used in such prototype testing, relating such samples to the records...

  11. 16 CFR 1616.31 - Labeling, recordkeeping, retail display and guaranties.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... process of manufacture of each production unit of articles of children's sleepwear, or fabrics or related... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each... sufficient to repeat the prototype tests required by § 1616.4 of the Standard for all fabrics, seams, threads...

  12. 16 CFR § 1616.31 - Labeling, recordkeeping, retail display and guaranties.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... process of manufacture of each production unit of articles of children's sleepwear, or fabrics or related... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each... sufficient to repeat the prototype tests required by § 1616.4 of the Standard for all fabrics, seams, threads...

  13. 75 FR 7236 - Narrow Woven Ribbons with Woven Selvedge from Taiwan: Preliminary Determination of Sales at Less...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-02-18

    ... their manufacturing processes, as well as their purchases of ribbons from unaffiliated suppliers. Also..., polypropylene, and polyethylene teraphthalate), metal threads and/or metalized yarns, or any combination thereof... in the Harmonized Tariff Schedule of the United States (HTSUS), Section XI, Note 13) or rubber thread...

  14. 40 CFR 63.4281 - Am I subject to this subpart?

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ..., sheets, tents, threads and V-belts. The coating and printing subcategory includes any fabric or other... subcategory includes any operation with slashing operations as defined in § 63.4371. In the slashing process... threads. (b) You are subject to this subpart if you own or operate a new, reconstructed, or existing...

  15. 40 CFR 63.4281 - Am I subject to this subpart?

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... fabric, rainwear, sheets, tents, threads and V-belts. The coating and printing subcategory includes any... slashing process, sizing compounds are applied to warp yarn to bind the fiber together and stiffen the yarn..., sheets, towels, and threads. (b) You are subject to this subpart if you own or operate a new...

  16. 16 CFR 1616.31 - Labeling, recordkeeping, retail display and guaranties.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... process of manufacture of each production unit of articles of children's sleepwear, or fabrics or related... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each... sufficient to repeat the prototype tests required by § 1616.4 of the Standard for all fabrics, seams, threads...

  17. 40 CFR 63.4281 - Am I subject to this subpart?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ..., sheets, tents, threads and V-belts. The coating and printing subcategory includes any fabric or other... subcategory includes any operation with slashing operations as defined in § 63.4371. In the slashing process... threads. (b) You are subject to this subpart if you own or operate a new, reconstructed, or existing...

  18. 40 CFR 63.4281 - Am I subject to this subpart?

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... fabric, rainwear, sheets, tents, threads and V-belts. The coating and printing subcategory includes any... slashing process, sizing compounds are applied to warp yarn to bind the fiber together and stiffen the yarn..., sheets, towels, and threads. (b) You are subject to this subpart if you own or operate a new...

  19. 40 CFR 63.4281 - Am I subject to this subpart?

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... fabric, rainwear, sheets, tents, threads and V-belts. The coating and printing subcategory includes any... slashing process, sizing compounds are applied to warp yarn to bind the fiber together and stiffen the yarn..., sheets, towels, and threads. (b) You are subject to this subpart if you own or operate a new...

  20. 16 CFR 1616.31 - Labeling, recordkeeping, retail display and guaranties.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... process of manufacture of each production unit of articles of children's sleepwear, or fabrics or related... content, and details of construction on all seams, fabrics, threads, stitches, and trims used in each... sufficient to repeat the prototype tests required by § 1616.4 of the Standard for all fabrics, seams, threads...

  1. Thread-Like CMOS Logic Circuits Enabled by Reel-Processed Single-Walled Carbon Nanotube Transistors via Selective Doping.

    PubMed

    Heo, Jae Sang; Kim, Taehoon; Ban, Seok-Gyu; Kim, Daesik; Lee, Jun Ho; Jur, Jesse S; Kim, Myung-Gil; Kim, Yong-Hoon; Hong, Yongtaek; Park, Sung Kyu

    2017-08-01

    The realization of large-area electronics with full integration of 1D thread-like devices may open up a new era for ultraflexible and human adaptable electronic systems because of their potential advantages in demonstrating scalable complex circuitry by a simply integrated weaving technology. More importantly, the thread-like fiber electronic devices can be achieved using a simple reel-to-reel process, which is strongly required for low-cost and scalable manufacturing technology. Here, high-performance reel-processed complementary metal-oxide-semiconductor (CMOS) integrated circuits are reported on 1D fiber substrates by using selectively chemical-doped single-walled carbon nanotube (SWCNT) transistors. With the introduction of selective n-type doping and a nonrelief photochemical patterning process, p- and n-type SWCNT transistors are successfully implemented on cylindrical fiber substrates under air ambient, enabling high-performance and reliable thread-like CMOS inverter circuits. In addition, it is noteworthy that the optimized reel-coating process can facilitate improvement in the arrangement of SWCNTs, building uniformly well-aligned SWCNT channels, and enhancement of the electrical performance of the devices. The p- and n-type SWCNT transistors exhibit field-effect mobility of 4.03 and 2.15 cm 2 V -1 s -1 , respectively, with relatively narrow distribution. Moreover, the SWCNT CMOS inverter circuits demonstrate a gain of 6.76 and relatively good dynamic operation at a supply voltage of 5.0 V. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. The CUBLAS and CULA based GPU acceleration of adaptive finite element framework for bioluminescence tomography.

    PubMed

    Zhang, Bo; Yang, Xiang; Yang, Fei; Yang, Xin; Qin, Chenghu; Han, Dong; Ma, Xibo; Liu, Kai; Tian, Jie

    2010-09-13

    In molecular imaging (MI), especially the optical molecular imaging, bioluminescence tomography (BLT) emerges as an effective imaging modality for small animal imaging. The finite element methods (FEMs), especially the adaptive finite element (AFE) framework, play an important role in BLT. The processing speed of the FEMs and the AFE framework still needs to be improved, although the multi-thread CPU technology and the multi CPU technology have already been applied. In this paper, we for the first time introduce a new kind of acceleration technology to accelerate the AFE framework for BLT, using the graphics processing unit (GPU). Besides the processing speed, the GPU technology can get a balance between the cost and performance. The CUBLAS and CULA are two main important and powerful libraries for programming on NVIDIA GPUs. With the help of CUBLAS and CULA, it is easy to code on NVIDIA GPU and there is no need to worry about the details about the hardware environment of a specific GPU. The numerical experiments are designed to show the necessity, effect and application of the proposed CUBLAS and CULA based GPU acceleration. From the results of the experiments, we can reach the conclusion that the proposed CUBLAS and CULA based GPU acceleration method can improve the processing speed of the AFE framework very much while getting a balance between cost and performance.

  3. Multiprocessor switch with selective pairing

    DOEpatents

    Gara, Alan; Gschwind, Michael K; Salapura, Valentina

    2014-03-11

    System, method and computer program product for a multiprocessing system to offer selective pairing of processor cores for increased processing reliability. A selective pairing facility is provided that selectively connects, i.e., pairs, multiple microprocessor or processor cores to provide one highly reliable thread (or thread group). Each paired microprocessor or processor cores that provide one highly reliable thread for high-reliability connect with a system components such as a memory "nest" (or memory hierarchy), an optional system controller, and optional interrupt controller, optional I/O or peripheral devices, etc. The memory nest is attached to a selective pairing facility via a switch or a bus

  4. Earth Science Data Fusion with Event Building Approach

    NASA Technical Reports Server (NTRS)

    Lukashin, C.; Bartle, Ar.; Callaway, E.; Gyurjyan, V.; Mancilla, S.; Oyarzun, R.; Vakhnin, A.

    2015-01-01

    Objectives of the NASA Information And Data System (NAIADS) project are to develop a prototype of a conceptually new middleware framework to modernize and significantly improve efficiency of the Earth Science data fusion, big data processing and analytics. The key components of the NAIADS include: Service Oriented Architecture (SOA) multi-lingual framework, multi-sensor coincident data Predictor, fast into-memory data Staging, multi-sensor data-Event Builder, complete data-Event streaming (a work flow with minimized IO), on-line data processing control and analytics services. The NAIADS project is leveraging CLARA framework, developed in Jefferson Lab, and integrated with the ZeroMQ messaging library. The science services are prototyped and incorporated into the system. Merging the SCIAMACHY Level-1 observations and MODIS/Terra Level-2 (Clouds and Aerosols) data products, and ECMWF re- analysis will be used for NAIADS demonstration and performance tests in compute Cloud and Cluster environments.

  5. Defect structure of high temperature hydride vapor phase epitaxy-grown epitaxial (0 0 0 1) AlN/sapphire using growth mode modification process

    NASA Astrophysics Data System (ADS)

    Su, Xujun; Zhang, Jicai; Huang, Jun; Zhang, Jinping; Wang, Jianfeng; Xu, Ke

    2017-06-01

    Defect structures were investigated by transmission electron microscopy for AlN/sapphire (0 0 0 1) epilayers grown by high temperature hydride vapor phase epitaxy using a growth mode modification process. The defect structures, including threading dislocations, inversion domains, and voids, were analyzed by diffraction contrast, high-resolution imaging, and convergent beam diffraction. AlN film growth was initiated at 1450 °C with high V/III ratio for 8 min. This was followed by low V/III ratio growth for 12 min. The near-interfacial region shows a high density of threading dislocations and inversion domains. Most of these dislocations have Burgers vector b = 1/3〈1 1 2 0〉 and were reduced with the formation of dislocation loops. In the middle range 400 nm < h < 2 μm, dislocations gradually aggregated and reduced to ∼109 cm-2. The inversion domains have a shuttle-like shape with staggered boundaries that deviate by ∼ ±5° from the c axis. Above 2 μm thickness, the film consists of isolated threading dislocations with a total density of 8 × 108 cm-2. Most of threading dislocations are either pure edge or mixed dislocations. The threading dislocation reduction in these films is associated with dislocation loops formation and dislocation aggregation-interaction during island growth with high V/III ratio.

  6. Scaling-up NLP Pipelines to Process Large Corpora of Clinical Notes.

    PubMed

    Divita, G; Carter, M; Redd, A; Zeng, Q; Gupta, K; Trautner, B; Samore, M; Gundlapalli, A

    2015-01-01

    This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". This paper describes the scale-up efforts at the VA Salt Lake City Health Care System to address processing large corpora of clinical notes through a natural language processing (NLP) pipeline. The use case described is a current project focused on detecting the presence of an indwelling urinary catheter in hospitalized patients and subsequent catheter-associated urinary tract infections. An NLP algorithm using v3NLP was developed to detect the presence of an indwelling urinary catheter in hospitalized patients. The algorithm was tested on a small corpus of notes on patients for whom the presence or absence of a catheter was already known (reference standard). In planning for a scale-up, we estimated that the original algorithm would have taken 2.4 days to run on a larger corpus of notes for this project (550,000 notes), and 27 days for a corpus of 6 million records representative of a national sample of notes. We approached scaling-up NLP pipelines through three techniques: pipeline replication via multi-threading, intra-annotator threading for tasks that can be further decomposed, and remote annotator services which enable annotator scale-out. The scale-up resulted in reducing the average time to process a record from 206 milliseconds to 17 milliseconds or a 12- fold increase in performance when applied to a corpus of 550,000 notes. Purposely simplistic in nature, these scale-up efforts are the straight forward evolution from small scale NLP processing to larger scale extraction without incurring associated complexities that are inherited by the use of the underlying UIMA framework. These efforts represent generalizable and widely applicable techniques that will aid other computationally complex NLP pipelines that are of need to be scaled out for processing and analyzing big data.

  7. The effects of wildfire on native tree species in the Middle Rio Grande bosques of New Mexico

    Treesearch

    Brad Johnson; David Merritt

    2009-01-01

    The cottonwood bosques along the Middle Fork of the Rio Grande (MRG) form a ribbon of surviving habitat in this once vast ecosystem. Historically, the channel had a multi-threaded and braided configuration that created a rich mosaic of habitats, including mixed-aged cottonwood forests, meadows, and willow-dominated riparian wetlands and backwaters (...

  8. The Reconstruction Toolkit (RTK), an open-source cone-beam CT reconstruction toolkit based on the Insight Toolkit (ITK)

    NASA Astrophysics Data System (ADS)

    Rit, S.; Vila Oliva, M.; Brousmiche, S.; Labarbe, R.; Sarrut, D.; Sharp, G. C.

    2014-03-01

    We propose the Reconstruction Toolkit (RTK, http://www.openrtk.org), an open-source toolkit for fast cone-beam CT reconstruction, based on the Insight Toolkit (ITK) and using GPU code extracted from Plastimatch. RTK is developed by an open consortium (see affiliations) under the non-contaminating Apache 2.0 license. The quality of the platform is daily checked with regression tests in partnership with Kitware, the company supporting ITK. Several features are already available: Elekta, Varian and IBA inputs, multi-threaded Feldkamp-David-Kress reconstruction on CPU and GPU, Parker short scan weighting, multi-threaded CPU and GPU forward projectors, etc. Each feature is either accessible through command line tools or C++ classes that can be included in independent software. A MIDAS community has been opened to share CatPhan datasets of several vendors (Elekta, Varian and IBA). RTK will be used in the upcoming cone-beam CT scanner developed by IBA for proton therapy rooms. Many features are under development: new input format support, iterative reconstruction, hybrid Monte Carlo / deterministic CBCT simulation, etc. RTK has been built to freely share tomographic reconstruction developments between researchers and is open for new contributions.

  9. 78 FR 45505 - Certain Oil Country Tubular Goods from India, the Republic of Korea, the Republic of the...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-07-29

    ... process based on the average import value of other ferrous waste and scrap using HTSUS subheadings 7204.41... products) or unfinished (including green tubes and limited service OCTG products), whether or not thread...; drill pipe; unattached couplings; and unattached thread protectors. The merchandise subject to the...

  10. Weaving the Cloth of Literacy: The Relationship between Braille and Reading.

    ERIC Educational Resources Information Center

    Zago, P. A.

    1996-01-01

    This brief article uses a weaving metaphor to address the relationship between braille and reading, with prior knowledge (the warp threads) interacting with reading content (the weft threads) by means of braille (the shuttle). Reading is seen to be an interactive process between reader, text, and the context of the reading situation. (DB)

  11. 75 FR 75664 - Determination Under the Textile and Apparel Commercial Availability Provision of the Dominican...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-12-06

    .../or staple fiber; textured and/or non- textured). Yarn Size(s): Various. Thread Count (warp): 43 to 56 ends per cm. Thread Count (weft): 29 to 38 filling pics per cm. Weave Type: Woven twill. Fabric Weight...-dyed. Finishing Processes: Napped on both sides. Janet E. Heinzen, Acting Chairman, Committee for the...

  12. 77 FR 34013 - Certain Oil Country Tubular Goods From the People's Republic of China: Preliminary Results of the...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-06-08

    ... (including green tubes and limited service OCTG products), whether or not thread protectors are attached. The... thread protectors. The merchandise covered by the order is currently classified in the Harmonized Tariff....1: Non-Market Economy Surrogate Country Selection Process (March 1, 2004). \\21\\ See Factor Valuation...

  13. 78 FR 57808 - Rules and Regulations Under the Wool Products Labeling Act of 1939

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-09-20

    ... decorative thread in a garment or . . . where the presence of wool is insignificant.'' \\19\\ \\17\\ AAFA (5... the manufacturing process in the foreign country and in the United States.\\56\\ \\56\\ This provision... need for declaring the wool content when we find wool in a decorative thread in a garment or similar...

  14. 76 FR 21815 - Airworthiness Directives; The Boeing Company Model 737 Airplanes

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-04-19

    ...)(c) of this service bulletin provides instructions to check for obvious differences in thread shape between thread grooves ``as given in CMM 27-41-01.'' Air Tran noted that CMM 27-41-01 does not provide any... have revised this AD to refer to the new service information. We agree that processes referred to by...

  15. Space and Missile Systems Center Standard: Technical Requirements for Electronic Parts, Materials, and Processes used in Space Vehicles

    DTIC Science & Technology

    2013-04-12

    DTL-38999 Connector, Electrical, Circular, Miniature, High Density, Quick Disconnect (Bayonet, Threaded , and Breach Coupling), Environment Resistant ...186 Table 1160-1. Resistance Tolerance and Required Derating...For MIL-DTL-5015 Connector, Electrical, Circular Threaded , AN Type, General Specification for MIL-H-6088G(1) Heat Treatment of Aluminum Alloys

  16. Convergent Synthesis and Photoinduced Processes in Multi-Chromophoric Rotaxanes1

    PubMed Central

    Megiatto, Jackson D.; Li, Ke; Schuster, David I.; Palkar, Amit; Herranz, M. Ángeles; Echegoyen, Luis; Abwandner, Silke; de Miguel, Gustavo; Guldi, Dirk M.

    2010-01-01

    A series of [2]rotaxane materials, in which [60]fullerene is linked to a macrocycle and ferrocene (Fc) moieties are placed at the termini of a thread, both of which possess a central Cu(I)-1,10-phenanthroline [Cu(phen)2]+ complex, were synthesized by self-assembly using Sauvage metal template methodology. Two types of threads were constructed, one with terminal ester linkages, and a second with terminal 1,2,3-triazole linkages derived from Cu(I)-catalyzed “click” 1,3-cycloaddition reactions. Model compounds lacking the fullerene moiety were prepared in an analogous manner. The ability of the interlocked Fc-[Cu(phen)2]+-C60 hybrids to undergo electron transfer upon photoexcitation was investigated by means of time-resolved fluorescence and transient absorption spectroscopy, using excitation wavelengths directed at the fullerene and [Cu(phen)2]+ subunits. The energies of the electronic excited states and charge separated (CS) states that might be formed upon photoexcitation were determined from spectroscopic and electrochemical data. These studies showed that MLCT excited states of the copper complex in the fullerenerotaxanes were quenched by electron transfer to the fullerene, resulting in charge separated states with oxidized copper and reduced fullerene moieties, (Fc)2-[Cu(phen)2]2+-C60•−. Even though electron transfer from Fc to the oxidized copper complex is predicted to be exergonic by 0.18 eV, no unequivocal evidence in support of such a process was obtained. The conclusion that Fc plays no role in the photoinduced processes in our systems rests on the lack of enhancement of the lifetime of the charge separated state as measured by decay of C60•− at ~ 1000 nm, since one-electron oxidized Fc is very difficult to detect spectroscopically in the 500–800 nm spectral region. PMID:20518479

  17. Orthorectification by Using Gpgpu Method

    NASA Astrophysics Data System (ADS)

    Sahin, H.; Kulur, S.

    2012-07-01

    Thanks to the nature of the graphics processing, the newly released products offer highly parallel processing units with high-memory bandwidth and computational power of more than teraflops per second. The modern GPUs are not only powerful graphic engines but also they are high level parallel programmable processors with very fast computing capabilities and high-memory bandwidth speed compared to central processing units (CPU). Data-parallel computations can be shortly described as mapping data elements to parallel processing threads. The rapid development of GPUs programmability and capabilities attracted the attentions of researchers dealing with complex problems which need high level calculations. This interest has revealed the concepts of "General Purpose Computation on Graphics Processing Units (GPGPU)" and "stream processing". The graphic processors are powerful hardware which is really cheap and affordable. So the graphic processors became an alternative to computer processors. The graphic chips which were standard application hardware have been transformed into modern, powerful and programmable processors to meet the overall needs. Especially in recent years, the phenomenon of the usage of graphics processing units in general purpose computation has led the researchers and developers to this point. The biggest problem is that the graphics processing units use different programming models unlike current programming methods. Therefore, an efficient GPU programming requires re-coding of the current program algorithm by considering the limitations and the structure of the graphics hardware. Currently, multi-core processors can not be programmed by using traditional programming methods. Event procedure programming method can not be used for programming the multi-core processors. GPUs are especially effective in finding solution for repetition of the computing steps for many data elements when high accuracy is needed. Thus, it provides the computing process more quickly and accurately. Compared to the GPUs, CPUs which perform just one computing in a time according to the flow control are slower in performance. This structure can be evaluated for various applications of computer technology. In this study covers how general purpose parallel programming and computational power of the GPUs can be used in photogrammetric applications especially direct georeferencing. The direct georeferencing algorithm is coded by using GPGPU method and CUDA (Compute Unified Device Architecture) programming language. Results provided by this method were compared with the traditional CPU programming. In the other application the projective rectification is coded by using GPGPU method and CUDA programming language. Sample images of various sizes, as compared to the results of the program were evaluated. GPGPU method can be used especially in repetition of same computations on highly dense data, thus finding the solution quickly.

  18. How Does the Current Generation of Medical Students View the Radiology Match?: An Analysis of the AuntMinnie and Student Doctor Network Online Forums.

    PubMed

    Yi, Paul H; Novin, Sherwin; Vander Plas, Taylor L; Huh, Eric; Magid, Donna

    2018-06-01

    The AuntMinnie (AM) and the Student Doctor Network (SDN) online forums are popular resources for medical students applying for residency. The purpose of this study was to describe medical student radiology-related posts on AM and SDN to better understand the medical student perspective on the application and Match process. We reviewed all posts made on the AM and SDN online forums over 5 consecutive academic years from July 2012 to July 2017. Each thread was organized into one of six major categories. We quantified forum utilization over the past 5 years by the total number of and the most frequently posted and viewed thread topics. We reviewed 2683 total threads with 5,723,909 views. Total number of threads posted and viewed fell by 46% and 63%, respectively, from 2013-2014 to 2014-2015, after which they returned near baseline by 2016-2017, along with an increase in interventional radiology-related posts between 2012-2013 (13%) and 2016-2017 (32%) (P < .001). The most common application-related topics were preapplication and program ranking advice (20% of all threads and views). Many posts were related to postinterview communication with residency programs (2% of all threads and views). After a drop in 2013-2014, utilization of AM and SDN increased in 2016-2017, along with increased interest in interventional radiology. Addressing the student concerns identified in our study, especially in preparing residency applications, ranking programs, and navigating difficult situations, such as postinterview program communication, may improve the radiology application process for future medical students and their advisors. Copyright © 2018 The Association of University Radiologists. Published by Elsevier Inc. All rights reserved.

  19. VoiceThread as a Peer Review and Dissemination Tool for Undergraduate Research

    NASA Astrophysics Data System (ADS)

    Guertin, L. A.

    2012-12-01

    VoiceThread has been utilized in an undergraduate research methods course for peer review and final research project dissemination. VoiceThread (http://www.voicethread.com) can be considered a social media tool, as it is a web-based technology with the capacity to enable interactive dialogue. VoiceThread is an application that allows a user to place a media collection online containing images, audio, videos, documents, and/or presentations in an interface that facilitates asynchronous communication. Participants in a VoiceThread can be passive viewers of the online content or engaged commenters via text, audio, video, with slide annotations via a doodle tool. The VoiceThread, which runs across browsers and operating systems, can be public or private for viewing and commenting and can be embedded into any website. Although few university students are aware of the VoiceThread platform (only 10% of the students surveyed by Ng (2012)), the 2009 K-12 edition of The Horizon Report (Johnson et al., 2009) lists VoiceThread as a tool to watch because of the opportunities it provides as a collaborative learning environment. In Fall 2011, eleven students enrolled in an undergraduate research methods course at Penn State Brandywine each conducted their own small-scale research project. Upon conclusion of the projects, students were required to create a poster summarizing their work for peer review. To facilitate the peer review process outside of class, each student-created PowerPoint file was placed in a VoiceThread with private access to only the class members and instructor. Each student was assigned to peer review five different student posters (i.e., VoiceThread images) with the audio and doodle tools to comment on formatting, clarity of content, etc. After the peer reviews were complete, the students were allowed to edit their PowerPoint poster files for a new VoiceThread. In the new VoiceThread, students were required to video record themselves describing their research and taking the viewer through their poster in the VoiceThread. This new VoiceThread with their final presentations was open for public viewing but not public commenting. A formal assessment was not conducted on the student impact of using VoiceThread for peer review and final research presentations. From an instructional standpoint, requiring students to use audio for the peer review commenting seemed to result in lengthier and more detailed reviews, connected with specific poster features when the doodle tool was utilized. By recording themselves as a "talking head" for the final product, students were required to be comfortable and confident with presenting their research, similar to what would be expected at a conference presentation. VoiceThread is currently being tested in general education Earth science courses at Penn State Brandywine as a dissemination tool for classroom-based inquiry projects and recruitment tool for Earth & Mineral Science majors.

  20. Benchmarking high performance computing architectures with CMS’ skeleton framework

    DOE PAGES

    Sexton-Kennedy, E.; Gartung, P.; Jones, C. D.

    2017-11-23

    Here, in 2012 CMS evaluated which underlying concurrency technology would be the best to use for its multi-threaded framework. The available technologies were evaluated on the high throughput computing systems dominating the resources in use at that time. A skeleton framework benchmarking suite that emulates the tasks performed within a CMSSW application was used to select Intel’s Thread Building Block library, based on the measured overheads in both memory and CPU on the different technologies benchmarked. In 2016 CMS will get access to high performance computing resources that use new many core architectures; machines such as Cori Phase 1&2, Theta,more » Mira. Because of this we have revived the 2012 benchmark to test it’s performance and conclusions on these new architectures. This talk will discuss the results of this exercise.« less

  1. Benchmarking high performance computing architectures with CMS’ skeleton framework

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sexton-Kennedy, E.; Gartung, P.; Jones, C. D.

    Here, in 2012 CMS evaluated which underlying concurrency technology would be the best to use for its multi-threaded framework. The available technologies were evaluated on the high throughput computing systems dominating the resources in use at that time. A skeleton framework benchmarking suite that emulates the tasks performed within a CMSSW application was used to select Intel’s Thread Building Block library, based on the measured overheads in both memory and CPU on the different technologies benchmarked. In 2016 CMS will get access to high performance computing resources that use new many core architectures; machines such as Cori Phase 1&2, Theta,more » Mira. Because of this we have revived the 2012 benchmark to test it’s performance and conclusions on these new architectures. This talk will discuss the results of this exercise.« less

  2. GPU and APU computations of Finite Time Lyapunov Exponent fields

    NASA Astrophysics Data System (ADS)

    Conti, Christian; Rossinelli, Diego; Koumoutsakos, Petros

    2012-03-01

    We present GPU and APU accelerated computations of Finite-Time Lyapunov Exponent (FTLE) fields. The calculation of FTLEs is a computationally intensive process, as in order to obtain the sharp ridges associated with the Lagrangian Coherent Structures an extensive resampling of the flow field is required. The computational performance of this resampling is limited by the memory bandwidth of the underlying computer architecture. The present technique harnesses data-parallel execution of many-core architectures and relies on fast and accurate evaluations of moment conserving functions for the mesh to particle interpolations. We demonstrate how the computation of FTLEs can be efficiently performed on a GPU and on an APU through OpenCL and we report over one order of magnitude improvements over multi-threaded executions in FTLE computations of bluff body flows.

  3. Research on TCP/IP network communication based on Node.js

    NASA Astrophysics Data System (ADS)

    Huang, Jing; Cai, Lixiong

    2018-04-01

    In the face of big data, long connection and high synchronization, TCP/IP network communication will cause performance bottlenecks due to its blocking multi-threading service model. This paper presents a method of TCP/IP network communication protocol based on Node.js. On the basis of analyzing the characteristics of Node.js architecture and asynchronous non-blocking I/O model, the principle of its efficiency is discussed, and then compare and analyze the network communication model of TCP/IP protocol to expound the reasons why TCP/IP protocol stack is widely used in network communication. Finally, according to the large data and high concurrency in the large-scale grape growing environment monitoring process, a TCP server design based on Node.js is completed. The results show that the example runs stably and efficiently.

  4. The pleural curtain of the camel (Camelus dromedarius).

    PubMed

    Buzzell, Gerald R; Kinne, Joerg; Tariq, Saeed; Wernery, Ulrich

    2010-10-01

    The visceral pleura of the camel (Camelus dromedarius) possesses a fibrous curtain of pleural threads or extensions along its basal margins, which extends into the pleural cavity of the costophrenic recesses. These threads are lined by mesothelium and have a core or stroma, which is largely collagenous. Small threads are avascular and nearly acellular. In larger proximal threads, blood vessels in the stroma are often arranged in a branching network, with irregular endothelia surrounded by several incomplete basal laminae. Lymphocytes and other inflammatory cell types aggregate in the stroma near blood vessels. The threads are lined by typical mesothelium except in patches close to the main pleural surface. These patches consist of layers of loosely applied cells with numerous cellular processes and features suggestive of phagocytosis. The position of the pleural curtain in the costophrenic recess and the presence of possibly phagocytotic cells suggest that the pleural curtain stirs, samples, and cleans the pleural fluid. The pleural curtain appears to be a feature of camelids and has also been seen in giraffes. Copyright © 2010 Wiley-Liss, Inc.

  5. Experimental study of canvas characterization for paintings

    NASA Astrophysics Data System (ADS)

    Cornelis, Bruno; Dooms, Ann; Munteanu, Adrian; Cornelis, Jan; Schelkens, Peter

    2010-02-01

    The work described here fits in the context of a larger project on the objective and relevant characterization of paintings and painting canvas through the analysis of multimodal digital images. We captured, amongst others, X-ray images of different canvas types, characterized by a variety of textures and weave patterns (fine and rougher texture; single thread and multiple threads per weave), including raw canvas as well as canvas processed with different primers. In this paper, we study how to characterize the canvas by extracting global features such as average thread width, average distance between successive threads (i.e. thread density) and the spatial distribution of primers. These features are then used to construct a generic model of the canvas structure. Secondly, we investigate whether we can identify different pieces of canvas coming from the same bolt. This is an important element for dating, authentication and identification of restorations. Both the global characteristics mentioned earlier and some local properties (such as deviations from the average pattern model) are used to compare the "fingerprint" of different pieces of cloth coming from the same or different bolts.

  6. Research of thread rolling on difficult-to-cut material workpieces

    NASA Astrophysics Data System (ADS)

    Popov, A. Yu; Bugay, I. A.; Nazarov, P. V.; Evdokimova, O. P.; Popov, P. E.; Vasilyev, E. V.

    2018-01-01

    In medicine production Ti-6Al-4V Grade 5 alloys are used. One of the most important tasks is to increase the strength of the products and decrease in value. The possibility to roll special thread on Ti-6Al-4V Grade 5 alloy workpiece on 2-roller thread rolling machine has been studied. This is wrought alloy, treatment of which in cold condition causes difficulties due to low plasticity. To obtain Ti-6Al-4V Grade 5 alloy product with thread by rolling is rather difficult. This is due to large axial workpiece displacements resulting from large alloy resistance to cold plastic deformation. The provision of adequate kinematics requires experimental researches and the selection of modes - speed of rolling and pressure on the movable roller. The purpose of the work is to determine the optimal modes for rolling thread on titanium alloy workpiece. It has been stated that, after rolling, the product strength has increased up to 30%. As a result of the work, the unit has been made and recommendations to choose the optimal rolling process modes have been offered.

  7. International Space Station Powered Bolt Nut Anomaly and Failure Analysis Summary

    NASA Technical Reports Server (NTRS)

    Sievers, Daniel E.; Warden, Harry K.

    2010-01-01

    A key mechanism used in the on-orbit assembly of the International Space Station (ISS) pressurized elements is the Common Berthing Mechanism. The mechanism that effects the structural connection of the Common Berthing Mechanism halves is the Powered Bolt Assembly. There are sixteen Powered Bolt Assemblies per Common Berthing Mechanism. The Common Berthing Mechanism has a bolt which engages a self aligning Powered Bolt Nut (PBN) on the mating interface (Figure 1). The Powered Bolt Assemblies are preloaded to approximately 84.5 kN (19000 lb) prior to pressurization of the CBM. The PBNs mentioned below, manufactured in 2009, will be used on ISS future missions. An on orbit functional failure of this hardware would be unacceptable and in some instances catastrophic due to the failure of modules to mate and seal the atmosphere, risking loss of crew and ISS functions. The manufacturing processes that create the PBNs need to be strictly controlled. Functional (torque vs. tension) acceptance test failures will be the result of processes not being strictly followed. Without the proper knowledge of thread tolerances, fabrication techniques, and dry film lubricant application processes, PBNs will be, and have been manufactured improperly. The knowledge gained from acceptance test failures and the resolution of those failures, thread fabrication techniques and thread dry film lubrication processes can be applied to many aerospace mechanisms to enhance their performance. Test data and manufactured PBN thread geometry will be discussed for both failed and successfully accepted PBNs.

  8. International Space Station Powered Bolt Nut Anomaly and Failure Analysis Summary

    NASA Technical Reports Server (NTRS)

    Sievers, Daniel E.; Warden, Harry K.

    2010-01-01

    A key mechanism used in the on-orbit assembly of the International Space Station (ISS) pressurized elements is the Common Berthing Mechanism (CBM). The mechanism that effects the structural connection of the CBM halves is the Powered Bolt Assembly. There are sixteen Powered Bolt Assemblies per CBM. The CBM has a bolt which engages a self aligning Powered Bolt Nut (PBN) on the mating interface; see Figure 1. The Powered Bolt Assemblies are preloaded to approximately 19 kilo pounds (KIPs) prior to pressurization of the CBM. The PBNs mentioned below, manufactured in 2009, will be used on ISS future missions. An on orbit functional failure of this hardware would be unacceptable and in some instances catastrophic due to the failure of modules to mate and seal the atmosphere, risking loss of crew and ISS functions. The manufacturing processes which create the PBNs need to be strictly controlled. Functional (torque vs. tension) acceptance test failures will be the result of processes not being strictly followed. Without the proper knowledge of thread tolerances, fabrication techniques, and dry film lubricant application processes, PBNs will be, and have been manufactured improperly. The knowledge gained from acceptance test failures and the resolution of those failures, thread fabrication techniques and thread dry film lubrication processes can be applied to many aerospace mechanisms to enhance their performance. Test data and manufactured PBN thread geometry will be discussed for both failed and successfully accepted PBNs.

  9. Structure and Dynamics of Quiescent Prominence Eruptions

    NASA Astrophysics Data System (ADS)

    Lu, Muzhou; Su, Y.; Adriaan van Ballegooijen, A.

    2012-05-01

    We present a survey on the fine structure and dynamics of quiescent prominence eruptions observed both on the disk and at the limb. We have identified 45 quiescent prominence eruptions by looking at the SDO (Solar Dynamics Observatory)/AIA (Atmospheric Imaging Assembly) daily movies from April to June in 2011. Among these events, there are 24 symmetric eruptions (coherent loop-like eruptions) and 21 asymmetric eruptions (one footpoint lifts off) as shown by AIA and STEREO/EUVI observations. Vertical filament threads are identified in 10 out of the 45 events, while horizontal threads are observed in almost all eruptions. We find 23 events with twisting/untwisting motions. For 14 selected limb events, we carry out a detailed study of the eruption dynamics using AIA observations at 304 Å. We find that the initial heights of these erupting prominences are located around 50-110 Mm above the limb. The eruptions start from a speed of less than 5 km/s, then increase to several tens km/s in the AIA field of view. The maximum speed of these events is 50 km/s. The acceleration plots show a positive acceleration in the range of 0 to 20 m/s^2. No significant difference is identified in the dynamics of the symmetric and asymmetric eruptions. Acknowledgments. This project is supported by the NASA contract SP02H1701R from LMSAL to Smithsonian Astrophysical Observatory (SAO). M. Lu is supported under the NSF-REU solar physics program at SAO, grant number ATM-0851866.

  10. Scalable Algorithms for Parallel Discrete Event Simulation Systems in Multicore Environments

    DTIC Science & Technology

    2013-05-01

    consolidated at the sender side. At the receiver side, the messages are deconsolidated and delivered to the appropriate thread. This approach bears some...Jiang, S. Kini, W. Yu, D. Buntinas, P. Wyckoff, and D. Panda . Performance comparison of mpi implementations over infiniband, myrinet and quadrics

  11. 40 CFR Appendix H to Subpart G of... - Substitutes Subject to Use Restrictions and Unacceptable Substitutes, Effective May 28, 1999

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... inch or the thread direction must be reversed (i.e. right-handed vs. left-handed). Simply changing the thread pitch is not sufficient. For quick-connect fittings, “differ” means that a person using normal... Acceptable subject to narrowed use limits. Acceptable when manufactured using any process that does not...

  12. 40 CFR Appendix H to Subpart G of... - Substitutes Subject to Use Restrictions and Unacceptable Substitutes, Effective May 28, 1999

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... inch or the thread direction must be reversed (i.e. right-handed vs. left-handed). Simply changing the thread pitch is not sufficient. For quick-connect fittings, “differ” means that a person using normal... Acceptable subject to narrowed use limits. Acceptable when manufactured using any process that does not...

  13. 40 CFR Appendix H to Subpart G of... - Substitutes Subject to Use Restrictions and Unacceptable Substitutes, Effective May 28, 1999

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... inch or the thread direction must be reversed (i.e. right-handed vs. left-handed). Simply changing the thread pitch is not sufficient. For quick-connect fittings, “differ” means that a person using normal... Acceptable subject to narrowed use limits. Acceptable when manufactured using any process that does not...

  14. 16 CFR § 1632.4 - Mattress test procedure.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... threads per square inch and fabric weight of 3.7±0.8 oz/yd2 (125±28 gm/m2). The size of the sheet or.... The cigarettes shall be positioned directly over the thread or in the depression created by the quilting process on the half of the test surface reserved for bare mattress tests. If the quilt design is...

  15. Thread-like supercapacitors based on one-step spun nanocomposite yarns.

    PubMed

    Meng, Qinghai; Wang, Kai; Guo, Wei; Fang, Jin; Wei, Zhixiang; She, Xilin

    2014-08-13

    Thread-like electronic devices have attracted great interest because of their potential applications in wearable electronics. To produce high-performance, thread-like supercapacitors, a mixture of stable dispersions of single-walled carbon nanotubes and conducting polyaniline nanowires are prepared. Then, the mixture is spun into flexible yarns with a polyvinyl alcohol outer sheath by a one-step spinning process. The composite yarns show excellent mechanical properties and high electrical conductivities after sufficient washing to remove surfactants. After applying a further coating layer of gel electrolyte, two flexible yarns are twisted together to form a thread-like supercapacitor. The supercapacitor based on these two yarns (SWCNTs and PAniNWs) possesses a much higher specific capacitance than that based only on pure SWCNTs yarns, making it an ideal energy-storage device for wearable electronics. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering.

    PubMed

    Suzuki, Shuji; Kakuta, Masanori; Ishida, Takashi; Akiyama, Yutaka

    2016-01-01

    Sequence homology searches are used in various fields and require large amounts of computation time, especially for metagenomic analysis, owing to the large number of queries and the database size. To accelerate computing analyses, graphics processing units (GPUs) are widely used as a low-cost, high-performance computing platform. Therefore, we mapped the time-consuming steps involved in GHOSTZ, which is a state-of-the-art homology search algorithm for protein sequences, onto a GPU and implemented it as GHOSTZ-GPU. In addition, we optimized memory access for GPU calculations and for communication between the CPU and GPU. As per results of the evaluation test involving metagenomic data, GHOSTZ-GPU with 12 CPU threads and 1 GPU was approximately 3.0- to 4.1-fold faster than GHOSTZ with 12 CPU threads. Moreover, GHOSTZ-GPU with 12 CPU threads and 3 GPUs was approximately 5.8- to 7.7-fold faster than GHOSTZ with 12 CPU threads.

  17. Parallelization and checkpointing of GPU applications through program transformation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Solano-Quinde, Lizandro Damian

    2012-01-01

    GPUs have emerged as a powerful tool for accelerating general-purpose applications. The availability of programming languages that makes writing general-purpose applications for running on GPUs tractable have consolidated GPUs as an alternative for accelerating general purpose applications. Among the areas that have benefited from GPU acceleration are: signal and image processing, computational fluid dynamics, quantum chemistry, and, in general, the High Performance Computing (HPC) Industry. In order to continue to exploit higher levels of parallelism with GPUs, multi-GPU systems are gaining popularity. In this context, single-GPU applications are parallelized for running in multi-GPU systems. Furthermore, multi-GPU systems help to solvemore » the GPU memory limitation for applications with large application memory footprint. Parallelizing single-GPU applications has been approached by libraries that distribute the workload at runtime, however, they impose execution overhead and are not portable. On the other hand, on traditional CPU systems, parallelization has been approached through application transformation at pre-compile time, which enhances the application to distribute the workload at application level and does not have the issues of library-based approaches. Hence, a parallelization scheme for GPU systems based on application transformation is needed. Like any computing engine of today, reliability is also a concern in GPUs. GPUs are vulnerable to transient and permanent failures. Current checkpoint/restart techniques are not suitable for systems with GPUs. Checkpointing for GPU systems present new and interesting challenges, primarily due to the natural differences imposed by the hardware design, the memory subsystem architecture, the massive number of threads, and the limited amount of synchronization among threads. Therefore, a checkpoint/restart technique suitable for GPU systems is needed. The goal of this work is to exploit higher levels of parallelism and to develop support for application-level fault tolerance in applications using multiple GPUs. Our techniques reduce the burden of enhancing single-GPU applications to support these features. To achieve our goal, this work designs and implements a framework for enhancing a single-GPU OpenCL application through application transformation.« less

  18. Distributed Emulation in Support of Large Networks

    DTIC Science & Technology

    2016-06-01

    Provider LTE Long Term Evolution MB Megabyte MIPS Microprocessor without Interlocked Pipeline Stages MRT Multi-Threaded Routing Toolkit NPS Naval...environment, modifications to a network, protocol, or model can be executed – and the effects measured – without affecting real-world users or services...produce their results when analyzing performance of Long Term Evolution ( LTE ) gateways [3]. Many research scenarios allow problems to be represented

  19. StackSplit - a plugin for multi-event shear wave splitting analyses in SplitLab

    NASA Astrophysics Data System (ADS)

    Grund, Michael

    2017-04-01

    The SplitLab package (Wüstefeld et al., Computers and Geosciences, 2008), written in MATLAB, is a powerful and widely used tool for analysing seismological shear wave splitting of single event measurements. However, in many cases, especially temporary station deployments close to seaside or for recordings affected by strong anthropogenic noise, only multi-event approaches provide stable and reliable splitting results. In order to extend the original SplitLab environment for such analyses, I present the StackSplit plugin that can easily be implemented within the well accepted main program. StackSplit grants easy access to several different analysis approaches within SplitLab, including a new multiple waveform based inversion method as well as the most established standard stacking procedures. The possibility to switch between different analysis approaches at any time allows the user for the most flexible processing of individual multi-event splitting measurements for a single recording station. Besides the provided functions of the plugin, no other external program is needed for the multi-event analyses since StackSplit performs within the available SplitLab structure.

  20. Constant time worker thread allocation via configuration caching

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Eichenberger, Alexandre E; O'Brien, John K. P.

    Mechanisms are provided for allocating threads for execution of a parallel region of code. A request for allocation of worker threads to execute the parallel region of code is received from a master thread. Cached thread allocation information identifying prior thread allocations that have been performed for the master thread are accessed. Worker threads are allocated to the master thread based on the cached thread allocation information. The parallel region of code is executed using the allocated worker threads.

  1. Model Checking Real Time Java Using Java PathFinder

    NASA Technical Reports Server (NTRS)

    Lindstrom, Gary; Mehlitz, Peter C.; Visser, Willem

    2005-01-01

    The Real Time Specification for Java (RTSJ) is an augmentation of Java for real time applications of various degrees of hardness. The central features of RTSJ are real time threads; user defined schedulers; asynchronous events, handlers, and control transfers; a priority inheritance based default scheduler; non-heap memory areas such as immortal and scoped, and non-heap real time threads whose execution is not impeded by garbage collection. The Robust Software Systems group at NASA Ames Research Center has JAVA PATHFINDER (JPF) under development, a Java model checker. JPF at its core is a state exploring JVM which can examine alternative paths in a Java program (e.g., via backtracking) by trying all nondeterministic choices, including thread scheduling order. This paper describes our implementation of an RTSJ profile (subset) in JPF, including requirements, design decisions, and current implementation status. Two examples are analyzed: jobs on a multiprogramming operating system, and a complex resource contention example involving autonomous vehicles crossing an intersection. The utility of JPF in finding logic and timing errors is illustrated, and the remaining challenges in supporting all of RTSJ are assessed.

  2. Reactor shroud joint

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ballas, G.J.; Fife, A.B.; Ganz, I.

    1998-04-07

    A shroud for a nuclear reactor is described. In one embodiment, the shroud includes first and second shroud sections, and each shroud section includes a substantially cylindrical main body having a first end and a second end. With respect to each shroud section, a flange is located at the main body first end, and the flange has a plurality of bolt openings therein and a plurality of scalloped regions. The first shroud section is welded to the second shroud section, and at least some of the bolt openings in the first shroud section flange align with respective bolt openings inmore » the second shroud section flange. In the event that the onset of inter-granular stress corrosion cracking is ever detected in the weld between the shroud section, bolts are inserted through bolt openings in the first shroud section flange and through aligned bolt openings the second shroud section flange. Each bolt, in one embodiment, has a shank section and first and second threaded end sections. Nuts are threadedly engaged to the threaded end sections and tightened against the respective flanges. 4 figs.« less

  3. Reactor shroud joint

    DOEpatents

    Ballas, G.J.; Fife, A.B.; Ganz, I.

    1998-04-07

    A shroud for a nuclear reactor is described. In one embodiment, the shroud includes first and second shroud sections, and each shroud section includes a substantially cylindrical main body having a first end and a second end. With respect to each shroud section, a flange is located at the main body first end, and the flange has a plurality of bolt openings therein and a plurality of scalloped regions. The first shroud section is welded to the second shroud section, and at least some of the bolt openings in the first shroud section flange align with respective bolt openings in the second shroud section flange. In the event that the onset of inter-granular stress corrosion cracking is ever detected in the weld between the shroud section, bolts are inserted through bolt openings in the first shroud section flange and through aligned bolt openings the second shroud section flange. Each bolt, in one embodiment, has a shank section and first and second threaded end sections. Nuts are threadedly engaged to the threaded end sections and tightened against the respective flanges. 4 figs.

  4. KENNEDY SPACE CENTER, FLA. - In the Orbiter Processing Facility, while Greg Harlow, with United Space Alliance (USA) (above) threads a camera under the tiles of the orbiter Endeavour, Peggy Ritchie, USA, (behind the stand) and NASA’s Richard Parker (seated) watch the images on a monitor to inspect for corrosion.

    NASA Image and Video Library

    2003-09-04

    KENNEDY SPACE CENTER, FLA. - In the Orbiter Processing Facility, while Greg Harlow, with United Space Alliance (USA) (above) threads a camera under the tiles of the orbiter Endeavour, Peggy Ritchie, USA, (behind the stand) and NASA’s Richard Parker (seated) watch the images on a monitor to inspect for corrosion.

  5. KENNEDY SPACE CENTER, FLA. - In the Orbiter Processing Facility, while Greg Harlow, with United Space Alliance (USA), (above) threads a camera under the tiles of the orbiter Endeavour, NASA’s Richard Parker (below left) and Peggy Ritchie, with USA, (at right) watch the images on a monitor to inspect for corrosion.

    NASA Image and Video Library

    2003-09-04

    KENNEDY SPACE CENTER, FLA. - In the Orbiter Processing Facility, while Greg Harlow, with United Space Alliance (USA), (above) threads a camera under the tiles of the orbiter Endeavour, NASA’s Richard Parker (below left) and Peggy Ritchie, with USA, (at right) watch the images on a monitor to inspect for corrosion.

  6. KENNEDY SPACE CENTER, FLA. - In the Orbiter Processing Facility, while Greg Harlow, with United Space Alliance (USA), (above) threads a camera under the tiles of the orbiter Endeavour, Peggy Ritchie, with USA, (behind the stand) and NASA’s Richard Parker watch the images on a monitor to inspect for corrosion.

    NASA Image and Video Library

    2003-09-04

    KENNEDY SPACE CENTER, FLA. - In the Orbiter Processing Facility, while Greg Harlow, with United Space Alliance (USA), (above) threads a camera under the tiles of the orbiter Endeavour, Peggy Ritchie, with USA, (behind the stand) and NASA’s Richard Parker watch the images on a monitor to inspect for corrosion.

  7. A generic multi-hazard and multi-risk framework and its application illustrated in a virtual city

    NASA Astrophysics Data System (ADS)

    Mignan, Arnaud; Euchner, Fabian; Wiemer, Stefan

    2013-04-01

    We present a generic framework to implement hazard correlations in multi-risk assessment strategies. We consider hazard interactions (process I), time-dependent vulnerability (process II) and time-dependent exposure (process III). Our approach is based on the Monte Carlo method to simulate a complex system, which is defined from assets exposed to a hazardous region. We generate 1-year time series, sampling from a stochastic set of events. Each time series corresponds to one risk scenario and the analysis of multiple time series allows for the probabilistic assessment of losses and for the recognition of more or less probable risk paths. Each sampled event is associated to a time of occurrence, a damage footprint and a loss footprint. The occurrence of an event depends on its rate, which is conditional on the occurrence of past events (process I, concept of correlation matrix). Damage depends on the hazard intensity and on the vulnerability of the asset, which is conditional on previous damage on that asset (process II). Losses are the product of damage and exposure value, this value being the original exposure minus previous losses (process III, no reconstruction considered). The Monte Carlo method allows for a straightforward implementation of uncertainties and for implementation of numerous interactions, which is otherwise challenging in an analytical multi-risk approach. We apply our framework to a synthetic data set, defined by a virtual city within a virtual region. This approach gives the opportunity to perform multi-risk analyses in a controlled environment while not requiring real data, which may be difficultly accessible or simply unavailable to the public. Based on the heuristic approach, we define a 100 by 100 km region where earthquakes, volcanic eruptions, fluvial floods, hurricanes and coastal floods can occur. All hazards are harmonized to a common format. We define a 20 by 20 km city, composed of 50,000 identical buildings with a fixed economic value. Vulnerability curves are defined in terms of mean damage ratio as a function of hazard intensity. All data are based on simple equations found in the literature and on other simplifications. We show the impact of earthquake-earthquake interaction and hurricane-storm surge coupling, as well as of time-dependent vulnerability and exposure, on aggregated loss curves. One main result is the emergence of low probability-high consequences (extreme) events when correlations are implemented. While the concept of virtual city can suggest the theoretical benefits of multi-risk assessment for decision support, identifying their real-world practicality will require the study of real test sites.

  8. ALFA: The new ALICE-FAIR software framework

    NASA Astrophysics Data System (ADS)

    Al-Turany, M.; Buncic, P.; Hristov, P.; Kollegger, T.; Kouzinopoulos, C.; Lebedev, A.; Lindenstruth, V.; Manafov, A.; Richter, M.; Rybalchenko, A.; Vande Vyvre, P.; Winckler, N.

    2015-12-01

    The commonalities between the ALICE and FAIR experiments and their computing requirements led to the development of large parts of a common software framework in an experiment independent way. The FairRoot project has already shown the feasibility of such an approach for the FAIR experiments and extending it beyond FAIR to experiments at other facilities[1, 2]. The ALFA framework is a joint development between ALICE Online- Offline (O2) and FairRoot teams. ALFA is designed as a flexible, elastic system, which balances reliability and ease of development with performance using multi-processing and multithreading. A message- based approach has been adopted; such an approach will support the use of the software on different hardware platforms, including heterogeneous systems. Each process in ALFA assumes limited communication and reliance on other processes. Such a design will add horizontal scaling (multiple processes) to vertical scaling provided by multiple threads to meet computing and throughput demands. ALFA does not dictate any application protocols. Potentially, any content-based processor or any source can change the application protocol. The framework supports different serialization standards for data exchange between different hardware and software languages.

  9. Screw-Thread Standards for Federal Services, 1957. Handbook H28 (1957), Part 3

    DTIC Science & Technology

    1957-09-01

    MOUNTING THREADS PHOTOGRAPHIC EQUIPMENT THREADS ISO METRIC THREADS; MISCELLANEOUS THREADS CLASS 5 INTERFERENCE-FIT THREADS, TRIAL STANDARD WRENCH...Bibliography on measurement of pitch diameter by means of wires 60 Appendix 14. Metric screw-thread standards 61 1. ISO thread profiles...61 2. Standard series for ISO metric threads 62 3. Designations for ISO metric threads 62 Tables Page Table XII. 1.—Basic

  10. Sociolinguistically Informed Natural Language Processing: Automating Irony Detection

    DTIC Science & Technology

    2017-10-23

    ML and NLP technologies fail to detect ironic intent empirically. We specifically proposed to assess quantitatively (using the collected dataset...Aim 2. To analyze when existing ML and NLP technologies fail to detect ironic intent empirically. We specifically proposed to assess quantitatively ...of the embedding reddit thread, and the other comments in this thread) constitute 4 sub-reddit (URL) description number of labeled comments politics

  11. Parallel Agent-Based Simulations on Clusters of GPUs and Multi-Core Processors

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aaby, Brandon G; Perumalla, Kalyan S; Seal, Sudip K

    2010-01-01

    An effective latency-hiding mechanism is presented in the parallelization of agent-based model simulations (ABMS) with millions of agents. The mechanism is designed to accommodate the hierarchical organization as well as heterogeneity of current state-of-the-art parallel computing platforms. We use it to explore the computation vs. communication trade-off continuum available with the deep computational and memory hierarchies of extant platforms and present a novel analytical model of the tradeoff. We describe our implementation and report preliminary performance results on two distinct parallel platforms suitable for ABMS: CUDA threads on multiple, networked graphical processing units (GPUs), and pthreads on multi-core processors. Messagemore » Passing Interface (MPI) is used for inter-GPU as well as inter-socket communication on a cluster of multiple GPUs and multi-core processors. Results indicate the benefits of our latency-hiding scheme, delivering as much as over 100-fold improvement in runtime for certain benchmark ABMS application scenarios with several million agents. This speed improvement is obtained on our system that is already two to three orders of magnitude faster on one GPU than an equivalent CPU-based execution in a popular simulator in Java. Thus, the overall execution of our current work is over four orders of magnitude faster when executed on multiple GPUs.« less

  12. Dedicated memory structure holding data for detecting available worker thread(s) and informing available worker thread(s) of task(s) to execute

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chiu, George L.; Eichenberger, Alexandre E.; O'Brien, John K. P.

    The present disclosure relates generally to a dedicated memory structure (that is, hardware device) holding data for detecting available worker thread(s) and informing available worker thread(s) of task(s) to execute.

  13. Automatic building of a web-like structure based on thermoplastic adhesive.

    PubMed

    Leach, Derek; Wang, Liyu; Reusser, Dorothea; Iida, Fumiya

    2014-09-01

    Animals build structures to extend their control over certain aspects of the environment; e.g., orb-weaver spiders build webs to capture prey, etc. Inspired by this behaviour of animals, we attempt to develop robotics technology that allows a robot to automatically builds structures to help it accomplish certain tasks. In this paper we show automatic building of a web-like structure with a robot arm based on thermoplastic adhesive (TPA) material. The material properties of TPA, such as elasticity, adhesiveness, and low melting temperature, make it possible for a robot to form threads across an open space by an extrusion-drawing process and then combine several of these threads into a web-like structure. The problems addressed here are discovering which parameters determine the thickness of a thread and determining how web-like structures may be used for certain tasks. We first present a model for the extrusion and the drawing of TPA threads which also includes the temperature-dependent material properties. The model verification result shows that the increasing relative surface area of the TPA thread as it is drawn thinner increases the heat loss of the thread, and that by controlling how quickly the thread is drawn, a range of diameters can be achieved from 0.2-0.75 mm. We then present a method based on a generalized nonlinear finite element truss model. The model was validated and could predict the deformation of various web-like structures when payloads are added. At the end, we demonstrate automatic building of a web-like structure for payload bearing.

  14. Efficient accesses of data structures using processing near memory

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jayasena, Nuwan S.; Zhang, Dong Ping; Diez, Paula Aguilera

    Systems, apparatuses, and methods for implementing efficient queues and other data structures. A queue may be shared among multiple processors and/or threads without using explicit software atomic instructions to coordinate access to the queue. System software may allocate an atomic queue and corresponding queue metadata in system memory and return, to the requesting thread, a handle referencing the queue metadata. Any number of threads may utilize the handle for accessing the atomic queue. The logic for ensuring the atomicity of accesses to the atomic queue may reside in a management unit in the memory controller coupled to the memory wheremore » the atomic queue is allocated.« less

  15. Exploring Manycore Multinode Systems for Irregular Applications with FPGA Prototyping

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ceriani, Marco; Palermo, Gianluca; Secchi, Simone

    We present a prototype of a multi-core architecture implemented on FPGA, designed to enable efficient execution of irregular applications on distributed shared memory machines, while maintaining high performance on regular workloads. The architecture is composed of off-the-shelf soft-core cores, local interconnection and memory interface, integrated with custom components that optimize it for irregular applications. It relies on three key elements: a global address space, multithreading, and fine-grained synchronization. Global addresses are scrambled to reduce the formation of network hot-spots, while the latency of the transactions is covered by integrating an hardware scheduler within the custom load/store buffers to take advantagemore » from the availability of multiple executions threads, increasing the efficiency in a transparent way to the application. We evaluated a dual node system irregular kernels showing scalability in the number of cores and threads.« less

  16. LEAKAGE CHARACTERISTICS OF MULTI-CONDUCTOR CABLES AND CONDUIT SEALS

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nelson, C.; Becker, S.

    1962-12-12

    Pipe threads in conduit seal-offs can be made air tight by use of a two- part thiokol-epoxy sealant such as Sika.'' This material bonds to metal but does not harden; thus, threaded parts can be separated. Gas seals in conduit sealoffs can be made by use of Chico, Type A'' sealant. This material is hard and can withstand high pressure differentials. However, there is a detectable leakage through Chico, Type A.'' Sika'' can be used to make a suitable gas- tight seal. However, this material is flexible and will not support long cable lengths. A dual pour method is suggestedmore » of first casting Chico'' around the connectors to obtain strength in the seal and then using either Sika'' or Micro-Preg'' to produce a tight seal. Leakage through the cable, between strands of conductor, can be reduced by either soldering the ends or dipping the ends in conductive epoxy paint. (auth)« less

  17. Multicore Challenges and Benefits for High Performance Scientific Computing

    DOE PAGES

    Nielsen, Ida M. B.; Janssen, Curtis L.

    2008-01-01

    Until recently, performance gains in processors were achieved largely by improvements in clock speeds and instruction level parallelism. Thus, applications could obtain performance increases with relatively minor changes by upgrading to the latest generation of computing hardware. Currently, however, processor performance improvements are realized by using multicore technology and hardware support for multiple threads within each core, and taking full advantage of this technology to improve the performance of applications requires exposure of extreme levels of software parallelism. We will here discuss the architecture of parallel computers constructed from many multicore chips as well as techniques for managing the complexitymore » of programming such computers, including the hybrid message-passing/multi-threading programming model. We will illustrate these ideas with a hybrid distributed memory matrix multiply and a quantum chemistry algorithm for energy computation using Møller–Plesset perturbation theory.« less

  18. CONTROL ROD ROTATING MECHANISM

    DOEpatents

    Baumgarten, A.; Karalis, A.J.

    1961-11-28

    A threaded rotatable shaft is provided which rotates in response to linear movement of a nut, the shaft being surrounded by a pair of bellows members connected to either side of the nut to effectively seal the reactor from leakage and also to store up energy to shut down the reactor in the event of a power failure. (AEC)

  19. Magnesium Object Manager Sandbox, A More Effective Sandbox Method for Windows 7

    DTIC Science & Technology

    2012-03-01

    keys, synchronization primitives , etc.). The object body is specific to, and the same for, each object type; it contains information common to each...the Object Directory Specific Rights Synchronization Object (Event, Mutex, Semaphore , Timer) Synchronization objects allow multiple threads to... Synchronization Object Specific Rights . . . . . . . . . . . . . . . . . . . . . 19 2.5 File Object Specific Rights

  20. Developing Internal Controls through Activities

    ERIC Educational Resources Information Center

    Barnes, F. Herbert

    2009-01-01

    Life events can include the Tuesday afternoon cooking class with the group worker or the Saturday afternoon football game, but in the sense that Fritz Redl thought of them, these activities are only threads in a fabric of living that includes all the elements of daily life: playing, working, school-based learning, learning through activities,…

  1. LibIsopach

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Brunhart-Lupo, Nicholas

    2016-12-06

    LibIsopach is a toolkit for high performance distributed immersive visualization, leveraging modern OpenGL. It features a multi-process scenegraph, explicit instance rendering, mesh generation, and three-dimensional user interaction event processing.

  2. ROOT 6 and beyond: TObject, C++14 and many cores

    DOE PAGES

    Bellenot, B.; Canal, Ph; Couet, O.; ...

    2015-12-23

    Following the release of version 6, ROOT has entered a new area of development. It will leverage the industrial strength compiler library shipping in ROOT 6 and its support of the C++11/14 standard, to significantly simplify and harden ROOT's interfaces and to clarify and substantially improve ROOT's support for multi-threaded environments. Furthermore, this talk will also recap the most important new features and enhancements in ROOT in general, focusing on those allowed by the improved interpreter and better compiler support, including I/O for smart pointers, easier type safe access to the content of TTrees and enhanced multi processor support.

  3. Deployment of 802.15.4 Sensor Networks for C4ISR Operations

    DTIC Science & Technology

    2006-06-01

    43 Figure 20.MSP410CA Dense Grid Monitoring (Crossbow User’s Manual, 2005). ....................................44 Figure 21.(a)MICA2 without...Deployment of Sensor Grid (COASTS OPORD, 2006). ...56 Figure 27.Topology View of Two Nodes and Base Station .......57 Figure 28.Nodes Employing Multi...Random Access Memory TCP/IP Transmission Control Protocol/Internet Protocol TinyOS Tiny Micro Threading Operating System UARTs Universal

  4. Multistate photo-induced relaxation and photoisomerization ability of fumaramide threads: a computational and experimental study.

    PubMed

    Altoè, Piero; Haraszkiewicz, Natalia; Gatti, Francesco G; Wiering, Piet G; Frochot, Céline; Brouwer, Albert M; Balkowski, Grzegorz; Shaw, Daniel; Woutersen, Sander; Buma, Wybren Jan; Zerbetto, Francesco; Orlandi, Giorgio; Leigh, David A; Garavelli, Marco

    2009-01-14

    Fumaric and maleic amides are the photoactive units of an important and widely investigated class of photocontrollable rotaxanes as they trigger ring shuttling via a cis-trans photoisomerization. Here, ultrafast decay and photoinduced isomerization in isolated fumaramide and solvated nitrogen-substituted fumaramides (that are employed as threads in those rotaxanes) have been investigated by means of CASPT2//CASSCF computational and time-resolved spectroscopic techniques, respectively. A complex multistate network of competitive deactivation channels, involving both internal conversion and intersystem crossing (ISC) processes, has been detected and characterized that accounts for the picosecond decay and photochemical/photophysical properties observed in the singlet as well as triplet (photosensitized) photochemistry of fumaramides threads. Interestingly, singlet photochemistry appears to follow a non-Kasha rule model, where nonequilibrium dynamical factors control the outcome of the photochemical process: accessible high energy portions of extended crossing seams turn out to drive the deactivation process and ground-state recovery. Concurrently, extended singlet/triplet degenerate regions of twisted molecular structures with significant spin-orbit-coupling values account for ultrafast (picosecond time scale) ISC processes that lead to higher photoisomerization efficiencies. This model discloses the principles behind the intrinsic photochemical reactivity of fumaramide and its control.

  5. Three dimensional simulations of viscous folding in diverging microchannels

    NASA Astrophysics Data System (ADS)

    Xu, Bingrui; Chergui, Jalel; Shin, Seungwon; Juric, Damir

    2016-11-01

    Three dimensional simulations on the viscous folding in diverging microchannels reported by Cubaud and Mason are performed using the parallel code BLUE for multi-phase flows. The more viscous liquid L1 is injected into the channel from the center inlet, and the less viscous liquid L2 from two side inlets. Liquid L1 takes the form of a thin filament due to hydrodynamic focusing in the long channel that leads to the diverging region. The thread then becomes unstable to a folding instability, due to the longitudinal compressive stress applied to it by the diverging flow of liquid L2. We performed a parameter study in which the flow rate ratio, the viscosity ratio, the Reynolds number, and the shape of the channel were varied relative to a reference model. In our simulations, the cross section of the thread produced by focusing is elliptical rather than circular. The initial folding axis can be either parallel or perpendicular to the narrow dimension of the chamber. In the former case, the folding slowly transforms via twisting to perpendicular folding, or it may remain parallel. The direction of folding onset is determined by the velocity profile and the elliptical shape of the thread cross section in the channel that feeds the diverging part of the cell.

  6. MESSENGER observations of magnetic reconnection in Mercury's magnetosphere.

    PubMed

    Slavin, James A; Acuña, Mario H; Anderson, Brian J; Baker, Daniel N; Benna, Mehdi; Boardsen, Scott A; Gloeckler, George; Gold, Robert E; Ho, George C; Korth, Haje; Krimigis, Stamatios M; McNutt, Ralph L; Raines, Jim M; Sarantos, Menelaos; Schriver, David; Solomon, Sean C; Trávnícek, Pavel; Zurbuchen, Thomas H

    2009-05-01

    Solar wind energy transfer to planetary magnetospheres and ionospheres is controlled by magnetic reconnection, a process that determines the degree of connectivity between the interplanetary magnetic field (IMF) and a planet's magnetic field. During MESSENGER's second flyby of Mercury, a steady southward IMF was observed and the magnetopause was threaded by a strong magnetic field, indicating a reconnection rate ~10 times that typical at Earth. Moreover, a large flux transfer event was observed in the magnetosheath, and a plasmoid and multiple traveling compression regions were observed in Mercury's magnetotail, all products of reconnection. These observations indicate that Mercury's magnetosphere is much more responsive to IMF direction and dominated by the effects of reconnection than that of Earth or the other magnetized planets.

  7. Integrated Hardware and Software for No-Loss Computing

    NASA Technical Reports Server (NTRS)

    James, Mark

    2007-01-01

    When an algorithm is distributed across multiple threads executing on many distinct processors, a loss of one of those threads or processors can potentially result in the total loss of all the incremental results up to that point. When implementation is massively hardware distributed, then the probability of a hardware failure during the course of a long execution is potentially high. Traditionally, this problem has been addressed by establishing checkpoints where the current state of some or part of the execution is saved. Then in the event of a failure, this state information can be used to recompute that point in the execution and resume the computation from that point. A serious problem arises when one distributes a problem across multiple threads and physical processors is that one increases the likelihood of the algorithm failing due to no fault of the scientist but as a result of hardware faults coupled with operating system problems. With good reason, scientists expect their computing tools to serve them and not the other way around. What is novel here is a unique combination of hardware and software that reformulates an application into monolithic structure that can be monitored in real-time and dynamically reconfigured in the event of a failure. This unique reformulation of hardware and software will provide advanced aeronautical technologies to meet the challenges of next-generation systems in aviation, for civilian and scientific purposes, in our atmosphere and in atmospheres of other worlds. In particular, with respect to NASA s manned flight to Mars, this technology addresses the critical requirements for improving safety and increasing reliability of manned spacecraft.

  8. Examining Literacy Teachers' Perceptions of the Use of VoiceThread in an Elementary, Middle School, and a High School Classroom for Enhancing Instructional Goals

    ERIC Educational Resources Information Center

    Stover, Katie; Kissel, Brian; Wood, Karen; Putman, Michael

    2015-01-01

    In today's digital age, Web 2.0 tools such as VoiceThread allow users to integrate images, voices, and responses within one digital platform, providing students with the opportunity to add another layer of meaning to their texts. We conducted this research to expand our understanding of the processes necessary for integrating digital tools into…

  9. [Time and use of discussion forums in type 1 diabetes: contribution to patient education].

    PubMed

    Harry, Isabelle; Gagnayre, Rémi

    2013-01-01

    The purpose of this study was to elucidate the concept of temporality in discussions on forums used by individuals concerned by type 1 diabetes: adults and parents of children. The contents of messages were first converted into skills, and their temporality was then analysed, particularly in terms of the duration of active threads. Two types of temporality are involved in the use of forums: prescribed time governed by the therapeutic requirements related to a chronic disease and the decisions to be taken, and open-ended social time available on the Internet and the resulting reflexive processes. Our results show that topics relating to self-care and adaptation skills are often discussed and new threads on the topic are frequently introduced. Considerable diversity in the activity level associated with the various threads was observed, as most threads were only active for short periods. Following this study, our research perspectives concern: (i) the ways in which patients and their families reconcile the temporality dictated by a chronic disease (prescribed time) with the open-ended social time available on the Internet; and (ii) the ways in which this temporality is characteristic of patient learning processes via discussion forums. Future research will focus on the concept of rythmo-apprenance (rhythmic learning) in therapeutic patient education.

  10. pFUnit 3.0 Tutorial Advanced

    NASA Technical Reports Server (NTRS)

    Clune, Tom

    2014-01-01

    This tutorial will introduce Fortran developers to unit-testing and test-driven development (TDD) using pFUnit. As with other unit-testing frameworks, pFUnit, simplifies the process of writing, collecting, and executing tests while providing clear diagnostic messages for failing tests. pFUnit specifically targets the development of scientific-technical software written in Fortran and includes customized features such as: assertions for multi-dimensional arrays, distributed (MPI) and thread-based (OpenMP) parallellism, and flexible parameterized tests.These sessions will include numerous examples and hands-on exercises that gradually build in complexity. Attendees are expected to have working knowledge of F90, but familiarity with object-oriented syntax in F2003 and MPI will be of benefit for the more advanced examples. By the end of the tutorial the audience should feel comfortable in applying pFUnit within their own development environment.

  11. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tomopy is a Python toolbox to perform x-ray data processing, image reconstruction and data exchange tasks at synchrotron facilities. The dependencies of the software are currently as follows: -Python related python standard library (http://docs.python.org/2/library/) numpy (http://www.numpy.org/) scipy (http://scipy.org/) matplotlib (http://matplotlip.org/) sphinx (http://sphinx-doc.org) pil (http://www.pythonware.com/products/pil/) pyhdf (http://pysclint.sourceforge.net/pyhdf/) h5py (http://www.h5py.org) pywt (http://www.pybytes.com/pywavelets/) file.py (https://pyspec.svn.sourceforge.net/svnroot/pyspec/trunk/pyspec/ccd/files.py) -C/C++ related: gridec (anonymous?? C-code written back in 1997 that uses standard C library) fftw (http://www.fftw.org/) tomoRecon (multi-threaded C++ verion of gridrec. Author: Mark Rivers from APS. http://cars9.uchicago.edu/software/epics/tomoRecon.html) epics (http://www.aps.anl.gov/epics/)

  12. Framework for Development of Object-Oriented Software

    NASA Technical Reports Server (NTRS)

    Perez-Poveda, Gus; Ciavarella, Tony; Nieten, Dan

    2004-01-01

    The Real-Time Control (RTC) Application Framework is a high-level software framework written in C++ that supports the rapid design and implementation of object-oriented application programs. This framework provides built-in functionality that solves common software development problems within distributed client-server, multi-threaded, and embedded programming environments. When using the RTC Framework to develop software for a specific domain, designers and implementers can focus entirely on the details of the domain-specific software rather than on creating custom solutions, utilities, and frameworks for the complexities of the programming environment. The RTC Framework was originally developed as part of a Space Shuttle Launch Processing System (LPS) replacement project called Checkout and Launch Control System (CLCS). As a result of the framework s development, CLCS software development time was reduced by 66 percent. The framework is generic enough for developing applications outside of the launch-processing system domain. Other applicable high-level domains include command and control systems and simulation/ training systems.

  13. Facilitating Co-Design for Extreme-Scale Systems Through Lightweight Simulation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Engelmann, Christian; Lauer, Frank

    This work focuses on tools for investigating algorithm performance at extreme scale with millions of concurrent threads and for evaluating the impact of future architecture choices to facilitate the co-design of high-performance computing (HPC) architectures and applications. The approach focuses on lightweight simulation of extreme-scale HPC systems with the needed amount of accuracy. The prototype presented in this paper is able to provide this capability using a parallel discrete event simulation (PDES), such that a Message Passing Interface (MPI) application can be executed at extreme scale, and its performance properties can be evaluated. The results of an initial prototype aremore » encouraging as a simple 'hello world' MPI program could be scaled up to 1,048,576 virtual MPI processes on a four-node cluster, and the performance properties of two MPI programs could be evaluated at up to 16,384 virtual MPI processes on the same system.« less

  14. Interactions among forest age, valley and channel morphology, and log jams regulate animal production in mountain streams

    NASA Astrophysics Data System (ADS)

    Walters, D. M.; Venarsky, M. P.; Hall, R. O., Jr.; Herdrich, A.; Livers, B.; Winkelman, D.; Wohl, E.

    2014-12-01

    Forest age and local valley morphometry strongly influence the form and function of mountain streams in Colorado. Streams in valleys with old growth forest (>350 years) have extensive log jam complexes that create multi-thread channel reaches with extensive pool habitat and large depositional areas. Streams in younger unmanaged forests (e.g., 120 years old) and intensively managed forests have much fewer log jams and lower wood loads. These are single-thread streams dominated by riffles and with little depositional habitat. We hypothesized that log jam streams would retain more organic matter and have higher metabolism, leading to greater production of stream macroinvertebrates and trout. Log jam reaches should also have greater emergence of adult aquatic insects, and consequently have higher densities of riparian spiders taking advantage of these prey. Surficial organic matter was 3-fold higher in old-growth streams, and these streams had much higher ecosystem respiration. Insect production (g m2 y-1) was similar among forest types, but fish density was four times higher in old-growth streams with copious log jams. However, at the valley scale, insect production (g m-1 valley-1) and trout density (number m-1 valley-1) was 2-fold and 10-fold higher, respectively, in old growth streams. This finding is because multi-thread reaches created by log jams have much greater stream area and stream length per meter of valley than single-thread channels. The more limited response of macroinvertebrates may be related to fish predation. Trout in old growth streams had similar growth rates and higher fat content than fish in other streams in spite of occurring at higher densities and higher elevation/colder temperatures. This suggests that the positive fish effect observed in old growth streams is related to greater availability of invertebrate prey, which is consistent with our original hypothesis. Preliminary analyses suggest that spider densities do not respond strongly to differences in stream morphology, but rather to changes in elevation and associated air temperatures. These results demonstrate strong indirect effects of forest age and valley morphometry on organic matter storage and animal secondary production in streams that is mediated by direct effects associated with the presence or absence of logjams.

  15. StackSplit - a plugin for multi-event shear wave splitting analyses in SplitLab

    NASA Astrophysics Data System (ADS)

    Grund, Michael

    2017-08-01

    SplitLab is a powerful and widely used tool for analysing seismological shear wave splitting of single event measurements. However, in many cases, especially temporary station deployments close to the noisy seaside, ocean bottom or for recordings affected by strong anthropogenic noise, only multi-event approaches provide stable and reliable splitting results. In order to extend the original SplitLab environment for such analyses, I present the StackSplit plugin that can easily be implemented within the well accepted main program. StackSplit grants easy access to several different analysis approaches within SplitLab, including a new multiple waveform based inversion method as well as the most established standard stacking procedures. The possibility to switch between different analysis approaches at any time allows the user for the most flexible processing of individual multi-event splitting measurements for a single recording station. Besides the provided functions of the plugin, no other external program is needed for the multi-event analyses since StackSplit performs within the available SplitLab structure which is based on MATLAB. The effectiveness and use of this plugin is demonstrated with data examples of a long running seismological recording station in Finland.

  16. Visualization of Spatio-Temporal Relations in Movement Event Using Multi-View

    NASA Astrophysics Data System (ADS)

    Zheng, K.; Gu, D.; Fang, F.; Wang, Y.; Liu, H.; Zhao, W.; Zhang, M.; Li, Q.

    2017-09-01

    Spatio-temporal relations among movement events extracted from temporally varying trajectory data can provide useful information about the evolution of individual or collective movers, as well as their interactions with their spatial and temporal contexts. However, the pure statistical tools commonly used by analysts pose many difficulties, due to the large number of attributes embedded in multi-scale and multi-semantic trajectory data. The need for models that operate at multiple scales to search for relations at different locations within time and space, as well as intuitively interpret what these relations mean, also presents challenges. Since analysts do not know where or when these relevant spatio-temporal relations might emerge, these models must compute statistical summaries of multiple attributes at different granularities. In this paper, we propose a multi-view approach to visualize the spatio-temporal relations among movement events. We describe a method for visualizing movement events and spatio-temporal relations that uses multiple displays. A visual interface is presented, and the user can interactively select or filter spatial and temporal extents to guide the knowledge discovery process. We also demonstrate how this approach can help analysts to derive and explain the spatio-temporal relations of movement events from taxi trajectory data.

  17. GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering

    PubMed Central

    Suzuki, Shuji; Kakuta, Masanori; Ishida, Takashi; Akiyama, Yutaka

    2016-01-01

    Sequence homology searches are used in various fields and require large amounts of computation time, especially for metagenomic analysis, owing to the large number of queries and the database size. To accelerate computing analyses, graphics processing units (GPUs) are widely used as a low-cost, high-performance computing platform. Therefore, we mapped the time-consuming steps involved in GHOSTZ, which is a state-of-the-art homology search algorithm for protein sequences, onto a GPU and implemented it as GHOSTZ-GPU. In addition, we optimized memory access for GPU calculations and for communication between the CPU and GPU. As per results of the evaluation test involving metagenomic data, GHOSTZ-GPU with 12 CPU threads and 1 GPU was approximately 3.0- to 4.1-fold faster than GHOSTZ with 12 CPU threads. Moreover, GHOSTZ-GPU with 12 CPU threads and 3 GPUs was approximately 5.8- to 7.7-fold faster than GHOSTZ with 12 CPU threads. PMID:27482905

  18. Effects of mechanical repetitive load on bone quality around implants in rat maxillae

    PubMed Central

    Uto, Yusuke; Nakano, Takayoshi; Ishimoto, Takuya; Inaba, Nao; Uchida, Yusuke; Sawase, Takashi

    2017-01-01

    Greater understanding and acceptance of the new concept “bone quality”, which was proposed by the National Institutes of Health and is based on bone cells and collagen fibers, are required. The novel protein Semaphorin3A (Sema3A) is associated with osteoprotection by regulating bone cells. The aims of this study were to investigate the effects of mechanical loads on Sema3A production and bone quality based on bone cells and collagen fibers around implants in rat maxillae. Grade IV-titanium threaded implants were placed at 4 weeks post-extraction in maxillary first molars. Implants received mechanical loads (10 N, 3 Hz for 1800 cycles, 2 days/week) for 5 weeks from 3 weeks post-implant placement to minimize the effects of wound healing processes by implant placement. Bone structures, bone mineral density (BMD), Sema3A production and bone quality based on bone cells and collagen fibers were analyzed using microcomputed tomography, histomorphometry, immunohistomorphometry, polarized light microscopy and birefringence measurement system inside of the first and second thread (designated as thread A and B, respectively), as mechanical stresses are concentrated and differently distributed on the first two threads from the implant neck. Mechanical load significantly increased BMD, but not bone volume around implants. Inside thread B, but not thread A, mechanical load significantly accelerated Sema3A production with increased number of osteoblasts and osteocytes, and enhanced production of both type I and III collagen. Moreover, mechanical load also significantly induced preferential alignment of collagen fibers in the lower flank of thread B. These data demonstrate that mechanical load has different effects on Sema3A production and bone quality based on bone cells and collagen fibers between the inside threads of A and B. Mechanical load-induced Sema3A production may be differentially regulated by the type of bone structure or distinct stress distribution, resulting in control of bone quality around implants in jaw bones. PMID:29244883

  19. Effects of mechanical repetitive load on bone quality around implants in rat maxillae.

    PubMed

    Uto, Yusuke; Kuroshima, Shinichiro; Nakano, Takayoshi; Ishimoto, Takuya; Inaba, Nao; Uchida, Yusuke; Sawase, Takashi

    2017-01-01

    Greater understanding and acceptance of the new concept "bone quality", which was proposed by the National Institutes of Health and is based on bone cells and collagen fibers, are required. The novel protein Semaphorin3A (Sema3A) is associated with osteoprotection by regulating bone cells. The aims of this study were to investigate the effects of mechanical loads on Sema3A production and bone quality based on bone cells and collagen fibers around implants in rat maxillae. Grade IV-titanium threaded implants were placed at 4 weeks post-extraction in maxillary first molars. Implants received mechanical loads (10 N, 3 Hz for 1800 cycles, 2 days/week) for 5 weeks from 3 weeks post-implant placement to minimize the effects of wound healing processes by implant placement. Bone structures, bone mineral density (BMD), Sema3A production and bone quality based on bone cells and collagen fibers were analyzed using microcomputed tomography, histomorphometry, immunohistomorphometry, polarized light microscopy and birefringence measurement system inside of the first and second thread (designated as thread A and B, respectively), as mechanical stresses are concentrated and differently distributed on the first two threads from the implant neck. Mechanical load significantly increased BMD, but not bone volume around implants. Inside thread B, but not thread A, mechanical load significantly accelerated Sema3A production with increased number of osteoblasts and osteocytes, and enhanced production of both type I and III collagen. Moreover, mechanical load also significantly induced preferential alignment of collagen fibers in the lower flank of thread B. These data demonstrate that mechanical load has different effects on Sema3A production and bone quality based on bone cells and collagen fibers between the inside threads of A and B. Mechanical load-induced Sema3A production may be differentially regulated by the type of bone structure or distinct stress distribution, resulting in control of bone quality around implants in jaw bones.

  20. Fragmentation mechanisms of confined co-flowing capillary threads revealed by active flow focusing

    NASA Astrophysics Data System (ADS)

    Robert de Saint Vincent, Matthieu; Delville, Jean-Pierre

    2016-08-01

    The control over stationary liquid thread fragmentation in confined co-flows is a key issue for the processing and transport of fluids in (micro-)ducts. Confinement indeed strongly enhances the stability of capillary threads, and also induces steric and hydrodynamic feedback effects on diphasic flows. We investigate the thread-to-droplet transition within the confined environment of a microchannel by using optocapillarity, i.e., interface stresses driven by light, as a wall-free constriction to locally flow focus stable threads in a tunable way, pinch them, and force their fragmentation. Above some flow-dependent onset in optical forcing, we observe a dynamic transition alternating between continuous (thread) and fragmented (droplets) states and show a surprisingly gradual thread-to-droplet transition when increasing the amplitude of the thread constriction. This transition is interpreted as an evolution from a convective to an absolute instability. Depending on the forcing amplitude, we then identify and characterize several stable fragmented regimes of single and multiple droplet periodicity (up to period-8). These droplet regimes build a robust flow-independent bifurcation diagram that eventually closes up, due to the flow confinement, to a monodisperse droplet size, independent of the forcing and close to the most unstable mode expected from the Rayleigh-Plateau instability. This fixed monodispersity can be circumvented by temporally modulating the optocapillary coupling, as we show that fragmentation can then occur either by triggering again the Rayleigh-Plateau instability when the largest excitable wavelength is larger than that of the most unstable mode, or as a pure consequence of a sufficiently strong optocapillary pinching. When properly adjusted, this modulation allows us to avoid the transient reforming and multidisperse regimes, and thereby to reversibly produce stable monodisperse droplet trains of controlled size. By actuating local flow focusing in time and amplitude, optocapillarity thus proves to be an efficient way to characterize and understand the thread-to-droplet transition in microchannels and to advance channel constriction strategies for the production of tunable monodisperse droplets when the overall confinement is important.

  1. Thread selection according to power characteristics during context switching on compute nodes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Archer, Charles J.; Blocksome, Michael A.; Randles, Amanda E.

    Methods, apparatus, and products are disclosed for thread selection during context switching on a plurality of compute nodes that includes: executing, by a compute node, an application using a plurality of threads of execution, including executing one or more of the threads of execution; selecting, by the compute node from a plurality of available threads of execution for the application, a next thread of execution in dependence upon power characteristics for each of the available threads; determining, by the compute node, whether criteria for a thread context switch are satisfied; and performing, by the compute node, the thread context switchmore » if the criteria for a thread context switch are satisfied, including executing the next thread of execution.« less

  2. Thread selection according to predefined power characteristics during context switching on compute nodes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    None, None

    Methods, apparatus, and products are disclosed for thread selection during context switching on a plurality of compute nodes that includes: executing, by a compute node, an application using a plurality of threads of execution, including executing one or more of the threads of execution; selecting, by the compute node from a plurality of available threads of execution for the application, a next thread of execution in dependence upon power characteristics for each of the available threads; determining, by the compute node, whether criteria for a thread context switch are satisfied; and performing, by the compute node, the thread context switchmore » if the criteria for a thread context switch are satisfied, including executing the next thread of execution.« less

  3. Succinoglycan Is Required for Initiation and Elongation of Infection Threads during Nodulation of Alfalfa by Rhizobium meliloti

    PubMed Central

    Cheng, Hai-Ping; Walker, Graham C.

    1998-01-01

    Rhizobium meliloti Rm1021 must be able to synthesize succinoglycan in order to invade successfully the nodules which it elicits on alfalfa and to establish an effective nitrogen-fixing symbiosis. Using R. meliloti cells that express green fluorescent protein (GFP), we have examined the nature of the symbiotic deficiency of exo mutants that are defective or altered in succinoglycan production. Our observations indicate that an exoY mutant, which does not produce succinoglycan, is symbiotically defective because it cannot initiate the formation of infection threads. An exoZ mutant, which produces succinoglycan without the acetyl modification, forms nitrogen-fixing nodules on plants, but it exhibits a reduced efficiency in the initiation and elongation of infection threads. An exoH mutant, which produces symbiotically nonfunctional high-molecular-weight succinoglycan that lacks the succinyl modification, cannot form extended infection threads. Infection threads initiate at a reduced rate and then abort before they reach the base of the root hairs. Overproduction of succinoglycan by the exoS96::Tn5 mutant does not reduce the efficiency of infection thread initiation and elongation, but it does significantly reduce the ability of this mutant to colonize the curled root hairs, which is the first step of the invasion process. The exoR95::Tn5 mutant, which overproduces succinoglycan to an even greater extent than the exoS96::Tn5 mutant, has completely lost its ability to colonize the curled root hairs. These new observations lead us to propose that succinoglycan is required for both the initiation and elongation of infection threads during nodule invasion and that excess production of succinoglycan interferes with the ability of the rhizobia to colonize curled root hairs. PMID:9748453

  4. Contention Modeling for Multithreaded Distributed Shared Memory Machines: The Cray XMT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Secchi, Simone; Tumeo, Antonino; Villa, Oreste

    Distributed Shared Memory (DSM) machines are a wide class of multi-processor computing systems where a large virtually-shared address space is mapped on a network of physically distributed memories. High memory latency and network contention are two of the main factors that limit performance scaling of such architectures. Modern high-performance computing DSM systems have evolved toward exploitation of massive hardware multi-threading and fine-grained memory hashing to tolerate irregular latencies, avoid network hot-spots and enable high scaling. In order to model the performance of such large-scale machines, parallel simulation has been proved to be a promising approach to achieve good accuracy inmore » reasonable times. One of the most critical factors in solving the simulation speed-accuracy trade-off is network modeling. The Cray XMT is a massively multi-threaded supercomputing architecture that belongs to the DSM class, since it implements a globally-shared address space abstraction on top of a physically distributed memory substrate. In this paper, we discuss the development of a contention-aware network model intended to be integrated in a full-system XMT simulator. We start by measuring the effects of network contention in a 128-processor XMT machine and then investigate the trade-off that exists between simulation accuracy and speed, by comparing three network models which operate at different levels of accuracy. The comparison and model validation is performed by executing a string-matching algorithm on the full-system simulator and on the XMT, using three datasets that generate noticeably different contention patterns.« less

  5. Modified locking thread form for fastener

    NASA Technical Reports Server (NTRS)

    Roopnarine, (Inventor); Vranish, John D. (Inventor)

    1998-01-01

    A threaded fastener has a standard part with a standard thread form characterized by thread walls with a standard included angle, and a modified part complementary to the standard part having a modified thread form characterized by thread walls which are symmetrically inclined with a modified included angle that is different from the standard included angle of the standard part's thread walls, such that the threads of one part make pre-loaded edge contact with the thread walls of the other part. The thread form of the modified part can have an included angle that is greater, less, or compound as compared to the included angle of the standard part. The standard part may be a bolt and the modified part a nut, or vice versa. The modified thread form holds securely even under large vibrational forces, it permits bi-directional use of standard mating threads, is impervious to the build up of tolerances and can be manufactured with a wider range of tolerances without loss of functionality, and distributes loading stresses (per thread) in a manner that decreases the possibility of single thread failure.

  6. Life's origin: the cosmic, planetary and biological processes

    NASA Technical Reports Server (NTRS)

    Scattergood, T.; Des Marais, D.; Jahnke, L.

    1987-01-01

    From elements formed in interstellar furnaces to humans peering back at the stars, the evolution of life has been a long, intricate and perhaps inevitable process. Life as we know it requires a planet orbiting a star at just the right distance so that water can exist in liquid form. It needs a rich supply of chemicals and energy sources. On Earth, the combination of chemistry and energy generated molecules that evolved ways of replicating themselves and of passing information from one generation to the next. Thus, the thread of life began. This chart traces the thread, maintained by DNA molecules for much of its history, as it weaves its way through the primitive oceans, gaining strength and diversity along the way. Organisms eventually moved onto the land, where advanced forms, including humans, ultimately arose. Finally, assisted by a technology of its own making, life has reached back out into space to understand its own origins, to expand into new realms, and to seek other living threads in the cosmos.

  7. Block-Parallel Data Analysis with DIY2

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Morozov, Dmitriy; Peterka, Tom

    DIY2 is a programming model and runtime for block-parallel analytics on distributed-memory machines. Its main abstraction is block-structured data parallelism: data are decomposed into blocks; blocks are assigned to processing elements (processes or threads); computation is described as iterations over these blocks, and communication between blocks is defined by reusable patterns. By expressing computation in this general form, the DIY2 runtime is free to optimize the movement of blocks between slow and fast memories (disk and flash vs. DRAM) and to concurrently execute blocks residing in memory with multiple threads. This enables the same program to execute in-core, out-of-core, serial,more » parallel, single-threaded, multithreaded, or combinations thereof. This paper describes the implementation of the main features of the DIY2 programming model and optimizations to improve performance. DIY2 is evaluated on benchmark test cases to establish baseline performance for several common patterns and on larger complete analysis codes running on large-scale HPC machines.« less

  8. Observations of the Growth of an Active Region Filament

    NASA Astrophysics Data System (ADS)

    Yang, Bo

    2017-04-01

    We present observations of the growth of an active region filament caused by magnetic interactions among the filament and its adjacent superpenumbral filament (SF) and dark thread-like structures (T). Multistep reconnections are identified during the whole growing process. Magnetic flux convergence and cancellation occurring at the positive footpoint region of the filament is the first step reconnection, which resulted in the filament bifurcating into two sets of intertwined threads. One set anchored in situ, while the other set moved toward and interacted with the SF and part of T. This indicates the second step reconnection, which gave rise to the disappearance of the SF and the formation of a long thread-like structure that connects the far ends of the filament and T. The long thread-like structure further interacted with the T and then separated into two parts, representing the third step reconnection. Finally, another similar long thread-like structure, which intertwined with the fixed filament threads, appeared. Hαobservations show that this twisted structure is a longer sinistral filament. Based on the observed photospheric vector magnetograms, we performed a non-linear force-free field extrapolation to reconstruct the magnetic fields above the photosphere and found that the coronal magnetic field lines associated with the filament consists of two twisted flux ropes winding around each other. These results suggest that magnetic interactions among filaments and their adjacent SFs and T could lead to the growth of the filaments, and the filament is probably supported in a flux rope.

  9. Inverted Control Rod Lock-In Device

    DOEpatents

    Brussalis, W. G.; Bost, G. E.

    1962-12-01

    A mechanism which prevents control rods from dropping out of the reactor core in the event the vessel in which the reactor is mounted should capsize is described. The mechanism includes a pivoted toothed armature which engages the threaded control rod lead screw and prevents removal of the rod whenever the armature is not attracted by the provided electromagnetic means. (AEC)

  10. Trigger Event Meets Culture Shock: Linking the Literature of Transformative Learning Theory and Cross-Cultural Adaptation.

    ERIC Educational Resources Information Center

    Lyon, Carol R.

    The literature on transformative learning theory and the literature on cross-cultural adaptation were analyzed to identify links between both bodies of literature. The notion of an unexpected phenomenon that influences individuals residing in an unfamiliar culture was shown to be a common thread linking the two bodies of literature. Transformative…

  11. "If You Brave Enough to Live It, the Least I Can Do Is Listen": Overcoming the Consequences of Complex Trauma

    ERIC Educational Resources Information Center

    Hudson, Lucy; Beilke, Sarah; Many, Michele

    2016-01-01

    Too many parents who find themselves involved with child welfare agencies have had lives threaded with deeply traumatic events. As adults, their childhood histories manifest themselves in substance abuse, domestic violence, relational problems, risk-taking behaviors, emotional lability, self-harming, anxiety, and depression. To successfully…

  12. 2,3,7,8-TETRACHLORODIBENZO-P-DIOXIN (TCDD) DISRUPTS EARLY MORPHOGENETIC EVENTS THAT FORM THE LOWER REPRODUCTIVE TRACT IN FEMALE RAT FETUSES

    EPA Science Inventory

    In female rats, in utero exposure to 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) during critical periods of organogenesis causes a permanent thread of tissue across the vaginal opening, which consists of a core of mesenchyme surrounded by keratinized epithelia. The objective of t...

  13. 40 CFR Appendix R to Subpart G of... - Substitutes Subject to Use Restrictions Listed in the December 20, 2011 Final Rule, Effective...

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    .... “Differ” means that either the diameter differs by at least 1/16 inch or the thread direction is reversed... (e.g., process tubes). The color mark must extend at least 2.5 centimeters (1 inch) from the... diameter differs by at least 1/16 inch or the thread direction is reversed (i.e., right-handed vs. left...

  14. 40 CFR Appendix R to Subpart G of... - Substitutes Subject to Use Restrictions Listed in the December 20, 2011 Final Rule, Effective...

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ...” means that either the diameter differs by at least 1/16 inch or the thread direction is reversed (i.e...., process tubes). The color mark must extend at least 2.5 centimeters (1 inch) from the compressor and must... diameter differs by at least 1/16 inch or the thread direction is reversed (i.e., right-handed vs. left...

  15. 40 CFR Appendix R to Subpart G of... - Substitutes Subject to Use Restrictions Listed in the December 20, 2011 Final Rule, Effective...

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    .... “Differ” means that either the diameter differs by at least 1/16 inch or the thread direction is reversed... (e.g., process tubes). The color mark must extend at least 2.5 centimeters (1 inch) from the... diameter differs by at least 1/16 inch or the thread direction is reversed (i.e., right-handed vs. left...

  16. Multi-Sensor Data Fusion Project

    DTIC Science & Technology

    2000-02-28

    seismic network by detecting T phases generated by underground events ( generally earthquakes ) and associating these phases to seismic events. The...between underwater explosions (H), underground sources, mostly earthquake - generated (7), and noise detections (N). The phases classified as H are the only...processing for infrasound sensors is most similar to seismic array processing with the exception that the detections are based on a more sophisticated

  17. Transformation Systems at NASA Ames

    NASA Technical Reports Server (NTRS)

    Buntine, Wray; Fischer, Bernd; Havelund, Klaus; Lowry, Michael; Pressburger, TOm; Roach, Steve; Robinson, Peter; VanBaalen, Jeffrey

    1999-01-01

    In this paper, we describe the experiences of the Automated Software Engineering Group at the NASA Ames Research Center in the development and application of three different transformation systems. The systems span the entire technology range, from deductive synthesis, to logic-based transformation, to almost compiler-like source-to-source transformation. These systems also span a range of NASA applications, including solving solar system geometry problems, generating data analysis software, and analyzing multi-threaded Java code.

  18. Multi-spacecraft solar energetic particle analysis of FERMI gamma-ray flare events within the HESPERIA H2020 project

    NASA Astrophysics Data System (ADS)

    Tziotziou, Kostas; Malandraki, Olga; Valtonen, Eino; Heber, Bernd; Zucca, Pietro; Klein, Karl-Ludwig; Vainio, Rami; Tsiropoula, Georgia; Share, Gerald

    2017-04-01

    Multi-spacecraft observations of solar energetic particle (SEP) events are important for understanding the acceleration processes and the interplanetary propagation of particles released during eruptive events. In this work, we have carefully studied 25 gamma-ray flare events observed by FERMI and investigated possible associations with SEP-related events observed with STEREO and L1 spacecraft in the heliosphere. A data-driven velocity dispersion analysis (VDA) and Time-Shifting Analysis (TSA) are used for deriving the release times of protons and electrons at the Sun and for comparing them with the respective times stemming from the gamma-ray event analysis and their X-ray signatures, in an attempt to interconnect the SEPs and Fermi events and better understand the physics involved. Acknowledgements: This project has received funding from the European Union's Horizon 2020 research and innovation program under grant agreement No 637324.

  19. Interplay of catalysis, fidelity, threading, and processivity in the exo- and endonucleolytic reactions of human exonuclease I

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shi, Yuqian; Hellinga, Homme W.; Beese, Lorena S.

    Human exonuclease 1 (hExo1) is a member of the RAD2/XPG structure-specific 5'-nuclease superfamily. Its dominant, processive 5'–3' exonuclease and secondary 5'-flap endonuclease activities participate in various DNA repair, recombination, and replication processes. A single active site processes both recessed ends and 5'-flap substrates. By initiating enzyme reactions in crystals, we have trapped hExo1 reaction intermediates that reveal structures of these substrates before and after their exo- and endonucleolytic cleavage, as well as structures of uncleaved, unthreaded, and partially threaded 5' flaps. Their distinctive 5' ends are accommodated by a small, mobile arch in the active site that binds recessed endsmore » at its base and threads 5' flaps through a narrow aperture within its interior. A sequence of successive, interlocking conformational changes guides the two substrate types into a shared reaction mechanism that catalyzes their cleavage by an elaborated variant of the two-metal, in-line hydrolysis mechanism. Coupling of substrate-dependent arch motions to transition-state stabilization suppresses inappropriate or premature cleavage, enhancing processing fidelity. The striking reduction in flap conformational entropy is catalyzed, in part, by arch motions and transient binding interactions between the flap and unprocessed DNA strand. At the end of the observed reaction sequence, hExo1 resets without relinquishing DNA binding, suggesting a structural basis for its processivity.« less

  20. Interplay of catalysis, fidelity, threading, and processivity in the exo- and endonucleolytic reactions of human exonuclease I.

    PubMed

    Shi, Yuqian; Hellinga, Homme W; Beese, Lorena S

    2017-06-06

    Human exonuclease 1 (hExo1) is a member of the RAD2/XPG structure-specific 5'-nuclease superfamily. Its dominant, processive 5'-3' exonuclease and secondary 5'-flap endonuclease activities participate in various DNA repair, recombination, and replication processes. A single active site processes both recessed ends and 5'-flap substrates. By initiating enzyme reactions in crystals, we have trapped hExo1 reaction intermediates that reveal structures of these substrates before and after their exo- and endonucleolytic cleavage, as well as structures of uncleaved, unthreaded, and partially threaded 5' flaps. Their distinctive 5' ends are accommodated by a small, mobile arch in the active site that binds recessed ends at its base and threads 5' flaps through a narrow aperture within its interior. A sequence of successive, interlocking conformational changes guides the two substrate types into a shared reaction mechanism that catalyzes their cleavage by an elaborated variant of the two-metal, in-line hydrolysis mechanism. Coupling of substrate-dependent arch motions to transition-state stabilization suppresses inappropriate or premature cleavage, enhancing processing fidelity. The striking reduction in flap conformational entropy is catalyzed, in part, by arch motions and transient binding interactions between the flap and unprocessed DNA strand. At the end of the observed reaction sequence, hExo1 resets without relinquishing DNA binding, suggesting a structural basis for its processivity.

  1. Twisting solar coronal jet launched at the boundary of an active region

    NASA Astrophysics Data System (ADS)

    Schmieder, B.; Guo, Y.; Moreno-Insertis, F.; Aulanier, G.; Yelles Chaouche, L.; Nishizuka, N.; Harra, L. K.; Thalmann, J. K.; Vargas Dominguez, S.; Liu, Y.

    2013-11-01

    Aims: A broad jet was observed in a weak magnetic field area at the edge of active region NOAA 11106 that also produced other nearby recurring and narrow jets. The peculiar shape and magnetic environment of the broad jet raised the question of whether it was created by the same physical processes of previously studied jets with reconnection occurring high in the corona. Methods: We carried out a multi-wavelength analysis using the EUV images from the Atmospheric Imaging Assembly (AIA) and magnetic fields from the Helioseismic and Magnetic Imager (HMI) both on-board the Solar Dynamics Observatory, which we coupled to a high-resolution, nonlinear force-free field extrapolation. Local correlation tracking was used to identify the photospheric motions that triggered the jet, and time-slices were extracted along and across the jet to unveil its complex nature. A topological analysis of the extrapolated field was performed and was related to the observed features. Results: The jet consisted of many different threads that expanded in around 10 minutes to about 100 Mm in length, with the bright features in later threads moving faster than in the early ones, reaching a maximum speed of about 200 km s-1. Time-slice analysis revealed a striped pattern of dark and bright strands propagating along the jet, along with apparent damped oscillations across the jet. This is suggestive of a (un)twisting motion in the jet, possibly an Alfvén wave. Bald patches in field lines, low-altitude flux ropes, diverging flow patterns, and a null point were identified at the basis of the jet. Conclusions: Unlike classical λ or Eiffel-tower-shaped jets that appear to be caused by reconnection in current sheets containing null points, reconnection in regions containing bald patches seems to be crucial in triggering the present jet. There is no observational evidence that the flux ropes detected in the topological analysis were actually being ejected themselves, as occurs in the violent phase of blowout jets; instead, the jet itself may have gained the twist of the flux rope(s) through reconnection. This event may represent a class of jets different from the classical quiescent or blowout jets, but to reach that conclusion, more observational and theoretical work is necessary.

  2. Cutting thread at flexible endoscopy.

    PubMed

    Gong, F; Swain, P; Kadirkamanathan, S; Hepworth, C; Laufer, J; Shelton, J; Mills, T

    1996-12-01

    New thread-cutting techniques were developed for use at flexible endoscopy. A guillotine was designed to follow and cut thread at the endoscope tip. A new method was developed for guiding suture cutters. Efficacy of Nd: YAG laser cutting of threads was studied. Experimental and clinical experience with thread-cutting methods is presented. A 2.4 mm diameter flexible thread-cutting guillotine was constructed featuring two lateral holes with sharp edges through which sutures to be cut are passed. Standard suture cutters were guided by backloading thread through the cutters extracorporeally. A snare cutter was constructed to retrieve objects sewn to tissue. Efficacy and speed of Nd: YAG laser in cutting twelve different threads were studied. The guillotine cut thread faster (p < 0.05) than standard suture cutters. Backloading thread shortened time taken to cut thread (p < 0.001) compared with free-hand cutting. Nd: YAG laser was ineffective in cutting uncolored threads and slower than mechanical cutters. Results of thread cutting in clinical studies using sewing machine (n = 77 cutting episodes in 21 patients), in-vivo experiments (n = 156), and postsurgical cases (n = 15 over 15 years) are presented. New thread-cutting methods are described and their efficacy demonstrated in experimental and clinical studies.

  3. Electron Microscopy of the Infection and Subsequent Development of Soybean Nodule Cells

    PubMed Central

    Goodchild, D. J.; Bergersen, F. J.

    1966-01-01

    Goodchild, D. J. (Commonwealth Scientific and Industrial Research Organization, Canberra, Australia), and F. J. Bergersen. Electron microscopy of the infection and subsequent development of soybean nodule cells. J. Bacteriol. 92:204–213. 1966—Electron microscopy of thin sections of the developing central tissue cells of young soybean root nodules has shown that infection is initiated by a few infection threads which penetrate cells of the young central tissue. Extension growth of the threads may be a result of pressure developed from the growth of the bacteria within the threads. Release of bacteria from a thread is preceded by the development on an infection thread of a bulge with a cellulose-free membrane-bounded extension; bacteria move from this into the host cells by an endocytotic process and remain enclosed in an infection vacuole which is bounded by a membrane of host-cell origin. Multiplication of the intracellular bacteria takes place within these vacuoles. Until the host cell becomes filled with bacteria, the vacuoles separate into discrete units at each division. Later, division of the bacteria occurs within each vacuole, thus leading to the mature structure of the central tissue cells in which several bacteria are enclosed within each membrane-bounded unit. Images PMID:5949564

  4. Competition between Visual Events Modulates the Influence of Salience during Free-Viewing of Naturalistic Videos

    PubMed Central

    Nardo, Davide; Console, Paola; Reverberi, Carlo; Macaluso, Emiliano

    2016-01-01

    In daily life the brain is exposed to a large amount of external signals that compete for processing resources. The attentional system can select relevant information based on many possible combinations of goal-directed and stimulus-driven control signals. Here, we investigate the behavioral and physiological effects of competition between distinctive visual events during free-viewing of naturalistic videos. Nineteen healthy subjects underwent functional magnetic resonance imaging (fMRI) while viewing short video-clips of everyday life situations, without any explicit goal-directed task. Each video contained either a single semantically-relevant event on the left or right side (Lat-trials), or multiple distinctive events in both hemifields (Multi-trials). For each video, we computed a salience index to quantify the lateralization bias due to stimulus-driven signals, and a gaze index (based on eye-tracking data) to quantify the efficacy of the stimuli in capturing attention to either side. Behaviorally, our results showed that stimulus-driven salience influenced spatial orienting only in presence of multiple competing events (Multi-trials). fMRI results showed that the processing of competing events engaged the ventral attention network, including the right temporoparietal junction (R TPJ) and the right inferior frontal cortex. Salience was found to modulate activity in the visual cortex, but only in the presence of competing events; while the orienting efficacy of Multi-trials affected activity in both the visual cortex and posterior parietal cortex (PPC). We conclude that in presence of multiple competing events, the ventral attention system detects semantically-relevant events, while regions of the dorsal system make use of saliency signals to select relevant locations and guide spatial orienting. PMID:27445760

  5. A novel polymer extrusion micropelletization process

    NASA Astrophysics Data System (ADS)

    Aquite, William

    Polymer micropellets provide a variety of potential applications for different processes in the polymer industry. Conventional pellets are in the size range of 2.5 mm to 5 mm, while micropellets are at least ten times smaller, in the size range of 50 μm to 1000 μm. The potential benefits to a processor using micropellets include: high surface to volume ratio, high bulk density, fast and even melting rates in extrusion, improved dry flow properties, faster injection molding cycles, and consequently lower energy consumption during processing. More specialized sintering processes that require polymer powders, such as selective sintering techniques, microporous plastics parts manufacturing, and other powder sintering methods would benefit from the production of polymer micropellets since these exhibit the advantages of pellets yet have a lower average size. This work focuses on the study of a technique developed at the Polymer Engineering Center. The technique uses a microcapillary die for the production of micropellets by causing instabilities in extruded polymer threads deformed using an air stream. Tuning of process conditions allow the development of surface disturbances that promote breakup of the threads into pellets, which are subsequently cooled and collected. Although micropellets with high sphericity and a narrow size distribution can be produced using this technique, minimal changes in process conditions also lead to the production of lenticular pellets as well as pellets, fibers and threads with a wide range of size and shape distributions. This work shows how changing processing conditions achieve a variety of shapes and sizes of micropellets, broadening its application for the production of powders from a variety of polymer resins. Different approaches were used, including dimensional analysis and numerical simulation of the micropelletization process. This research reveals the influence of non-linear viscoelastic effects on the dispersion of a polymer thread through surface disturbances. Furthermore, this research reveals how processing parameters can influence the quality of the produced micropellet. Through this work, an economically feasible technique was developed that can produce the raw material for processors that depend on polymer powders that will deliver ideally shaped and distributed micropellets.

  6. Thread gauge for measuring thread pitch diameters

    DOEpatents

    Brewster, A.L.

    1985-11-19

    A thread gauge which attaches to a vernier caliper to measure the thread pitch diameter of both externally threaded and internally threaded parts is disclosed. A pair of anvils are externally threaded with threads having the same pitch as those of the threaded part. Each anvil is mounted on a stem having a ball on which the anvil can rotate to properly mate with the parts to which the anvils are applied. The stems are detachably secured to the caliper blades by attachment collars having keyhole openings for receiving the stems and caliper blades. A set screw is used to secure each collar on its caliper blade. 2 figs.

  7. Thread gauge for measuring thread pitch diameters

    DOEpatents

    Brewster, Albert L.

    1985-01-01

    A thread gauge which attaches to a vernier caliper to measure the thread pitch diameter of both externally threaded and internally threaded parts. A pair of anvils are externally threaded with threads having the same pitch as those of the threaded part. Each anvil is mounted on a stem having a ball on which the anvil can rotate to properly mate with the parts to which the anvils are applied. The stems are detachably secured to the caliper blades by attachment collars having keyhole openings for receiving the stems and caliper blades. A set screw is used to secure each collar on its caliper blade.

  8. Analysis of an Online Match Discussion Board: Improving the Otolaryngology—Head and Neck Surgery Match

    PubMed Central

    Kozin, Elliott D.; Sethi, Rosh; Lehmann, Ashton; Remenschneider, Aaron K.; Golub, Justin S.; Reyes, Samuel A.; Emerick, Kevin; Lee, Daniel J.; Gray, Stacey T.

    2015-01-01

    Introduction “The Match” has become the accepted selection process for graduate medical education. Otomatch.com has provided an online forum for Otolaryngology-Head and Neck Surgery (OHNS) Match-related questions for over a decade. Herein, we aim to 1) delineate the types of posts on Otomatch to better understand the perspective of medical students applying for residency and 2) provide recommendations to potentially improve the Match process. Methods Discussion forum posts on Otomatch between December 2001 and April 2014 were reviewed. The title of each thread and total number of views were recorded for quantitative analysis. Each thread was organized into one of six major categories and one of eighteen subcategories, based on chronology within the application cycle and topic. National Resident Matching Program (NRMP) data were utilized for comparison. Results We identified 1,921 threads corresponding to over 2 million page views. Over 40% of threads related to questions about specific programs, and 27% were discussions about interviews. Views, a surrogate measure for popularity, reflected different trends. The majority of individuals viewed posts on interviews (42%), program specific questions (20%) and how to rank programs (11%). Increase in viewership tracked with a rise in applicant numbers based on NRMP data. Conclusions Our study provides an in depth analysis of a popular discussion forum for medical students interested in the OHNS Match. The most viewed posts are about interview dates and questions regarding specific programs. We provide suggestions to address unmet needs for medical students and potentially improve the Match process. PMID:25550223

  9. Jet evolution in a dense medium: event-by-event fluctuations and multi-particle correlations

    NASA Astrophysics Data System (ADS)

    Escobedo, Miguel A.; Iancu, Edmond

    2017-11-01

    We study the gluon distribution produced via successive medium-induced branchings by an energetic jet propagating through a weakly-coupled quark-gluon plasma. We show that under suitable approximations, the jet evolution is a Markovian stochastic process, which is exactly solvable. For this process, we construct exact analytic solutions for all the n-point correlation functions describing the gluon distribution in the space of energy [M. A. Escobedo, E. Iancu, Event-by-event fluctuations in the medium-induced jet evolution, JHEP 05 (2016) 008. arXiv:arxiv:arXiv:1601.03629, doi:http://dx.doi.org/10.1007/JHEP05(2016)008, M. A. Escobedo, E. Iancu, Multi-particle correlations and KNO scaling in the medium-induced jet evolution, JHEP 12 (2016) 104. arXiv:arxiv:arXiv:1609.06104, doi:http://dx.doi.org/10.1007/JHEP12(2016)104]. Using these results, we study the event-by-event distribution of the energy lost by the jet at large angles and of the multiplicities of the soft particles which carry this energy. We find that the event-by-event fluctuations are huge: the standard deviation in the energy loss is parametrically as large as its mean value [M. A. Escobedo, E. Iancu, Event-by-event fluctuations in the medium-induced jet evolution, JHEP 05 (2016) 008. arXiv:arxiv:arXiv:1601.03629, doi:http://dx.doi.org/10.1007/JHEP05(2016)008]. This has important consequences for the phenomenology of di-jet asymmetry in Pb+Pb collisions at the LHC: it implies that the fluctuations in the branching process can contribute to the measured asymmetry on an equal footing with the geometry of the di-jet event (i.e. as the difference between the in-medium path lengths of the two jets). We compute the higher moments of the multiplicity distribution and identify a remarkable regularity known as Koba-Nielsen-Olesen (KNO) scaling [M. A. Escobedo, E. Iancu, Multi-particle correlations and KNO scaling in the medium-induced jet evolution, JHEP 12 (2016) 104. arXiv:arxiv:arXiv:1609.06104, doi:http://dx.doi.org/10.1007/JHEP12(2016)104

  10. Toward automated interpretation of integrated information: Managing "big data" for NDE

    NASA Astrophysics Data System (ADS)

    Gregory, Elizabeth; Lesthaeghe, Tyler; Holland, Stephen

    2015-03-01

    Large scale automation of NDE processes is rapidly maturing, thanks to recent improvements in robotics and the rapid growth of computer power over the last twenty years. It is fairly straightforward to automate NDE data collection itself, but the process of NDE remains largely manual. We will discuss three threads of technological needs that must be addressed before we are able to perform automated NDE. Spatial context, the first thread, means that each NDE measurement taken is accompanied by metadata that locates the measurement with respect to the 3D physical geometry of the specimen. In this way, the geometry of the specimen acts as a database key. Data context, the second thread, means that we record why the data was taken and how it was measured in addition to the NDE data itself. We will present our software tool that helps users interact with data in context, Databrowse. Condition estimation, the third thread, is maintaining the best possible knowledge of the condition (serviceability, degradation, etc.) of an object or part. In the NDE context, we can prospectively use Bayes' Theorem to integrate the data from each new NDE measurement with prior knowledge. These tools, combined with robotic measurements and automated defect analysis, will provide the information needed to make high-level life predictions and focus NDE measurements where they are needed most.

  11. Dislocation reduction in heteroepitaxial Ge on Si using SiO{sub 2} lined etch pits and epitaxial lateral overgrowth

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Leonhardt, Darin; Han, Sang M.

    2011-09-12

    We report a technique that significantly reduces threading dislocations in Ge on Si heteroepitaxy. Germanium is first grown on Si and etched to produce pits in the surface where threading dislocations terminate. Further processing leaves a layer of SiO{sub 2} only within etch pits. Subsequent selective epitaxial Ge growth results in coalescence above the SiO{sub 2}. The SiO{sub 2} blocks the threading dislocations from propagating into the upper Ge epilayer. With annealed Ge films grown on Si, the said method reduces the defect density from 2.6 x 10{sup 8} to 1.7 x 10{sup 6} cm{sup -2}, potentially making the layermore » suitable for electronic and photovoltaic devices.« less

  12. 78 FR 76815 - Steel Threaded Rod From India: Preliminary Affirmative Countervailing Duty Determination and...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-12-19

    ... DEPARTMENT OF COMMERCE International Trade Administration [C-533-856] Steel Threaded Rod From... exporters of steel threaded rod from India. The period of investigation (``POI'') is January 1, 2012... this investigation is steel threaded rod. Steel threaded rod is certain threaded rod, bar, or studs, of...

  13. Convergent synthesis and photoinduced processes in multi-chromophoric rotaxanes.

    PubMed

    Megiatto, Jackson D; Li, Ke; Schuster, David I; Palkar, Amit; Herranz, M Ángeles; Echegoyen, Luis; Abwandner, Silke; de Miguel, Gustavo; Guldi, Dirk M

    2010-11-18

    A series of [2]rotaxane materials, in which [60]fullerene is linked to a macrocycle and ferrocene (Fc) moieties are placed at the termini of a thread, both of which possess a central Cu(I)-1,10-phenanthroline [Cu(phen)(2)](+) complex, were synthesized by self-assembly using Sauvage metal template methodology. Two types of threads were constructed, one with terminal ester linkages, and a second with terminal 1,2,3-triazole linkages derived from Cu(I)-catalyzed "click" 1,3-cycloaddition reactions. Model compounds lacking the fullerene moiety were prepared in an analogous manner. The ability of the interlocked Fc-[Cu(phen)(2)](+)-C(60) hybrids to undergo electron transfer upon photoexcitation in benzonitrile, dichloromethane, and ortho-dichlorobenzene was investigated by means of time-resolved fluorescence and transient absorption spectroscopy, using excitation wavelengths directed at the fullerene and [Cu(phen)(2)](+) subunits. The energies of the electronic excited states and charge separated (CS) states that might be formed upon photoexcitation were determined from spectroscopic and electrochemical data. These studies showed that MLCT excited states of the copper complex in the fullerenerotaxanes were quenched by electron transfer to the fullerene in benzonitrile, resulting in charge separated states with oxidized copper and reduced fullerene moieties, (Fc)(2)-[Cu(phen)(2)](2+)-C(60)(•-). Even though electron transfer from Fc to the oxidized copper complex is predicted to be exergonic by 0.16 to 0.20 eV, no unequivocal evidence in support of such a process was obtained. The conclusion that Fc plays no role in the photoinduced processes in our systems rests on the lack of enhancement of the lifetime of the charge separated state, as measured by decay of C(60)(•-) at ∼1000 nm, since one-electron oxidized Fc is very difficult to detect spectroscopically in the 500-800 nm spectral region.

  14. Real-time video compressing under DSP/BIOS

    NASA Astrophysics Data System (ADS)

    Chen, Qiu-ping; Li, Gui-ju

    2009-10-01

    This paper presents real-time MPEG-4 Simple Profile video compressing based on the DSP processor. The programming framework of video compressing is constructed using TMS320C6416 Microprocessor, TDS510 simulator and PC. It uses embedded real-time operating system DSP/BIOS and the API functions to build periodic function, tasks and interruptions etcs. Realize real-time video compressing. To the questions of data transferring among the system. Based on the architecture of the C64x DSP, utilized double buffer switched and EDMA data transfer controller to transit data from external memory to internal, and realize data transition and processing at the same time; the architecture level optimizations are used to improve software pipeline. The system used DSP/BIOS to realize multi-thread scheduling. The whole system realizes high speed transition of a great deal of data. Experimental results show the encoder can realize real-time encoding of 768*576, 25 frame/s video images.

  15. Multithreaded hybrid feature tracking for markerless augmented reality.

    PubMed

    Lee, Taehee; Höllerer, Tobias

    2009-01-01

    We describe a novel markerless camera tracking approach and user interaction methodology for augmented reality (AR) on unprepared tabletop environments. We propose a real-time system architecture that combines two types of feature tracking. Distinctive image features of the scene are detected and tracked frame-to-frame by computing optical flow. In order to achieve real-time performance, multiple operations are processed in a synchronized multi-threaded manner: capturing a video frame, tracking features using optical flow, detecting distinctive invariant features, and rendering an output frame. We also introduce user interaction methodology for establishing a global coordinate system and for placing virtual objects in the AR environment by tracking a user's outstretched hand and estimating a camera pose relative to it. We evaluate the speed and accuracy of our hybrid feature tracking approach, and demonstrate a proof-of-concept application for enabling AR in unprepared tabletop environments, using bare hands for interaction.

  16. Ariadne's Thread: Using Social Presence Indices to Distinguish Learning Events in Face-to-Face and ICT-Rich Settings

    ERIC Educational Resources Information Center

    Baskin, Colin; Henderson, Michael

    2005-01-01

    Drawing on ancient Greek mythology, this article traces the learning experiences of 164 pre-service education students as they make the transition from a conventional face-to-face (f-2-f) learning environment to an Information and Communication Technology (ICT) rich setting. Influenced by Social Presence Theory (Short, Williams & Christie,…

  17. OBSERVATIONS OF THE GROWTH OF AN ACTIVE REGION FILAMENT

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Bo; Jiang, Yunchun; Yang, Jiayan

    We present observations of the growth of an active region filament caused by magnetic interactions among the filament and its adjacent superpenumbral filament (SF) and dark thread-like structures (T). Multistep reconnections are identified during the whole growing process. Magnetic flux convergence and cancellation occurring at the positive footpoint region of the filament is the first step reconnection, which resulted in the filament bifurcating into two sets of intertwined threads. One set anchored in situ, while the other set moved toward and interacted with the SF and part of T. This indicates the second step reconnection, which gave rise to themore » disappearance of the SF and the formation of a long thread-like structure that connects the far ends of the filament and T. The long thread-like structure further interacted with the T and then separated into two parts, representing the third step reconnection. Finally, another similar long thread-like structure, which intertwined with the fixed filament threads, appeared. H {sub α} observations show that this twisted structure is a longer sinistral filament. Based on the observed photospheric vector magnetograms, we performed a non-linear force-free field extrapolation to reconstruct the magnetic fields above the photosphere and found that the coronal magnetic field lines associated with the filament consists of two twisted flux ropes winding around each other. These results suggest that magnetic interactions among filaments and their adjacent SFs and T could lead to the growth of the filaments, and the filament is probably supported in a flux rope.« less

  18. Stream Splitting in Support of Intrusion Detection

    DTIC Science & Technology

    2003-06-01

    increased. Every computer on the Internet has no need to see the traffic of every other computer on the Internet. Indeed if this was so, nothing would get ...distinguishes the stream splitter from other network analysis tools. B. HIGH LEVEL DESIGN To get the desired level of performance, a multi-threaded...of greater concern than added accuracy of a Bayesian model. This is a case where close is good enough . b. PassiveSensors Though similar to active

  19. An adaptive transmission protocol for managing dynamic shared states in collaborative surgical simulation.

    PubMed

    Qin, J; Choi, K S; Ho, Simon S M; Heng, P A

    2008-01-01

    A force prediction algorithm is proposed to facilitate virtual-reality (VR) based collaborative surgical simulation by reducing the effect of network latencies. State regeneration is used to correct the estimated prediction. This algorithm is incorporated into an adaptive transmission protocol in which auxiliary features such as view synchronization and coupling control are equipped to ensure the system consistency. We implemented this protocol using multi-threaded technique on a cluster-based network architecture.

  20. Parallel satellite orbital situational problems solver for space missions design and control

    NASA Astrophysics Data System (ADS)

    Atanassov, Atanas Marinov

    2016-11-01

    Solving different scientific problems for space applications demands implementation of observations, measurements or realization of active experiments during time intervals in which specific geometric and physical conditions are fulfilled. The solving of situational problems for determination of these time intervals when the satellite instruments work optimally is a very important part of all activities on every stage of preparation and realization of space missions. The elaboration of universal, flexible and robust approach for situation analysis, which is easily portable toward new satellite missions, is significant for reduction of missions' preparation times and costs. Every situation problem could be based on one or more situation conditions. Simultaneously solving different kinds of situation problems based on different number and types of situational conditions, each one of them satisfied on different segments of satellite orbit requires irregular calculations. Three formal approaches are presented. First one is related to situation problems description that allows achieving flexibility in situation problem assembling and presentation in computer memory. The second formal approach is connected with developing of situation problem solver organized as processor that executes specific code for every particular situational condition. The third formal approach is related to solver parallelization utilizing threads and dynamic scheduling based on "pool of threads" abstraction and ensures a good load balance. The developed situation problems solver is intended for incorporation in the frames of multi-physics multi-satellite space mission's design and simulation tools.

  1. Online discussion groups for bulimia nervosa: an inductive approach to Internet-based communication between patients.

    PubMed

    Wesemann, Dorette; Grunwald, Martin

    2008-09-01

    Online discussion forums are often used by people with eating disorders. This study analyses 2,072 threads containing a total of 14,903 postings from an unmoderated German "prorecovery" forum for persons suffering from bulimia nervosa (www.ab-server.de) during the period from October 2004 to May 2006. The threads were inductively analyzed for underlying structural types, and the various types found were then analyzed for differences in temporal and quantitative parameters. Communication in the online discussion forum occurred in three types of thread: (1) problem-oriented threads (78.8% of threads), (2) communication-oriented threads (15.3% of threads), and (3) metacommunication threads (2.6% of threads). Metacommunication threads contained significantly more postings than problem-oriented and communication-oriented threads, and they were viewed significantly more often. Moreover, there are temporal differences between the structural types. Topics relating to active management of the disorder receive great attention in prorecovery forums. (c) 2008 by Wiley Periodicals, Inc.

  2. Methods for improved forewarning of critical events across multiple data channels

    DOEpatents

    Hively, Lee M [Philadelphia, TN

    2007-04-24

    This disclosed invention concerns improvements in forewarning of critical events via phase-space dissimilarity analysis of data from mechanical devices, electrical devices, biomedical data, and other physical processes. First, a single channel of process-indicative data is selected that can be used in place of multiple data channels without sacrificing consistent forewarning of critical events. Second, the method discards data of inadequate quality via statistical analysis of the raw data, because the analysis of poor quality data always yields inferior results. Third, two separate filtering operations are used in sequence to remove both high-frequency and low-frequency artifacts using a zero-phase quadratic filter. Fourth, the method constructs phase-space dissimilarity measures (PSDM) by combining of multi-channel time-serial data into a multi-channel time-delay phase-space reconstruction. Fifth, the method uses a composite measure of dissimilarity (C.sub.i) to provide a forewarning of failure and an indicator of failure onset.

  3. Cell-laden composite suture threads for repairing damaged tendons.

    PubMed

    Costa-Almeida, Raquel; Domingues, Rui M A; Fallahi, Afsoon; Avci, Huseyin; Yazdi, Iman K; Akbari, Mohsen; Reis, Rui L; Tamayol, Ali; Gomes, Manuela E; Khademhosseini, Ali

    2018-04-01

    Tendons have limited regenerative capacity due to their low cellularity and hypovascular nature, which results in poor clinical outcomes of presently used therapies. As tendon injuries are often observed in active adults, it poses an increasing socio-economic burden on healthcare systems. Currently, suture threads are used during surgical repair to anchor the tissue graft or to connect injured ends. Here, we created composite suture threads coated with a layer of cell-laden hydrogel that can be used for bridging the injured tissue aiming at tendon regeneration. In addition, the fibres can be used to engineer 3-dimensional constructs through textile processes mimicking the architecture and mechanical properties of soft tissues, including tendons and ligaments. Encapsulated human tendon-derived cells migrated within the hydrogel and aligned at the surface of the core thread. An up-regulation of tendon-related genes (scleraxis and tenascin C) and genes involved in matrix remodelling (matrix metalloproteinases 1, matrix metalloproteinases 2) was observed. Cells were able to produce a collagen-rich matrix, remodelling their micro-environment, which is structurally comparable to native tendon tissue. Copyright © 2017 John Wiley & Sons, Ltd.

  4. Manufactured Textile Fibers

    NASA Astrophysics Data System (ADS)

    Gupta, Bhupender S.

    The first conversion of naturally occurring fibers into threads strong enough to be looped into snares, knit to form nets, or woven into fabrics is lost in prehistory. Unlike stone weapons, such threads, cords, and fabrics—being organic in nature—have in most part disappeared, although in some dry caves traces remain. There is ample evidence to indicate that spindles used to assist in the twisting of fibers together had been developed long before the dawn of recorded history. In that spinning process, fibers such as wool were drawn out of a loose mass, perhaps held in a distaff, and made parallel by human fingers. (A maidservant so spins in Giotto's The Annunciation to Anne, ca. A.D. 1306, Arena Chapel, Padua, Italy.1) A rod (spindle), hooked to the lengthening thread, was rotated so that the fibers while so held were twisted together to form additional thread. The finished length then was wound by hand around the spindle, which, in becoming the core on which the finished product was accumulated, served the dual role of twisting and storing, and, in so doing, established a principle still in use today.

  5. Center for Technology for Advanced Scientific Componet Software (TASCS)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Govindaraju, Madhusudhan

    Advanced Scientific Computing Research Computer Science FY 2010Report Center for Technology for Advanced Scientific Component Software: Distributed CCA State University of New York, Binghamton, NY, 13902 Summary The overall objective of Binghamton's involvement is to work on enhancements of the CCA environment, motivated by the applications and research initiatives discussed in the proposal. This year we are working on re-focusing our design and development efforts to develop proof-of-concept implementations that have the potential to significantly impact scientific components. We worked on developing parallel implementations for non-hydrostatic code and worked on a model coupling interface for biogeochemical computations coded in MATLAB.more » We also worked on the design and implementation modules that will be required for the emerging MapReduce model to be effective for scientific applications. Finally, we focused on optimizing the processing of scientific datasets on multi-core processors. Research Details We worked on the following research projects that we are working on applying to CCA-based scientific applications. 1. Non-Hydrostatic Hydrodynamics: Non-static hydrodynamics are significantly more accurate at modeling internal waves that may be important in lake ecosystems. Non-hydrostatic codes, however, are significantly more computationally expensive, often prohibitively so. We have worked with Chin Wu at the University of Wisconsin to parallelize non-hydrostatic code. We have obtained a speed up of about 26 times maximum. Although this is significant progress, we hope to improve the performance further, such that it becomes a practical alternative to hydrostatic codes. 2. Model-coupling for water-based ecosystems: To answer pressing questions about water resources requires that physical models (hydrodynamics) be coupled with biological and chemical models. Most hydrodynamics codes are written in Fortran, however, while most ecologists work in MATLAB. This disconnect creates a great barrier. To address this, we are working on a model coupling interface that will allow biogeochemical computations written in MATLAB to couple with Fortran codes. This will greatly improve the productivity of ecosystem scientists. 2. Low overhead and Elastic MapReduce Implementation Optimized for Memory and CPU-Intensive Applications: Since its inception, MapReduce has frequently been associated with Hadoop and large-scale datasets. Its deployment at Amazon in the cloud, and its applications at Yahoo! for large-scale distributed document indexing and database building, among other tasks, have thrust MapReduce to the forefront of the data processing application domain. The applicability of the paradigm however extends far beyond its use with data intensive applications and diskbased systems, and can also be brought to bear in processing small but CPU intensive distributed applications. MapReduce however carries its own burdens. Through experiments using Hadoop in the context of diverse applications, we uncovered latencies and delay conditions potentially inhibiting the expected performance of a parallel execution in CPU-intensive applications. Furthermore, as it currently stands, MapReduce is favored for data-centric applications, and as such tends to be solely applied to disk-based applications. The paradigm, falls short in bringing its novelty to diskless systems dedicated to in-memory applications, and compute intensive programs processing much smaller data, but requiring intensive computations. In this project, we focused both on the performance of processing large-scale hierarchical data in distributed scientific applications, as well as the processing of smaller but demanding input sizes primarily used in diskless, and memory resident I/O systems. We designed LEMO-MR [1], a Low overhead, elastic, configurable for in- memory applications, and on-demand fault tolerance, an optimized implementation of MapReduce, for both on disk and in memory applications. We conducted experiments to identify not only the necessary components of this model, but also trade offs and factors to be considered. We have initial results to show the efficacy of our implementation in terms of potential speedup that can be achieved for representative data sets used by cloud applications. We have quantified the performance gains exhibited by our MapReduce implementation over Apache Hadoop in a compute intensive environment. 3. Cache Performance Optimization for Processing XML and HDF-based Application Data on Multi-core Processors: It is important to design and develop scientific middleware libraries to harness the opportunities presented by emerging multi-core processors. Implementations of scientific middleware and applications that do not adapt to the programming paradigm when executing on emerging processors can severely impact the overall performance. In this project, we focused on the utilization of the L2 cache, which is a critical shared resource on chip multiprocessors (CMP). The access pattern of the shared L2 cache, which is dependent on how the application schedules and assigns processing work to each thread, can either enhance or hurt the ability to hide memory latency on a multi-core processor. Therefore, while processing scientific datasets such as HDF5, it is essential to conduct fine-grained analysis of cache utilization, to inform scheduling decisions in multi-threaded programming. In this project, using the TAU toolkit for performance feedback from dual- and quad-core machines, we conducted performance analysis and recommendations on how processing threads can be scheduled on multi-core nodes to enhance the performance of a class of scientific applications that requires processing of HDF5 data. In particular, we quantified the gains associated with the use of the adaptations we have made to the Cache-Affinity and Balanced-Set scheduling algorithms to improve L2 cache performance, and hence the overall application execution time [2]. References: 1. Zacharia Fadika, Madhusudhan Govindaraju, ``MapReduce Implementation for Memory-Based and Processing Intensive Applications'', accepted in 2nd IEEE International Conference on Cloud Computing Technology and Science, Indianapolis, USA, Nov 30 - Dec 3, 2010. 2. Rajdeep Bhowmik, Madhusudhan Govindaraju, ``Cache Performance Optimization for Processing XML-based Application Data on Multi-core Processors'', in proceedings of The 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, May 17-20, 2010, Melbourne, Victoria, Australia. Contact Information: Madhusudhan Govindaraju Binghamton University State University of New York (SUNY) mgovinda@cs.binghamton.edu Phone: 607-777-4904« less

  6. Sediment Core Extrusion Method at Millimeter Resolution Using a Calibrated, Threaded-rod.

    PubMed

    Schwing, Patrick T; Romero, Isabel C; Larson, Rebekka A; O'Malley, Bryan J; Fridrik, Erika E; Goddard, Ethan A; Brooks, Gregg R; Hastings, David W; Rosenheim, Brad E; Hollander, David J; Grant, Guy; Mulhollan, Jim

    2016-08-17

    Aquatic sediment core subsampling is commonly performed at cm or half-cm resolution. Depending on the sedimentation rate and depositional environment, this resolution provides records at the annual to decadal scale, at best. An extrusion method, using a calibrated, threaded-rod is presented here, which allows for millimeter-scale subsampling of aquatic sediment cores of varying diameters. Millimeter scale subsampling allows for sub-annual to monthly analysis of the sedimentary record, an order of magnitude higher than typical sampling schemes. The extruder consists of a 2 m aluminum frame and base, two core tube clamps, a threaded-rod, and a 1 m piston. The sediment core is placed above the piston and clamped to the frame. An acrylic sampling collar is affixed to the upper 5 cm of the core tube and provides a platform from which to extract sub-samples. The piston is rotated around the threaded-rod at calibrated intervals and gently pushes the sediment out the top of the core tube. The sediment is then isolated into the sampling collar and placed into an appropriate sampling vessel (e.g., jar or bag). This method also preserves the unconsolidated samples (i.e., high pore water content) at the surface, providing a consistent sampling volume. This mm scale extrusion method was applied to cores collected in the northern Gulf of Mexico following the Deepwater Horizon submarine oil release. Evidence suggests that it is necessary to sample at the mm scale to fully characterize events that occur on the monthly time-scale for continental slope sediments.

  7. Sediment Core Extrusion Method at Millimeter Resolution Using a Calibrated, Threaded-rod

    PubMed Central

    Schwing, Patrick T.; Romero, Isabel C.; Larson, Rebekka A.; O'Malley, Bryan J.; Fridrik, Erika E.; Goddard, Ethan A.; Brooks, Gregg R.; Hastings, David W.; Rosenheim, Brad E.; Hollander, David J.; Grant, Guy; Mulhollan, Jim

    2016-01-01

    Aquatic sediment core subsampling is commonly performed at cm or half-cm resolution. Depending on the sedimentation rate and depositional environment, this resolution provides records at the annual to decadal scale, at best. An extrusion method, using a calibrated, threaded-rod is presented here, which allows for millimeter-scale subsampling of aquatic sediment cores of varying diameters. Millimeter scale subsampling allows for sub-annual to monthly analysis of the sedimentary record, an order of magnitude higher than typical sampling schemes. The extruder consists of a 2 m aluminum frame and base, two core tube clamps, a threaded-rod, and a 1 m piston. The sediment core is placed above the piston and clamped to the frame. An acrylic sampling collar is affixed to the upper 5 cm of the core tube and provides a platform from which to extract sub-samples. The piston is rotated around the threaded-rod at calibrated intervals and gently pushes the sediment out the top of the core tube. The sediment is then isolated into the sampling collar and placed into an appropriate sampling vessel (e.g., jar or bag). This method also preserves the unconsolidated samples (i.e., high pore water content) at the surface, providing a consistent sampling volume. This mm scale extrusion method was applied to cores collected in the northern Gulf of Mexico following the Deepwater Horizon submarine oil release. Evidence suggests that it is necessary to sample at the mm scale to fully characterize events that occur on the monthly time-scale for continental slope sediments. PMID:27585268

  8. CUDAMPF: a multi-tiered parallel framework for accelerating protein sequence search in HMMER on CUDA-enabled GPU.

    PubMed

    Jiang, Hanyu; Ganesan, Narayan

    2016-02-27

    HMMER software suite is widely used for analysis of homologous protein and nucleotide sequences with high sensitivity. The latest version of hmmsearch in HMMER 3.x, utilizes heuristic-pipeline which consists of MSV/SSV (Multiple/Single ungapped Segment Viterbi) stage, P7Viterbi stage and the Forward scoring stage to accelerate homology detection. Since the latest version is highly optimized for performance on modern multi-core CPUs with SSE capabilities, only a few acceleration attempts report speedup. However, the most compute intensive tasks within the pipeline (viz., MSV/SSV and P7Viterbi stages) still stand to benefit from the computational capabilities of massively parallel processors. A Multi-Tiered Parallel Framework (CUDAMPF) implemented on CUDA-enabled GPUs presented here, offers a finer-grained parallelism for MSV/SSV and Viterbi algorithms. We couple SIMT (Single Instruction Multiple Threads) mechanism with SIMD (Single Instructions Multiple Data) video instructions with warp-synchronism to achieve high-throughput processing and eliminate thread idling. We also propose a hardware-aware optimal allocation scheme of scarce resources like on-chip memory and caches in order to boost performance and scalability of CUDAMPF. In addition, runtime compilation via NVRTC available with CUDA 7.0 is incorporated into the presented framework that not only helps unroll innermost loop to yield upto 2 to 3-fold speedup than static compilation but also enables dynamic loading and switching of kernels depending on the query model size, in order to achieve optimal performance. CUDAMPF is designed as a hardware-aware parallel framework for accelerating computational hotspots within the hmmsearch pipeline as well as other sequence alignment applications. It achieves significant speedup by exploiting hierarchical parallelism on single GPU and takes full advantage of limited resources based on their own performance features. In addition to exceeding performance of other acceleration attempts, comprehensive evaluations against high-end CPUs (Intel i5, i7 and Xeon) shows that CUDAMPF yields upto 440 GCUPS for SSV, 277 GCUPS for MSV and 14.3 GCUPS for P7Viterbi all with 100 % accuracy, which translates to a maximum speedup of 37.5, 23.1 and 11.6-fold for MSV, SSV and P7Viterbi respectively. The source code is available at https://github.com/Super-Hippo/CUDAMPF.

  9. Manyscale Computing for Sensor Processing in Support of Space Situational Awareness

    NASA Astrophysics Data System (ADS)

    Schmalz, M.; Chapman, W.; Hayden, E.; Sahni, S.; Ranka, S.

    2014-09-01

    Increasing image and signal data burden associated with sensor data processing in support of space situational awareness implies continuing computational throughput growth beyond the petascale regime. In addition to growing applications data burden and diversity, the breadth, diversity and scalability of high performance computing architectures and their various organizations challenge the development of a single, unifying, practicable model of parallel computation. Therefore, models for scalable parallel processing have exploited architectural and structural idiosyncrasies, yielding potential misapplications when legacy programs are ported among such architectures. In response to this challenge, we have developed a concise, efficient computational paradigm and software called Manyscale Computing to facilitate efficient mapping of annotated application codes to heterogeneous parallel architectures. Our theory, algorithms, software, and experimental results support partitioning and scheduling of application codes for envisioned parallel architectures, in terms of work atoms that are mapped (for example) to threads or thread blocks on computational hardware. Because of the rigor, completeness, conciseness, and layered design of our manyscale approach, application-to-architecture mapping is feasible and scalable for architectures at petascales, exascales, and above. Further, our methodology is simple, relying primarily on a small set of primitive mapping operations and support routines that are readily implemented on modern parallel processors such as graphics processing units (GPUs) and hybrid multi-processors (HMPs). In this paper, we overview the opportunities and challenges of manyscale computing for image and signal processing in support of space situational awareness applications. We discuss applications in terms of a layered hardware architecture (laboratory > supercomputer > rack > processor > component hierarchy). Demonstration applications include performance analysis and results in terms of execution time as well as storage, power, and energy consumption for bus-connected and/or networked architectures. The feasibility of the manyscale paradigm is demonstrated by addressing four principal challenges: (1) architectural/structural diversity, parallelism, and locality, (2) masking of I/O and memory latencies, (3) scalability of design as well as implementation, and (4) efficient representation/expression of parallel applications. Examples will demonstrate how manyscale computing helps solve these challenges efficiently on real-world computing systems.

  10. Two years' outcome of thread lifting with absorbable barbed PDO threads: Innovative score for objective and subjective assessment.

    PubMed

    Ali, Yasser Helmy

    2018-02-01

    Thread-lifting rejuvenation procedures have evolved again, with the development of absorbable threads. Although they have gained popularity among plastic surgeons and dermatologists, very few articles have been written in literature about absorbable threads. This study aims to evaluate two years' outcome of thread lifting using absorbable barbed threads for facial rejuvenation. Prospective comparative stud both objectively and subjectively and follow-up assessment for 24 months. Thread lifting for face rejuvenation has significant long-lasting effects that include skin lifting from 3-10 mm and high degree of patients' satisfaction with less incidence rate of complications, about 4.8%. Augmented results are obtained when thread lifting is combined with other lifting and rejuvenation modalities. Significant facial rejuvenation is achieved by thread lifting and highly augmented results are observed when they are combined with Botox, fillers, and/or platelet rich plasma (PRP) rejuvenations.

  11. Thread gauge for tapered threads

    DOEpatents

    Brewster, Albert L.

    1994-01-11

    The thread gauge permits the user to determine the pitch diameter of tapered threads at the intersection of the pitch cone and the end face of the object being measured. A pair of opposed anvils having lines of threads which match the configuration and taper of the threads on the part being measured are brought into meshing engagement with the threads on opposite sides of the part. The anvils are located linearly into their proper positions by stop fingers on the anvils that are brought into abutting engagement with the end face of the part. This places predetermined reference points of the pitch cone of the thread anvils in registration with corresponding points on the end face of the part being measured, resulting in an accurate determination of the pitch diameter at that location. The thread anvils can be arranged for measuring either internal or external threads.

  12. Thread gauge for tapered threads

    DOEpatents

    Brewster, A.L.

    1994-01-11

    The thread gauge permits the user to determine the pitch diameter of tapered threads at the intersection of the pitch cone and the end face of the object being measured. A pair of opposed anvils having lines of threads which match the configuration and taper of the threads on the part being measured are brought into meshing engagement with the threads on opposite sides of the part. The anvils are located linearly into their proper positions by stop fingers on the anvils that are brought into abutting engagement with the end face of the part. This places predetermined reference points of the pitch cone of the thread anvils in registration with corresponding points on the end face of the part being measured, resulting in an accurate determination of the pitch diameter at that location. The thread anvils can be arranged for measuring either internal or external threads. 13 figures.

  13. Growth and Optimization of 2 Micrometers InGaSb/AlGaSb Quantum-Well-Based VECSELs on GaAs/AlGaAs DBRs

    DTIC Science & Technology

    2013-08-01

    overwhelming nonradiative recombination losses in the antimonide active region. Furthermore, if the growth of the antimonide active region is done on a GaAs...This is important as threading dislocations would introduce a strong nonradiative recombination process in the QWs and relaxation that is not 100...These defects can act as nonradiative recombination centers. Thus, the source of the threading dislocations and their density in the active region

  14. 2005 USSOCOM Chemical, Biological, Radiological Conference and Exhibition

    DTIC Science & Technology

    2005-12-08

    Cree, Inc. 22 Signal respose to releases 12:42 12:48 12:54 13:0 13:6 13:12 13:18 -2 0 2 4 6 8 10 12 x 10-3 Processed Data for AS-B2-009-09-03-05...acrid smell and onset of nerve agent symptoms Confidential USSOCOM Scenario 3: Chemical Agent Attack • First responders don their Self -Contained...Nuclear (CBRN) detectors Smart Threads is a dynamic, easily expandable, self - configuring platform Smart Threads Integrated Radiation Sensors (STIRS

  15. A model of human event detection in multiple process monitoring situations

    NASA Technical Reports Server (NTRS)

    Greenstein, J. S.; Rouse, W. B.

    1978-01-01

    It is proposed that human decision making in many multi-task situations might be modeled in terms of the manner in which the human detects events related to his tasks and the manner in which he allocates his attention among his tasks once he feels events have occurred. A model of human event detection performance in such a situation is presented. An assumption of the model is that, in attempting to detect events, the human generates the probability that events have occurred. Discriminant analysis is used to model the human's generation of these probabilities. An experimental study of human event detection performance in a multiple process monitoring situation is described and the application of the event detection model to this situation is addressed. The experimental study employed a situation in which subjects simulataneously monitored several dynamic processes for the occurrence of events and made yes/no decisions on the presence of events in each process. Input to the event detection model of the information displayed to the experimental subjects allows comparison of the model's performance with the performance of the subjects.

  16. Process Specification for Eddy Current Inspection

    NASA Technical Reports Server (NTRS)

    Koshti, Ajay

    2011-01-01

    This process specification establishes the minimum requirements for eddy current inspection of flat surfaces, fastener holes, threaded fasteners and seamless and welded tubular products made from nonmagnetic alloys such as aluminum and stainless steel.

  17. Interplay of catalysis, fidelity, threading, and processivity in the exo- and endonucleolytic reactions of human exonuclease I

    PubMed Central

    Shi, Yuqian; Hellinga, Homme W.; Beese, Lorena S.

    2017-01-01

    Human exonuclease 1 (hExo1) is a member of the RAD2/XPG structure-specific 5′-nuclease superfamily. Its dominant, processive 5′–3′ exonuclease and secondary 5′-flap endonuclease activities participate in various DNA repair, recombination, and replication processes. A single active site processes both recessed ends and 5′-flap substrates. By initiating enzyme reactions in crystals, we have trapped hExo1 reaction intermediates that reveal structures of these substrates before and after their exo- and endonucleolytic cleavage, as well as structures of uncleaved, unthreaded, and partially threaded 5′ flaps. Their distinctive 5′ ends are accommodated by a small, mobile arch in the active site that binds recessed ends at its base and threads 5′ flaps through a narrow aperture within its interior. A sequence of successive, interlocking conformational changes guides the two substrate types into a shared reaction mechanism that catalyzes their cleavage by an elaborated variant of the two-metal, in-line hydrolysis mechanism. Coupling of substrate-dependent arch motions to transition-state stabilization suppresses inappropriate or premature cleavage, enhancing processing fidelity. The striking reduction in flap conformational entropy is catalyzed, in part, by arch motions and transient binding interactions between the flap and unprocessed DNA strand. At the end of the observed reaction sequence, hExo1 resets without relinquishing DNA binding, suggesting a structural basis for its processivity. PMID:28533382

  18. Design of internal screw thread measuring device based on the Three-Line method principle

    NASA Astrophysics Data System (ADS)

    Hu, Dachao; Chen, Jianguo

    2010-08-01

    In accordance with the principle of Three-Line, this paper analyze the correlation of every main parameter of internal screw thread, and then designed a device to measure the main parameters of internal screw thread. Internal thread parameters, such as the pitch diameter, thread angle and screw-pitch of common screw thread, terraced screw thread, zigzag screw thread were obtained through calculation and measurement. The practical applications have proved that this device is convenience to use, and the measurements have a high accuracy. Meanwhile, the application for the patent of invention has been accepted by the Patent Office (Filing number: 200710044081.5).

  19. Low-frequency seismic events in a wider volcanological context

    NASA Astrophysics Data System (ADS)

    Neuberg, J. W.; Collombet, M.

    2006-12-01

    Low-frequency seismic events have been in the centre of attention for several years, particularly on volcanoes with highly viscous magmas. The ultimate aim is to detect changes in volcanic activity by identifying changes in the seismic behaviour in order to forecast an eruption, or in case of an ongoing eruption, forecast the short and longterm behaviour of the volcanic system. A major boost in recent years arose through several attempts of multi-parameter volcanic monitoring and modelling programs, which allowed multi-disciplinary groups of volcanologists to interpret seismic signals together with, e.g. ground deformation, stress field analysis and petrological information. This talk will give several examples of such multi-disciplinary projects, focussing on the joint modelling of seismic source processes for low-frequency events together with advanced magma flow models, and the signs of magma movement in the deformation and stress field at the surface.

  20. Deviation of the typical AAA substrate-threading pore prevents fatal protein degradation in yeast Cdc48.

    PubMed

    Esaki, Masatoshi; Islam, Md Tanvir; Tani, Naoki; Ogura, Teru

    2017-07-14

    Yeast Cdc48 is a well-conserved, essential chaperone of ATPases associated with diverse cellular activity (AAA) proteins, which recognizes substrate proteins and modulates their conformations to carry out many cellular processes. However, the fundamental mechanisms underlying the diverse pivotal roles of Cdc48 remain unknown. Almost all AAA proteins form a ring-shaped structure with a conserved aromatic amino acid residue that is essential for proper function. The threading mechanism hypothesis suggests that this residue guides the intrusion of substrate proteins into a narrow pore of the AAA ring, thereby becoming unfolded. By contrast, the aromatic residue in one of the two AAA rings of Cdc48 has been eliminated through evolution. Here, we show that artificial retrieval of this aromatic residue in Cdc48 is lethal, and essential features to support the threading mechanism are required to exhibit the lethal phenotype. In particular, genetic and biochemical analyses of the Cdc48 lethal mutant strongly suggested that when in complex with the 20S proteasome, essential proteins are abnormally forced to thread through the Cdc48 pore to become degraded, which was not detected in wild-type Cdc48. Thus, the widely applicable threading model is less effective for wild-type Cdc48; rather, Cdc48 might function predominantly through an as-yet-undetermined mechanism.

Top